Re: Call from French orgs from the cultural industry for transparent AI in EU from Laurent Le Meur on 2023-10-03 (public-tdmrep@w3.org from October 2023)

From: Laurent Le Meur <laurent@edrlab.org>
Date: Tue, 3 Oct 2023 13:03:13 +0200
To: Claudia Russo <Claudia@stm-assoc.org>
Cc: "public-tdmrep@w3.org" <public-tdmrep@w3.org>
Message-Id: <E4A58408-35F0-4465-94F7-FEEA352CB013@edrlab.org>

Hi Claudia, 

ISCC is a new thing, there is nothing equivalent, from what I know. It will become an ISO Standard. AI companies have not put in place anything so far for identifying content, and content industries are not the fastest animals in the forest when it comes to technology.  

Sebastian Posth, one of its main promotors, presented it during the last W3C TPAC for its potential usage against ebook counterfeit; and before that at the EDRLab Digital Publishing Summit 2023. I think it would make a good fit for what we are looking for. 

Best regards
Laurent

 

> Le 3 oct. 2023 à 11:46, Claudia Russo <Claudia@stm-assoc.org> a écrit :
> 
> Thank you Laurent
> 
> I am personally new to the ISCC, is this much in use by either content industries or AI companies?
> 
> 
> Best regards
> Claudia
> From: Laurent Le Meur <laurent@edrlab.org <mailto:laurent@edrlab.org>>
> Sent: Tuesday, October 3, 2023 10:14
> To: public-tdmrep@w3.org <mailto:public-tdmrep@w3.org> <public-tdmrep@w3.org <mailto:public-tdmrep@w3.org>>
> Subject: Call from French orgs from the cultural industry for transparent AI in EU
>  
> EXTERNAL EMAIL
> The article below was printed in Le Monde on Sept 29th. It is focusing on Generative AI  and the EU AI Act. 
> 
> <Image2.jpeg>
> Tribune : Construisons dès aujourd’hui une Intelligence Artificielle de rang mondial respectueuse de la propriété littéraire et artistique - Syndicat national de l'édition
> sne.fr
>  <https://www.sne.fr/actu/tribune-construisons-des-aujourdhui-une-intelligence-artificielle-de-rang-mondial-respectueuse-de-la-propriete-litteraire-et-artistique/>Tribune : Construisons dès aujourd’hui une Intelligence Artificielle de rang mondial respectueuse de la propriété littéraire et artistique - Syndicat national de l'édition <https://www.sne.fr/actu/tribune-construisons-des-aujourdhui-une-intelligence-artificielle-de-rang-mondial-respectueuse-de-la-propriete-litteraire-et-artistique/>
> sne.fr <https://www.sne.fr/actu/tribune-construisons-des-aujourdhui-une-intelligence-artificielle-de-rang-mondial-respectueuse-de-la-propriete-litteraire-et-artistique/>
> 
> In summary, it calls for the EU to go further than simply requesting AI companies to publish summaries of copyrighted data used for training (this is the current trend). The request is to obtain total transparency through a detailed list of all works used by Generative AI systems for training, and their sources. 
> 
> This request is shared by many practitioners, in the EU but also in the US.
> 
> Personal thinking: Providing URLs would not be sufficient, because many works appear on multiple URLs that are not managed by rights owners, and many URLs are transient. Such repositories of training sources should therefore index for each training source an ISCC code <https://iscc.foundation/iscc/>,  a date of import, a source url (if any), and optionally a few other metadata (some title). And they should be searchable by ISCC (or title). 
> 
> This would make it easy to check that an opt-out has been respected, even if a work / content has been syndicated through multiple locations / websites. 
> 
> What is your opinion on this? 
> 
> Best regards
> Laurent

Received on Tuesday, 3 October 2023 11:03:31 UTC