R: Call from French orgs from the cultural industry for transparent AI in EU

Dear Claudia,
should you need more information on the ISCC, you can also contact my colleague Paola Mazzucchi who follows the development of this standard on behalf of AIE and can put you in touch with Sebastian in case.
Best
Giulia

Da: Laurent Le Meur <laurent@edrlab.org>
Inviato: martedì 3 ottobre 2023 13:03
A: Claudia Russo <Claudia@stm-assoc.org>
Cc: public-tdmrep@w3.org
Oggetto: Re: Call from French orgs from the cultural industry for transparent AI in EU

Hi Claudia,

ISCC is a new thing, there is nothing equivalent, from what I know. It will become an ISO Standard. AI companies have not put in place anything so far for identifying content, and content industries are not the fastest animals in the forest when it comes to technology.

Sebastian Posth, one of its main promotors, presented it during the last W3C TPAC for its potential usage against ebook counterfeit; and before that at the EDRLab Digital Publishing Summit 2023. I think it would make a good fit for what we are looking for.

Best regards
Laurent




Le 3 oct. 2023 à 11:46, Claudia Russo <Claudia@stm-assoc.org<mailto:Claudia@stm-assoc.org>> a écrit :

Thank you Laurent

I am personally new to the ISCC, is this much in use by either content industries or AI companies?


Best regards
Claudia
________________________________
From: Laurent Le Meur <laurent@edrlab.org<mailto:laurent@edrlab.org>>
Sent: Tuesday, October 3, 2023 10:14
To: public-tdmrep@w3.org<mailto:public-tdmrep@w3.org> <public-tdmrep@w3.org<mailto:public-tdmrep@w3.org>>
Subject: Call from French orgs from the cultural industry for transparent AI in EU

EXTERNAL EMAIL
The article below was printed in Le Monde on Sept 29th. It is focusing on Generative AI  and the EU AI Act.

<Image2.jpeg>
Tribune : Construisons dès aujourd’hui une Intelligence Artificielle de rang mondial respectueuse de la propriété littéraire et artistique - Syndicat national de l'édition<https://www.sne.fr/actu/tribune-construisons-des-aujourdhui-une-intelligence-artificielle-de-rang-mondial-respectueuse-de-la-propriete-litteraire-et-artistique/>
sne.fr<https://www.sne.fr/actu/tribune-construisons-des-aujourdhui-une-intelligence-artificielle-de-rang-mondial-respectueuse-de-la-propriete-litteraire-et-artistique/>

In summary, it calls for the EU to go further than simply requesting AI companies to publish summaries of copyrighted data used for training (this is the current trend). The request is to obtain total transparency through a detailed list of all works used by Generative AI systems for training, and their sources.

This request is shared by many practitioners, in the EU but also in the US.

Personal thinking: Providing URLs would not be sufficient, because many works appear on multiple URLs that are not managed by rights owners, and many URLs are transient. Such repositories of training sources should therefore index for each training source an ISCC code<https://iscc.foundation/iscc/>,  a date of import, a source url (if any), and optionally a few other metadata (some title). And they should be searchable by ISCC (or title).

This would make it easy to check that an opt-out has been respected, even if a work / content has been syndicated through multiple locations / websites.

What is your opinion on this?

Best regards
Laurent

________________________________

Network Confidentiality Notice

Il presente messaggio, e ogni eventuale documento a questo allegato, potrebbe contenere informazioni da considerarsi strettamente riservate ad esclusivo utilizzo del destinatario in indirizzo. Chiunque ricevesse questo messaggio per errore o comunque lo leggesse senza esserne legittimato è avvertito che trattenerlo, copiarlo, divulgarlo, distribuirlo a persone diverse dal destinatario è severamente proibito ed è pregato di darne notizia immediatamente al mittente oltre che cancellare il messaggio e i suoi eventuali allegati dal proprio sistema.
Ai sensi del Regolamento UE 2016/679, il Titolare del trattamento garantisce la massima riservatezza ed il pieno rispetto degli obblighi previsti dalla normativa nazionale e comunitaria in merito alla protezione dei dati personali.

This message, and any attached file transmitted with it, contains information that may be confidential or privileged for the sole use of the intended recipient. If you are not the intended recipient of this e-mail or read it without entitlement be advised that keeping, copying, disseminating or distributing this message to persons other than the intended recipient is strictly forbidden. You are to notify immediately to the sender and to delete this message and any file attached from your system.
In accordance with EU Reg. 2016/679 (GDPR), the Data Controller guarantees the maximum level of confidentiality and full respect of all obligations provided for by the national and the EU legislation currently in force with regard to protection of personal data..

Received on Tuesday, 3 October 2023 13:37:40 UTC