- From: ProjectParadigm-ICT-Program <metadataportals@yahoo.com>
- Date: Mon, 16 Jan 2023 17:32:14 +0000 (UTC)
- To: public-lod <public-lod@w3.org>, W3C AIKR CG <public-aikr@w3.org>, Public-cogai <public-cogai@w3.org>, semantic-web <semantic-web@w3.org>, W3c Semweb HCLS <public-semweb-lifesci@w3.org>, "public-philoweb@w3.o" <public-philoweb@w3.org>
- Message-ID: <1123117665.1154421.1673890334075@mail.yahoo.com>
Chat GPT is the new buzz in town. And according to the news media, millions have already engaged the chat bot. I am curious to find out if anyone has tested the chat bot to find out how much it knows about the person chatting with it. According to Open AI the data set, which is largely unspecified, runs until Dec 31, 2021 in terms of the data used, collected (scraped) from the Internet. Hundreds of thousands of websites exist which have openly accessible information about people with accounts on them ranging from work, academic, professional, trade, industry, corporate and non-profit domains. Quite a large portion of these have Terms and Conditions and User Agreements that state that the websites do not sell the data to third parties. But most of these website also may have fora, chat groups and messaging systems, of which some content can be made publicly available on the Internet, if the account holder so desires and chooses such an option. Most AI algorithms owned and developed by large Internet companies, and definitely not only Meta, Google, Amazon, Microsoft, but even startups with substantial investor funding are creating large data sets which remain hidden from scrutiny and oversight. It is common knowledge that in quite a few countries around the world combining data sets to collect personal data in public administrations is carefully monitored, regulated and supervised. The same does not hold for the Internet and Internet companies. The current standard for data privacy protection is the General Data Protection Regulation from the European Union, but even this GDPR is far from perfect and being a "gold standard". Therefore I think engaging chat bots with the intent of finding out how much they know about our lives, and to limit the scope for now, to the work, academic, professional, trade, industry, corporate and non-profit domains should yield some clues to how far reaching this scraping for data to include in data sets is going. And it would be nice if this could be done in such a way e.g. via templates or predetermined question sets to make this possible for analysis in a project setting. Too much is being left to chance, investors and market frenzy and less to scrutiny and informed debate with regard to the potential of AI products like ChatGPT. Milton Ponson GSM: +297 747 8280 PO Box 1154, Oranjestad Aruba, Dutch Caribbean Project Paradigm: Bringing the ICT tools for sustainable development to all stakeholders worldwide through collaborative research on applied mathematics, advanced modeling, software and standards development
Received on Monday, 16 January 2023 17:33:14 UTC