Its now increasingly easy and convincing to auto generate content using various large scale language models. Earlier tools were pretty simple and limited with their knowledge to generate convincing content.
Some examples are here:
[This GPT-3 based predictive text generator known as ChatGPT also has some data indexed about Hive. ]
Problem statement
How to address auto generated text which is not plagiarized & easily detectable posted to the Hive blockchain for monetization ?
In the past we had expensive bots like @cheetah and some others looking for plagiarized content. Now with content generated by newer predictive text generation methods by Large Language models, the job of identifying is not easy.
Here is a rather cute example of a subject about which content is generated:
Prompt: Please write an article which I can post to Hive blockchain. The article must be about the tip of a safety pin
Is there a way to identify auto generated content ?
There are evolving work to identify auto generated content. OpenAPI themselves has a tool published here https://platform.openai.com/ai-text-classifier . Other auto generators including the older ones will need different technologies and APIs.
Auto generated images
The same is applicable for auto generated images from various AI generators like Stable Diffusion & DALL·E and many others.
Questions
- Do we, Hive has a tool to identify auto generated content ?
- Are we planning to allow such content, ie auto generated content ?