OpenAI has built a text watermarking method to detect ChatGPT-written content

From Tom's Hardware: OpenAI has already built and tested a tool to detect whether any written content has been created using ChatGPT. However, the Wall Street Journal reports that the company is holding back the tool from public release because of several concerns.

The tool adds a pattern to how the large language model (LLM) writes its output, allowing OpenAI to detect if ChatGPT created it. However, the pattern remains unnoticeable to humans, thereby not impacting the LLM’s quality. Internal documentation says that the tool is 99.9% effective in detecting ChatGPT’s output, but OpenAI has yet to release it.

While text watermarking is highly effective for detecting content written by ChatGPT, it cannot work with output from other LLMs like Gemini AI or Llama 3. Furthermore, this technique can be easily circumvented. For example, you can insert ChatGPT’s output in Google Translate, convert it to another language, and then back to English, effectively removing the watermarking.

View: Full Article