For Machine Learning

Don’t Train Your Models
on Bad Data

Whether you’re sourcing training data, developing a foundational LLM, or fine tuning on your own data, you need to ensure generative text does not make it into your training set. We can help.

As Seen In

Trusted by the industry

“We're using GPTZero to augment our in-house signals to ensure data we create is free of any third-party LLM usage. We trust it so much that anything deemed by GPTZero to be LLM-generated is blocked from submission, and we retroactively use its scores to better tune our in-house ML quality signals and cheat-detection models.”

- Senior Product Manager, at Unicorn Data Labeler

Identify AI-written content across the internet

We developed the most comprehensive and accurate AI checker solution for Machine Learning engineers, which we use to filter data sets we source

Improve interpretability with our premium API

Rather than produce a binary evaluation of whether text is generated by an LLM, our premium model combines recent breakthroughs in AI detection to offer granular details and interpretability only available on GPTZero

Fast and easy to implement

Our public REST API is the same one that powers our web application, so you know we’re eating our own dogfood. Our response time average 0.4s for a 700 word document.

curl --request POST
--url https://api.gptzero.me/v2/predict/text 
--header 'Accept: application/json'
--header 'Content-Type: application/json'
--header 'x-api-key: '
--data '{
"document": "string",
"version": "string"
}'

Don’t worry about security with our on-prem solution

We understand some customers prefer to keep their data on premisis. Run our API on your own infrastructure so your data never leaves your servers.

GPTZero reviews

“The best AI checker"according to our customers and community

TechCrunch

Kyle Wiggins

GPTZero was the only consistent performer, classifying AI-generated text correctly. As for the rest … not so much.

CEO, Transit Pluto Studios

Samantha Warren

GPTZero has been incomparably more accurate than any of the other AI checkers. For me, it’s the best solution to build trust with my clients.

President, American Federation of Teachers

Randi Weingarten

This tool is a magnifying glass to help teachers get a closer look behind the scenes of a document, ultimately creating a better exchange of ideas that can help kids learn.

Director of Learning

Catalina Ramirez

The granular detail provided by GPTZero allows administrators to observe AI usage across the institution. This data is helping guide us on what type of education, parameters, and policies need to be in place to promote an innovative and healthy use of AI.

Teaching Assistant, University of Minnesota

Helen Dombrock

After talking to the class, each student we compiled with GPTZero as possibly using AI ended up telling us they did, which made us extremely confident in GPTZero’s capabilities.

Assistant Professor of Economics, University of Toronto

Kristina McElheran

Harvard Law School

Justin Curl

I'm a huge fan of the writing reports that let me verify my documents are human-written. The writing video, in particular, is a great way to visualize the writing process!

Investor, Washington Partners

Frank Li

Excellent chrome extension. I ran numerous tests on human written content and the results were 100% accurate.

Writer

Nate Walker

Outstanding! This is an extraordinary tool to not only assess the end result but to view the real-time process it took to write the document.

Outlook News

GPTZero is the best AI detection tool for teachers and educators.

FAQs about GPTZero

Everything you need to know about GPTZero and our ChatGPT detector.

Can’t find an answer? You can talk to our customer service team.

GPTZero uses deep learning models that keep pace with AI advancements to deliver precise, reliable results that help you understand and interpret the origin of a piece of text. A sentence-by-sentence classifier calculates the probability that a text was created by AI, giving you both document-level and granular insights.

Independent and internal benchmarking show that GPTZero outperforms competitors at detecting mixed documents (where both AI and human writing has been used) with 96.5% accuracy. Our false positive rate is under 1%, making it one of the most reliable detectors available.

Yes. GPTZero is continually updated to recognize text from the newest large language models, including ChatGPT (GPT-3, GPT-4, GPT-5), Google Gemini, LLaMA, Claude, and others. As new models are released, we adapt and update our detection system.

Since 2022, we've focused on reducing bias for ESL (English as a Second Language) writers. By tagging educational data, adding representative datasets, and using text pre-classification, we've reduced the false positive rate on TOEFL essays to just 1.1%.

Many try to bypass AI detectors by paraphrasing or using homoglyph substitutions. Our model's Paraphraser Shield technology is designed to catch these cases. Even if AI content has been altered to look more human-like, GPTZero can detect it.

No AI detector is 100% accurate, and AI itself is changing constantly. GPTZero performs best of longer texts and English prose. We encourage using it as a conversation starter, and not as the final verdict. GPTZero is the only detector specifically de-biased for ESL writers, lowering false positives to 1%.

Our model is trained on millions of documents spanning various domains of writing including creating writing, scientific writing, blogs, news articles, and more. We test our models on a never-before-seen set of human and AI articles from a section of our large-scale dataset, in addition to a smaller set of challenging articles that are outside its training distribution.

When you run text through our API, you'll see a document_classification field that labels it as HUMAN_ONLY, MIXED, and AI_ONLY. Each classification comes with probabilities (via the class_probabilities field) and a confidence_category field, which can be high, medium, or low. When the confidence is "high", error rates are below 1%. The API provides sentence-level highlights (API users can access this highlighting through the highlight_sentence_for_ai field) so you can see exactly which parts of the text contributed most to the classification.

No. We do not store or collect the documents passed into any calls to our API. For dashboard users, only aggregate inputs are stored to improve the service. See our privacy policy for full details.

Don’t Train Your Models
on Bad Data

As Seen In

Trusted by the industry

Identify AI-written content across the internet

Improve interpretability with our premium API

Fast and easy to implement

Don’t worry about security with our on-prem solution

GPTZero reviews

FAQs about GPTZero

How does GPTZero detect AI-generated text?

How accurate is GPTZero at detecting AI text?

Can GPTZero detect text from the latest AI models?

How does GPTZero reduce false positives, especially for ESL students?

How does GPTZero handle paraphrased or modified AI content?

What are the known limitations of AI detection?

What data did you train your model on?

How do I use and interpret the results from your API?

Are you storing data from API calls?

Don’t Train Your Modelson Bad Data

As Seen In

Trusted by the industry

Identify AI-written content across the internet

Improve interpretability with our premium API

Fast and easy to implement

Don’t worry about security with our on-prem solution

GPTZero reviews

FAQs about GPTZero

How does GPTZero detect AI-generated text?

How accurate is GPTZero at detecting AI text?

Can GPTZero detect text from the latest AI models?

How does GPTZero reduce false positives, especially for ESL students?

How does GPTZero handle paraphrased or modified AI content?

What are the known limitations of AI detection?

What data did you train your model on?

How do I use and interpret the results from your API?

Are you storing data from API calls?

Don’t Train Your Models
on Bad Data