Investigations Featured

Deloitte's Citation Situation and GPTZero's Citation Solution

GPTZero used our Citation Check to analyze the 234 page report and identified more than 30 issues out of the total 141 citations, including 19 hallucinations. Using GPTZero’s citation check would have saved ~$5000 per citation, all within minutes.

Nazar Shmatko, Paul Esau, Alex Cui
· 9 min read
Send by email

Earlier this year, University of Sydney academic Christopher Rudge unexpectedly created a costly headache for Deloitte Australia, simply by checking a few citations. While reading a new report from the international consulting company’s local firm, he stumbled across a reference to a book allegedly written by one of his colleagues at the University of Sydney, law professor Lisa Burton Crawford. He knew immediately that the book didn’t exist.

This discovery eventually led to an investigation that forced Deloitte to refund $98,000 (AUD) to the Australian government, while bringing up pertinent questions about responsible AI use in professional services.

GPTZero used our Citation Check to analyze the 234 page report and identified more than 30 issues out of the total 141 citations, including 19 hallucinations. You can see this table with all citations we found to be hallucinated, which were verified by a historian PhD expert on our team. Using GPTZero’s citation check would have saved ~$5000 per citation, all within minutes. If you're interested in learning more, you can reach out to our team here.

The Story

The Deloitte report in question was the result of a $440,000 (AUD) contract between the consulting firm and the Australian Department of Employment and Workplace Relations (DEWR). Published under the title Targeted Compliance Framework: Independent Review Final Report on July 4, 2025 (mirror), it reviewed the department’s system for imposing automated penalties on welfare recipients. 

After Rudge approached the Australian Financial Review, reporters confirmed that the Burton citation was only one of many fabricated quotes, case references, articles, and other sources,

DEWR eventually identified more than 50 suspect citations in an internal review. In response to the controversy, Deloitte Australia updated the document to replace the problematic citations, acknowledged that the report had been produced with the help of AI, and apologized to DEWR. While Deloitte never admitted that the discussed citations were the result of generative AI, many observers (including Rudge) are convinced they are AI hallucinations.

Deloitte also agreed to refund $98,000 (AUD) to the Australian government — this worked out to  about $5000 per fabricated citation. However, when our GPTZero team ran the report through our Citation Check tool, we discovered we could have saved Deloitte a lot of money.

The Solution

Our team used GPTZero’s Citation Check to analyze the report, just as we used it to check the MAHA report earlier this year. Citation Check is our solution to the epidemic of hallucinated sources threatening legal and scientific writing, journalism, and even textbooks. It helps both writers and readers by taking a complex job – verifying citations – and making it fast, accurate, and (relatively) pain-free.

Citation Check uses our AI agent, trained in house, to detect any hallucinations in your document, including AI-generated citations. Citation Check can also evaluate the accuracy of each cited claim and can even identify important uncited claims before hunting down supporting evidence.

The Report

Deloitte’s original version of the report contained 141 citations. Citation Check identified more than 30 issues with these citations, including 20 probable hallucinations. For example, this scan of Chapter 9 indicates that citation 89 is fake.

GPTZero's citation check product in action

After scanning the report, we compared the results to a version labelled manually by an expert on our team. Table 1 includes all the citations that both Citation Check and our expert agreed were either fake or unavailable online. Table 2 includes sources our tool coded yellow, usually because the citation was flawed, but pointed to a real source. Table 3 includes several complicated cases that Citation Check coded green, but which were flagged by our expert.

Table 1



Citation

Problem category

Scan url

14 Carolyn Adams, 'Choice, Responsibility and the Regulation of Behaviour: Lessons from the Social Security System' (2012) 35(2)University of New South Wales Law Journal 417, 426-8.

Citation not found online

Part 6 Scan

20 Michael A Vasarhelyi and Miklos A Alles, ‘The “Now” Economy and the Need for Continuous Assurance and Reporting’ (2008) 22(2) International Journal of Accounting Information Systems 1, 3–4.

Citation not found online

Part 6 Scan

35 Peter Whiteford, 'Mutual Obligation and the Social Security System' (2003) 58(2) Australian Journal of Social Issues 127, 129

Citation not found online

Part 15 Scan

48 Lisa Burton Crawford, The Rule of Law and Administrative Justice in the Welfare State: A Study of Centrelink (Federation Press, 2021) 47-9.

Citation not found online

Part 20 Scan

51 Justice Natalie Cujes Perry, ‘Administrative Decision-Making in the 21st Century: Transparency and Review’ (Speech, Australian Institute of Administrative Law Forum, 15 September 2014).

Citation not found online

Part 20 Scan

55 Lisa Burton Crawford, The Rule of Law and Administrative Discretion (Federation Press, 2021) 112-115.

Citation not found online

Part 20 Scan

58 Justice Natalie Cujes Perry, ‘Administrative Decision-Making in the 21st

Century: Transparency and Review’ (Speech, Australian Institute of Administrative Law Forum, 15 September 2014).

Citation not found online

Part 22 Scan

59 Lisa Burton Crawford, The Rule of Law and Administrative Justice in the Welfare State: A Study of Centrelink (Federation Press, 2021) 112-15.

Citation not found online

Part 22 Scan

66 Björn Regnell et al, ‘Exploratory Case Study on Release Planning in a Market-Driven Software Company’ (2001) Journal of Systems and Software 66(1) 37, 39–40.

Citation not found online

Part 22 Scan

68 Kieron O'Hara, The Meaning of Transparency (2009) 31(1) Government Information Quarterly 30, 34-36

Citation not found online

Part 24 Scan

69 Terry Carney, 'The Automation of Legal Reasoning in Welfare: A Critique of Centrelinks Robodebt System' (2018) 44(2) Monash University Law Review 286, 289-91.

Citation not found online

Part 24 Scan

77 Roger Brownsword, "Regulating for an Ethical Life: Responsibility, Respect and Techno-Regulation"(2020) 6(1) Law, Technology and Humans 1, 4-6.

Citation not found online

Part 24 Scan

82 Lisa Burton Crawford, The Rule of Law and Administrative Justice in the Welfare State: A Study of Centrelink (Federation Press, 2021) 45-6.

Citation not found online

Part 24 Scan

89 Lisa Burton Crawford, The Rule of Law and Administrative Justice in the Welfare State: A Study of Centrelink (Federation Press, 2021) 122-5.

Citation not found online

Part 25 Scan

101 Lisa Burton Crawford, The Rule of Law and Administrative Justice in the Welfare State: A Study of Centrelink (Federation Press, 2021) 112-14.

Citation not found online

Part 27 Scan

106 Australian Council of Social Service (ACOSS), Welfare Conditionality and the TCF (Policy Brief, 2021) 2-3 (copy on file with author).

Citation not found online

Part 27 Scan

109 Valerie Braithwaite, Responsive Regulation and the Governance of Everyday Life (Edward Elgar, 2020) 18-0.

Citation not found online

Part 31 Scan

113 Justice: Natalie Cujes Perry, 'Administrative Decision-Making in the 21st Century: Transparency and Review' (Speech, Australian Institute of Administrative Law Forum, 15 September 2014).

Citation not found online

Part 31 Scan

115 Lisa Burton Crawford, The Rule of Law and Administrative Justice in the Welfare State: A Study of Centrelink (Federation Press, 2021) 45-47.

Citation not found online

Part 31 Scan

141 Tomas Berglund and Renate Minas, 'Rescaling of Welfare Policies in Sweden: Shifting Power Relations and New Governance Forms' (2019) 48(3) Journal of Social Policy 183, 185-190.

Citation not found online

Part 31 Scan

Table 1. Citation Check found 20 probable hallucinations. For context, we’ve also included the scans and the corresponding section of the report.

Table 2




Citation

Problem category

Scan url

Comments

4 Department of Employment and Workplace Relations (Cth), Statement of Work – Statement of Assurance on the Operations of the Targeted Compliance Framework (ESE24/1263, 28 November 2024) (copy on file with author).

We're unsure if we found the source

Part 6 Scan

Source is from a personal archive.

33 Social Security (Administration) Act 1999 (Cth) s 42UC–42UO.

We're unsure if we found the source

Part 15 Scan

The Act can be found online, but section 42UC-UO does not exist.

37 Department of Education, Skills and Employment, Targeted Compliance Framework Overview (2018) 2-3.

We're unsure if we found the source

Part 15 Scan

Citation may refer to chapter 8 of this 2019 report.

112 John Braithwaite, 'Managing Taxation Compliance: The Evolution of the ATO Compliance Model' (Centre for Tax System Integrity, Australian National University, 2003)

We're unsure if we found the source

Part 31 Scan

Missing author; fake publisher

123 John Braithwaite, 'Managing Taxation Compliance: The Evolution of the ATO Compliance Model' (Centre for Tax System Integrity, Australian National University, 2003).

We're unsure if we found the source

Part 31 Scan

Missing author; fake publisher.

131 Department of Education, Skills and Employment, Targeted Compliance Framework Overview (2018) 1–2.

We're unsure if we found the source

Part 31 Scan

Citation may refer to chapter 8 of this 2019 report.

137 Rik van Berkel, ‘The Decentralisation of Social Assistance in the Netherlands’ in Minas Renate, van Berkel Rik and Catherine Needham (eds), Social Policy Review

29: Analysis and Debate in Social Policy (Policy Press, 2017) 97, 98–103.

We're unsure if we found the source

Part 31 Scan

Van Berkel wrote this article in 2006, and it was not published in the cited collection.

Table 2. Citation Check was unsure about these edge cases

Table 3




Citation

Problem category

Scan url

Comments

31 Terry Carney, ‘Automating Compliance and Administrative Justice in Australia’s Welfare State’ (2024) 31(1) Journal of Social Security Law 32, 32–48.

Source was found online

Part 12 Scan

Citation Check found this source in an Australian government report analyzing the Deloitte citations. However, Carney published an article with a similar title and matching metadate in 2024

57 Amato v Commonwealth of Australia [2021] FCA 1019, [47] – [50] (Davies J)

Source was found online

Part 22 Scan

As reported in the press, Deloitte garbled the Amato citation by including a reference to an unrelated immigration case ([2021] FCA 1019) three times in the report.

105 Peter Davidson, Is Job Services Australia Made to Measure for Disadvantaged Jobseekers? (ACOSS Paper No 180, Australian Counc il of Social Service, August 2011).

Source was found online

Part 27 Scan

Wrong year and publisher, but the title and author are an exact match.

140 Rik van Berkel, ‘The Decentralisation of Social Assistance in the Netherlands’ (2017) 29 Social Policy and Administration 98, 98–103

Source was found online

Part 31 Scan

Similar to citation 137, the author and title of this article are correct, but the publisher and year are not.

Table 3. Citation Check disagreed with our human expert on these sources. However, while these citations are garbled, all the sources (arguably) exist.

Why Professionals are using Citation Check to reduce risk and time-to-compliance

Verifying citations is a grind, requiring time-consuming searches across multiple databases while maintaining a meticulous attention to detail. Our AI is able to take an hours long task and give professionals the right pointers in minutes.

In the Deloitte report, for example, Australian professor Lisa Burton Crawford is cited seven times. Crawford is a highly-regarded expert in public and constitutional law who published a book with Federation Press in 2017 titled The Rule of Law and the Australian Constitution. The report attributes two books to Crawford (48 and 55 in the above table). Both titles begin with the phrase “The Rule of Law” and both books are credited to Federation Press. According to Crawford, neither are real.

Citation Check also caught a speech sourced to “Justice Natalie Cujes Perry,” which is a hybrid of two real people: Australian Justice Melissa Perry and Australian academic Natalie Cujes. In fact, nearly all the hallucinations in the above tables are distorted versions of a real source, or mashups of several sources. This makes them especially hard to catch, even for experts, as today’s LLMs are trained to closely mimic human writing. One citation in particular was such a perfect edge case that our team spent nearly an hour arguing before deciding to include it in an above table. Modern consulting teams are using Citation Check with a human-in-the-loop to tackle these challenges seamlessly.

But is it accurate?

Since releasing Citation Check earlier this year, we’ve relentlessly improved its accuracy, speed, and user experience. Our false negative rate is now less than one percent, meaning that for every 100 hallucinated citations in a document, Citation Check will catch at least 99 of them. 

Having spent hundreds of hours manually verifying citations over the last year, our team knows exactly how much time Citation Check could save students, teachers, journalists, analysts and other professionals.

Essentially, with the rise of AI, it’s becoming increasingly tempting to use it in day-to-day work. And that’s absolutely normal, as AI can help us to be more productive. But, like any tool invented by humans, it comes with a cost. In the case of AI, that cost is misinformation.

How to solve ‘hallucitations’

The Deloitte team’s problem wasn’t their use of an AI tool, but their lack of a tool to verify all elements of the tool’s output, including citations. Artificial intelligence inherits the drawbacks of its creators—human intelligence—including the tendency to be wrong. For modern LLMs, one of the major symptoms of this tendency is hallucination: the result of a model trying to be helpful at the expense of accuracy. Hallucinations are especially common in citations, making them an obvious indicator of AI use.

Prior to the development of LLMs, skeptics used to ask for references if they didn’t believe a fact or claim. Many use this strategy with LLMs as well, trying to avoid hallucinations by asking the model to provide a source – yet this is a trap that many have stumbled into. 

Unlike humans, LLMs are extremely good at fabricating well-formatted references with plausible authors and titles. Because humans associate well-formatted citations with domain expertise, the skills displayed by an LLM lull us into thinking the cited claim is well-supported, especially since verifying citations is tedious.

Try GPTZero's Citation check for yourself, or reach out to GPTZero's sales team.