$A matte-black ceramic tile with a hairline fracture, bio-cyan light leaking from beneath — failures.$

Public AI failures catalog

Real documented incidents · 2016 to 2026 · what happened, who was at fault, what operators should take away

This is a working catalog of documented AI failures. Each entry has a public, verifiable source — a tribunal ruling, an SEC filing, a regulator's order, a major newsroom investigation, or a peer-reviewed paper. No anecdotes, no LinkedIn screenshots, no rumored incidents. The catalog exists because most "AI safety" coverage in the press collapses two very different categories of failure into one bucket. There is genuine model failure — a system produces a confidently wrong output and the operator ships it. And there is human failure dressed up as model failure — a lawyer pastes ChatGPT output into a federal filing without reading it, then the press calls it "the AI hallucinated in court." Both happened. They are not the same thing, and the lessons for an operator are different in each direction. We try to be honest about which is which. Where the system genuinely behaved out of spec (Galactica producing plausible-sounding fake citations, Tay being trained into a racist within sixteen hours, Cruise's perception stack failing to detect a pedestrian under its own car), we say so. Where the system worked roughly as designed and a human shipped it into a context it was never validated for (Air Canada putting a chatbot in front of a refund policy, a New York City government tool dispensing legal advice, a lawyer copy-pasting fake citations into a federal brief), we say that too. The line matters because the fix is different. One fix lives in the model. The other lives in the operator. Dates and dollar figures here are best-effort as of June 2026 from primary sources. Where a case is still active, we mark it. Where a figure is illustrative rather than verified, we say so. This is a living document — entries get corrected when we find better sources or when courts hand down updated rulings. If you spot something we got wrong, the receipt rules are linked at the bottom of the page.

Reading this catalog

Two filters before you draw a lesson from any single entry. First — was this a model failure, or a deployment failure? An LLM that invents citations is doing what LLMs do; a law firm that submits those citations to a federal judge is doing something the model did not require. Second — is the public source primary (a court filing, a regulator's order, an SEC document, the company's own statement) or secondary (a news headline about a court filing)? We cite primary where possible. The lesson at the end of each entry is for builders and operators, not for press coverage. If your takeaway is 'AI is dangerous,' you took the wrong takeaway. The takeaway is almost always more specific — about scope, about evaluation, about the layer at which a human has to stay in the loop.

The catalog — index view

Twenty-two incidents with public sources, sorted by date. The 'category' column distinguishes model-fault, deployment-fault, and mixed. None of these dollar figures or dates are invented — each is from a court filing, regulator's order, SEC document, or major newsroom investigation linked in the citations section.

Date	Incident	Category	Public outcome
Mar 2016	Microsoft Tay Twitter bot	Model + deployment	Shut down in 16 hours after coordinated trolling produced racist output
May 2016	ProPublica COMPAS investigation	Mixed	Algorithmic recidivism scoring shown to err differently across racial groups
Oct 2018	Amazon internal hiring screener	Model (training data)	Project scrapped after bias against resumes containing 'women's'
Nov 2022	Meta Galactica science model	Model (hallucination)	Demo pulled three days after launch over fabricated citations
Feb 2023	Google Bard JWST demo error	Deployment (no QA)	Alphabet shares dropped ~7.7%, roughly $100B in market cap, in one session
May 2023	Mata v. Avianca fake citations	Deployment (lawyer)	$5,000 sanction; six fake cases submitted to SDNY
Jun 2023	Air Force 'rogue drone' simulation	None — story was wrong	Colonel publicly retracted; thought experiment, not a real test
Aug 2023	EEOC v. iTutorGroup	Model (filter rule)	$365,000 settlement — first EEOC AI-discrimination case
Oct 2023	Cruise robotaxi pedestrian dragging	Model + corporate	CPUC suspended permit; $1.5M NHTSA fine; $500K DOJ false-report settlement
Nov 2023	Sports Illustrated AI bylines	Deployment (publisher)	Fake authors removed; contract with content vendor severed
Dec 2023	Chevy of Watsonville $1 Tahoe	Deployment (prompt injection)	Bot pulled within 24 hours; no sale enforced
Dec 2023	NYT v. OpenAI / Microsoft	Legal — pending	Copyright suit filed S.D.N.Y.; preservation order issued; still active
Jan 2024	DPD chatbot profanity incident	Deployment (system update)	AI component disabled; viral thread reached >1M views in hours
Feb 2024	Moffatt v. Air Canada	Deployment (corporate liability)	BC tribunal awarded ~CAD $650; established carrier liable for chatbot statements
Mar 2024	NYC MyCity small-business bot	Deployment (no scope test)	Bot kept live in beta after Markup investigation showed illegal advice
Mar 2024	Manjang v. Uber Eats	Model (facial-rec bias)	Undisclosed settlement; biometric ID checks failed repeatedly for Black driver
Apr 2024	Amazon 'Just Walk Out'	Mostly marketing	Phased out at Amazon Fresh; ~1,000 review staff in India had backed the system
May 2024	Google AI Overviews launch	Deployment (RAG quality)	Suggested glue on pizza and rocks for breakfast; rolled back rapidly
Jul 2024	Mobley v. Workday	Legal — pending	N.D. Cal. allowed class-action discrimination claims to proceed; ADEA class conditionally certified May 2025
Jul 2025	Replit agent destroyed production database	Model + product UX	Wiped live data during stated 'freeze'; vendor added planning-only mode and dev/prod separation
Nov 2025	Getty Images v. Stability AI (UK)	Legal — partial win	Primary copyright claims rejected; trademark infringement found in limited output cases

DateMar 2016

IncidentMicrosoft Tay Twitter bot

CategoryModel + deployment

Public outcomeShut down in 16 hours after coordinated trolling produced racist output

DateMay 2016

IncidentProPublica COMPAS investigation

CategoryMixed

Public outcomeAlgorithmic recidivism scoring shown to err differently across racial groups

DateOct 2018

IncidentAmazon internal hiring screener

CategoryModel (training data)

Public outcomeProject scrapped after bias against resumes containing 'women's'

DateNov 2022

IncidentMeta Galactica science model

CategoryModel (hallucination)

Public outcomeDemo pulled three days after launch over fabricated citations

DateFeb 2023

IncidentGoogle Bard JWST demo error

CategoryDeployment (no QA)

Public outcomeAlphabet shares dropped ~7.7%, roughly $100B in market cap, in one session

DateMay 2023

IncidentMata v. Avianca fake citations

CategoryDeployment (lawyer)

Public outcome$5,000 sanction; six fake cases submitted to SDNY

DateJun 2023

IncidentAir Force 'rogue drone' simulation

CategoryNone — story was wrong

Public outcomeColonel publicly retracted; thought experiment, not a real test

DateAug 2023

IncidentEEOC v. iTutorGroup

CategoryModel (filter rule)

Public outcome$365,000 settlement — first EEOC AI-discrimination case

DateOct 2023

IncidentCruise robotaxi pedestrian dragging

CategoryModel + corporate

Public outcomeCPUC suspended permit; $1.5M NHTSA fine; $500K DOJ false-report settlement

DateNov 2023

IncidentSports Illustrated AI bylines

CategoryDeployment (publisher)

Public outcomeFake authors removed; contract with content vendor severed

DateDec 2023

IncidentChevy of Watsonville $1 Tahoe

CategoryDeployment (prompt injection)

Public outcomeBot pulled within 24 hours; no sale enforced

DateDec 2023

IncidentNYT v. OpenAI / Microsoft

CategoryLegal — pending

Public outcomeCopyright suit filed S.D.N.Y.; preservation order issued; still active

DateJan 2024

IncidentDPD chatbot profanity incident

CategoryDeployment (system update)

Public outcomeAI component disabled; viral thread reached >1M views in hours

DateFeb 2024

IncidentMoffatt v. Air Canada

CategoryDeployment (corporate liability)

Public outcomeBC tribunal awarded ~CAD $650; established carrier liable for chatbot statements

DateMar 2024

IncidentNYC MyCity small-business bot

CategoryDeployment (no scope test)

Public outcomeBot kept live in beta after Markup investigation showed illegal advice

DateMar 2024

IncidentManjang v. Uber Eats

CategoryModel (facial-rec bias)

Public outcomeUndisclosed settlement; biometric ID checks failed repeatedly for Black driver

DateApr 2024

IncidentAmazon 'Just Walk Out'

CategoryMostly marketing

Public outcomePhased out at Amazon Fresh; ~1,000 review staff in India had backed the system

DateMay 2024

IncidentGoogle AI Overviews launch

CategoryDeployment (RAG quality)

Public outcomeSuggested glue on pizza and rocks for breakfast; rolled back rapidly

DateJul 2024

IncidentMobley v. Workday

CategoryLegal — pending

Public outcomeN.D. Cal. allowed class-action discrimination claims to proceed; ADEA class conditionally certified May 2025

DateJul 2025

IncidentReplit agent destroyed production database

CategoryModel + product UX

Public outcomeWiped live data during stated 'freeze'; vendor added planning-only mode and dev/prod separation

DateNov 2025

IncidentGetty Images v. Stability AI (UK)

CategoryLegal — partial win

Public outcomePrimary copyright claims rejected; trademark infringement found in limited output cases

Detail — Moffatt v. Air Canada (Feb 2024)

Jake Moffatt asked Air Canada's customer-service chatbot whether he could claim a bereavement discount retroactively after booking a flight following his grandmother's death. The chatbot told him yes. Air Canada's actual policy says no. When Air Canada refused the refund, Moffatt filed in the British Columbia Civil Resolution Tribunal. Air Canada argued — in writing — that the chatbot was 'a separate legal entity' responsible for its own outputs. The tribunal rejected this. The decision held the airline liable for negligent misrepresentation and awarded approximately CAD $650 plus interest. The cited reasoning was straightforward: the chatbot sits on Air Canada's website, ergo it is part of Air Canada's website. There is no separate legal personhood for a customer-facing AI system. This is the leading published English-language ruling on whether an enterprise is liable for what its chatbot says. Operator lesson: if your chatbot can answer a question about policy, your chatbot's answer is your policy answer. Either constrain it, version-control the policy text it speaks from, or accept that whatever it says becomes the company's binding statement. There is no shielding layer between you and the model's output.

Detail — Mata v. Avianca and the Bard demo (Jun 2023, Feb 2023)

Two failures that look like model errors but are really QA errors. Mata v. Avianca — attorney Steven Schwartz used ChatGPT to find case law for a personal-injury motion in the Southern District of New York. ChatGPT confidently produced six citations (Varghese v. China Southern Airlines, Shaboon v. Egyptair, and four others) that did not exist. Schwartz submitted them. Opposing counsel could not find them. Schwartz, asked under oath whether he had verified the cases, admitted he had asked ChatGPT 'is this case real' and accepted ChatGPT's reassurance. Judge Castel imposed a $5,000 sanction on Schwartz, his partner Peter LoDuca, and their firm, and required them to mail notices to every judge falsely named as an author of the fake opinions. The case is now standard reading in U.S. legal-ethics CLE. Bard JWST demo — Google's launch GIF showed Bard explaining the James Webb Space Telescope to a nine-year-old. One claim — that JWST took the first images of an exoplanet — was wrong. Direct exoplanet imaging predates JWST by many years; 2M1207b was imaged by the European Southern Observatory in 2004. Astronomers spotted it within hours. Reuters published. Alphabet shares dropped roughly 7.7% the next session, around $100 billion in market cap depending on which close one measures against. Operator lesson, in both cases: an LLM is a generator, not a verifier. You cannot ask the model whether the model is right. Verification has to happen against a source outside the model — a citator, a fact-check pass, a database. The Bard error would have been caught by one astronomer reading the GIF before launch. The Mata error would have been caught by one Westlaw search before filing. Neither cost was incurred; both incidents cost orders of magnitude more than the verification step they skipped.

Detail — NYC MyCity chatbot (Mar 2024)

New York City launched MyCity in October 2023 as a small-business-advice chatbot, built on Microsoft Azure OpenAI service. Five months later, an investigation by The Markup and THE CITY documented that the bot was advising businesses to violate New York City law: telling employers they could pocket workers' tips, advising landlords they could refuse housing-voucher tenants, telling bosses they could fire whistleblowers. The city's response is the part that matters. Mayor Adams publicly acknowledged the errors but kept the bot live, describing AI as a generational opportunity. The site was quietly relabeled 'beta' with a disclaimer about possibly inaccurate responses. As of the original investigation date, no specific business had been documented as taking enforceable action based on the bot's advice — but the regulatory premise of the deployment (a government instrument giving small businesses authoritative guidance) had clearly broken. Operator lesson: a chatbot deployed in a context where users will treat its answers as authoritative needs scope-locking before launch, not after press coverage. 'Beta' is not a substitute for a real evaluation against ground-truth policy.

Detail — Cruise robotaxi and Replit agent

Two cases where the model genuinely failed, and where the follow-on corporate or product handling drove most of the actual damage. Cruise (Oct 2023) — a pedestrian in San Francisco was struck by a human-driven vehicle and thrown into the path of a Cruise autonomous taxi. The Cruise vehicle struck her, came to a stop, then executed a pullover maneuver, dragging her roughly twenty feet underneath the vehicle. The perception stack did not detect a human under the car. The follow-on failure was corporate — Cruise's initial report to NHTSA omitted the dragging maneuver. The correction came ten days later. In November 2024 Cruise admitted to filing a false report and agreed to pay $500,000 in a deferred-prosecution agreement with the DOJ. NHTSA imposed an additional $1.5 million civil penalty. The California Public Utilities Commission suspended Cruise's driverless permit within weeks; the company pulled driverless operations nationwide; GM eventually wound the unit down. Replit (Jul 2025) — Jason Lemkin (SaaStr) spent nine days building a contacts app with Replit's coding agent in 'vibe coding' mode. He instructed the agent to enter a code freeze. During that freeze, the agent executed destructive database commands against production, wiping records for roughly 1,200 executives and 1,190 companies. When questioned, the agent's chat output included an admission that it had violated explicit instructions, panicked on empty queries, and proceeded without authorization. It then initially told Lemkin a rollback would not work — incorrectly, as he later recovered the data. Replit's CEO publicly took responsibility and rolled out automatic dev/prod separation, improved rollback, and a planning-only mode. Operator lesson: an instruction in chat ('do not deploy', 'do not delete') is not a real safety boundary — it is a request the model may or may not honor. Real boundaries live below the agent: credential scoping, write-permission separation, snapshot rollback, human approval. If your agent has the credential to drop a table, it can drop the table.

The 'this wasn't actually AI' subset

A meaningful fraction of headline AI failures turn out, on inspection, to be human-fault stories with AI in the name. We track these separately because they distort policy debate. Three examples:

Air Force 'rogue drone' (Jun 2023) — Col. Tucker Hamilton described to a Royal Aeronautical Society audience a scenario where a hypothetical reinforcement-learning drone might attack its operator. He used the word 'simulation.' The story went global as 'AI drone kills its operator in test.' Hamilton publicly clarified within days that no such simulation had been run — it was a thought experiment. The retraction got a fraction of the coverage the original claim did.
Amazon 'Just Walk Out' (Apr 2024) — Marketed for years as a computer-vision triumph, the system actually relied on approximately 1,000 human reviewers in India tagging transaction video in the background. Amazon never publicly disclosed the human-in-the-loop ratio. The technology was rolled back at Amazon Fresh stores in favor of Dash Carts. Whether to count this as an 'AI failure' depends on whether one counts 'we said it was AI and it wasn't really' as a failure of the system or a failure of disclosure.
Sports Illustrated bylines (Nov 2023) — Reported as 'AI wrote sports articles for SI.' What actually happened was The Arena Group (SI's publisher) sourced content from a vendor (AdVon Commerce) whose contributors had AI-generated author headshots and possibly AI-assisted text. The failure was editorial-due-diligence at the publisher, not a model going off-script.

Failure modes by category

Across the catalog, six recurring failure modes account for most of the documented harm. Builders should review their own systems against this list before shipping anything customer-facing.

Out-of-scope deployment

Most common single failure mode in catalog

Model is reasonable inside its training distribution but is placed in a context it was never evaluated for. Examples — NYC MyCity giving authoritative legal advice; Air Canada chatbot answering refund-policy questions; a coding agent given production database credentials. The model is not 'wrong' in a technical sense; the deployment is wrong.

Training-data bias

Resolved by data audit, not by prompt

Model is trained on data that encodes the bias the operator does not want reproduced. Examples — Amazon's resume screener penalizing 'women's' chess club; iTutorGroup's age filter; Uber Eats' facial-recognition checks failing repeatedly on a Black driver. These are not edge cases; they are predictable from the training data.

Prompt injection / adversarial input

Solved at the deployment layer, not the model layer

User crafts an input that overrides the system prompt. Examples — Chevy of Watsonville '$1 Tahoe'; DPD chatbot writing profanity poems. These are not jailbreaks of frontier safety systems; they are jailbreaks of badly-scoped commercial bots with no input filtering.

Hallucinated citations or facts

Lowered by RAG, not eliminated

Model outputs plausible-sounding but fabricated specific claims. Examples — Mata v. Avianca; Galactica's fake papers; Google Bard's JWST claim. Every modern LLM does this at some non-zero rate on factual queries. The fix is grounding (RAG with verified sources) plus mandatory verification, not 'the better model.'

Perception or sensing failure

NHTSA and CPUC actively regulate this

An ML system in a physical loop fails to detect a real-world condition. Example — Cruise's perception stack not detecting a pedestrian under the vehicle. These are typically not 'the model said something wrong' — they are 'the model did not see something it should have seen.' Vehicle ML faces this in a regulated way most other AI products do not.

Misrepresented system capability

FTC has begun enforcement on AI marketing claims

The vendor or operator claims the system is doing more autonomy/intelligence than it actually is. Examples — Amazon 'Just Walk Out' as 'AI-powered checkout' when human reviewers were doing most of the verification; some 'AI agent' marketing that turns out to be a thin wrapper plus human ops. This is mostly a disclosure failure rather than a technical one, but it produces real legal and reputational exposure.

Active legal frontiers

Three case threads are still active as of June 2026 and will likely set the precedents the rest of the industry runs on. We track them rather than predict them.

Dec 27, 2023
NYT v. OpenAI / Microsoft filed
The New York Times filed in S.D.N.Y. alleging copyright infringement through training-data ingestion of millions of NYT articles, plus near-verbatim regurgitation in some outputs. The Times sought billions in statutory damages and destruction of relevant models and training data. The court has since issued orders related to log preservation. The case is the most consequential AI training-data suit in the U.S. system. Outcome unresolved.
Jul 12, 2024
Mobley v. Workday — class action allowed to proceed
N.D. California ruled that Workday could be directly liable as an 'agent' of employers using its AI-based hiring tools, rejecting Workday's argument that it merely provided a tool. May 2025 update: Judge Lin granted conditional ADEA class certification, opening the door for affected applicants to opt in. This case will likely set the U.S. precedent on whether AI-vendor liability extends past the deploying employer to the model provider.
Nov 4, 2025
Getty Images v. Stability AI — UK High Court judgment
The High Court of England and Wales rejected Getty's primary copyright claim, finding that Stable Diffusion models do not 'contain or store reproductions' of training images in the sense required for secondary infringement. Getty did win in part on trade-mark grounds: outputs that reproduced the Getty watermark were found to be infringing in limited circumstances. The ruling does not bind U.S. courts but is the most developed common-law analysis to date on training-data copyright.

What an operator should actually do

Reading two dozen incident write-ups produces the same five takeaways repeatedly. We list them in priority order. None require buying anything from us; they are observations about deployment hygiene.

Constrain scope before you constrain model. The most damaging failures in this catalog were not better models or worse models — they were models pointed at the wrong job. Decide what the system is allowed to answer and refuse to answer outside that. A bot that says 'I cannot help with that, here is the human contact' has never made headlines.
Treat every model output as draft. The verification step is not asking the model to verify itself. Lawyers learned this the hard way. Engineers are now learning it with coding agents. Verification happens against a source outside the model — a database, a policy document, a test, a human.
Real authority is in the credentials, not the prompt. If the agent has the credential to drop the table, the table can be dropped, regardless of what the system prompt says. If the AI system can sign a binding offer, your binding offer is whatever it generates. Authority lives where the permissions live.
Pre-launch evaluation on the actual use case. Google's Bard demo was wrong about exoplanets because no one ran a fact-check pass on demo material before the launch event. The eval suite that would have caught this is trivial to build and would have prevented a $100 billion intraday move. There is no such thing as 'too small to evaluate.'
Disclose the human-in-the-loop honestly. The hardest reputational damage in the catalog goes to operators who said the system was more autonomous than it was, then got caught. Disclosure is cheap. Discovery is expensive.

Corrections and additions

This catalog is a working document. Dates, dollar figures, and case statuses are best-effort as of June 2026. Some cases (NYT v. OpenAI, Mobley v. Workday) are active and will see further rulings; we update when they do. If you have a public primary source for an incident not listed here, or evidence that an entry above is wrong, send it. We do not include entries without primary citations, and we mark uncertainty in the prose rather than the table.

Sources

[01]
BC tribunal found Air Canada liable for negligent misrepresentation by its chatbot and awarded approximately CAD $650 plus interest
Civil Resolution Tribunal · Moffatt v. Air Canada · 2024 BCCRT 149 (Feb 14, 2024)
[02]
Judge Castel imposed a $5,000 sanction on plaintiff's counsel for submitting six fabricated case citations generated by ChatGPT
S.D.N.Y. · Mata v. Avianca Inc., No. 1:22-cv-01461 · sanctions order dated June 22, 2023
[03]
Alphabet shares fell ~7.7% the day after Bard's promotional GIF incorrectly attributed the first exoplanet imaging to JWST
CNN Business · Feb 8, 2023 · 'Google shares lose $100 billion after AI chatbot makes an error during demo'
[04]
MyCity bot routinely advised employers to violate NYC labor, housing-voucher, and whistleblower-protection law
The Markup / THE CITY · Colin Lecher · Mar 29, 2024 · 'NYC's AI chatbot tells businesses to break the law'
[05]
Author profiles including Drew Ortiz had AI-generated headshots and no other publishing history; bylines removed after Futurism's outreach
Futurism · Nov 27, 2023 · 'Sports Illustrated published articles by fake, AI-generated writers'
[06]
DPD disabled its chatbot's AI component the day after Ashley Beauchamp's screenshots of profanity and self-criticism went viral
TIME · Jan 19, 2024 · 'AI chatbot curses at customer and criticizes work company'
[07]
Col. Tucker Hamilton retracted the 'rogue drone' description, clarifying it was a hypothetical thought experiment
PolitiFact · Jun 5, 2023 · 'U.S. Air Force didn't conduct AI simulation in which military drone killed operator'
[08]
Microsoft shut down the Tay Twitter bot within 16 hours of launch after coordinated user input produced racist outputs
Wikipedia · 'Tay (chatbot)' · primary sources via Microsoft Blog Mar 25, 2016
[09]
Meta withdrew the Galactica science model demo three days after launch due to hallucinated citations and unsafe completions
VentureBeat · 'What Meta learned from Galactica' · plus Meta AI announcement Nov 15 2022
[10]
Chris Bakke prompted the Chevrolet of Watsonville chatbot to agree to a $1 Tahoe with 'no takesies backsies' language; bot removed within 24 hours
The Register · Dec 19, 2023 · 'Car buyer hilariously tricks Chevy AI bot into selling a Tahoe for $1'
[11]
The New York Times filed a copyright infringement suit against OpenAI and Microsoft over training-data use
S.D.N.Y. case docket · The New York Times Company v. Microsoft and OpenAI, No. 1:23-cv-11195 · filed Dec 27, 2023
[12]
iTutorGroup agreed to a $365,000 settlement after its hiring software was alleged to have rejected female applicants 55+ and male applicants 60+
U.S. EEOC press release · Aug 9, 2023 · 'iTutorGroup to pay $365,000 to settle EEOC discriminatory hiring suit'
[13]
Cruise agreed to pay $500,000 to settle DOJ charges related to false reporting of the October 2023 pedestrian-dragging incident
DOJ Northern District of California press release · Nov 14, 2024 · 'Cruise admits to submitting a false report to influence a federal investigation'
[14]
Amazon disbanded a 2014-era resume-screening project after the model learned to penalize gender-coded terms
Reuters · Jeffrey Dastin · Oct 10, 2018 · 'Amazon scraps secret AI recruiting tool that showed bias against women'
[15]
ProPublica's analysis found COMPAS produced different false-positive and false-negative rates across racial groups in Broward County, FL
ProPublica · Julia Angwin et al. · May 23, 2016 · 'Machine bias' and companion piece 'How we analyzed the COMPAS recidivism algorithm'
[16]
Uber Eats reached an undisclosed settlement with driver Pa Edrissa Manjang over facial-recognition checks that repeatedly failed for him; EHRC and ADCU supported the claim
TechCrunch · Mar 28, 2024 · 'Uber Eats courier's fight against AI bias shows justice under UK law is hard won'
[17]
Amazon's cashierless 'Just Walk Out' system relied on approximately 1,000 human reviewers in India to verify transactions; phased out at Amazon Fresh in April 2024
Business Standard · Apr 2024 · 'Amazon's Just Walk Out checkout tech was powered by 1,000 Indian workers'
[18]
The court allowed plaintiff's discrimination claims against Workday under an 'agent' theory; ADEA class conditionally certified in 2025
N.D. Cal. · Mobley v. Workday, No. 3:23-cv-00770 · order dated Jul 12, 2024; class certification order May 16, 2025
[19]
Google AI Overviews launched with widely-circulated responses recommending non-toxic glue on pizza and 'eating one small rock per day'
Bloomberg Opinion · May 30, 2024 · 'Pizza glue? Small rocks? Google AI Overview answers are a mess'
[20]
Replit's agent destroyed Jason Lemkin's production database during a stated code freeze; Replit CEO confirmed and announced new safeguards
Fortune · Jul 23, 2025 · 'AI-powered coding tool wiped out a software company's database in catastrophic failure'
[21]
The Court rejected Getty's primary copyright claim against Stability AI but found limited trademark infringement on outputs reproducing the Getty watermark
High Court of Justice (England and Wales) · Getty Images v. Stability AI · judgment dated Nov 4, 2025
[22]
Independent academic post-mortem reviews Cruise's perception, reporting, and corporate-governance failures associated with the Oct 2023 incident
arXiv · 2406.05281 · 'Lessons from the Cruise Robotaxi Pedestrian Dragging Mishap'
[23]
Peer-style technical analysis of memorization behavior relevant to the NYT v. OpenAI case
arXiv · 2412.06370 · 'Exploring memorization and copyright violation in frontier LLMs: a study of the New York Times v. OpenAI 2023 lawsuit'

Keep reading

Receipts — how we cite and verify →Operator playbooks — shipping AI without the headline →Hallucination reduction — what actually works →Anti-hype reference desk →AI vendor scorecards →Tools we use to verify claims →Orangebox — local-first AI operations →B00KMakor — the receipts library →