Menu
<- Back to Cases

Legal Contract Risk Review Agent

ClientConfidential Global Top-20 Law Firm (China Practice)
IndustryLegal Services (Global Top-20 Law Firm)
Core TechClaude Extended Thinking + Citations
ROIReview time -50% | Risk miss rate -80% | Multilingual consistency 95%+
BoundaryAnonymized implementation brief
01 /Context

The Senior Lawyer Bottleneck.

Our client, the China practice of a global top-20 law firm, reviews over 5,000 commercial contracts annually across M&A, real estate, IP licensing, and cross-border trade. The review process is heavily dependent on senior associates and partners — only they have the experience to spot non-obvious risk clauses hidden in dense legal language.

This created a severe throughput bottleneck. Junior lawyers could handle routine reviews, but subtle risks — indemnification clauses buried in schedules, jurisdictional conflicts between governing law and dispute resolution, or seemingly benign force majeure provisions that actually shift disproportionate risk — required senior eyes. The result: contracts queued for weeks, billable hours consumed on repetitive review, and the occasional risk clause that slipped through because the reviewing lawyer was on their 40th contract that day.

02 /Friction

Why Summarization Isn't Review.

The firm had evaluated multiple AI contract tools. Each failed on the same fundamental misunderstanding of what contract review actually requires.

  • 1.The Shallow Extraction Problem: Standard LLM tools extracted key terms (parties, dates, amounts) and produced readable summaries. But risk review isn't about what's in the contract — it's about what's missing, what's ambiguous, and what's asymmetrically unfavorable. These tools couldn't reason about the interplay between clauses.
  • 2.The Unverifiable Conclusion Problem: When a tool flagged a clause as 'high risk,' the reviewing lawyer had no way to verify why. The model's reasoning was opaque. In a profession where every conclusion must be defensible, an unverifiable risk flag is worse than no flag at all.
  • 3.The Multilingual Inconsistency Problem: The firm reviews contracts in Chinese, English, and Japanese. Different lawyers applied different review standards across languages, creating inconsistent risk assessments for substantially similar contract terms.
03 /Solution

From Summarizer to Reasoning Review System.

Contract set
Reasoning chain
Clause citations

Extended reasoning and clause-level evidence make risk review explainable enough for senior legal judgment.

I. Extended Thinking Risk Reasoning Agent

Claude 3 Opus with Extended Thinking enabled reads each clause in context of the entire agreement, identifies inter-clause dependencies (e.g., 'the indemnification in Section 8 is limited by the cap in Schedule 3, which is triggered by the force majeure definition in Section 11'), and produces a risk assessment with a full reasoning chain the reviewing lawyer can follow step by step.

II. Citations Agent with Clause-Level Traceability

Every risk finding includes a Citation pointing to the exact paragraph, clause, or schedule in the source contract. The reviewing lawyer clicks the citation and sees the original text highlighted in context. No more 'the model thinks Section 4 is risky' without knowing which sentence and why.

III. Multilingual Unified Review Agent

A dedicated agent enforces a single review standard across Chinese, English, and Japanese contracts. The same risk taxonomy, severity classification, and recommendation framework are applied regardless of contract language — eliminating the inconsistency that arose when different lawyers applied different standards.

IV. RAG Legal Knowledge Base + Token101

The system maintains a RAG knowledge base of regulatory frameworks, industry-standard templates, and the firm's own precedent library. Token101 routes all contract reviews to Opus (reasoning requirements justify the cost), with Sonnet handling routine clause extraction. Zero Data Retention — client contract text never persists in prompt logs.

Senior Lawyer Leverage, Not Replacement.

-50%Contract review time reduction, with senior lawyers focusing on nuanced judgment calls.
-80%Risk clause miss rate reduction, with Extended Thinking catching inter-clause dependencies.
95%+Cross-language review consistency, eliminating variance across Chinese, English, and Japanese.
ZeroData retention — client contract text never persists in prompt logs.

"We don't just flag risks; we build AI that shows its reasoning so lawyers can verify it."

Related implementation briefs.

Get this architecture for your industry.