Book a demo

PatSnap PatentBench Research:
Benchmarking AI Tools for Patent Tasks

PatentBench is the first comprehensive benchmark built specifically for patent-focused AI,
based on real-world patent scenarios.

Overview

The PatSnap PatentBench is a benchmark for patent tasks in real-world scenarios. It evaluates the performance of AI tools against curated test samples each consisting of a "test question" and a "standard answer" that closely represent the ideal references used in actual patent work.

Each benchmark is based on hundreds of carefully selected real-world cases, covering different patent tasks, jurisdictions, and industries. The PatSnap PatentBench currently covers three patent workflows:

Novelty Search - a key patent task that involves systematically identifying prior art worldwide to determine whether a technical solution is new and inventive under patent law.

Design FTO Search - a process of searching globally registered design patents to determine whether a product's appearance infringes on existing design patent rights.

FTO Search - an analysis to determine whether a product or process potentially infringes valid claims of existing utility patents in target markets.

Benchmark 1

Novelty Search

81%
X Detection Rate
Top 100
36%
X Recall Rate
Top 100
340
Test Samples
4
Patent Offices
US · CN · EP · WO

Performance Comparison

X Detection Rate (Top 100)
PatSnap
81%
ChatGPT-o3
32%
DeepSeek-R1
9%
X Recall Rate (Top 100)
PatSnap
36%
ChatGPT-o3
11%
DeepSeek-R1
3%
Benchmark 2
New

Design FTO Search

77%
Hit Rate
Top 200
0.7
PRES Score
261
Test Samples
3
Jurisdictions
US · CN · EU

Performance Comparison

High-Risk Patent Hit Rate (Top 200)
PatSnap
77%
ChatGPT 5.4
0.93%
Gemini 3.1 Pro
3.71%
PRES Score
PatSnap
0.7
ChatGPT 5.4
0.015
Gemini 3.1 Pro
0.040
Benchmark 3
New

FTO Search

Benchmark in progress
FTO search is a systematic process of identifying all potentially blocking patents in a target jurisdiction, to assess
whether a technology can be freely practiced without infringing others' patent rights.