AIDB Daily Papers

特許の新規性を細粒度で予測：パッセージ検索に基づくアプローチ

原題: Is It Novel and Why? Fine-Grained Patent Novelty Prediction Based on Passage Retrieval

著者: Valentin Knappich, Anna Hätty, Simon Razniewski, Annemarie Friedrich

公開日: 2026-05-04 | 分野: LLM NLP AI 情報検索自然言語処理人工知能 cs.CL cs.AI cs.IR

※ 日本語タイトル・ポイントはAIによる自動生成です。正確な内容は原論文をご確認ください。

ポイント

特許の新規性評価を、請求項レベルの二値分類から特徴レベルでのパッセージ検索と抽象的推論の複合タスクへと再定義しました。
新規のデータセットFiNE-Patentsを構築し、LLMを用いた特徴分解、先行技術との照合、新規性予測を行うワークフローを提案しました。
提案手法は、パッセージ検索と新規特徴特定において既存手法を上回り、LLMは請求項レベルの分類タスクにおける偽相関に頑健であることを示しました。

Abstract

Novelty assessment is a critical yet complex task in the examination process for patent acceptance, requiring examiners to determine whether an invention is disclosed in a prior art document. The process involves intricate matching between specific features of a patent claim and passages in the prior art. While prior work has approached novelty prediction primarily as a binary classification task at the claim level, we argue that this formulation is susceptible to spurious correlations and lacks the granularity required for practical application. In this work, we introduce FiNE-Patents (Fine-grained Novelty Examination of Patents), a novel dataset comprising 3,658 first patent claims annotated with fine-grained, feature-level prior art references extracted from European Search Opinion (ESOP) documents. We propose shifting the evaluation paradigm from simple binary classification to a joint retrieval and abstract reasoning task at the feature level, requiring models to identify specific passages from a prior art document that disclose individual claim features, and to identify which features of a claim make it novel. We implement and evaluate LLM-based workflows that decompose claims into features, analyze each feature against prior art, and finally derive a claim-level novelty prediction. Our experiments demonstrate that these workflows outperform embedding-based baselines on passage retrieval and novel feature identification. Furthermore, we show that unlike trained classifiers, LLMs are robust against spurious correlations present in the claim-level novelty classification task. We release the dataset and code to foster further research into transparent and granular patent analysis.

Paper AI Chat

この論文のPDF全文を対象にAIに質問できます。

質問の例:

AIチャット機能を利用するには、ログインまたは会員登録（無料）が必要です。

会員登録 / ログイン

arXivで読む PDFを開く

メタ情報

arXiv ID: 2605.02392
カテゴリ: cs.CL, cs.AI, cs.IR

ポイント

Abstract

Paper AI Chat

関連するAIDB記事

メタ情報