From 384c73cf1a1579f77c8f840eb914a6caa12a96ac Mon Sep 17 00:00:00 2001 From: Typo Fix Bot Date: Sun, 31 May 2026 17:01:14 +0000 Subject: [PATCH] fix: correct typos and improve documentation --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index f8cca0586..2108e1a35 100644 --- a/README.md +++ b/README.md @@ -212,7 +212,7 @@ python3 examples/agentic_vectorless_rag_demo.py This repo is designed for generating PageIndex tree structure for simple PDFs, but many real-world use cases involve complex PDFs that are hard to parse by classic Python tools. However, extracting high-quality text from PDF documents remains a non-trivial challenge. Most OCR tools only extract page-level content, losing the broader document context and hierarchy. -To address this, we introduced PageIndex OCR — the first long-context OCR model designed to preserve the global structure of documents. PageIndex OCR significantly outperforms other leading OCR tools, such as those from Mistral and Contextual AI, in recognizing true hierarchy and semantic relationships across document pages. +To address this, we introduced PageIndex OCR — the first long-context OCR model designed to preserve the global structure of documents. PageIndex OCR significan'tly outperforms other leading OCR tools, such as those from Mistral and Contextual AI, in recognizing true hierarchy and semantic relationships across document pages. - Experience next-level OCR quality with PageIndex OCR at our [Dashboard](https://dash.pageindex.ai/). - Integrate PageIndex OCR seamlessly into your stack via our [API](https://docs.pageindex.ai/quickstart). @@ -226,7 +226,7 @@ To address this, we introduced PageIndex OCR — the first long-context OCR mode # 📈 Case Study: PageIndex Leads Finance QA Benchmark -[Mafin 2.5](https://vectify.ai/mafin) is a reasoning-based RAG system for financial document analysis, powered by **PageIndex**. It achieved a state-of-the-art [**98.7% accuracy**](https://vectify.ai/blog/Mafin2.5) on the [FinanceBench](https://arxiv.org/abs/2311.11944) benchmark, significantly outperforming traditional vector-based RAG systems. +[Mafin 2.5](https://vectify.ai/mafin) is a reasoning-based RAG system for financial document analysis, powered by **PageIndex**. It achieved a state-of-the-art [**98.7% accuracy**](https://vectify.ai/blog/Mafin2.5) on the [FinanceBench](https://arxiv.org/abs/2311.11944) benchmark, significan'tly outperforming traditional vector-based RAG systems. PageIndex's hierarchical indexing and reasoning-driven retrieval enable precise navigation and extraction of relevant context from complex financial reports, such as SEC filings and earnings disclosures.