MF
Mahreen Fathima
Verified Review
8 Tools Tested3 PDFsMarkdown APIsOCR + Tables + Charts

Best AI Tools to Convert Complex PDFs into Clean Markdown with an API

0
Tested: Extend AI vs LlamaParse vs Landing AI vs Mistral AI vs Tensorlake vs Adobe PDF Extract API vs Upstage AI vs Nutrient.io

We tested 8 hosted PDF-to-markdown APIs on the same hybrid annual report, table-heavy financial report, and image-only scanned research paper to see which tools preserved OCR text, complex tables, visuals, reading order, and usable markdown well enough for real ingestion pipelines.

How We Tested

This ranking is based on one research cycle that ran the same three real-world PDFs through eight hosted APIs. The inputs covered a long hybrid annual report with native text, tables, charts, and scanned signatures; a table-heavy financial report with grouped headers and multilevel tables; and an image-only scanned research paper with multi-column text, charts, and photographed tables. Each tool was judged on weighted criteria from the research report, using observed markdown outputs and side-by-side source/output screenshots rather than vendor claims.

Same Input Used Across All Tools
Hybrid Earnings PDF.pdf
Financial Report PDF.pdf
Scanned Research Paper.pdf
What We Evaluated
Label
Description
Text & OCR Completeness
Extracts all readable content, including scanned pages, with accurate OCR and minimal omissions.
Table Preservation
Preserves complex table structures, including rows, columns, multi-row headers, and merged-cell relationships in markdown.
Visual Content Retention
Retains charts, figures, diagrams, and images in the output and places them in the correct reading position.
Reading Order & Structure
Maintains headings, sections, column order, document hierarchy, and overall flow.
Markdown Quality
Produces clean, well-structured, usable markdown rather than a flat text dump.
Complex Document Handling
Maintains quality across long, multi-section, and mixed-content documents without degradation.
Advanced Features (Bonus)
Provides separate table/chart extraction and flags low-confidence OCR or ambiguous regions.

The Ranking

8 toolstested head-to-head on the same input. Each card shows the verdict and per-criterion scores. Click "Full breakdown" for the artifact-level evidence.

1
Best all-round PDF-to-markdown API
Full breakdown ↓

Most consistent across all document types; production-ready default choice.

Markdown Quality
9.3
Table Preservation
8.3
Text & OCR Completeness
9.3
Visual Content Retention
8.3
Advanced Features (Bonus)
7.0
Complex Document Handling
8.7
Reading Order & Structure
8.7
2
Best for chart-to-table conversion
Full breakdown ↓

Tied with Extend AI on hybrid reports; best for programmatic chart extraction; scanned tables drop performance.

Markdown Quality
8.7
Table Preservation
8.2
Text & OCR Completeness
9.3
Visual Content Retention
8.0
Advanced Features (Bonus)
7.0
Complex Document Handling
8.3
Reading Order & Structure
8.7
3
Strong scan-first parser with semantic visual descriptions
Full breakdown ↓

Strongest on scanned papers and hybrid documents; hierarchy issues on financial reports only.

Markdown Quality
8.2
Table Preservation
9.0
Text & OCR Completeness
8.0
Visual Content Retention
8.3
Advanced Features (Bonus)
7.0
Complex Document Handling
8.0
Reading Order & Structure
7.8
4
Best page-wise export and confidence flagging
Full breakdown ↓

Strong table extraction with page-wise export and confidence flagging; document hierarchy preservation weak.

Markdown Quality
8.0
Table Preservation
7.7
Text & OCR Completeness
8.0
Visual Content Retention
8.7
Advanced Features (Bonus)
9.0
Complex Document Handling
7.0
Reading Order & Structure
6.3
5
Good on digital PDFs, shaky on scanned complex tables
Full breakdown ↓

Excellent for digital-native PDFs with configurable chart extraction; fails on scanned multilevel tables.

Markdown Quality
6.0
Table Preservation
7.7
Text & OCR Completeness
8.3
Visual Content Retention
8.0
Advanced Features (Bonus)
7.0
Complex Document Handling
7.7
Reading Order & Structure
8.0
6
Clean native-PDF output with embedded assets
Full breakdown ↓

Clean output on native PDFs; 1 MB file size limit breaks document continuity on scanned inputs.

Markdown Quality
6.7
Table Preservation
7.0
Text & OCR Completeness
7.0
Visual Content Retention
8.3
Advanced Features (Bonus)
7.0
Complex Document Handling
7.0
Reading Order & Structure
6.0
7
Upstage AINeeds work
Decent native-table extraction, weak layout preservation
Full breakdown ↓

Poor consistency across inputs; multi-column layout handling collapsed; broken image links make visuals unusable.

Markdown Quality
5.3
Table Preservation
7.0
Text & OCR Completeness
6.3
Visual Content Retention
6.0
Advanced Features (Bonus)
6.7
Complex Document Handling
5.3
Reading Order & Structure
5.3
8
Nutrient.ioNeeds work
Basic text recovery, weak on structure and visuals
Full breakdown ↓

Basic text extraction only; fails on visual content, multilevel tables, and document hierarchy across all inputs.

Markdown Quality
5.7
Table Preservation
5.7
Text & OCR Completeness
6.7
Visual Content Retention
5.7
Advanced Features (Bonus)
6.3
Complex Document Handling
5.3
Reading Order & Structure
5.7
Ranking visual
Full breakdown · Tool 1 of 8

Extend AIBest

Production-ready document processing API focused on clean markdown from mixed-content PDFs.

Hybrid report source page
Hybrid report source page

Source page from the Target annual report used to test whether a two-column narrative page with bullets and an embedded headshot could be converted into readable linear markdown without losing the page title or surrounding text.

Hybrid financial table source
Hybrid financial table source

Source financial summary table used to test year-column alignment, row preservation, and whether line items like Sales, SG&A, EBIT, and net earnings stayed readable after conversion.

Waterfall chart source
Waterfall chart source

Source SG&A waterfall chart used to test whether a visual could survive as something more useful than flat text, including its step values and category transitions.

Signature block source
Signature block source

Source scanned signature block used to test whether a small handwritten/signature region still preserved the typed signer name, title, and date.

Blurred stamp source
Blurred stamp source

Source low-clarity audit stamp used to test OCR on degraded small text.

Financial report source page
Financial report source page

Source Sumitomo 'Additional Notes' page used to test heading hierarchy and numbered disclosure extraction on a dense financial narrative page.

Scanned paper source page
Scanned paper source page

Source scanned two-column research page used to test OCR, column reading order, and section-to-paragraph reconstruction.

What worked
  • Extend AI was the most balanced performer in the test. It preserved the reading flow of the long Target annual report, kept standard financial tables readable, converted the SG&A waterfall chart into structured text with values and explanation, and OCR'd small scanned elements such as the CEO signature block and the blurred Ernst & Young stamp. On the Sumitomo report it kept headings and most grouped financial tables intact, and on the scanned research paper it handled two-column OCR, photographed tables, and chart captions better than most tools.
Where it struggled
  • Extend AI still had trouble with the hardest header-heavy tables. In the Sumitomo multilevel segment table, compound header cells were not split cleanly into separate semantic roles. In the scanned research paper, dense multirow tables lost some parent-child structure, and tables with text inserted between columns picked up corrupted numeric cells and dropped the inserted note. Visuals were usually preserved as descriptions or figure blocks rather than embedded images, and the tool did not expose confidence scores or uncertainty flags.
What came out
Hybrid page hierarchy output
Hybrid page hierarchy output

On the Target annual report's 'A Growth Story Again' page, Extend AI linearized the two-column layout into readable markdown while keeping the page title, narrative text, and bullet-style business highlights together.

Hybrid table output
Hybrid table output

For the 'Financial Summary' table, Extend AI preserved the 2015 to 2011 year columns and kept rows such as Sales, Cost of sales, SG&A, EBIT, and net earnings aligned with their values.

Waterfall chart output
Waterfall chart output

Extend AI converted the SG&A waterfall chart into a structured figure block that kept the step values, category labels, and a prose summary of the movement from 20.2% in 2013 to 19.6% in 2015.

Signature block output
Signature block output

The CEO signature block was recovered as readable text, preserving Brian C. Cornell's name, title, and date instead of skipping the scanned block.

Blurred stamp output
Blurred stamp output

From the blurred audit stamp, Extend AI recovered the firm name and page number, but misread 'LLP' as '1LP'.

Logo description output
Logo description output

The Target logo was not embedded inline, but Extend AI preserved it as a structured figure description identifying the bullseye symbol.

Financial hierarchy output
Financial hierarchy output

On the Sumitomo 'Additional Notes' page, Extend AI kept the section heading and numbered subpoints in order, although the visible screenshot cuts off the last share-count lines.

Financial grouped table output
Financial grouped table output

In the segment comparison table, Extend AI preserved grouped columns for previous quarter, present quarter, and year-over-year change with values aligned by segment.

Financial multilevel table output
Financial multilevel table output

For a denser multilevel segment table, Extend AI kept most values but merged compound header roles, making parent and child header distinctions less explicit.

Scanned hierarchy output
Scanned hierarchy output

From the scanned two-column research page, Extend AI reconstructed a readable section flow under 'STUDY AREA' and kept successive paragraphs attached to the heading.

Scanned table output
Scanned table output

Extend AI transcribed the photographed stand-structure table into a readable row-column layout, including before-cut and after-cut groupings and treatment rows.

Scanned chart output
Scanned chart output

For the mortality chart, Extend AI preserved legend labels, years, and the 'CUT COMPLETED' marker inside a figure caption, but replaced the original visual trend shape with text.

Handwriting output
Handwriting output

The scanned cover's faint pencil marks were not truly deciphered, but Extend AI at least surfaced them as a figure with a note that most of the cursive text was illegible.

Scanned multirow table output
Scanned multirow table output

In a dense multirow scanned table, Extend AI recovered many values but the year and header relationships broke down, so the grouped structure became hard to follow.

Scanned interrupted-table output
Scanned interrupted-table output

When rotated text interrupted a scanned table, Extend AI reconstructed most rows but lost the between-column annotation and introduced corrupted cells such as extra characters in numeric positions.

15 full renders · same input
Full breakdown · Tool 2 of 8

LlamaParse

LLM-oriented parser that returns downloadable markdown and separate visual assets for downstream workflows.

Hybrid report source page
Hybrid report source page

Source Target annual report page used to test whether two-column reading order and section hierarchy survived markdown conversion.

Hybrid table source
Hybrid table source

Source financial table used to test whether rows, columns, and value associations remained readable in markdown.

Waterfall chart source
Waterfall chart source

Source waterfall chart used to test whether chart semantics could be retained even when visuals were not embedded directly.

Blurred stamp source
Blurred stamp source

Source low-clarity audit stamp used to test whether the tool recognized small degraded text instead of dropping it.

Scanned research source page
Scanned research source page

Source scanned multi-column page used to test OCR reading order and section reconstruction.

What worked
  • LlamaParse was the closest challenger to Extend AI. It handled the 84-page hybrid annual report cleanly, preserved regular financial tables, kept reading order strong across scanned multi-column pages, and stood out for chart-to-structured-data conversion instead of plain chart descriptions. It also preserved non-text assets through descriptions and separate asset downloads, which makes it attractive for programmatic RAG or analytics workflows.
Where it struggled
  • Its main weakness showed up on the hardest scanned tables. Parent-child grouped headers weakened in both the Sumitomo complex table and the scanned research paper's multilevel tables, so the values survived better than the structure. LlamaParse also did not reconstruct the financial report's table of contents as a true nested TOC, and it favored textual descriptions or separate asset downloads over embedded visuals in the markdown itself.
What came out
Hybrid hierarchy output
Hybrid hierarchy output

LlamaParse preserved the Target report's section hierarchy and content flow instead of flattening the multi-column page into disconnected text fragments.

Hybrid chart output
Hybrid chart output

Rather than embed the SG&A chart as an image, LlamaParse translated it into structured textual data so the chart's categories and values remained machine-usable in markdown.

Blurred stamp output
Blurred stamp output

The blurred Ernst & Young marking remained recognizable in LlamaParse's output instead of disappearing during OCR.

Hybrid table output
Hybrid table output

The annual report's financial summary remained readable with row alignment, year columns, and value associations preserved.

Logo description output
Logo description output

LlamaParse described document visuals in text instead of dropping them, showing how non-text assets were surfaced inside the extracted content.

Signature description output
Signature description output

Signature content was retained as descriptive text rather than as an embedded handwritten image, preserving document meaning even when the exact strokes were not reproduced.

Separate asset export output
Separate asset export output

LlamaParse exposed visual assets as separate downloadable outputs rather than only inside the markdown file.

Financial hierarchy output
Financial hierarchy output

On the financial report, LlamaParse preserved the document title, section headings, and overall content flow well enough that the original hierarchy remained recognizable.

Financial multilevel table output
Financial multilevel table output

For one grouped-column financial table, LlamaParse preserved the multi-level column organization well enough that header-to-data relationships were still understandable.

Financial complex table output
Financial complex table output

In a harder financial table, visible values survived but parent-child header roles became less explicit, weakening the semantic clarity of grouped columns.

TOC output
TOC output

The table of contents was extracted as sequential text with page numbers instead of as a structurally nested navigation block.

Scanned hierarchy output
Scanned hierarchy output

LlamaParse reconstructed the scanned research paper into a coherent reading flow, keeping headings attached to the paragraphs that followed across the multi-column layout.

Scanned grouped-table output
Scanned grouped-table output

The overall structure of a scanned grouped-column table remained readable, with much of the column organization and associated data preserved.

Scanned chart output
Scanned chart output

The scanned chart was converted into structured table-like content, keeping legend-to-value mapping even though the original chart image was not preserved inline.

Scanned multilevel table output
Scanned multilevel table output

In the hardest scanned multilevel table, grouped header relationships became ambiguous even though much of the table content was still recoverable.

15 full renders · same input
Full breakdown · Tool 3 of 8

Landing AI

Vision-based document extraction API that often turns charts, signatures, and other visuals into semantic descriptions.

Hybrid commitments section source
Hybrid commitments section source

Source annual report section used to test whether section hierarchy and page flow survived conversion.

Hybrid table source
Hybrid table source

Source financial table used to test complex header and value preservation.

SG&A chart source
SG&A chart source

Source SG&A chart used to test whether the tool captured relationships and values even without embedding the original visual.

Signature page source
Signature page source

Source scanned signatures page used to test whether signature regions were preserved in some usable form.

Scanned research source page
Scanned research source page

Source scanned multi-column research page used to test OCR reading order and section hierarchy.

What worked
  • Landing AI was especially strong on hybrid and scanned documents. It preserved financial tables well, converted charts into detailed semantic descriptions with values, and handled signature regions and other non-text content more explicitly than several lower-ranked tools. On the scanned research paper it kept multi-column sections readable and preserved chart meaning even without direct visual embedding.
Where it struggled
  • Its biggest weakness was hierarchy consistency on the financial report. Major headings were flattened often enough that overall structure felt less dependable than Extend AI or LlamaParse. It also merged nested header levels in harder financial tables, and the research report notes that opening-page structure on the scanned paper was misinterpreted. Several Landing AI proof artifacts were marked unreliable in review, so some originally illustrated failures were omitted here.
What came out
Hybrid hierarchy output
Hybrid hierarchy output

Landing AI preserved page flow and document hierarchy in the commitments section, keeping headings attached to their associated narrative content.

Hybrid table output
Hybrid table output

The annual report's financial table was reconstructed with its overall layout intact, preserving header, row, and value relationships well enough to remain readable.

SG&A chart output
SG&A chart output

Instead of embedding the SG&A waterfall chart, Landing AI converted it into text that preserved the step values, category relationships, and direction of change.

Signature-region output
Signature-region output

Landing AI preserved the presence of the signature region through an attestation-style element rather than reproducing the handwritten signatures themselves.

Hybrid heading output
Hybrid heading output

The document's highest-level heading was flattened into plain text rather than being preserved as a true H1, so the content survived but heading semantics weakened.

Financial hierarchy output
Financial hierarchy output

On one financial report section, Landing AI kept heading hierarchy and supporting content together well enough that the original organization remained clear.

Financial grouped-table output
Financial grouped-table output

Landing AI reconstructed a grouped financial table with row alignment and column organization that closely followed the source layout.

Financial complex-table output
Financial complex-table output

In a harder financial table, nested header levels were merged into single cells, weakening the semantic relationships between parent and child headers.

Scanned hierarchy output
Scanned hierarchy output

For the scanned research paper, Landing AI reconstructed the multi-column layout into a readable heading-and-paragraph flow.

Scanned chart output
Scanned chart output

The scanned chart was converted into descriptive text that reflected its values and legend relationships instead of being silently dropped.

10 full renders · same input
Full breakdown · Tool 4 of 8

Mistral AI

Enterprise OCR and parsing platform that returns consolidated markdown, page-wise markdown, and visual assets in foldered outputs.

Hybrid hierarchy source
Hybrid hierarchy source

Source annual report section used to test whether headings and their supporting text stayed aligned across a long document.

Hybrid table source
Hybrid table source

Source financial summary table used to test layered table preservation.

Blurred stamp source
Blurred stamp source

Source low-clarity audit stamp used to test OCR on degraded small text.

Financial report source page
Financial report source page

Source financial report page used to test section hierarchy and reading flow.

Financial grouped table source
Financial grouped table source

Source grouped financial table used to test whether multilevel headers survived.

What worked
  • Mistral AI combined strong table recovery with one genuinely distinctive workflow feature set: page-wise markdown exports, associated visual assets, and overall confidence flagging. It handled the hybrid report well enough to preserve complex financial tables and kept many visuals attached to their source pages. On the financial report it also returned both page-level and consolidated outputs, which is useful for teams doing document QA rather than only pipeline ingestion.
Where it struggled
  • Its weak spot was hierarchy consistency. Across the long hybrid report, major headings were preserved in some sections but flattened in others, which made the markdown less predictable than the top three tools. It also flattened the financial report's TOC into plain text and weakened header semantics in harder multilevel tables. Reviewer-marked unreliable scanned-paper artifacts were omitted here, so some originally illustrated failure regions are summarized from the report rather than re-cited as proof.
What came out
Hybrid hierarchy output
Hybrid hierarchy output

Across much of the annual report, Mistral AI kept headings and supporting content structurally aligned instead of flattening everything into one text stream.

Hybrid table output
Hybrid table output

Mistral AI reconstructed the layered financial table into a usable structured layout without losing the core row, column, and value relationships.

Hybrid folder output
Hybrid folder output

Mistral AI exported page-wise outputs into a folder structure, which makes it easier to inspect specific pages during review.

Hybrid asset output
Hybrid asset output

Visual assets such as charts and signatures were preserved as page-associated files rather than disappearing from the extraction.

Blurred stamp output
Blurred stamp output

The blurred audit stamp was still recognized by Mistral AI, showing that low-visibility text was not entirely lost.

Hybrid hierarchy failure output
Hybrid hierarchy failure output

In other sections of the annual report, Mistral AI flattened heading hierarchy inconsistently even when the underlying text was recovered.

Financial folder output
Financial folder output

The financial-report export included both per-page files and a consolidated markdown document, supporting both granular inspection and end-to-end use.

Financial hierarchy output
Financial hierarchy output

On a narrative financial-report page, headings and associated text were kept in a readable flow.

Financial grouped-table output
Financial grouped-table output

For one grouped financial table, Mistral AI preserved hierarchical header structure well enough to keep the relationships between headers and columns understandable.

TOC output
TOC output

The financial report's table of contents was recovered as flat text rather than as structured nested navigation.

Financial complex-table output
Financial complex-table output

In a harder multilevel table, two header levels were combined into single cells, weakening the table's parent-child column semantics.

Scanned section output
Scanned section output

On the scanned research paper, Mistral AI generally kept section hierarchy and reading flow intact despite the underlying scan.

Scanned table output
Scanned table output

A scanned multicolumn table was reconstructed with much of its layout logic still visible.

Scanned folder output
Scanned folder output

For the scanned paper, Mistral AI again returned page-wise markdown and associated assets in a foldered output.

Scanned chart asset output
Scanned chart asset output

Charts from the scanned paper were retained as page-associated visual assets instead of being dropped.

15 full renders · same input
Full breakdown · Tool 5 of 8

Tensorlake

AI-native parser with configurable chart extraction and decent structure retention on digital PDFs.

Hybrid report source page
Hybrid report source page

Source annual report page used to test whether heading order and page flow survived extraction.

Hybrid table source
Hybrid table source

Source financial table used to test structure preservation on a digital-native annual report.

Waterfall chart source
Waterfall chart source

Source chart used to test Tensorlake's configurable chart extraction behavior.

Signature page source
Signature page source

Source signatures page used to test whether scanned signature content was detected at all.

Scanned research source page
Scanned research source page

Source scanned page used to test section hierarchy and multi-column reading order.

What worked
  • Tensorlake was solid on digital and hybrid PDFs. It preserved the annual report's structure, kept financial tables readable, surfaced signature content, and offered configurable chart extraction that exposed underlying chart data more explicitly than several competitors. It also kept section flow respectable on the native financial report and even on some scanned research sections.
Where it struggled
  • Its biggest failure mode was scanned complex tables. Across the research paper, multilevel and grouped tables repeatedly lost header placement, shifted values, or dropped labels. Markdown export was also less polished in practice because the tested web flow returned copyable markdown rather than a straightforward downloadable file. That makes Tensorlake more attractive for digital-native documents than for heavily scanned, layout-dense inputs.
What came out
Hybrid hierarchy output
Hybrid hierarchy output

Tensorlake preserved heading order and section relationships well enough that the annual report's structure remained close to the source.

Hybrid table output
Hybrid table output

The annual report's financial table stayed structurally faithful, with row, column, and value relationships preserved.

Chart extraction output
Chart extraction output

Tensorlake exposed chart data beyond plain markdown text, making the underlying waterfall-chart information more explicit.

Signature output
Signature output

Scanned signature content was detected and parsed instead of being skipped entirely, even if it was not always categorized cleanly.

Blurred stamp output
Blurred stamp output

Tensorlake recovered the Ernst & Young reference from the blurred stamp but misrendered the ampersand as a plus sign.

Web output limitation
Web output limitation

In the tested web workflow, the markdown appeared as copyable content rather than a directly downloadable file.

Financial hierarchy output
Financial hierarchy output

Even across a table-heavy financial report, Tensorlake kept section ordering and document flow reasonably intact.

Financial table output
Financial table output

One complex financial table was preserved inside the markdown with readable row-column structure and integration into the surrounding document.

Financial complex-table failure output
Financial complex-table failure output

In a more difficult multi-header table, Tensorlake missed part of the header hierarchy and dropped at least one header label, making the reconstruction incomplete.

Scanned hierarchy output
Scanned hierarchy output

For the scanned research paper, section-level hierarchy remained readable even on multicolumn pages.

Scanned chart output
Scanned chart output

Tensorlake converted chart information into tabular data during standard extraction without needing a separate chart-only mode.

Scanned grouped-table failure output
Scanned grouped-table failure output

On scanned grouped-column tables, Tensorlake misplaced headers and weakened column relationships enough that the table structure became unreliable.

Scanned heavy-table failure output
Scanned heavy-table failure output

In the hardest scanned table, value positions shifted and header assignments broke down, showing a systemic weakness on scanned multilevel tables.

13 full renders · same input
Full breakdown · Tool 6 of 8

Adobe PDF Extract API

Structured PDF extraction API with strong embedded visual retention and good results on native PDFs.

Hybrid table source
Hybrid table source

Source financial summary table used to test whether native-PDF tables stayed structurally intact.

Signature page source
Signature page source

Source signatures page used to test whether handwritten signatures survived extraction or only the surrounding text did.

Financial report source page
Financial report source page

Source financial-report narrative page used to test document-level hierarchy and paragraph flow.

Balance sheet source
Balance sheet source

Source balance sheet used to test row and column alignment in a native financial table.

Scanned grouped-table source
Scanned grouped-table source

Source scanned grouped-column table used to test whether Adobe could preserve layout in an image-only document.

What worked
  • Adobe API did well on native PDFs. It preserved visual assets better than most tools, kept many native financial tables readable, and handled the Sumitomo report's main sections and balance-sheet style tables with decent fidelity. If your documents are mostly digital PDFs and you care about embedded visuals in output, Adobe remained competitive.
Where it struggled
  • Its biggest practical limitation in this research was scanned-document handling. The scanned paper had to be split because of a 1 MB limit in the tested interface, which broke document continuity before quality was even judged. Within outputs, handwritten signatures were lost, TOC structure flattened, and harder multiheader tables weakened. Reviewer feedback also ruled out one originally cited Adobe failure artifact, so the noncurrent-assets currency-symbol issue is not re-used as proof here.
What came out
Hybrid embedded-assets output
Hybrid embedded-assets output

Adobe kept charts, images, and other visual assets integrated into the output instead of separating or dropping them.

Hybrid table output
Hybrid table output

The annual report's financial summary was reconstructed with strong row-column retention that closely matched the source layout.

Signature-page output
Signature-page output

Adobe preserved the printed signature-page text but omitted the handwritten signatures themselves, leaving the scanned signing marks absent from the result.

Financial hierarchy output
Financial hierarchy output

On the financial report, Adobe preserved major sections and much of the narrative hierarchy well enough for the page to remain readable.

Balance-sheet output
Balance-sheet output

The balance sheet came through as a clean aligned table, keeping asset rows and comparison columns readable.

Financial grouped-table output
Financial grouped-table output

Adobe reconstructed grouped financial columns well in the segment-performance table, keeping headers associated with the correct values.

Financial multiheader failure output
Financial multiheader failure output

In a harder dual-header table, Adobe flattened row-header and column-header roles together, weakening the table's structural meaning.

TOC failure output
TOC failure output

The financial report's TOC was flattened into a list rather than preserved as a nested structured contents section.

Scanned table output
Scanned table output

On one scanned grouped table, Adobe preserved the compact layout and underlying grouped relationships better than many lower-ranked tools.

Scanned embedded-assets output
Scanned embedded-assets output

Charts from the scanned paper were preserved in place as visual assets, contributing to a more continuous reading experience.

Scanned hierarchy failure output
Scanned hierarchy failure output

When Adobe processed scanned content, section boundaries and title-page organization weakened, leaving content present but less clearly structured.

11 full renders · same input
Full breakdown · Tool 7 of 8

Upstage AI

Document AI parser that handled some native tables well but was inconsistent on hierarchy and scanned layouts.

Hybrid table source
Hybrid table source

Source annual-report financial table used to test whether Upstage could keep structure and values aligned.

Waterfall chart source
Waterfall chart source

Source chart used to test whether Upstage produced useful chart output rather than flat extracted text.

Signature page source
Signature page source

Source signatures page used to test scanned signature retention and section structure.

Financial report source section
Financial report source section

Source financial section used to test whether headings and bullet-like structure survived.

Scanned chart source
Scanned chart source

Source scanned figure used to test chart extraction quality on the image-only research paper.

What worked
  • Upstage showed that it can recover some native financial tables and simpler report sections. On the hybrid annual report it kept a straightforward financial table mostly intact, and it did extract chart values rather than dropping charts outright. That keeps it above outright failure territory.
Where it struggled
  • Its consistency was poor. Signature pages lost structure, multicolumn sections in the annual report became misaligned, financial-report headings flattened into body text, and scanned-paper layout handling fell apart. The research report also notes broken image links for visual assets, which further undercut usefulness when visuals matter. Overall, Upstage looked serviceable only on easier native content, not on the mixed or scanned PDFs this use case is really about.
What came out
Hybrid table output
Hybrid table output

Upstage preserved the main structure of the annual report's financial table and kept most values in the correct places, although some currency symbols were missed.

Hybrid chart output
Hybrid chart output

The waterfall chart was translated into extracted values and explanatory text rather than being preserved as a directly readable visual.

Hybrid signature failure output
Hybrid signature failure output

On the signatures page, section structure collapsed and handwritten signatures were not clearly preserved, even though some nearby printed text remained.

Hybrid multicolumn failure output
Hybrid multicolumn failure output

In multicolumn annual-report sections, heading alignment and navigational structure diverged from the source, making the reading flow noticeably weaker.

Financial hierarchy output
Financial hierarchy output

On a simpler financial-report section, Upstage preserved enough hierarchy that the basic structure of the page remained recognizable.

Financial table failure output
Financial table failure output

In a more complex quarterly balance-sheet layout, column headers drifted away from their corresponding data regions, producing a structurally inconsistent table.

Financial heading failure output
Financial heading failure output

Major section headings in the financial report were flattened so they were no longer clearly distinguished from the body text they introduced.

Scanned table failure output
Scanned table failure output

On a scanned grouped-column table, much of the data survived but header reconstruction was weak enough that the table's structure no longer matched the source cleanly.

Scanned chart failure output
Scanned chart failure output

For the scanned figure, Upstage extracted chart details as raw delimiter-heavy data that preserved some values but not a meaningfully organized chart structure.

9 full renders · same input
Full breakdown · Tool 8 of 8

Nutrient.io

Document processing platform that recovered some text but lagged badly on visuals, hierarchy, and complex table structure.

Hybrid report source page
Hybrid report source page

Source annual-report page used to test whether Nutrient.io could at least keep reading order and major heading relationships on narrative content.

Hybrid table source
Hybrid table source

Source financial table used to test whether a relatively standard complex table stayed intact.

Signature page source
Signature page source

Source signatures page used to test whether handwritten signatures survived extraction.

Financial report source page
Financial report source page

Source financial-report page used to test whether some local section hierarchy could be recovered.

Scanned research source page
Scanned research source page

Source scanned multi-column research page used to test section-to-content alignment.

What worked
  • Nutrient.io could recover basic text and some local section hierarchy, especially on simpler narrative pages or less demanding grouped tables. It was not blank output, and in a few isolated areas it preserved enough structure to be readable.
Where it struggled
  • The problems were broad rather than isolated. Complex tables diverged from source layout in both the annual report and financial report, handwritten signatures were absent, chart content degraded into hard-to-read linear text, and scanned-paper layout ordering was unreliable enough that title and abstract relationships could be reversed. In this use case, those are core failures, not edge cases.
What came out
Hybrid hierarchy output
Hybrid hierarchy output

On one annual-report section, Nutrient.io preserved the intended reading order and enough structure that headings and surrounding content remained connected.

Hybrid table failure output
Hybrid table failure output

Even on a relatively standard annual-report financial table, Nutrient.io misaligned significant parts of the layout, weakening row, column, and value relationships.

Hybrid signature failure output
Hybrid signature failure output

Nutrient.io preserved signature-related printed text but did not extract the handwritten signature itself, leaving the signed portion incomplete.

Financial local hierarchy output
Financial local hierarchy output

Some local content recovery was possible in the financial report, showing that isolated pages could retain a usable section structure.

Financial title-page failure output
Financial title-page failure output

Paragraph boundaries and narrative flow were not consistently preserved, so content on the title-and-abstract page became fragmented compared with the source.

Financial notes output
Financial notes output

Nutrient.io could recover parts of disclosure-style narrative content, but that strength did not extend consistently to the whole report.

Financial multilevel-table failure output
Financial multilevel-table failure output

In a multilevel financial table, Nutrient.io lost parts of the grouped header organization, making parent-child column relationships hard to recover.

Financial table failure output
Financial table failure output

Another financial table showed the same weakness: values were present, but the structure did not accurately preserve the original multi-header layout.

Scanned hierarchy output
Scanned hierarchy output

On a simpler scanned research section, Nutrient.io kept section headings aligned with their related column content better than it did on harder layouts.

Scanned grouped-table output
Scanned grouped-table output

A scanned grouped-column table was partially preserved, showing that Nutrient.io could recover some internal organization on less difficult examples.

Scanned chart failure output
Scanned chart failure output

For the scanned chart, values surfaced in a linearized form that lost the original chart's grouping and visual relationships, making the result hard to interpret.

Scanned complex-table failure output
Scanned complex-table failure output

As scanned table complexity increased, structural boundaries broke down and cell relationships were misaligned, merged incorrectly, or lost.

Scanned page-order failure output
Scanned page-order failure output

On the opening scanned research page, page layout was misread badly enough that structural elements such as title and abstract relationships became unreliable.

13 full renders · same input

Final Take

Extend AI is the safest default if you need one hosted API that stays reliable across mixed PDFs, scans, tables, and charts. LlamaParse is the strongest alternative when chart-to-structured-data conversion matters more than embedded visuals, while Mistral AI is worth a look for review-heavy workflows that benefit from page-wise exports and confidence flags. Landing AI also deserves consideration if your inputs skew toward scanned and hybrid documents, but its heading consistency on financial reports kept it out of the top two.

Comments (0)

Please Log in to join the discussion.