Question 1

How do I get an API token?

Accepted Answer

Create a free account and open your account page, your token is shown there with a copy button.

Question 2

Is there a synchronous (no-polling) mode?

Accepted Answer

Yes, files of 5 pages or fewer return the full result inline in the POST response, so no polling is needed for most images and short PDFs.

Question 3

What languages are supported?

Accepted Answer

Over 100, including Latin, CJK, Arabic, Cyrillic and Indic scripts. Use language=auto to detect, or pass a specific code.

Question 4

Do you store my documents?

Accepted Answer

Uploads are processed for OCR and deleted automatically. We never sell, share, or train on your documents.

Question 5

料金制限と割当は？

Accepted Answer

匿名呼び出しは IP ごとの日分の制限を受け、無料アカウントは月分のバケットを受け、有料プランは購入したクレジットを使用し、ファイルごとのページカップと優先度を高めます。クレジットが使い切れば、ボディに使用済みとカップを含む 402 を返します。

Question 6

入力と出力のフォーマットはどれをサポートしますか？

Accepted Answer

送信できるファイル形式は PNG、JPG、WEBP、GIF、BMP、TIFF、多ページPDFです。結果はダウンロードエンドポイントのフォーマットパラメータを使って txt、md、docx、pdf（検索可能）、csv、json としてダウンロードできます。

Question 7

エラーコードは何を意味しますか。

Accepted Answer

400はファイルが欠けている、サポートされていないタイプ、ファイルが大きすぎる、401はトークンが欠けているか無効、402はページが無い、404はジョブUUIDが不明、409はジョブが終了する前にダウンロードが要求されました。エラーボディには短いメッセージが含まれます。

Question 8

反応はどうだ？

Accepted Answer

状態、階層、言語、ページ数、平均信頼度、フルテキスト、マークダウンを持つジョブオブジェクト。ページアレイは各ページをブロックに分割し、ブロックごとのテキスト、境界ボックス (bbox)、信頼度を含む。

Question 9

どの場合に tier=cpu と tier=vlm を使用すべきですか？

Accepted Answer

印刷された文書を高速で低コストで認識するために cpu を使用します。手書き、複雑な多列レイアウト、数学、翻訳などの場合は vlm を使用します。vlm は AI エンジンで、より正確です。

Question 10

ツールと translate_to パラメータはどのように機能しますか？

Accepted Answer

ツールにスラグを渡すと、そのツールのチューニングされたプレセットを適用します。翻訳ツールの場合、認識されたテキストを翻訳するには、ターゲット言語コードとともに translate_to を渡します。

Question 11

送信できるファイルのサイズはどれくらいで、大きなジョブはどう扱われますか？

Accepted Answer

5ページ以下のファイルは POST 応答でインラインで返されます。より大きなファイルは待ち受けまたは処理中として直ちに返されます。GET /api/v1/ocr/ をポールします。<uuid>ファイルのページ数を増やすには、 ファイルのページ数を増やす必要があります。

Question 12

公式のSDKはありますか？

Accepted Answer

これは、HTTPクライアントを使って、HTTPサーバにアクセスするためのアプリケーションです。APIは、HTTPS上の単純なRESTであり、HTTPクライアントを持つ言語からどれでも動作します。Python、Node.js、cURLの例を参照してください。インストールするSDKはありません。標準HTTPコードの数行が必要です。

Field	Type	Description
`file`	file	Required. The image or PDF to process.
`tier`	string	`cpu` (default, fast/printed) or `vlm` (premium AI: handwriting, layout, math).
`language`	string	`auto` (default) or a language code (`en`, `ch`, `ja`, `ar`, …).
`tool`	string	Optional tool slug (e.g. `extract-tables`, `handwriting-to-text`) to apply that tool's preset.
`translate_to`	string	For the translate tool, target language code.

Code	Meaning
`400`	No file, unsupported type, or file too large.
`401`	Missing or invalid API token.
`402`	Out of pages, daily/monthly free limit reached, or no credits. The body includes `used`/`cap`.
`404`	Job UUID not found.
`409`	Download requested before the job finished.

API OCR.chat

概要

認証

文書を提出

結果を取得

フォーマットをダウンロード

文書とチャット

コード例

パラメータ

エラーと制限

よくある質問