Processor list (original) (raw)

Skip to main content

Stay organized with collections Save and categorize content based on your preferences.

This page contains detailed information on all processors offered by Document AI. You can see a list of all processors by solution type.

All Document AI processors adhere to theData Processing and Security Terms.

Refer to the Managing processor versionsdocumentation for more details. Also, specific processor limits apply in addition to overall product quotas and limits.

Digitize text

Enterprise Document OCR (Optical Character Recognition)

Description Identify and extract text in different types of documents. This processor allows you to identify and extract text, including handwritten text, from documents in more than 200 languages. The processor also uses machine learning to perform a quality assessment of a document based on the readability of its content.
Category Digitize
Functions OCR, Quality Analysis
Release stage General availability
Access status Public
Type in API OCR_PROCESSOR
Supported languages Full list of languages Language Name BCP 47 Tag Script Handwriting supported Afrikaans af Latn Albanian sq Latn Arabic ar Arab Armenian hy Armn Belarusian be Cyrl Bangla bn Beng Bengali bn Beng Bulgarian bg Cyrl Catalan ca Latn Chinese zh Hani Croatian hr Latn Czech cs Latn Danish da Latn Dutch nl Latn English en Latn Estonian et Latn Filipino fil Latn Finnish fi Latn French fr Latn German de Latn Greek el Grek Gujarati gu Gujr Hebrew iw Hebr Hindi hi Deva Hungarian hu Latn Icelandic is Latn Indonesian id Latn Italian it Latn Japanese ja Jpan Kannada kn Knda Khmer km Khmr Korean ko Kore Lao lo Laoo Latvian lv Latn Lithuanian lt Latn Macedonian mk Cyrl Malay ms Latn Malayalam ml Mlym Marathi mr Deva Nepali ne Deva Norwegian no Latn Persian fa Arab Polish pl Latn Portuguese (Portugal & Brazil) pt Latn Punjabi pa Guru Romanian ro Latn Russian ru Cyrl Serbian sr Cyrl Slovak sk Latn Slovenian sl Latn Spanish es Latn Swedish sv Latn Tagalog tl Latn Tamil ta Taml Telugu te Telu Thai th Thai Turkish tr Latn Ukrainian uk Cyrl Vietnamese vi Latn Yiddish yi Hebr
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-ocr-v1.2-2022-11-10 Stable None None Frozen model version of v1.0: Model files, configurations, and binaries of a version snapshot frozen in a container image for up to 18 months. pretrained-ocr-v2.0-2023-06-02 Stable None None Production-ready model specialized for document use cases. Includes access to all OCR add-ons. pretrained-ocr-v2.1-2024-08-07 Stable None None The main areas of improvement for v2.1 are: better printed text recognition, more precise checkbox detection and more accurate reading order. pretrained-ocr-v2.1.1-2025-01-31 Release candidate None None v2.1.1 is similar to V2.1, and is available in all regions except: US, EU, and asia-southeast1. For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 500 Maximum pages (imageless mode online/synchronous requests): 30
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions asia-south1 asia-southeast1 australia-southeast1 eu europe-west2 europe-west3 northamerica-northeast1 us
More information Enterprise Document OCR

Refer to Sample datasetsfor sample labeled and unlabeled datasets to use for training.

Custom Extractor

Description Extract fields from documents using generative AI or custom models; fine tune models to accurately extract data from your documents.
Category Extract
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Type in API CUSTOM_EXTRACTION_PROCESSOR
Notes If using generative AI for extraction, then: Only the English language is officially supported.Region availability is in the US, EU, northamerica-northeast1 and asia-southeast1.
Supported languages Full list of languages Language Name BCP 47 Tag Script Handwriting supported Afrikaans af Latn Arabic ar Arab Azerbaijani az Latn Azerbaijani (Cyrillic) az-Cyrl Cyrl Belarusian be Cyrl Bulgarian bg Cyrl Bosnian bs Latn Catalan ca Latn Cebuano ceb Latn Czech cs Latn Welsh cy Latn Danish da Latn German de Latn Greek el Grek English en Latn Esperanto eo Latn Spanish es Latn Estonian et Latn Basque eu Latn Persian fa Arab Finnish fi Latn Filipino fil Latn French fr Latn Irish ga Latn Galician gl Latn Hindi hi Deva Croatian hr Latn Haitian Creole ht Latn Hungarian hu Latn Indonesian id Latn Icelandic is Latn Italian it Latn Hebrew iw Hebr Japanese ja Jpan Javanese jv Latn Kazakh kk Cyrl Korean ko Kore Kyrgyz ky Cyrl Latin la Latn Lithuanian lt Latn Latvian lv Latn Macedonian mk Cyrl Mongolian mn Cyrl Marathi mr Deva Malay ms Latn Maltese mt Latn Nepali ne Deva Dutch nl Latn Norwegian no Latn Polish pl Latn Pashto ps Arab Portuguese (Portugal & Brazil) pt Latn Romanian ro Latn Russian ru Cyrl Russian (Petrine Orthography) ru-PETR1708 Cyrl Sanskrit sa Deva Slovak sk Latn Slovenian sl Latn Albanian sq Latn Serbian sr Cyrl Swedish sv Latn Swahili sw Latn Tagalog tl Latn Turkish tr Latn Ukrainian uk Cyrl Urdu ur Arab Uzbek uz Latn Uzbek (Cyrillic) uz-Cyrl Cyrl Vietnamese vi Latn Yiddish yi Hebr Chinese simplified zh-Hans Hani Chinese traditional zh-Hant Hani Zulu zu Latn
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-foundation-model-v1.2-2024-05-10 Stable None None Production-ready candidate powered by Gemini 1.5 Pro LLM and newly developed technologies, including specialized language and vision models. Also includes advanced OCR features such as checkbox detection. Recommended for those who want to use the largest supported token limits or experiment with newer models. pretrained-foundation-model-v1.3-2024-08-31 Stable None None Production-ready candidate powered by the Gemini 1.5 Flash LLM. Also includes advanced OCR features such as checkbox detection. Recommended for those who want the lowest latency. pretrained-foundation-model-v1.4-2025-02-05 Stable None None Production-ready candidate powered by Gemini 2.0 Flash LLM. Production-ready candidate powered by the Gemini 2.0 Flash LLM. Also includes advanced OCR features such as checkbox detection. pretrained-foundation-model-v1.5-2025-05-05 Preview None None Production-ready candidate powered by Gemini 2.5 Flash LLM. Production-ready candidate powered by the Gemini 2.5 Flash LLM. Recommended for those who want to experiment with newer models. For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 200 Maximum pages (imageless mode online/synchronous requests): 30
Normalized data types You can find more information in theEnrichment & normalization, andCreate dataset pages. Full list of normalized data types dateTime as STRING currency as STRING money as google.type.Money number as FLOAT or INTEGER
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions asia-south1 asia-southeast1 australia-southeast1 eu europe-west2 europe-west3 northamerica-northeast1 us
More information Custom Extractor

Form Parser

Description Extract general key-value pairs (entity and checkbox), tables, and generic entities from documents in addition to OCR text. This processor applies advanced machine learning technologies to extract key-value pairs, checkboxes, and tables from documents more than 200 languages. This processor also leverages deep learning models to extract 11 generic entities that are common in various document types.
Category Extract
Functions OCR, Form Parsing, Entity Extraction
Release stage General availability
Access status Public
Type in API FORM_PARSER_PROCESSOR
Supported languages Full list of languages Language Name BCP 47 Tag Script Handwriting supported Afrikaans af Latn Albanian sq Latn Arabic ar Arab Belarusian be Cyrl Catalan ca Latn Chinese zh Hani Croatian hr Latn Czech cs Latn Danish da Latn Dutch nl Latn English en Latn Estonian et Latn Filipino fil Latn Finnish fi Latn French fr Latn German de Latn Hebrew iw Hebr Hindi hi Deva Hungarian hu Latn Icelandic is Latn Indonesian id Latn Italian it Latn Japanese ja Jpan Korean ko Kore Latvian lv Latn Lithuanian lt Latn Macedonian mk Cyrl Malay ms Latn Marathi mr Deva Nepali ne Deva Norwegian no Latn Persian fa Arab Polish pl Latn Portuguese (Portugal & Brazil) pt Latn Romanian ro Latn Russian ru Cyrl Serbian sr Cyrl Slovak sk Latn Slovenian sl Latn Spanish es Latn Swedish sv Latn Tagalog tl Latn Turkish tr Latn Ukrainian uk Cyrl Vietnamese vi Latn Yiddish yi Hebr
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-form-parser-v1.0-2020-09-23 Stable None None Legacy version. For best quality and full feature set, use the Form Parser v2.0. pretrained-form-parser-v2.0-2022-11-10 Stable Show fields email phone url date_time address person organization quantity price id page_number None Recommended version. Supports generic entities and includes upgraded table, KVP, and checkbox model, as well as more than 200 languages. pretrained-form-parser-v2.1-2023-06-26 Release Candidate None None Public Preview version. Same model as v2.0 with native text extraction from digital PDF files enabled. For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 100 Maximum pages (imageless mode online/synchronous requests): 30
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions asia-south1 asia-southeast1 australia-southeast1 eu europe-west2 europe-west3 northamerica-northeast1 us
More information Form Parser

Layout Parser

Description Extracts document content elements (text, tables, and lists) and creates context-aware chunks. Layout Parser extracts document content elements like text, tables, and lists, and creates context-aware chunks that facilitate information retrieval in generative AI and discovery applications.
Category Extract
Functions Layout Parsing, Document Chunking
Release stage General availability
Access status Public
Type in API LAYOUT_PARSER_PROCESSOR
Notes This parser supports PDF, HTML and DOCX files.
Supported languages Full list of languages Language Name BCP 47 Tag Script Handwriting supported Afrikaans af Latn Albanian sq Latn Arabic ar Arab Armenian hy Armn Belarusian be Cyrl Bangla bn Beng Bengali bn Beng Bulgarian bg Cyrl Catalan ca Latn Chinese zh Hani Croatian hr Latn Czech cs Latn Danish da Latn Dutch nl Latn English en Latn Estonian et Latn Filipino fil Latn Finnish fi Latn French fr Latn German de Latn Greek el Grek Gujarati gu Gujr Hebrew iw Hebr Hindi hi Deva Hungarian hu Latn Icelandic is Latn Indonesian id Latn Italian it Latn Japanese ja Jpan Kannada kn Knda Khmer km Khmr Korean ko Kore Lao lo Laoo Latvian lv Latn Lithuanian lt Latn Macedonian mk Cyrl Malay ms Latn Malayalam ml Mlym Marathi mr Deva Nepali ne Deva Norwegian no Latn Persian fa Arab Polish pl Latn Portuguese (Portugal & Brazil) pt Latn Punjabi pa Guru Romanian ro Latn Russian ru Cyrl Serbian sr Cyrl Slovak sk Latn Slovenian sl Latn Spanish es Latn Swedish sv Latn Tagalog tl Latn Tamil ta Taml Telugu te Telu Thai th Thai Turkish tr Latn Ukrainian uk Cyrl Vietnamese vi Latn Yiddish yi Hebr
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-layout-parser-v1.0-2024-06-03 Stable None None Recommended version. For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 500 Maximum pages (imageless mode online/synchronous requests): 30
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions eu us
More information Layout Parser

Explore pretrained processors

Bank Statement Parser

Description Extract from bank statements including name, account, transactions, etc.
Category Pretrained
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Type in API BANK_STATEMENT_PROCESSOR
Notes If a page of a multi-page input file is the correct document type and one of the supported versions, the processor performs entity extraction on the first supported document. If the processor doesn't find any applicable documents in the input file, the processor returns an error message.
Supported languages Language Name BCP 47 Tag Script Handwriting supported English en Latn
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-bankstatement-v1.0-2021-08-08 Stable None None pretrained-bankstatement-v1.1-2021-08-13 Stable None None pretrained-bankstatement-v2.0-2021-12-10 Stable None None pretrained-bankstatement-v3.0-2022-05-16 Stable None None This version assumes that the input file contains a single bank statement. Unlike the default version, this version does not check the input file for bank statements and will not return an error if no bank statements are found. pretrained-bankstatement-v4.0-2023-07-31 Release Candidate None None pretrained-bankstatement-v5.0-2023-12-06 Release Candidate None None For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 30 Maximum pages (imageless mode online/synchronous requests): 30
Fields detected in the earliest version You can also find this information in theField detected page. Full list of fields account_number account_type bank_address bank_name client_address client_name ending_balance starting_balance statement_date statement_end_date statement_start_date table_item table_item/transaction_deposit table_item/transaction_deposit_date table_item/transaction_deposit_description table_item/transaction_withdrawal table_item/transaction_withdrawal_date table_item/transaction_withdrawal_description
Enriched fields You can find more information in theEnrichment & normalization page. Full list of enriched fields bank_address bank_name
Normalized fields You can find more information in theEnrichment & normalization page. Full list of normalized fields ending_balance starting_balance statement_date statement_end_date statement_start_date table_item/transaction_deposit table_item/transaction_deposit_date table_item/transaction_withdrawal table_item/transaction_withdrawal_date
Uptraining
Human-in-the-Loop
Labeling Instructions Open in new window.
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions eu us

W2 Parser

Description Extract from Form W2, including employee, employer, wages, etc.
Category Pretrained
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Type in API FORM_W2_PROCESSOR
Notes If a page of a multi-page input file is the correct document type and one of the supported versions, the processor performs entity extraction on the first supported document. If the processor doesn't find any applicable documents in the input file, the processor returns an error message.
Supported languages Language Name BCP 47 Tag Script Handwriting supported English en Latn
Supported form/versions 2020 (standard and customized versions) 2019 (standard and customized versions) 2018 (standard and customized versions)
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-w2-v1.0-2020-10-01 Stable None None pretrained-w2-v1.1-2022-01-27 Stable None None pretrained-w2-v1.2-2022-01-28 Stable Show fields AllocatedTips ControlNumber DependentCareBenefits EIN EmployeeAddress EmployeeName EmployerNameAndAddress EmployerStateIdNumber_Line1 FederalIncomeTaxWithheld FormYear LocalIncomeTax_Line1 LocalityName_Line1 LocalWagesTipsEtc_Line1 MedicareTaxWithheld MedicareWagesAndTips NonqualifiedPlans SocialSecurityTaxWithheld SocialSecurityTips SocialSecurityWages SSN State_Line1 StateIncomeTax_Line1 StateWagesTipsEtc_Line1 WagesTipsOtherCompensation None Quality improvements and supporting new fields; does not include splitter. pretrained-w2-v2.0-2022-03-30 Release Candidate Show fields AllocatedTips ControlNumber DependentCareBenefits EIN EmployeeAddress_AdditionalStreetAddressOrPostalBox EmployeeAddress_City EmployeeAddress_State EmployeeAddress_StreetAddressOrPostalBox EmployeeAddress_Zip EmployeeName_FirstName EmployeeName_LastName EmployeeName_MiddleNameOrInitial EmployerAddress_AdditionalStreetAddressOrPostalBox EmployerAddress_City EmployerAddress_State EmployerAddress_StreetAddressOrPostalBox EmployerAddress_Zip EmployerName EmployerStateIdNumber_Line1 FederalIncomeTaxWithheld FormYear LocalIncomeTax_Line1 LocalWagesTipsEtc_Line1 LocalityName_Line1 MedicareTaxWithheld MedicareWagesAndTips NonqualifiedPlans SSN SocialSecurityTaxWithheld SocialSecurityTips SocialSecurityWages StateIncomeTax_Line1 StateWagesTipsEtc_Line1 State_Line1 WagesTipsOtherCompensation a_Code a_Value b_Code b_Value c_Code c_Value d_Code d_Value None Quality improvements and support for box 12 fields and fine-grained predictions of EmployeeName, EmployeeAddress, and EmployerNameAndAddress, all of which are no longer part of the output and are replaced with additional fields. pretrained-w2-v2.1-2022-06-08 Stable Show fields AllocatedTips ControlNumber DependentCareBenefits EIN EmployeeAddress_AdditionalStreetAddressOrPostalBox EmployeeAddress_City EmployeeAddress_State EmployeeAddress_StreetAddressOrPostalBox EmployeeAddress_Zip EmployeeName_FirstName EmployeeName_LastName EmployeeName_MiddleNameOrInitial EmployeeName_Suffix EmployerAddress_AdditionalStreetAddressOrPostalBox EmployerAddress_City EmployerAddress_State EmployerAddress_StreetAddressOrPostalBox EmployerAddress_Zip EmployerName EmployerStateIdNumber_Line1 FederalIncomeTaxWithheld FormYear LocalIncomeTax_Line1 LocalWagesTipsEtc_Line1 LocalityName_Line1 MedicareTaxWithheld MedicareWagesAndTips NonqualifiedPlans SSN SocialSecurityTaxWithheld SocialSecurityTips SocialSecurityWages StateIncomeTax_Line1 StateWagesTipsEtc_Line1 State_Line1 WagesTipsOtherCompensation a_Code a_Value b_Code b_Value c_Code c_Value d_Code d_Value None Similar to version pretrained-w2-v2.0-2022-03-30 with further quality enhancements and introducing one more entity EmployeeName_Suffix. For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 15 Maximum pages (imageless mode online/synchronous requests): 15
Fields detected in the earliest version You can also find this information in theField detected page. Full list of fields ControlNumber EIN EmployeeAddress EmployeeName EmployerNameAndAddress FederalIncomeTaxWithheld MedicareTaxWithheld MedicareWagesAndTips SSN SocialSecurityTaxWithheld SocialSecurityWages WagesTipsOtherCompensation
Enriched fields You can find more information in theEnrichment & normalization page. Full list of enriched fields EmployerNameAndAddress EIN
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions eu us

US Passport Parser

Description Extract fields such as names, document ID, date of birth, etc.
Category Pretrained
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Type in API US_PASSPORT_PROCESSOR
Supported languages Language Name BCP 47 Tag Script Handwriting supported English en Latn
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-us-passport-v1.0-2021-06-14 Stable None None For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 2 Maximum pages (batch/offline/asynchronous requests): 2 Maximum pages (imageless mode online/synchronous requests): 2
Fields detected in the earliest version You can also find this information in theField detected page. Full list of fields Family Name Given Names Document Id Expiration Date Date Of Birth Issue Date MRZ Code Portrait
Normalized fields You can find more information in theEnrichment & normalization page. Full list of normalized fields Date Of Birth Expiration Date Issue Date
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions eu us

Utility Parser

Description Extract text and values from utility bills such as supplier name and previous paid amount.
Category Pretrained
Functions OCR, Entity Extraction
Release stage General availability
Access status Limited[*]
Type in API UTILITY_PROCESSOR
Supported languages Language Name BCP 47 Tag Script Handwriting supported English en Latn
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-utility-v1.1-2021-04-09 Stable None None pretrained-utility-v1.2-2022-12-15 Release Candidate None None For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 10 Maximum pages (batch/offline/asynchronous requests): 200 Maximum pages (imageless mode online/synchronous requests): 30
Fields detected in the earliest version You can also find this information in theField detected page. Full list of fields adjusted_amount amount_due balance_transfer_amount carrier currency currency_exchange_rate delivery_date deposit_credited_amount due_date freight_amount invoice_date invoice_id late_fee_amount line_item line_item/amount line_item/description line_item/frequency line_item/product_code line_item/purchase_order line_item/quantity line_item/service_address line_item/service_end_date line_item/service_id_1 line_item/service_id_2 line_item/service_start_date line_item/supplier_account_number line_item/tax_amount line_item/unit_number line_item/unit_of_measure line_item/unit_price line_item/usage net_amount payment_terms prior_amount_due prior_paid_amount purchase_order receiver_address receiver_email receiver_name receiver_phone receiver_tax_id receiver_website reclaimed_water remit_to_address remit_to_name service service/service_end_date service/service_id service/service_start_date service/unit_of_measure service/usage service_address service_end_date service_id service_start_date ship_from_address ship_from_name ship_to_address ship_to_name supplier_account_number supplier_address supplier_email supplier_iban supplier_name supplier_payment_ref supplier_phone supplier_registration supplier_tax_id supplier_website tampering total_amount total_tax_amount usage vat vat/amount vat/category_code vat/tax_amount vat/tax_rate
Normalized fields You can find more information in theEnrichment & normalization page. Full list of normalized fields adjusted_amount amount_due balance_transfer_amount currency currency_exchange_rate delivery_date due_date invoice_date late_fee_amount line_item/amount line_item/quantity line_item/tax_amount line_item/unit_price net_amount prior_amount_due prior_paid_amount total_amount total_tax_amount
Uptraining
Human-in-the-Loop
Labeling Instructions Open in new window.
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions eu us

Identity Document Proofing Parser

Description Predict the validity of ID documents using multiple signals. Identity Document Proofing Processor is designed to help predict the validity of ID documents with four different signals. The processor currently returns information from the following signals: fraud_signals_is_identity_document detection: Predicts whether an image contains a recognized identity document. fraud_signals_suspicious_words detection: Predicts whether words are present that aren't typical on IDs. fraud_signals_image_manipulation detection: Predicts whether the image was altered or tampered with an image editing tool. fraud_signals_online_duplicate detection: Predicts whether the image can be found online (US only).
Category Pretrained
Functions OCR, Quality Analysis
Release stage General availability
Access status Public
Type in API ID_PROOFING_PROCESSOR
Notes The Online Duplicate Detection feature is currently processed in US data centers. Regional and multi-regional support is unavailable for this feature outside of the US. This processor is supported by algorithms that are updated more frequently than new processor versions are released. For this reason, the processor might return different outputs over time even when using the same processor version. For example, the Online Duplicate Detection system monitors images present on the web. The system's behavior can then change more quickly than can be tracked in processor versions. Refer to notes on Responsible AI[†] and Human review.[‡]
Supported languages Language Name BCP 47 Tag Script Handwriting supported English en Latn
Supported form/versions Support for US passports, passcards and driver's licenses.
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-id-proofing-v1.0-2022-10-03 Stable None None pretrained-id-proofing-v1.1-2023-05-18 Release Candidate Show fields fraud_signals_photocopy_detection None Additional photocopy detection signal pretrained-id-proofing-v1.2-2023-10-04 Release Candidate Show fields fraud_signals_photocopy_detection None For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 2 Maximum pages (batch/offline/asynchronous requests): 2 Maximum pages (imageless mode online/synchronous requests): 2
Fields detected in the earliest version You can also find this information in theField detected page. Full list of fields fraud_signals_is_identity_document fraud_signals_suspicious_words evidence_suspicious_word evidence_inconclusive_suspicious_word fraud_signals_image_manipulation fraud_signals_online_duplicate (US only) fraud_signals_photocopy_detection evidence_hostname (US only) evidence_thumbnail_url (US only)
Normalized fields You can find more information in theEnrichment & normalization page. Full list of normalized fields fraud_signals_image_manipulation fraud_signals_online_duplicate (US only) fraud_signals_is_identity_document fraud_signals_suspicious_words
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions eu us

Pay Slip Parser

Description Extract from pay slips, including name, business, amounts, etc.
Category Pretrained
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Type in API PAYSTUB_PROCESSOR
Notes If the multi-page input document contains more than one valid pay slips, the processor extracts entities from only the first valid pay slip. If no pay slips are found in the input file, the processor returns an error message.
Supported languages Language Name BCP 47 Tag Script Handwriting supported English en Latn
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-paystub-v1.0-2021-03-19 Stable None None pretrained-paystub-v1.1-2021-08-13 Stable Show fields net_pay net_pay_ytd employee_account_number None Quality improvement and new fields support; pretrained-paystub-v1.2-2021-12-10 Stable None None pretrained-paystub-v2.0-2022-05-17 Release Candidate Show fields deduction_item deduction_item/deduction_type deduction_item/deduction_this_period deduction_item/deduction_ytd direct_deposit_item direct_deposit_item/direct_deposit direct_deposit_item/employee_account_number earning_item earning_item/earning_type earning_item/earning_rate earning_item/earning_hours earning_item/earning_this_period earning_item/earning_ytd page_number tax_item tax_item/tax_type tax_item/tax_this_period tax_item/tax_ytd federal_additional_tax federal_allowance federal_marital_status state_additional_tax state_allowance state_marital_status None This version assumes that the input file contains a single pay slip. Unlike the default version, this version does not check the input file for pay slips and will not return an error if no pay slips are found. Quality improvement, new fields support and new schema. Bonus, Commissions, Holiday, Overtime, Regular Pay and Vacation are now part of earning_item/earning_this_period, and their year-to-date versions are in earning_item/earning_ytd. Direct Deposit and Employee Account Number are now nested under direct_deposit_item.Async page limit is 10. pretrained-paystub-v2.0-2022-07-22 Stable None None Quality improvement and uptraining enhancements. pretrained-paystub-v3.0-2023-12-06 Release Candidate None None For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 50 Maximum pages (imageless mode online/synchronous requests): 30
Fields detected in the earliest version You can also find this information in theField detected page. Full list of fields bonus bonus_ytd commissions commissions_ytd direct_deposit employee_account_number (Added in "pretrained-paystub-v1.1-2021-08-13") employee_address employee_name employer_address employer_name end_date gross_earnings gross_earnings_ytd holiday holiday_ytd net_pay (Added in "pretrained-paystub-v1.1-2021-08-13") net_pay_ytd (Added in "pretrained-paystub-v1.1-2021-08-13") overtime overtime_ytd pay_date regular_pay regular_pay_ytd ssn start_date vacation vacation_ytd
Enriched fields You can find more information in theEnrichment & normalization page. Full list of enriched fields employer_address employer_name
Normalized fields You can find more information in theEnrichment & normalization page. Full list of normalized fields bonus bonus_ytd commissions commissions_ytd direct_deposit end_date gross_earnings gross_earnings_ytd holiday holiday_ytd net_pay net_pay_ytd overtime overtime_ytd pay_date regular_pay regular_pay_ytd start_date vacation vacation_ytd
Uptraining
Human-in-the-Loop
Labeling Instructions Open in new window.
Supported regions eu us

US Driver License Parser

Description Extract fields such as names, document ID, date of birth, etc.
Category Pretrained
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Type in API US_DRIVER_LICENSE_PROCESSOR
Supported languages Language Name BCP 47 Tag Script Handwriting supported English en Latn
Supported form/versions Supports all 50 States and D.C.
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-us-driver-license-v1.0-2021-06-14 Stable None None For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 2 Maximum pages (batch/offline/asynchronous requests): 2 Maximum pages (imageless mode online/synchronous requests): 2
Fields detected in the earliest version You can also find this information in theField detected page. Full list of fields Family Name Given Names Document Id Expiration Date Date Of Birth Issue Date Address Portrait
Normalized fields You can find more information in theEnrichment & normalization page. Full list of normalized fields Date Of Birth Expiration Date Issue Date
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions eu us

Expense Parser

Description Extract text and values from expense documents such as expense date, supplier name, total amount, and currency.
Category Pretrained
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Type in API EXPENSE_PROCESSOR
Supported languages Full list of languages Language Name BCP 47 Tag Script Handwriting supported German de Latn English en Latn Spanish es Latn French fr Latn Japanese ja Jpan Dutch nl Latn
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-expense-v1.1-2021-04-09 Stable None None Launched in April 2021. pretrained-expense-v1.3.2-2024-09-11 Release Candidate Show fields credit_card_last_four_digits line_item/quantity payment_type ja: Japanese An upgrade to v1.3 with an enhanced underlying vision model. pretrained-expense-v1.4-2022-11-18 Release Candidate Show fields traveler_name reservation_id line_item/transaction_date ja: Japanese it: Italian pt: Portuguese (Portugal & Brazil) Performance improvements and support for uptraining. Maximum pages (online/synchronous requests) limit has been increased to 15. pretrained-expense-v1.4.2-2024-09-12 Release Candidate Show fields traveler_name reservation_id line_item/transaction_date ja: Japanese it: Italian pt: Portuguese (Portugal & Brazil) An upgrade to v1.4 with an enhanced underlying vision model. For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 10 Maximum pages (batch/offline/asynchronous requests): 10 Maximum pages (imageless mode online/synchronous requests): 10
Fields detected in the earliest version You can also find this information in theField detected page. Full list of fields credit_card_last_four_digits currency end_date net_amount payment_type purchase_time receipt_date start_date supplier_address supplier_city supplier_name tip_amount total_amount total_tax_amount line_item line_item/amount line_item/description line_item/product_code
Enriched fields You can find more information in theEnrichment & normalization page. Full list of enriched fields supplier_address supplier_name supplier_phone
Normalized fields You can find more information in theEnrichment & normalization page. Full list of normalized fields currency total_amount total_tax_amount net_amount receipt_date purchase_time start_date end_date line_item/amount line_item/payment_date line_item/payment_amount
Uptraining
Human-in-the-Loop
Labeling Instructions Open in new window.
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions asia-southeast1 australia-southeast1 eu northamerica-northeast1 us

Invoice Parser

Description Extract text and values from invoices such as invoice number, supplier name, invoice amount, tax amount, invoice date, due date. The invoice Parser extracts both header and line item fields, such as invoice number, supplier name, invoice amount, tax amount, invoice date, due date, and line item amounts.
Category Pretrained
Functions OCR, Entity Extraction
Release stage General availability
Access status Public
Type in API INVOICE_PROCESSOR
Supported languages Full list of languages Language Name BCP 47 Tag Script Handwriting supported German de Latn English en Latn Spanish es Latn Estonian et Latn French fr Latn Italian it Latn Latvian lv Latn Lithuanian lt Latn Dutch nl Latn Portuguese (Portugal & Brazil) pt Latn Romanian ro Latn Swedish sv Latn
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-invoice-v1.1-2021-04-09 Stable None None pretrained-invoice-v1.2-2022-02-18 Stable None None Deprecation is planned soon. pretrained-invoice-v1.3-2022-07-15 Stable None it: Italian pt: Portuguese (Portugal & Brazil) ro: Romanian sv: Swedish et: Estonian lv: Latvian lt: Lithuanian Uptrainable processor version. Maximum pages (online/synchronous requests) has been increased to 15. pretrained-invoice-v1.4-2022-10-21 Release Candidate None None Uptrainable processor version. Maximum pages (online/synchronous requests) has been increased to 15. pretrained-invoice-v1.5-2023-09-15 Release Candidate None None pretrained-invoice-v2.0-2023-12-06 Release Candidate None None For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 200 Maximum pages (imageless mode online/synchronous requests): 30
Fields detected in the earliest version You can also find this information in theField detected page. Full list of fields amount_paid_since_last_invoice carrier currency currency_exchange_rate delivery_date due_date freight_amount invoice_date invoice_id line_item line_item/amount line_item/description line_item/product_code line_item/purchase_order line_item/quantity line_item/unit line_item/unit_price net_amount payment_terms purchase_order receiver_address receiver_email receiver_name receiver_phone receiver_tax_id receiver_website remit_to_address remit_to_name ship_from_address ship_from_name ship_to_address ship_to_name supplier_address supplier_email supplier_iban supplier_name supplier_payment_ref supplier_phone supplier_registration supplier_tax_id supplier_website total_amount total_tax_amount vat vat/amount vat/category_code vat/tax_amount vat/tax_rate
Enriched fields You can find more information in theEnrichment & normalization page. Full list of enriched fields supplier_address supplier_name supplier_phone
Normalized fields You can find more information in theEnrichment & normalization page. Full list of normalized fields amount_paid_since_last_invoice currency currency_exchange_rate delivery_date due_date freight_amount invoice_date net_amount total_amount total_tax_amount line_item/amount line_item/quantity line_item/unit_price vat/amount vat/tax_amount vat/tax_rate
Uptraining
Human-in-the-Loop
Labeling Instructions Open in new window.
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions asia-south1 asia-southeast1 australia-southeast1 eu northamerica-northeast1 us

Classify documents

Custom Classifier

Description Train a model to classify a document type from a set of classes.
Category Classify
Functions OCR, Classification
Release stage General availability
Access status Public
Type in API CUSTOM_CLASSIFICATION_PROCESSOR
Supported languages Language Name BCP 47 Tag Script Handwriting supported English en Latn
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 200 Maximum pages (imageless mode online/synchronous requests): 30
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions asia-south1 asia-southeast1 australia-southeast1 eu europe-west2 europe-west3 northamerica-northeast1 us
More information Create a custom classification processor

Custom Splitter

Description Train a model to split a file containing multiple documents into individual, classified documents.
Category Classify
Functions OCR, Classification, Splitting
Release stage General availability
Access status Public
Type in API CUSTOM_SPLITTING_PROCESSOR
Notes i18n can be supported through custom training options only.
Supported languages Language Name BCP 47 Tag Script Handwriting supported English en Latn
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 1000 Maximum pages (imageless mode online/synchronous requests): 30
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions asia-south1 asia-southeast1 australia-southeast1 eu europe-west2 europe-west3 northamerica-northeast1 us
More information Create a custom splitter processor

Summarize documents

Summarizer

Description Get abstract and bullet point summaries for short and long documents.
Category pretrained
Functions Summarize
Release stage Preview
Access status Public
Type in API SUMMARY_PROCESSOR
Supported languages Language Name BCP 47 Tag Script Handwriting supported English en Latn
Processor versions Version ID Release Channel Additional fields detected Additional languages supported Description pretrained-foundation-model-v1.0-2023-08-22 Stable None None Google foundation model For more information, see Managing processor versions.
Quotas and limits Maximum pages (online/synchronous requests): 15 Maximum pages (batch/offline/asynchronous requests): 250 Maximum pages (imageless mode online/synchronous requests): 30
Uptraining
Human-in-the-Loop
Sample Input File Open in new window.
Sample Output Open in new window.
Supported regions us
More information Custom summarizer

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2025-06-06 UTC.