Skip to main content
Atlas automatically classifies and extracts structured data from a wide range of documents used in financial workflows. You do not need to declare a document type at upload time — Atlas detects it from the image content and returns the appropriate fields in the document_type-specific payload within ocr_data. Each document type returns a distinct set of extracted fields, all following the same { "value": ..., "confidence_score": ... } structure. For the full list of fields per document type, see Supported Documents.

Document categories

These documents establish individual identity and are used for KYC verification in lending applications.
Document typeDescription
AADHAARIndian national identity card. Extracts name, address, date of birth, and Aadhaar number. Includes a redact_url for a version of the document with the Aadhaar number masked — use this when storing or transmitting copies to comply with UIDAI guidelines.
PANPermanent Account Number card. Extracts name, father’s name, date of birth, and PAN number. Also returns document_quality_flag to identify whether the document is an original, xerox_copy, or photo_of_photo.
DRIVING_LICENSEState-issued driving licence.
VOTER_IDElection Commission voter identity card.
PASSPORTIndian passport.
The document_quality_flag field (values: original, xerox_copy, photo_of_photo) is currently returned for PAN cards. Use it to flag low-quality copies for manual review before making credit decisions.
These documents verify the legal registration and operational status of a business entity.
Document typeDescription
GST_CERTGST registration certificate. Extracts GST number, legal name, address, constitution of business, and validity dates.
UDYAM_CERTUdyam registration certificate for MSMEs. Extracts registration number, enterprise name, owner name, address, type of enterprise, and NIC codes.
SHOP_ACT_LICENSEMaharashtra Shops and Establishments Act licence. Extracts establishment name, employer name, address, nature of business, and total employees.
FSSAIFood Safety and Standards Authority of India licence. Extracts registration number, business name, address, kind of business, and validity dates.
DRUG_LICENSEDrug licence issued under the Drugs and Cosmetics Act.
PARTNERSHIP_DEEDPartnership agreement between partners. Extracts partner names and firm details.
Memorandum of AssociationMemorandum of Association of a company.
Articles of AssociationArticles of Association of a company.
Certificate of IncorporationCertificate of Incorporation issued by MCA. Extracts company name, CIN, and incorporation date.
BOARD_RESOLUTIONBoard resolution authorizing specific actions (e.g., loan acceptance).
These documents capture transaction and financing details for loan underwriting.
Document typeDescription
EBILLElectricity bill used for address verification. Extracts consumer name, address, and billing details.
RTO_FORM_26Vehicle ownership certificate (Form 26).
RTO_FORM_29Notice of transfer of ownership (Form 29).
RTO_FORM_30Application for transfer of ownership (Form 30).
RTO_FORM_34Hypothecation agreement form.
RTO_FORM_35Termination of hypothecation form.
RC_BOOKVehicle registration certificate. Extracts owner name, registration number, engine and chassis details.
INVOICEDealer invoice for a product purchase. Extracts customer name, dealer details, product information, invoice amount, and hypothecation status.
DELIVERY_ORDERDealer delivery order. Extracts dealer name and address, customer name and delivery address, product details, serial/IMEI numbers, and EMI amount.
BANK_STATEMENTBank account statement. Extracts account number, IFSC code, bank name, account holder name, and statement period.
MARGIN_MONEY_RECEIPTReceipt for margin/down payment. Extracts customer name, dealer name, product name, MMR amount, and hypothecation status.
DEBIT_NOTEDealer debit note for ancillary charges. Extracts customer name, insurance charges, RTO charges, accessory charges, and hypothecation charges.
ACH_FORMAutomated Clearing House mandate form.
PURCHASE_ORDERPurchase order between buyer and seller. Extracts buyer/seller names, GSTINs, and total invoice amount.
TAX_INVOICEGST tax invoice.
INSURANCEVehicle insurance policy. Extracts insured name, insurer, policy number, engine and chassis numbers, IDV, gross premium, tenure, and hypothecation status.
ITRIncome Tax Return document. Extracts PAN, total income, and filing details.
SALARY_SLIPSalary slip issued by employer. Extracts employee name, employer name, net salary, and pay period.
These documents are used in property loan and mortgage underwriting. Many are Maharashtra-specific revenue and registration records.
Document typeDescription
SALE_DEEDRegistered sale deed for property transfer.
INDEX_2Index II extract from the Sub-Registrar’s office. Extracts owner names, mortgagor name, loan amount, property valuation, survey numbers, property area, and mortgage dates.
SATBARA7/12 extract (Satbara Utara) from Maharashtra land records.
RENT_AGREEMENTRental or leave-and-licence agreement.
GIFT_DEEDRegistered gift deed.
NAMUNA_8Namuna 8 form from Gujarat revenue records.
CHATUSIMAChatusima boundary document for land parcels.
GAVTHAN_CERTIFICATEGavthan certificate for village settlement land.
PROPERTY_TAX_RECEIPTMunicipal property tax payment receipt.
PROPERTY_CARDProperty card from urban land records.
GUNTHEWARI_CERTIFICATEGunthewari regularisation certificate.
MEMORANDUM_OF_DEPOSIT_OF_TITLE_DEEDSMemorandum of deposit of title deeds (MODT) for equitable mortgage.
These document types process images rather than scanned paperwork, and are used to verify physical asset condition and identity at the point of delivery.
Document typeDescription
IMEI_CLOSEUP_IMAGEClose-up photo of a device’s IMEI/serial sticker. Extracts model, RAM, ROM, IMEI numbers, product SKU, price, and serial number.
ASSET_OPEN_BOX_IMAGEPhoto of an open-box asset. Detects whether the customer is present (customer_present) and counts the number of visible faces (number_of_faces_visible).
AADHAR_EXTRACT_IMAGECropped Aadhaar photo extract. Detects whether a photo is present (photo_present). Used to verify that the physical Aadhaar card contains a photograph.
PRODUCT_STICKERProduct label or sticker image.
GOLD_PLEDGE_CARDSGold pledge card issued by lender. Extracts customer name, pledged asset details, and loan amount.
GAS_BILLGas utility bill used for address verification.
PHONE_BILLTelephone bill used for address verification.
WIFI_BILLInternet service bill used for address verification.

Type-specific OCR fields

Each document_type in the ocr_data array returns a different field schema. For example, an AADHAAR document returns name, address, aadhaar_number, and date_of_birth, while an INVOICE returns customer_name, dealer_address, total_invoice_amount, and so on. Your processing logic should branch on document_type to read the correct fields.
{
  "ocr_data": [
    {
      "document_id": "doc-001",
      "error_code": "",
      "error_reason": "",
      "data": {
        "document_type": "PAN",
        "name_as_per_pan": { "value": "D MANIKANDAN DURAISAMY", "confidence_score": 0.95 },
        "pan_number": { "value": "PLZPM5601F", "confidence_score": 0.99 },
        "dob": { "value": "16/07/1986", "confidence_score": 0.98 },
        "document_quality_flag": { "value": "original", "confidence_score": 0.97 }
      }
    }
  ]
}

Extraction failures

If Atlas cannot extract data from a document, the data object will be empty or partially populated, and the error_code and error_reason fields will contain a machine-readable code and a human-readable explanation.
{
  "document_id": "doc-002",
  "error_code": "UNRECOGNISED",
  "error_reason": "document not in list",
  "data": {}
}
Common error codes include UNRECOGNISED (document type not supported) and INVALID_DOC (document could not be processed, for example due to image quality or multiple subjects in frame). For details on every extracted field per document type, see Supported Documents.