Skip to content
This repository has been archived by the owner on Aug 28, 2023. It is now read-only.

A centralized repository for AI Platform notebooks using the Google Cloud Document AI API.

License

Notifications You must be signed in to change notification settings

GoogleCloudPlatform/documentai-notebooks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

63 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Document AI Notebooks

This repository contains several Jupyter notebooks to be used with the Cloud Document AI Platform. Use the general notebooks to process any form type or the specialized notebooks for any of the solutions such as Procurement DocAI or Lending DocAI. These notebooks help you get started with extracting data from your documents whether you're bring your own form types or using one of our specialized parsers for invoices, receipts, tax forms and more.

gif

Prerequisites

You must have your own GCP project with billing enabled and have working knowledge of the following products:

Set Up Steps

  1. Set up your GCP project for Document AI following the Setup Guide.
  2. Enable the 'Document AI API' in your project in the Document AI Platform.
  3. Create or use an existing instance of AI Platform Notebook with Python 3 using the default configurations.
  4. In the notebook, go to Git > Clone a Repository and paste the repository URL.
  5. Install the required libraries in the notebook terminal python -m pip install -r requirements.txt

Please note Colab and Jupyter notebooks are also work with these samples. However, additional authentication will be required for service accounts.

Instructions

  1. Identify which form type or utility you would like to run through a processor.
  2. Create your processor using the instructions.
  3. Copy your processor id. processorId
  4. Update the PROCESSOR_ID, PROJECT_ID and REGION variables in the notebook.
PROJECT_ID = "YOUR_PROJECT_ID_HERE"
LOCATION = "LOCATION"  # Format is 'us' or 'eu'
PROCESSOR_ID = "PROCESSOR_ID"  # Create processor in Cloud Console

Please note, the location must match the one assigned to the processor.

  1. Run the notebook.

About

A centralized repository for AI Platform notebooks using the Google Cloud Document AI API.

Topics

Resources

License

Stars

Watchers

Forks