I have different pdf files from different hospital of different patients. I want data from those pdf file either from table or from regular text. A sample pdf file I have attached here. There will be multipages pdf file which I want to store in AWS S3 bucket that I already done the part and I want different data in proper usable format from pdf using node js and AWS Textract. For Basic detail and functionality please go to the link below.
Hello, We have more than 2 millions invoices where we need to structure the data with the following information : 1- Supplier name 2- Invoice number 3- Date of invoice 4- Amount without taxes 5- Total amount I would like to develop a system with google vision or AWS trxtract to automate the collection of the informations mentioned above on past invoices but also on future invoices. There are invoices in PDF format there are scanned invoices. They come from several hundred different suppliers, but we still need the same information. Are you able to do this kind of work?