78

Python Developer Needed for Invoice PDF to Excel Automation

UpworkIENot specifiedintermediate
PythonMicrosoft ExcelAutomationData ScrapingAPI
Title: Python Tool to Extract Hours from Invoice PDFs and Update Excel Project Description I am looking for a developer to build a small automation tool that reads multiple invoice PDFs and extracts specific data into an Excel spreadsheet. All invoices have the same format and layout, so the extraction logic can be consistent. Data to Extract from Each PDF From a table in the invoice the tool should extract: CW Name (employee name) WE Date (Week Ending date) Unit (hours worked) In some cases there may be multiple rows for the same employee and week, so the tool must sum the hours for the same week. Excel Output The extracted data should be written into one existing Excel file with the following structure: Column A: Employee Name Following columns: Week Ending dates Cells: Total hours worked for that employee in that week Example: Name 01-Feb-26 08-Feb-26 15-Feb-26 Megan Redmond 41.5 25 63.75 Important Requirement The tool must update the same Excel file every month rather than creating a new file. Each time I run the program: I place new invoice PDFs into a folder. The program reads all new PDFs. It adds the new weeks and hours into the existing Excel file. It should not overwrite existing data. Preferred Features Python-based solution Simple user interface (select PDF folder and Excel file) Ability to process multiple PDFs at once Error logging if a PDF cannot be read Files Provided I will provide: Sample invoice PDF Example Excel template showing the expected format Goal A simple tool that allows me to: Put invoice PDFs in a folder Run the program Automatically update the Excel timesheet Budget: Open to proposals Timeline: Flexible (small project)
View Original Listing
Unlock AI intelligence, score breakdowns, and real-time alerts
Upgrade to Pro — $29.99/mo