78
Python Developer Needed for Invoice PDF to Excel Automation
UpworkIENot specifiedintermediate
PythonMicrosoft ExcelAutomationData ScrapingAPI
Title: Python Tool to Extract Hours from Invoice PDFs and Update Excel
Project Description
I am looking for a developer to build a small automation tool that reads multiple invoice PDFs and extracts specific data into an Excel spreadsheet.
All invoices have the same format and layout, so the extraction logic can be consistent.
Data to Extract from Each PDF
From a table in the invoice the tool should extract:
CW Name (employee name) WE Date (Week Ending date) Unit (hours worked)
In some cases there may be multiple rows for the same employee and week, so the tool must sum the hours for the same week.
Excel Output
The extracted data should be written into one existing Excel file with the following structure:
Column A: Employee Name Following columns: Week Ending dates Cells: Total hours worked for that employee in that week
Example:
Name 01-Feb-26 08-Feb-26 15-Feb-26 Megan Redmond 41.5 25 63.75
Important Requirement
The tool must update the same Excel file every month rather than creating a new file.
Each time I run the program:
I place new invoice PDFs into a folder. The program reads all new PDFs. It adds the new weeks and hours into the existing Excel file.
It should not overwrite existing data.
Preferred Features
Python-based solution Simple user interface (select PDF folder and Excel file) Ability to process multiple PDFs at once Error logging if a PDF cannot be read
Files Provided
I will provide:
Sample invoice PDF Example Excel template showing the expected format
Goal
A simple tool that allows me to:
Put invoice PDFs in a folder Run the program Automatically update the Excel timesheet
Budget: Open to proposals
Timeline: Flexible (small project)
Unlock AI intelligence, score breakdowns, and real-time alerts
Upgrade to Pro — $29.99/mo