PDF Data Extraction Specialist – Magazine Archive Project
UpworkCANot specifiedintermediateScore: 45
PDF ConversionData ExtractionPDF
We are looking for a technical freelancer to help convert magazine PDFs into a structured, searchable database.
We have 100+ magazine issues in PDF format. We need someone who can:
• Extract all text content (articles, titles, authors, page numbers)
• Extract images and link them to the correct articles
• Capture issue information (issue number, date, categories, etc.)
• Organize everything into a clean, structured database (CSV, SQL, Airtable, etc.)
• Create a repeatable process we can scale across all issues
This is not manual copy-and-paste. We are looking for someone experienced in PDF parsing, OCR, and database structuring.
Requirements:
• Experience with PDF extraction tools (Python, OCR, etc.)
• Experience structuring data into databases
• Ability to handle large document sets
Project Plan:
Start with a small pilot (3–5 issues).
If successful, expand to the full archive.
To Apply:
Please include:
• Examples of similar work
• Tools you would use
• Timeline for the pilot
• Estimated cost
Unlock AI Intelligence, score breakdowns, and real-time alerts
Upgrade to Pro — $29.99/moClient
Spent: $1,265Rating: 5.0Verified