62

Pyhton Supplier Data Matching & Product Identification System

UpworkNLNot specifiedintermediate
API IntegrationData ExtractionPythonData AnalysisMicrosoft Azure SQL Database
We are looking for an experienced backend developer to design and build a scalable, data driven product matching system for our e-commerce operations. The goal is to create a system that automatically processes supplier product lists in different formats and matches them to our internal product catalog with high accuracy. This system must be configurable, maintainable, and future-proof, so that new suppliers and new product categories can be added without changing core code. Project Overview We work with many suppliers who provide product lists in (Excel/CSV). These lists need to be: - Matched to our internal catalog - Stored with full traceability - Prepared for internal pricing and stock updates You will build a backend system that handles this end-to-end process. Key Responsibilities - Design and develop a Python based matching engine - Build a flexible supplier ingestion system driven by configuration (not hardcoded rules) - Integrate with Python back end of our ingestion web application - Implement intelligent matching logic structured title parsing - Build category-based matching profiles (phones, watches, accessories, etc.) - Implement controlled AI usage for specific cases (e.g., color interpretation) - Ensure full logging, confidence scoring, and explainability of matches - Create a system that supports manual overrides and future improvements Core Requirements 1. Universal & Configurable Architecture The system must be designed so that: - New suppliers can be added via configuration - Column mappings can be adjusted without code changes - New product categories (e.g. toothbrushes, microwaves, or other appliances) can be added through configuration and database profiles - Core matching logic remains unchanged - Adding new products or categories must be mainly a configuration task, not a development task. 2. Supplier File Processing - Support Excel and CSV files - Store supplier configurations in the database - Handle frequent format changes without redeployment 3. Matching Engine - Use structured parsing of titles and attributes - Extract features such as brand, model, storage, size, color, etc. - Score multiple candidates and select the best match - Provide confidence scores and explanations 4. Database & Infrastructure - Azure SQL Server - Central storage for: - Product catalog - Supplier configurations - Matching profiles - Processing runs - Results and audit logs Technical Stack (Preferred) - Backend: Python - Database: Azure SQL Server - APIs: REST-based architecture - AI: Azure OpenAI (limited, controlled usage) - File Processing: Pandas / OpenPyXL / similar Required Skills - Python backend development - Experience with data processing pipelines - SQL (preferably SQL Server / Azure SQL) - Experience with system architecture and scalability - API design - Data normalization and matching logic Nice to Have - Experience with AI-assisted workflows - Knowledge of product catalog systems - Experience building configurable tools What We Value Most We are not looking for a quick script. We are looking for a developer who can: - Think in systems - Design for long-term scalability - Build maintainable architecture - Document decisions - Deliver production-ready software This project is intended to be a long-term foundation for our operations. Project Scope - Initial system design - Core backend development - Database schema design - Matching engine implementation - Configuration framework - Documentation - Support during testing phase Potential long-term collaboration for maintenance and extensions. Additional Information - The ZIP file includes documentation and explanations of our current database structure and data model. Access to live environments and production systems will be provided in a later project phase, once the collaboration is established. - The ZIP file also contains an early prototype of the matching engine. This prototype is incomplete and not production-ready. It was created during an exploratory phase to better understand the problem and define the required scope. It is provided for context only and should not be considered a technical foundation for the final solution.
View Original Listing
Unlock AI intelligence, score breakdowns, and real-time alerts
Upgrade to Pro — $29.99/mo

Client

Spent: $3,901.02Rating: 5.0Verified