AI Audio Monitoring & Daily Summary System
UpworkAENot specifiedexpertScore: 63
AI DevelopmentAI Agent DevelopmentAI App DevelopmentAutomatic Speech RecognitionDeep LearningEdge AIArtificial IntelligenceMachine LearningPython
We are looking for an experienced AI developer to build a system that continuously listens to conversations inside a specific room and generates a structured summary at the end of each day.
This is not a basic transcription task. The system must:
* Record and process audio throughout the day
* Transcribe conversations accurately
* Identify and distinguish between different speakers
* Match voices to predefined names (speaker recognition)
* Generate a clean daily summary of what was discussed
* Optionally highlight key topics, decisions, conflicts, or repeated issues
Core Requirements:
* Strong experience with speech-to-text systems
* Speaker diarization and voice identification
* Ability to train the system to recognize specific individuals by voice
* Automated daily report generation
* Privacy-aware architecture and secure storage
* Ability to deploy locally or on a secure server
Nice to Have:
* Experience with Whisper, Pyannote, or similar speech models
* Experience with LLM-based summarization
* Real-time processing capability
* Dashboard interface for reviewing transcripts and summaries
Deliverables:
* Fully working AI audio monitoring system
* Speaker recognition trained on specific individuals
* Automated end-of-day summary report
* Documentation for setup and scaling
Please include:
* Similar projects you’ve built
* What tech stack you would use
* Estimated timeline
* Fixed price
Unlock AI Intelligence, score breakdowns, and real-time alerts
Upgrade to Pro — $29.99/moClient
Spent: $10,774.28Rating: 4.1Verified