Job Description
Voice Recognition Engineer(Bangalore, Hyderabad, Mumbai, Pune, Chennai)
Position Description
Company Profile:
At CGI, were a team of builders. We call our employees members because all who join CGI are building their own companyone that has grown to thousands of professionals located in countries around the world. Founded in 1976, CGI is a leading IT and business process services firm committed to helping clients succeed. We have the global resources, expertise, stability, and dedicated professionals needed to achieve results for our clientsand for our members.
Role Summary
We are seeking a highly skilled Voice Recognition Engineer with 57 years of experience in speech recognition technologies, AI/ML model optimization, and web-based voice applications. The ideal candidate will be responsible for developing, enhancing, and maintaining robust speech recognition systems, ensuring high accuracy, low latency, and seamless cross-browser compatibility for production-grade applications.
Your Future Duties and Responsibilities
- Develop and maintain browser-based speech recognition solutions using technologies such as Web Speech API.
- Design, optimize, and fine-tune speech recognition models to improve transcription accuracy and performance.
- Ensure compatibility and consistent behavior across major browsers including Chrome, Safari, Edge, and Firefox.
- Troubleshoot and resolve cross-browser voice recognition issues in production environments.
- Build scalable backend services using Python and FastAPI to support speech processing workflows.
- Implement and optimize AI/ML models using PyTorch and related frameworks.
- Monitor application performance and drive improvements in latency, reliability, and user experience.
- Collaborate with cross-functional teams, architects, and clients for deployment, testing, and continuous enhancements.
- Conduct validation, testing, and maintenance activities as browser technologies evolve.
- Support production environments and participate in issue resolution and performance tuning.
Required Qualifications to be Successful in This Role
- 57 years of experience in Voice Recognition, Speech Processing, or AI/ML Engineering.
- Strong expertise in Web Speech API and browser-based speech recognition technologies.
- Hands-on experience with Python development.
- Strong experience in FastAPI and PyTorch.
- Experience in speech recognition model training, fine-tuning, and optimization.
- Solid understanding of cross-browser compatibility challenges and solutions.
- Experience supporting production-grade web applications.
- Knowledge of REST APIs and microservices architecture.
- Strong debugging, analytical, and problem-solving skills.
- Excellent communication and stakeholder management skills.
- B.E./B.Tech/MCA or equivalent qualification.
Preferred Skills
- Experience with Hugging Face speech models.
- Knowledge of CUDA-based model acceleration and optimization.
- Familiarity with Piper and speech processing frameworks.
- Experience with real-time voice processing applications.
- Knowledge of C++ for performance optimization.
- Exposure to cloud-based AI/ML deployment environments.
Technologies Required
- Python
- FastAPI
- PyTorch
- Web Speech API
- Speech Recognition Systems
- Hugging Face
- CUDA
- Piper
- REST APIs
- C++
- Real-Time Voice Processing
Job Classification
Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Software Development - Other
Employement Type: Full time
Contact Details:
Company: CGI
Location(s): Pune
Keyskills:
Speech Recognition
Voice Recognition Systems
Web Speech API
FastAPI
Python
Real-Time Voice Processing
AI/ML Model Optimization
Browser-Based Voice Technologies
Speech Processing
PyTorch
Model Fine-Tuning
Production Support
Cross-Browser Compatibility
REST APIs