Job Description
Key Responsibilities:
- Develop and implement machine learning models, offline batch models as well as real time online and edge compute models
- Analyze complex datasets and extract meaningful insights to drive business decisions
- Collaborate with cross-functional teams to identify and solve business problems using data-driven approaches
- Communicate findings and recommendations to stakeholders effectively
Required Qualifications:
- Bachelor's or Master's degree in Computer Science, Data Science, Statistics, or a related field
- 3+ years of experience in a Data Scientist role
- Strong proficiency in Python and SQL
- Solid understanding of machine learning algorithms and statistical modeling techniques
- Knowledge of Natural Language Processing (NLP) and Computer Vision (CV) concepts and algorithms
- Hands-on experience implementing and deploying machine learning algorithms
- Experience with data visualization tools and techniques
- Strong analytical and problem-solving skills
- Excellent communication skills, both written and verbal
Preferred Qualifications:
- Experience with PySpark and other big data processing frameworks
- Knowledge of deep learning frameworks (e.g., TensorFlow, PyTorch)
Technical Skills:
- Programming Languages: Python (required), SQL (required), Java (basic knowledge preferred)
- Machine Learning: Strong foundation in traditional ML algorithms, and a working knowledge of NLP and Computer Vision
- Big Data: Deep knowledge of PySpark
- Data Storage and Retrieval: Familiarity with databases/mlflow preferred
- Mathematics: Strong background in statistics, linear algebra, and probability theory
- Version Control: Git
Soft Skills:
- Excellent communication skills to facilitate interactions with stakeholders
- Ability to explain complex technical concepts to non-technical audiences
- Strong problem-solving and analytical thinking
- Self-motivated and able to work independently as well as in a team environment
- Curiosity and eagerness to learn new technologies and methodologies