Accountabilities:
- Collaborate with the Commercial Multi-functional team to find opportunities for using internal and external data to enhance business solutions.
- Work closely with business and advanced data science teams on cross-functional projects, delivering complex data science solutions that contribute to the Commercial Organization.
- Manage platforms and processes for complex projects using a wide range of data engineering techniques in advanced analytics.
- Prioritize business and information needs with management; translate business logic into technical requirements, such as creating queries, stored procedures, and scripts.
- Interpret data, process it, analyze results, present findings, and provide ongoing reports.
- Develop and implement databases, data collection systems, data analytics, and strategies that optimize data efficiency and quality.
- Acquire data from primary or secondary sources and maintain databases/data systems.
- Identify and define new process improvement opportunities.
- Manage and support data solutions in BAU scenarios, including data profiling, designing data flow, creating business alerts for fields, and query optimization for ML models.
Essential Skills/Experience:
- BS/MS in a quantitative field (Computer Science, Data Science, Engineering, Information Systems, Economics)
- 5+ years of work experience with DB skills like Python, SQL, Snowflake, Amazon Redshift, MongoDB, Apache Spark, Apache Airflow, AWS cloud and Amazon S3 experience, Oracle, Teradata
- Good experience in Apache Spark or Talend Administration Center or AWS Lambda, MongoDB, Informatica, SQL Server Integration Services
- Experience in building ETL pipeline and data integration
- Build efficient Data Management (Extract, consolidate and store large datasets with improved data quality and consistency)
- Streamlined data transformation: Convert raw data into usable formats at scale, automate tasks, and apply business rules
- Good written and verbal skills to communicate complex methods and results to diverse audiences; willing to work in a cross-cultural environment
- Analytical mind with problem-solving inclination; proficiency in data manipulation, cleansing, and interpretation
- Experience in support and maintenance projects, including ticket handling and process improvement
- Setting up Workflow Orchestration (Schedule and manage data pipelines for smooth flow and automation)
- Importance of Scalability and Performance (handling large data volumes with optimized processing capabilities)
- Experience with Git
Desirable Skills/Experience:
- Knowledge of distributed computing and Big Data Technologies like Hive, Spark, Scala, HDFS; use these technologies along with statistical tools like Python/R
- Experience working with HTTP requests/responses and API REST services
- Familiarity with data visualization tools like Tableau, Qlik, Power BI, Excel charts/reports
- Working knowledge of Salesforce/Veeva CRM, Data governance, and Data mining algorithms
- Hands-on experience with EHR, administrative claims, and laboratory data (e.g., Prognos, IQVIA, Komodo, Symphony claims data)
- Good experience in consulting, healthcare, or biopharmaceuticals