Who you are
BE/M.Sc./PhD in Computer Science or related field and 8+ years of work experience in a relevant capacity
Experience in working with cloud environments such as, Hadoop, AWS, GCP, and Azure.
Experience with enforcing security controls and best practices to protect sensitive data within AWS data pipelines, including encryption, access controls, and auditing mechanisms.
Agile mindset, a spirit of initiative, and desire to work hands-on together with your team
Interest in solving challenging technical problems and developing the future data architecture that will enable the implementation of innovative data analytics use-cases
Experience in leading small to medium-sized teams
Experience in creating architectures for ETL processes for batch as well as streaming ingestion
Knowledge of designing and validating software stacks for GxP relevant contexts as well as working with PII data
Familiarity with the data domains covering the Pharma value-chain (e.g. research, clinical, regulatory, manufacturing, supply chain, and commercial)
Strong, hands-on experience in working with Python, Pyspark & R codebases, proficiency in additional programming languages (e.g. C/C++, Rust, Typescript, Java, …) is expected
Knowledge of database technologies for OLTP and OLAP workloads and a firm grasp of SQL
Experience working with Apache Spark and the Hadoop ecosystem
Working with heterogenous compute environments and multi-platform setups
Experience in working with cloud environments such as, Hadoop, AWS, GCP, and Azure
Basic knowledge of Statistics and Machine Learning algorithms is favorable
This is the respective role description:
The ability to easily find, access, and analyze data across an organization is key for every modern business to be able to efficiently make decisions, optimize processes, and to create new business models. The Data Architect plays a key role in unlocking this potential by defining and implementing a harmonized data architecture for Healthcare. Together with a team of Data Engineers, the Data Architect is responsible for the delivery and maintenance of reliable data pipelines for the ingestion of data from a heterogenous ecosystem as well as the creation of specialized, cloud-based data processing and analytics stacks. He/she will work hand in hand with our Data Governance team to ensure that all data assets are covered by our governance process. During the process the Data Architect will collaborate closely with our business, corporate functions, as well as out IT organization, in order to gather requirements and shape a global framework of engineering best practices.