CA

AI Data Engineer

Capgemini

8 months ago

7 - 10 years

Work From Office

Gurgaon, Haryana, Haryana, India

  • Design, develop, and maintain scalable data pipelines to support ML models.
  • Conduct exploratory data analysis (EDA) to gather insights and identify data patterns.
  • Create and manage datasets comprising text, image, audio, and video data for various ML applications.
  • Data pipelines

    Data Preprocessing

    PYTHON

    JAVA

    Scala

    Apache SPARK

    Hadoop

    SQL

    NoSQL

    Data Visualization

    EDA

    Machine Learning

    Artificial Intelligence (AI)

    Job description & requirements

    Your Role


    • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
    • 5+ years of experience in data engineering, with a focus on building data pipelines and preprocessing data.
    • Strong proficiency in programming languages such as Python, Java, or Scala.
    • Hands-on experience with data processing frameworks and tools like Apache Spark, Hadoop, or similar.
    • Proficiency in SQL and experience with relational and NoSQL databases.
    • Experience with data visualization and EDA tools such as Pandas, Matplotlib, or Tableau.
    • Familiarity with ML and AI concepts, particularly in relation to data preparation and pipelines.
    • Experience with text, image, audio, and video data management, including labeling and cleansing.
    • Exposure to EdgeAI applications and their unique data processing requirements (preferred).
    • Strong problem-solving skills and the ability to work independently and collaboratively.

     

    Your Profile


    • Design, develop, and maintain scalable data pipelines to support ML models.
    • Perform data preprocessing, cleansing, and labeling to ensure high-quality data inputs for ML applications.
    • Conduct exploratory data analysis (EDA) to gather insights and identify data patterns.
    • Collaborate with data scientists and ML engineers to align data pipeline requirements with model development needs.
    • Create and manage datasets comprising text, image, audio, and video data for various ML applications.
    • Implement best practices for data management, ensuring data integrity, consistency, and security.
    • Optimize data workflows and processing pipelines for efficiency and performance.
    • Utilize cloud-based data storage and processing solutions as needed.
    • Stay current with industry trends and technologies to continuously improve data engineering processes.
    • Provide technical support and guidance to junior data engineers and other team members.

    Experience :

    7 - 10 years

    Job Domain/Function :

    Data Engineering

    Job Type :

    Work From Office

    Employment Type :

    Full Time

    Number Of Position(s) :

    1

    Educational Qualifications :

    Bachelor's Degree

    Location :

    Gurgaon, Haryana, India, Gurgaon, Haryana, India

    Create alert for similar jobs

    CA

    Capgemini

    Similar Jobs