MS

AI Platform Engineering Specialist

Morgan Stanley

4 months ago

5 - 7 years

Work From Office

Bengaluru, Karnataka, India

  • Develop Terraform modules and Cloud architecture to enable secure AI cloud service deployment and consumption at scale.
  • Have a platform mindset and build common, reusable solutions to scale Generative AI use cases using pre-trained models as well as fine-tuned models.
  • Leverage Kubernetes/OpenShift to develop modern containerized workloads.
  • PYTHON

    Sql

    NoSQL

    Big data

    Data Governance

    RESTful API design

    Azure

    Aws

    Google Cloud Platform (GCP)

    ci/cd

    Devops

    Agile Methodologies

    Job description & requirements

    What you’ll do in the role:


    1. Develop tooling and self-service capabilities for deploying AI solutions for the firm leveraging Kubernetes/OpenShift, Python, authentication solutions, APIs, REST framework, etc.
    2. Develop Terraform modules and Cloud architecture to enable secure AI cloud service deployment and consumption at scale.
    3. Have a platform mindset and build common, reusable solutions to scale Generative AI use cases using pre-trained models as well as fine-tuned models.
    4. Leverage Kubernetes/OpenShift to develop modern containerized workloads.
    5. Integrate with capabilities such as large-scale vector stores for embeddings.
    6. Author best practices on the Generative AI ecosystem, when to use which tools, available models such as GPT, Llama, Hugging Face etc. and libraries such as Langchain.
    7. Analyze, investigate, and implement GenAI solutions focusing on Agentic Orchestration and Agent Builder frameworks.
    8. Author and publish architecture decision records to capture major design decisions and product selection for building Generative AI solutions. Inclusive of app authentication, service communication, state externalization, container layering strategy and immutability.
    9. Ensure AI platform are reliable, scalable, and operational; (e.g. blueprints for upgrade/release strategies (E.g. Blue/Green); logging/monitoring/metrics; automation of system management tasks)
    10. Participate in all team’s Agile/ Scrum ceremonies.
    11. Participate in team’s on call rotation in build/run team model


    What you’ll bring to the role:

    1. At least 4 years’ relevant experience would generally be expected to find the skills required for this role
    2. Bachelor’s or Master’s degree in Computer Science or related field, or equivalent job experience
    3. 4 years of experience in software engineering, design and development
    4. Strong hands-on Application Development background in at least one prominent programming language, preferably Python Flask or FAST Api.
    5. Broad understanding of data engineering (SQL, NoSQL, Big Data, Kafka, Redis), data governance, data privacy and security.
    6. Experience in development, management, and deployment of Kubernetes workloads, preferably on OpenShift.
    7. Experience with designing, developing, and managing RESTful services for large-scale enterprise solutions.
    8. Experience deploying applications on Azure, AWS, and/or GCP using IaC (Terraform)
    9. Hands-on experience with multiprocessing, multithreading, asynchronous I/O, performance profiling in at least one prominent programming language, preferably python.
    10. Ability to articulate technical concepts effectively to diverse audiences.
    11. Excellent communication skills.
    12. Demonstrated ability to work effectively and collaboratively in a global organization, across time zones, and across organizations
    13. Demonstrated experience in DevOps, understanding of CI/CD (Jenkins) and GitOps.
    14. Knowledge of DevOps and Agile practices.
    15. Nice to have
    16. Practitioner of unit testing, performance testing and BDD/acceptance testing.
    17. Understanding of OAuth 2.0 protocol for secure authorization.
    18. Proficiency with Open Telemetry tools including Grafana, Loki, Prometheus, and Cortex.
    19. Good knowledge of Microservice based architecture, industry standards, for both public and private cloud.
    20. Good understanding of modern Application configuration techniques.
    21. Hands on experience with Cloud Application Deployment patterns like Blue/Green.
    22. Good understanding of State sharing between scalable cloud components (Kafka, dynamic distributed caching).
    23. Good knowledge of various DB engines (SQL, Redis, Kafka, etc) for cloud app storage.
    24. Experience building AI applications, preferably Generative AI and LLM based apps.
    25. Deep understanding of AI agents, Agentic Orchestration, Multi-Agent Workflow Automation, along with hands-on experience in Agent Builder frameworks such Lang Chain and Lang Graph.
    26. Experience working with Generative AI development, embeddings, fine tuning of Generative AI models.
    27. Understanding of ModelOps/ ML Ops/ LLM Op.
    28. Understanding of SRE techniques.



    Experience :

    5 - 7 years

    Job Domain/Function :

    AI Engineering

    Job Type :

    Work From Office

    Employment Type :

    Full Time

    Number Of Position(s) :

    1

    Educational Qualifications :

    Bachelor's Degree

    Location 1 :

    Bengaluru, Karnataka, India, Bengaluru, Karnataka, India

    Location 2 :

    Mumbai, Maharashtra, India, Maharashtra, India

    Create alert for similar jobs

    MS

    Morgan Stanley

    Similar Jobs