Lead AI Application Engineer role where you build and maintain a shared AI Platform that supports the full ML lifecycle across cloud and on-premises environments.
Core responsibilities:
- Architect and maintain multi-tenant AI Platform with focus on high availability, low latency and cost-efficiency
- Implementation of LLMOps/MLOps best practices with automated deployment pipelines
- Development of AI Services Catalogue (Inference-as-a-Service, Embeddings-as-a-Service, RAG-as-a-Service)
- Management of AI Data Infrastructure including Vector Databases and Feature Stores
- Orchestration of Model Hosting environments with Kubernetes and GPU resources
- Creation of Developer Self-Service Portal/CLI for product squads
- Internal workshops and documentation for team empowerment