# Platform Engineer – AI & Analytics Platform
## Role Overview
We are seeking an experienced Platform Engineer to join a cutting-edge AI and Analytics Platform team responsible for building and maintaining scalable, cloud-native infrastructure that supports advanced analytics, machine learning, and Agentic AI solutions.
The successful candidate will play a key role in designing, developing, and operating modern platform capabilities that enable data scientists, engineers, and AI practitioners to deliver enterprise-scale AI and analytics solutions.
## Key Responsibilities
Platform Engineering
- Design, build, and maintain scalable platform infrastructure for AI, analytics, and data-driven applications
- Develop and operate cloud-native platform services supporting enterprise AI workloads
- Ensure platform reliability, scalability, security, and performance
- Implement platform automation and standardization across environments
Cloud Infrastructure
- Build and manage solutions across AWS and Microsoft Azure
- Design resilient and highly available cloud architectures
- Optimize cloud resources for performance and cost efficiency
Containerization & Orchestration
- Deploy and manage containerized workloads using Kubernetes and Docker
- Support Kubernetes cluster operations, upgrades, and troubleshooting
- Develop deployment strategies and platform automation for containerized applications
AI & Agentic Platform Development
- Contribute to the development and evolution of modern AI and Agentic AI platforms
- Support infrastructure requirements for AI, machine learning, and LLM-based solutions
- Enable scalable deployment and operation of AI applications and services
- Collaborate with AI engineers, data scientists, and analytics teams
Infrastructure as Code & Automation
- Implement and maintain Infrastructure as Code (IaC) solutions using Terraform
- Automate infrastructure provisioning, configuration, and deployment processes
- Develop reusable platform components and deployment templates
- Support CI/CD and DevOps best practices
Observability & Monitoring
- Implement monitoring, logging, and observability solutions
- Work with frameworks such as OpenTelemetry and cloud-native monitoring solutions
- Develop dashboards, alerts, and operational insights
- Improve platform reliability through proactive monitoring and incident prevention
Open Source & AI Frameworks
- Support and integrate modern open-source AI technologies
- Work with frameworks such as LangChain and LangGraph
- Evaluate and adopt emerging technologies that enhance platform capabilities
Collaboration & Continuous Improvement
- Work closely with platform engineers, developers, data scientists, architects, and stakeholders
- Drive continuous improvements in platform usability, performance, and security
- Contribute to architecture discussions, technical documentation, and operational best practices
- Participate in Agile development processes and ceremonies