Key Responsibilities
Strategic Leadership
Define and own the end-to-end DevOps vision, strategy, and roadmap for e-commerce and AI-powered commerce solutions.
Partner with engineering, QA, and product leaders to align DevOps goals with organizational objectives.
Champion automation-first, GitOps, and continuous delivery principles.
Anticipate infrastructure and deployment needs of AI/LLM workloads (e.g., GPU scheduling, inference scaling, model versioning, RAG pipelines).
Infrastructure & Automation
Design, implement, and scale CI/CD pipelines (GitLab CI/GitHub Actions) for microservices, APIs, and AI components.
Drive infrastructure-as-code adoption (Terraform, Helm, Ansible) across multi-cloud environments.
Oversee container orchestration (Kubernetes, OpenShift) and optimize clusters for both stateless commerce services and GPU-intensive AI workloads.
Establish automated monitoring, alerting, and self-healing systems to ensure high availability and low-latency experiences.
AI/ML Platform Integration
Collaborate with AI/ML engineers to enable model deployment and serving pipelines.
Define best practices for model registry, versioning, and rollbacks.
Implement observability for AI-specific metrics (latency, hallucination/error rates, cost tracking).
Support safe experimentation with multi-agent systems, ensuring infrastructure resilience and guardrails.
Security, Compliance & Governance
Implement secrets management and enforce DevSecOps practices.
Ensure compliance with SecNum, PCI DSS, GDPR, and data governance for both customer data and AI training data.
Proactively manage risks related to AI integrations (e.g., prompt injection, data leakage, inference abuse).
People & Stakeholder Management
Mentor and guide a distributed DevOps/SRE team, fostering a high-performance, automation-driven culture.
Partner with engineering, QA, product, and AI platform teams to ensure smooth end-to-end delivery.
Present operational health, risk assessments, and platform readiness to senior leadership and C-level stakeholders.
Continuous Improvement
Drive optimization of cost, performance, and developer productivity across the delivery lifecycle.
Stay current with trends in DevOps, AI infrastructure, and cloud-native commerce architectures.
Explore the role of generative AI in DevOps (e.g., intelligent runbooks, automated incident triage, anomaly detection).
Requirements
Bachelor’s/Master’s degree in Computer Science, Engineering, or related field.
14+ years of progressive experience in DevOps/SRE/Platform Engineering, with at least 3 years in a leadership capacity.
Proven track record supporting e-commerce, marketplace, or large-scale digital platforms.
Proven expertise with Azure cloud services:
Compute: AKS, Azure App Service, Azure Container Apps
Data/AI: Azure ML, Azure OpenAI, Cognitive Search, Azure Database (Postgres/pgvector)
Infra: Azure DevOps, Terraform/Bicep, Azure Policy, Key Vault, Monitor, Log Analytic
Deep expertise in CI/CD, Kubernetes, containers, Terraform, and cloud-native infrastructure.
Strong knowledge of monitoring and observability (Prometheus, Grafana, ELK/EFK, OpenTelemetry, etc).
Experience supporting AI/ML deployment pipelines (model serving, GPUs, inference scaling).
Familiarity with event-driven and microservices architectures.
Excellent analytical, communication, and stakeholder management skills.
Preferred Qualifications
Certifications: Microsoft Certified: Azure DevOps Engineer Expert, Microsoft Certified: Azure Solutions Architect Expert, AWS DevOps Professional, CKA/CKAD, or equivalent.
Hands-on with Azure MLOps stack (Azure Machine Learning pipelines, Azure ML Registry, Azure Cognitive Search with vector support, Azure OpenAI) as well as broader MLOps tools (MLflow, Kubeflow, Arize, LangSmith) and vector databases (Pinecone, Weaviate, Milvus, pgvector).
Exposure to AI-powered DevOps practices (e.g., predictive monitoring, AIOps).
Experience in start-up or high-growth environments with global distributed teams.
Why Join Us?
Own the DevOps vision for a next-gen digital commerce ecosystem impacting millions of users.
Work at the intersection of e-commerce and AI, designing platforms that support both traditional and next-gen AI-driven capabilities.
Collaborate with world-class engineers, QA leaders, and AI specialists.
Competitive salary, benefits, and opportunities for career advancement.
Play a critical role in shaping how AI-ready, secure, and scalable DevOps is done at enterprise scale.