Are you passionate about building secure, scalable cloud infrastructure and supporting cutting-edge AI workloads? We’re looking for a Cloud Engineer & Infrastructure Security professional to architect, deploy, and secure hybrid cloud and on-prem infrastructure supporting mission-critical applications and LLM environments.
This role combines cloud engineering, DevSecOps, infrastructure security, Kubernetes administration, and observability to ensure reliable, secure, and high-performing systems.
Key Responsibilities
Infrastructure & Cloud Management
- Deploy and manage Kubernetes clusters across cloud and on-prem environments
- Build infrastructure using Terraform and Helm
- Design and secure cloud networking, including VPCs, VPNs, and firewalls
Infrastructure Security
- Implement Zero Trust security models
- Manage IAM, least-privilege access controls, and secrets management
- Enforce security policies and micro-segmentation
DevSecOps & CI/CD
- Integrate security scanning and compliance checks into CI/CD pipelines
- Implement SBOM generation and policy-as-code frameworks
- Automate security validation throughout the deployment lifecycle
LLM & Hybrid Infrastructure
- Build and maintain infrastructure for AI and LLM workloads using technologies such as vLLM and KServe
- Support secure hybrid cloud and on-prem deployments
Monitoring & Observability
- Implement monitoring and alerting solutions using Grafana, Prometheus, and Azure Monitor
- Maintain dashboards, SLIs, SLOs, and performance metrics
Requirements
- Strong Kubernetes administration experience
- Expertise with Terraform and Helm
- Deep understanding of infrastructure security and Zero Trust principles
- Experience with DevSecOps practices and CI/CD security
- Knowledge of cloud networking, VPNs, and firewalls
- Linux administration, scripting, and automation skills
- Experience with monitoring and observability platforms
- Familiarity with AI/LLM infrastructure environments
Nice to Have
- CKA, CKAD, Terraform Associate, CISSP, or similar certifications
- GitOps experience (Argo CD, Flux)
- Knowledge of OPA, Gatekeeper, Kyverno, SPIFFE, or SPIRE
- GPU infrastructure experience for AI workloads
- SRE and incident response experience
- Open-source contributions in cloud or AI infrastructure projects
Benefits
- Competitive salary and performance bonuses
- Fully remote work environment
- Company-provided laptop and hardware
- AI and automation training
- Global startup exposure
- Continuous learning and career growth opportunities
Location: South Africa (Remote)
Department: Technical Assistants
