Open to senior platform & SRE roles

I'm Faisal Bin Basha, AI Platform , DevSecOps , DevOps Engineer

Principal-level DevOps and AI Infrastructure Engineer designing, scaling, and securing mission-critical distributed systems across cloud and on-prem environments. I build resilient platforms on Kubernetes and AWS EKS, with deep expertise in observability, CI/CD automation, and DevSecOps.

Faisal Bin Basha
Faisal Bin Basha AI Platform Engineer
16+
Years in Tech
18
Certifications
8
Companies
4
Cloud Platforms

The toolkit behind resilient platforms

A deep technical stack assembled across 16+ years — spanning cloud architecture, container orchestration, observability, automation, security, and machine learning infrastructure.

Cloud & Infrastructure

Designing scalable, resilient, cost-efficient cloud architectures across hyperscalers and on-prem.

AWS EKS AWS RDS AWS S3 AWS Lambda Azure AKS Oracle Cloud Kubernetes VMware

DevOps & CI/CD

Automation-first pipelines that make deployments rapid, reliable, and repeatable.

Jenkins Ansible Docker JFrog Artifactory SonarQube GitLab BitBucket Groovy
📊

Observability & Data

Real-time visibility into distributed systems — metrics, logs, and databases at scale.

Prometheus Grafana cAdvisor Elasticsearch Logstash Kibana MySQL 8.2 Cluster
🧠

AI / ML & Languages

Building the infrastructure that powers next-generation AI/ML workloads.

Python TensorFlow Deep Learning Kubeflow DVC C++ R JavaScript
🔒

DevSecOps & Security

Secure-by-design principles applied across the full platform lifecycle.

Aqua Security AWS Cert Manager SSL/TLS CyberArk PAM Vulnerability Scanning CEH eJPT
💬

Leadership & Languages

Leading complex incident response, mentoring teams, and communicating across cultures.

English Arabic Hindi Tamil Malayalam

Certifications & Credentials

  • AWS
    Certified Security SpecialtyAWS · 2025
  • AWS
    Certified Machine Learning SpecialtyAWS · 2024
  • AWS
    Certified Solution Architect AssociateAWS · 2020
  • AWS
    Certified Cloud PractitionerAWS
  • AZ
    Azure 400 DevOps CertifiedMicrosoft · 2021
  • AZ
    Azure Solution Architect ProfessionalMicrosoft · 2021
  • OCI
    OCI Observability ProfessionalOracle · 2025
  • OCI
    OCI Data Science ProfessionalOracle · 2025
  • OCI
    OCI Generative AI ProfessionalOracle · 2024
  • OCI
    OCI AI FoundationsOracle · 2025
  • OCI
    OCI Foundations AssociateOracle · 2025
  • K8s
    Certified Kubernetes AdministratorLinux Foundation CNCF
  • LF
    Linux Foundation Certified SysAdminLFCS · 2024
  • DL
    Deep Learning SpecializationCoursera · Andrew Ng
  • DS
    Data Science SpecializationJohns Hopkins · Coursera
  • EC
    Certified Ethical HackerEC-Council
  • INE
    eJPT Penetration TesterINE
  • SFC
    Scrum Fundamentals CertifiedSCRUM

The journey so far

16+ years across cloud architecture, DevOps, SRE and AI infrastructure — layered on top of graduate study in Computer Science, Artificial Intelligence and Cyber Law.

2021 — Present · Dubai, UAE

AI Platform Engineer / Senior SRE & DevOps & DevSecOps

ANUVU — connectivity & entertainment for aviation & maritime
  • Architected containerized infrastructure across AWS EKS and on-prem Kubernetes clusters for mission-critical workloads.
  • Engineered observability with Prometheus, Grafana and cAdvisor; defined SLIs/SLOs and cut incident response time.
  • Operated ELK logging pipelines, resolving indexing pressure (HTTP 429), circuit breaker exceptions and shard allocation failures.
  • Administered MySQL 8.2 clustered environments with Group Replication — fixed replication conflicts and storage bottlenecks.
  • Built Jenkins CI/CD pipelines and scheduled automation for health checks, log rotation and disk management.
  • Hardened platform security via AWS Certificate Manager, Aqua Security container scanning and SSL/TLS lifecycle management.
05/2021 — 05/2022 · Dubai, UAE

Senior DevOps Engineer

Saab — healthcare innovation
  • Deployed Snipe-IT with Docker-Compose and automated SSL certificate deployment via NGINX + Ansible.
  • Configured Jenkins with Artifactory, SonarQube and Docker plugins for multi-language CI/CD (Python, C/C++, NodeJS, Vue).
  • Integrated GitLab/BitBucket with LDAP across Jenkins, Jira and Confluence for unified authentication.
  • Managed binary artifacts in JFrog Artifactory; authored pipeline scripts, shell scripts, Dockerfiles and Ansible playbooks.
07/2020 — 05/2021

Senior DevOps & Cloud Engineer

OSN — Middle East & North Africa satellite TV
  • Boosted data mining and automation by 45% with a scalable classifier integrating supervised and unsupervised learning.
  • Deployed to cloud meeting 100% of deadlines under strict security standards.
  • Provisioned multiple on-prem Kubernetes clusters via Kubeadm on Ubuntu 20.04.
04/2018 — 06/2020 · Dubai, UAE

Azure Cloud Solution Architect

Este — web, mobile & ML applications
  • Installed and configured Jenkins on AKS with internal Azure load balancing and SCM-triggered pipelines.
  • Managed Azure Active Directory access, backup policies, and Linux administration on CentOS/RHEL.
  • Built an ML recommendation engine using purchase history and browsing behavior to drive product visibility.
03/2016 — 03/2018 · Dubai, UAE

Meteorological Reporting System

Sharjah International Airport
  • Established GIT branching/naming conventions and developed Groovy scripts integrating SCM with Chef.
  • Automated CI/CD in Jenkins using Docker, Python, Groovy, PowerShell and Unreal Engine build systems.
  • Performed OS installations, upgrades and server patching on RHEL 5.x–7.x using PXE, DHCP, Kickstart and Jumpstart.
  • Built AWS CloudFormation templates for custom-sized VPCs, subnets and NAT; Python integrations with AWS APIs.
07/2014 — 02/2016 · Dubai, UAE

Web / Mobile Application Developer

N M Informatics — web & mobile app startup
  • Enhanced delivery speed by 20% through integrating 5 new technical tools.
  • Automated iOS application archiving and App Store deployment using fastlane.
06/2013 — 06/2014 · Chennai, India

Instructor — iOS / Android

VELS University
  • Taught iOS / Objective-C to master's students — syntax, semantics, memory management.
  • Covered UI, Core Data, Protocols, Delegates and method swizzling.
03/2009 — 12/2012 · Chennai, India

iOS Game Developer

TrueTech Solutions
  • Built automated unit tests using open-source frameworks and TDD.
  • Worked with AWS CloudFormation, VPCs, subnets, NAT and Python + Amazon API integrations.
  • Created Jenkins jobs to build AWS infrastructure from GitHub repos containing Ansible playbooks.
  • Installed and configured DHCP, DNS (BIND), web (Apache/IIS), mail and file servers on Linux.
09/2025 · Atlanta, USA

MS in Computer Science

Georgia Institute of Technology
01/2023 — 06/2024 · London, UK

MSc in Artificial Intelligence

University of West London
2021 — 2022 · Bangalore, India

Post Graduate Degree in Cyber Law & Forensic Law

National Law School of India University
2010 — 2014 · Chennai, India

Bachelor of Computer Application

University of Madras
1994 — 2000 · Dubai, UAE

High School

Our Own English High School — 80% aggregate · Maths & Physics Olympiad distinctions

How I can help

From greenfield platform design to hardening what you already run in production — here are the engagements I take on.

01

Cloud & Platform Architecture

Multi-cluster Kubernetes on AWS EKS, Azure AKS, OCI or on-prem. Designed for availability, resilience and cost efficiency.

  • EKS / AKS cluster design
  • Hybrid & on-prem Kubernetes
  • VPC, networking & IAM
  • Capacity planning & cost review
02

Site Reliability & Observability

End-to-end telemetry stacks that surface the right signal — alerting that respects SLIs/SLOs instead of paging on noise.

  • Prometheus, Grafana, cAdvisor
  • ELK / OpenSearch logging
  • Alertmanager & SLO design
  • Incident response & RCA
03

CI/CD & Automation

Jenkins pipelines, Ansible playbooks and infrastructure-as-code that make shipping and recovery boring in the best way.

  • Jenkins pipeline engineering
  • Ansible configuration management
  • Docker & container registries
  • Release & rollback automation
04

DevSecOps & Hardening

Security as a first-class citizen — SSL/TLS lifecycle, container scanning, secrets management and continuous vulnerability posture.

  • Aqua Security & image scanning
  • AWS Certificate Manager
  • CyberArk PAM
  • Penetration testing (eJPT, CEH)
05

AI / ML Platform Engineering

Scalable, secure infrastructure for AI/ML workloads — from training pipelines to model serving and data versioning.

  • Kubeflow on EKS/AKS
  • DVC data versioning
  • GPU scheduling & autoscaling
  • Recommendation engines
06

Database Reliability

High-throughput MySQL with Group Replication — tuning, replication health and recovery from transaction storms.

  • MySQL 8.x clustered environments
  • Replication conflict resolution
  • AWS RDS design & operation
  • Performance tuning

Work I'm proud of

A selection of the most impactful platforms and systems I've built, scaled and hardened.

Kubernetes AWS EKS ANUVU

Multi-Cluster Kubernetes Platform

Architected and ran containerized infrastructure across AWS EKS and on-prem clusters for mission-critical connectivity systems in aviation and maritime — the platform inflight Wi-Fi and IFE ride on.

Prometheus Grafana ELK

End-to-End Observability Stack

Designed and operationalised a full observability platform — Prometheus for metrics, Grafana for dashboards, cAdvisor for container telemetry, ELK for logs. Alerting that mapped to real user impact, not noise.

MySQL 8.2 Group Replication HA

MySQL 8.2 Clustered Environment

Administered a production MySQL 8.2 cluster with Group Replication. Resolved replication conflicts, rolled-back transactions and storage bottlenecks while keeping write throughput high and consistency intact.

Jenkins Ansible CI/CD

Jenkins CI/CD Pipeline Ecosystem

End-to-end Jenkins pipelines with Artifactory, SonarQube and Docker integrations across Python, C/C++, NodeJS and Vue codebases. Scheduled automation for health checks, log rotation and disk management. Ansible at the config layer.

DevSecOps Aqua Security SSL/TLS

DevSecOps Hardening Programme

Rolled out secure-by-design principles across the platform — SSL/TLS lifecycle via AWS Certificate Manager, container vulnerability scanning with Aqua Security, and NGINX-delivered certificate automation through Ansible.

Machine Learning Python Azure

ML Recommendation Engine

Built a recommendation engine that drove product visibility using purchase history, cart activity, brand preference and browsing behaviour — on-site suggestions and email campaigns. Boosted data mining and automation by 45%.

Let's build something reliable together

Open to Senior SRE, Platform Engineering, DevOps/DevSecOps leadership and AI Infrastructure roles — remote, hybrid, or on-site in the UAE. Also available for consulting engagements. I typically reply within 24 hours.