Vivek Kushwah
Designing, testing, and implementing resilient infrastructure. Building autonomous systems that scale.

01 // Identity
DevOps Engineer with a background designing, testing and implementing infrastructure and applications. Purpose-driven professional with capacity to be strong team player plus work effectively independently.
I don't just maintain servers; I build autonomous systems that scale with business needs.
02 // Capabilities
Container Orchestration
Infrastructure as Code
CI/CD & GitOps
Monitoring & Observability
Databases & Caching
SRE & Platform Engineering
Data & Security
03 // Run Logs
Engineer
- Led the end-to-end migration from Azure API Management (APIM) to AWS API Gateway, involving both legacy and active APIs across multiple teams. Implemented request/response transformation, improved connection security, and optimized gateway configurations — resulting in better latency, simplified management, and seamless integration with backend services.
- Designed and implemented reusable Terraform modules to standardize infrastructure provisioning across AWS, Azure, and GCP. Reduced setup time for new environments. Maintained production-grade Kubernetes clusters across AWS.
- Achieved a 40% reduction in cloud spending through architectural refactoring, instance rightsizing, decommissioning unused resources, and creating proper tagging. Also enabled cluster autoscaling and workload rightsizing, created organization-wide cost monitoring dashboards and audit reports.
Engineer
- Designed and provisioned scalable cloud infrastructure using EC2, ElasticSearch, MongoDB with autoscaling, and core AWS services including RDS, S3, and CloudFront—boosting uptime and responsiveness across critical production workloads.
- Owned Kubernetes manifest authoring and cluster operations, deploying microservices with optimized resource limits, readiness/liveness probes, and autoscaling strategies—resulting in 20% better resource utilization and reduced pod evictions.
- Built and maintained Dockerized application stacks, enforcing image hygiene and minimizing attack surface via multi-stage builds and base image validation across development and production environments.
DevOps Engineer
- Provisioned and automated infrastructure from scratch using Terraform and Atlantis for PR-based workflows; optimized Docker images to improve performance, reduce size, and accelerate deployments. Built logging and alerting using ELK stack with Elastalert, Heartbeat, and Status Page, cutting downtime by 25%.
- Engineered and refined Azure Pipelines, slashing deployment time by 50% and boosting development velocity. Managed Databricks ETL workflows, cluster provisioning, and alerting with Azure Data Factory and ML Workspaces for seamless data operations.
- Deployed and configured AKS clusters with complete network and DNS integration in Azure, ensuring high availability and consistent application delivery at scale.
DevOps Engineer
- Managed and optimized AWS infrastructure using services such as Lambda, RDS, EKS, DMS, EC2, CloudFront, Route 53, and more; implemented IaC using Terraform with Atlantis and Terraform Drift, extended to Snowflake infrastructure, and built disaster recovery and backup strategies for high availability.
- Designed and enhanced CI/CD pipelines on CircleCI and GitHub Actions, leveraging caching, parallel jobs, and smart branching strategies to reduce runtime by 30%; contributed to unit and integration test coverage, and API development in Django and Airflow Pipelines.
- Built and maintained Airflow infrastructure with custom Docker images and Kubernetes cluster setup; developed and optimized DAGs, implemented DMS task auto-scaling, cutting costs by 40%; managed Snowflake ingestion workflows and warehouse optimization, achieving 35–40% cost reduction.
04 // Deployments
Infrastructure Automation with Terraform and Azure
Successfully designed and implemented an infrastructure automation project using Terraform and Azure. Created Terraform modules and Git/GitHub best practices to improve reusability, scalability, and code management. Integrated CI/CD pipelines with Azure YAML for the Drift and Atlantis in AKS for automated deployments on pull requests and continuous integration, resulting in a reduction in resource provisioning.
Monitoring and Observability with Prometheus and Grafana
Implemented a robust monitoring and observability solution utilizing Prometheus for metric collection from various services with custom exporters and alerting rules, along with Grafana for creating interactive dashboards enabling real-time monitoring, integrated with PagerDuty and Slack for efficient incident management and performance optimization.