~/cloud-engineer Online

Vithushanth Chandrakumar

$ |

AWS Certified Solutions Architect with hands-on experience building serverless automation, secure infrastructure, and monitoring solutions. Combining 4+ years of IT operations with cloud engineering expertise to deliver reliable, scalable systems.

AWS SAA-C03
CompTIA A+
Scroll Down

$ About me

I'm a Junior Cloud Engineer with a proven foundation in IT operations and a passion for cloud automation and infrastructure.

After 4+ years supporting 500+ users across healthcare and financial sectors, I've developed strong troubleshooting skills and a deep understanding of system reliability. My journey from service desk to infrastructure engineering taught me that the best solutions prevent problems before they occur—a principle I now apply through automated AWS architectures.

I earned my AWS Solutions Architect Associate certification and built 6 hands-on projects covering serverless automation, VPC networking, event-driven processing, and monitoring. My background in physical infrastructure (fiber optics) gives me a unique perspective on how cloud networking actually works under the hood.

4+ Years

IT Operations & Infrastructure

500+ Users

Technical Support Experience

6 Projects

AWS Hands-on Portfolio

Education

Bachelor of Computer Science

Queen Mary University of London

2017 - 2019

$ Projects

01

Static Website with Custom Domain & HTTPS

S3 CloudFront Route 53 ACM

Production-grade static website hosting with CDN, custom domain, and SSL/TLS security

Problem Solved

Many developers struggle with deploying beyond localhost. This project demonstrates production-ready cloud hosting with proper security (HTTPS) and custom domain management, proving I can deploy real-world infrastructure.

Architecture

User
Route 53
CloudFront
S3 Bucket
  • S3 bucket configured for static website hosting with versioning
  • CloudFront distribution for global CDN and HTTPS enforcement
  • Route 53 for DNS management with health checks
  • ACM certificate for free SSL/TLS (auto-renewal)
  • Bucket policies restricting access to CloudFront only

Key Learnings

  • CloudFront cache behaviors and invalidation strategies
  • Difference between S3 bucket policies and IAM policies
  • SSL/TLS certificate validation using DNS (vs email)
  • Cost optimization through caching and compression

Real-World Relevance

This exact architecture is used by companies for marketing sites, documentation, and SPA frontends. Understanding CloudFront caching and security policies is critical for any cloud engineer supporting web applications.

02

Automated EC2/EBS Snapshots with Retention

Lambda EventBridge Python SNS

Serverless backup automation with intelligent retention policy and notifications

Problem Solved

Manual backups are error-prone and unsustainable at scale. This project automates EC2/EBS snapshots with intelligent retention policies, balancing data protection with storage costs—a critical requirement in production environments.

Architecture

EventBridge
Lambda
EC2/EBS API
SNS + CloudWatch
  • EventBridge cron rule triggers Lambda daily at 2 AM UTC
  • Python function using boto3 SDK for snapshot operations
  • Tag-based volume identification for selective backups
  • 7-day retention policy to manage storage costs
  • SNS notifications for success/failure alerts
  • CloudWatch Logs for audit trail and debugging

Technical Implementation

# Lambda function logic (simplified)
import boto3
from datetime import datetime, timedelta

ec2 = boto3.client('ec2')
sns = boto3.client('sns')

def lambda_handler(event, context):
    # Create snapshots for tagged volumes
    volumes = ec2.describe_volumes(
        Filters=[{'Name': 'tag:Backup', 'Values': ['true']}]
    )
    
    # Delete snapshots older than 7 days
    retention_date = datetime.now() - timedelta(days=7)
    # ... cleanup logic
    
    # Send SNS notification
    sns.publish(TopicArn=TOPIC_ARN, Message=status)

Key Learnings

  • Boto3 SDK for AWS service automation
  • IAM role design with least-privilege permissions
  • EventBridge cron expressions vs rate expressions
  • Importance of tagging strategies for resource management
  • Error handling and retry logic in serverless functions

Real-World Relevance

Backup automation is mandatory in production. This demonstrates understanding of disaster recovery, cost optimization, and serverless patterns used by DevOps teams daily. The retention policy shows awareness of balancing compliance with costs.

03

Custom VPC with Public & Private Subnets

VPC Subnets NAT Gateway Security Groups

Production-grade network architecture with multi-AZ deployment and defense-in-depth security

Problem Solved

Default VPCs lack proper network segmentation. This project implements production-grade network isolation, separating public-facing resources from sensitive backend systems— essential for security compliance and least-privilege access.

Architecture

VPC (10.0.0.0/16)
AZ-1
Public Subnet
10.0.1.0/24
Private Subnet
10.0.11.0/24
AZ-2
Public Subnet
10.0.2.0/24
Private Subnet
10.0.12.0/24
  • Custom VPC with 10.0.0.0/16 CIDR block
  • Multi-AZ deployment for high availability
  • Internet Gateway for public subnet internet access
  • NAT Gateway for private subnet outbound traffic
  • Route tables controlling traffic flow
  • Security Groups and NACLs for defense-in-depth

Key Learnings

  • CIDR block planning and subnet sizing calculations
  • Difference between Internet Gateway and NAT Gateway
  • Route table precedence and routing decisions
  • Security Groups (stateful) vs NACLs (stateless)
  • Multi-AZ architecture patterns for resilience
  • Cost implications of NAT Gateways ($0.045/hour + data transfer)

Real-World Relevance

This is the foundation of ALL enterprise AWS architectures. Every production application uses this network pattern. Understanding VPC design is essential for supporting application deployments, troubleshooting connectivity, and implementing security controls.

04

Serverless CRUD API

API Gateway Lambda DynamoDB Python

Fully serverless RESTful API with auto-scaling and zero server management

Problem Solved

Traditional server-based APIs require constant maintenance, patching, and scaling management. This serverless API eliminates infrastructure overhead while providing automatic scaling, high availability, and pay-per-request pricing.

Architecture

Client
API Gateway
Lambda
DynamoDB
  • API Gateway RESTful endpoints with request validation
  • 5 Lambda functions for CRUD operations (Create, Read, Update, Delete, List)
  • DynamoDB table with on-demand billing for cost optimization
  • CORS configuration for browser-based access
  • IAM roles for secure Lambda-to-DynamoDB communication
  • CloudWatch for API metrics and Lambda logs

API Endpoints

POST   /items          # Create new item
GET    /items          # List all items
GET    /items/{id}     # Get single item
PUT    /items/{id}     # Update item
DELETE /items/{id}     # Delete item

Key Learnings

  • API Gateway request/response transformations
  • Lambda function design patterns and error handling
  • DynamoDB single-table design and partition key selection
  • Cold start mitigation strategies
  • CORS configuration for cross-origin requests

Real-World Relevance

Serverless APIs power modern microservices architectures. This demonstrates backend development skills, event-driven patterns, and cost-effective scaling— critical for cloud-native applications.

05

S3 Event-Driven Processing

S3 Lambda Python Event Notifications

Automated file processing pipeline triggered by S3 upload events

Problem Solved

Manual file processing is slow, error-prone, and doesn't scale. This event-driven architecture automatically processes files uploaded to S3, enabling scalable data pipelines without polling or manual intervention.

Architecture

User Upload
S3 Source Bucket
Event Notification
Lambda Function
S3 Destination Bucket
  • Source S3 bucket with event notifications enabled
  • S3 PUT event triggers Lambda function automatically
  • Processing logic (e.g., image resizing, format conversion)
  • Output written to destination bucket
  • Error handling for unsupported file types
  • CloudWatch metrics for processing time and failures

Key Learnings

  • S3 event notification configuration and filtering
  • Binary file handling in Lambda (Pillow for images)
  • S3 SDK operations: getObject, putObject with metadata
  • Idempotency in event-driven systems
  • Lambda environment variables for configuration

Real-World Relevance

Event-driven processing is fundamental to data pipelines, media workflows, and ETL operations. This pattern is used for log analysis, image/video processing, document conversion, and real-time data transformation across all industries.

06

CloudWatch Monitoring Dashboard & Alarms

CloudWatch SNS Metrics Alarms

Comprehensive observability setup with proactive alerting and centralized dashboards

Problem Solved

Without proper monitoring, issues go unnoticed until users report them—leading to longer downtime and poor user experience. This monitoring setup enables proactive incident management with automated alerts before problems impact users.

Architecture

AWS Resources
CloudWatch Metrics
Dashboard + Alarms
SNS Notifications
  • CloudWatch Dashboard with widgets for key metrics
  • Alarms for EC2 CPU, Lambda errors, API latency
  • SNS topic for email/SMS notifications
  • Log Insights queries for troubleshooting
  • Custom metrics from application code

Configured Alarms

  • EC2 CPU utilization > 80% for 5 minutes
  • Lambda error rate > 5% over 10 minutes
  • API Gateway 5XX errors > 10 per minute
  • DynamoDB read/write capacity threshold

Key Learnings

  • CloudWatch metric namespaces and dimensions
  • Alarm threshold tuning to avoid alert fatigue
  • Dashboard design for quick incident triage
  • Log aggregation, filtering, and pattern matching
  • Cost optimization for metric storage and alarms

Real-World Relevance

Monitoring is critical for maintaining production systems. This demonstrates understanding of SRE principles, proactive operations, and the ability to maintain service reliability through observability—a core responsibility of cloud engineers.

$ Skills

AWS Cloud Services

EC2 S3 VPC Lambda CloudFront Route 53 DynamoDB API Gateway CloudWatch SNS EventBridge IAM ACM

Automation & Scripting

Python Bash Boto3 SDK AWS CLI Serverless Functions

Infrastructure & Networking

VPC Design Subnets & Routing Security Groups NACLs NAT Gateway Internet Gateway Network Troubleshooting

DevOps & Tools

Git GitHub CI/CD Concepts IaC Awareness Documentation

IT Operations

ServiceNow Jira Active Directory Office 365 Remote Support Incident Management Knowledge Base

$ Experience

Junior Infrastructure Engineer

Chrome Eiendom AS, Norway 2023 - 2025
  • Installed and maintained fiber optic infrastructure, gaining hands-on networking experience directly transferable to AWS VPC, subnets, and routing
  • Diagnosed technical issues in real-time for clients and teams, sharpening incident response skills essential for cloud support roles
  • Collaborated with technical teams to ensure reliable service delivery and uptime, demonstrating clear communication in high-pressure environments
Cloud-Relevant Skills: Physical network fundamentals, real-time troubleshooting, high-pressure communication

Service Desk Analyst

St John Ambulance (NHS), UK 2021 - 2022
  • Managed 150+ monthly incident tickets across hardware, software, networking, and access control using ServiceNow and Jira
  • Created knowledge base articles and delivered team training, reducing repeat tickets by 20% and improving self-service capabilities
  • Delivered fast remote support in time-critical healthcare situations while consistently meeting SLAs
Cloud-Relevant Skills: Documentation, automation thinking, incident management, SLA adherence

Technical Support Analyst

Metro Bank PLC, UK 2018 - 2021
  • Delivered technical support to 500+ users, achieving 98% first-contact resolution on hardware, software, and networking issues
  • Managed user accounts and permissions in Office 365 and Active Directory, applying least-privilege principles directly transferable to AWS IAM
  • Identified recurring problems, documented solutions, and implemented proactive fixes that reduced overall ticket volume
Cloud-Relevant Skills: IAM concepts, least-privilege access, proactive problem-solving, scale operations

$ Certifications

AWS Certified Solutions Architect

Associate

SAA-C03

Compute Storage Database Networking Security Cost Optimization

CompTIA A+

Foundational

Hardware Networking Troubleshooting Security

$ Connect with me

I'm currently seeking Junior Cloud Engineer opportunities where I can apply my AWS knowledge and IT operations experience to build reliable, scalable systems.

Whether you're looking for someone to join your cloud team or just want to discuss AWS architecture, feel free to reach out!