Skip to content

Field Manual // First Edition

Open to DevOps/SRE roles & OSS collaboration

MD. YUNUS

$ whoamiSRE who builds observable, self-healing cloud platforms

The story of an electronics student who fell in love with infrastructure. Framework-level code merged into Cloudflare andDocker. Oracle ACE Associate. I make platforms observable, incidents short, and deploys boring.

9+
PRs merged into Cloudflare & Docker
281
automated tests shipped to production
45%
MTTR reduction via AI incident agent
12+
OCI reference architectures published

01Origins

// from circuits to clusters

Motivated B.Tech (Electronics & Communication) graduate (expected 2027) with proven open-source contributions at Cloudflare and Docker, Oracle ACE Associate, and hands-on experience in DevOps, MLOps, and cloud infrastructure. I've delivered framework-level fixes, reduced MTTR by 45% via an AI-driven incident-root-cause agent, and shipped 281 automated tests across production systems. Passionate about building reliable, observable, and secure platforms at scale.

My journey began in electronics and communication engineering, but I've gravitated toward cloud-native technologies through self-directed learning and hands-on contributions. What drives me is the challenge of building systems that are not just functional, but resilient, observable, and secure by design—especially in distributed, high-scale environments.

// CHANGELOGFILE: origins.md
  • [v0.1]ECE foundation initialized — B.Tech Electronics & Communication.
  • [v0.4]First containers and CI pipelines shipped — DevOps internship (2024).
  • [v0.7]ML infrastructure at scale — 2.5M+ record forecasting pipelines at Shell (2025).
  • [v0.9]The open source arc — framework PRs merged at Cloudflare & Docker (2026).
  • [v1.0]Oracle ACE Associate — reference architectures running in the community (2026).
KubernetesDockerTerraformOCIAWSPrometheusGoPythonTypeScriptCI/CD

02The Open Source Arc

// cloudflare/vinext · docker/agent — 9 merged PRs

I didn't learn open source from a course. I learned it by reading framework internals and shipping fixes that other developers now build on.

03Chapters of Work

// from campus to cloud platforms

  1. 2026

    Cloudflare & Docker Inc. (Open Source Contributor) — Core Framework Contributor

    Remote

    • ->Exported `ServerInsertedHTMLContext` to unblock CSS-in-JS SSR for Apollo Client, styled-components, and Emotion users, adding 5 integration tests and 238 lines of clean code (PR #151)
    • ->Authored documentation that eliminated `Cannot find module` errors for ESM validator packages across RSC and SSR environments, adding a test fixture and 24 integration-test lines (PR #193)
    • ->Fixed `Dynamic require of .mjs is not supported` on Node 22 by integrating `ensureESModule`/`renameCJSConfigs` in the CLI, adding 9 unit tests and maintaining 169 passing tests (PR #197)
    • ->Wired `RouterContext.Provider` in the Pages Router, fixing `useRouter()` returning null and providing a single source-of-truth router value (PR #249)
    • ->Extracted shared request-handling utilities into `request-pipeline.ts`, removing 1,627 duplicated lines, fixing 6 CodeQL security issues, and adding 37 unit tests (PR #258)
    • ->Added snapshot tests for entry template generators, raising coverage from 0% to ~100% for those modules (PR #345)
    • ->Prevented `useActionState` from becoming undefined after `redirect()` by adding explicit navigation handling and an E2E test (PR #620)
    • ->Docker Agent: Implemented proactive token compaction at 90% capacity and auto-recovery for overflow errors across 14 LLM provider patterns, adding 33 integration tests (PR #1973)
    • ->Docker Agent: Corrected table-text copy mismatches by implementing a visual-to-rune mapping algorithm, improving developer UX (PR #2175)
  2. Jan 2026 – Present● CURRENT

    Oracle Corporation — Oracle ACE Associate

    Remote

    • ->Published 12+ OCI reference architectures covering secure VCN topologies, IAM design patterns, and DevOps practices, adopted by the Oracle community
    • ->Produced comparative analyses of managed vs. self-managed cloud services (OKE vs. Kubernetes, managed databases vs. self-hosted)
    • ->Earned four OCI certifications: Architect Associate, DevOps Professional, Developer Professional, and Foundations
    • ->Actively mentors peers in the Oracle ACE community and shares cloud-native best practices via technical blogs and talks
  3. Oct 2025 – Nov 2025

    Shell India Market Private Limited — ML Infrastructure Intern

    Remote

    • ->Engineered forecasting pipelines processing 2.5M+ aviation records using Prophet (MAE < 5%) and automated anomaly detection with Isolation Forest
    • ->Containerized FastAPI inference services with Docker and automated deployments via GitHub Actions, cutting operational overhead
    • ->Achieved 100% test pass rate for the end-to-end ML system
  4. Jun 2024 – Jul 2024

    Stellanova GlobalTech Pvt. Ltd. — DevOps Intern

    Remote

    • ->Containerized five micro-services using Docker and optimized Jenkins & GitHub Actions pipelines, reducing deployment times by 30% and increasing CI/CD throughput
    • ->Implemented zero-trust connectivity patterns using Tailscale, Ngrok, and SSH tunneling, eliminating direct server exposure

04Builds

// problem -> fix -> measurable impact

incident-root-cause-investigator-pro.sh

Incident Root-Cause Investigator Pro

PythonCoral MCPFastAPIDockerGitHub Actions

problem -> During incidents, engineers burn critical minutes manually correlating logs, metrics, and traces across disconnected tools while users stay impacted.

built -> Architected an enterprise AI agent orchestrating multi-source telemetry via a Coral MCP query layer. ML models analyze logs, metrics, and traces to surface actionable insights during incident response.

  • 45% faster MTTD (simulated)
  • 281 automated tests
  • failure-injection coverage
production-observability-platform.sh

Production Observability Platform

JavaSpring BootPrometheusGrafanaGitHub Actions

problem -> Services shipping without instrumentation mean failures get reported by users instead of alerts — the most expensive way to find out.

built -> Built a 4-stage CI/CD pipeline (build, test, security scan, deploy) and instrumented a Spring Boot service with Prometheus metrics and Grafana dashboards, with automated synthetic testing and alert routing.

  • 4-stage CI/CD pipeline
  • real-time dashboards
  • multi-channel alerting
cloudflare-vinext-contributions.sh

Cloudflare Vinext Contributions

TypeScriptReactNext.jsNode.js

problem -> SSR gaps in a production framework blocked CSS-in-JS users (Apollo, styled-components, Emotion), and duplicated request-handling code hid security issues.

built -> Shipped framework-level fixes across SSR context, routing, module resolution, and test coverage — merged into a framework used in production at the edge.

  • 7 PRs merged
  • 1,627 duplicate lines removed
  • 6 CodeQL issues fixed
  • 0% → 100% module coverage
$ git log --merged[expand]
  • PR #151: Exported `ServerInsertedHTMLContext` to unblock CSS-in-JS SSR for Apollo Client, styled-components, and Emotion users
  • PR #193: Documented ESM resolution for validator packages
  • PR #197: Fixed Dynamic require of .mjs on Node 22
  • PR #249: Wired `RouterContext.Provider` in Pages Router fixing `useRouter()` returning null
  • PR #258: Extracted shared request-handling utilities, removing 1,627 duplicated lines and fixing 6 CodeQL security issues
  • PR #345: Added snapshot tests for entry template generators (0% → 100% coverage)
  • PR #620: Prevented `useActionState` from becoming undefined after `redirect()`
docker-agent-contributions.sh

Docker Agent Contributions

GoprotobufgRPC

problem -> LLM context-window overflows crashed agent sessions mid-task, and table text copied from the TUI came out garbled.

built -> Implemented proactive token compaction at 90% capacity with auto-recovery across LLM providers, and a visual-to-rune mapping algorithm for accurate text selection.

  • 14 LLM provider patterns handled
  • 33 integration tests
  • zero-crash overflow recovery
$ git log --merged[expand]
  • PR #1973: Implemented proactive token compaction at 90% capacity and auto-recovery for overflow errors across 14 LLM provider patterns
  • PR #2175: Corrected table-text copy mismatches by implementing visual-to-rune mapping algorithm

05The Toolbelt

// engineered precision for the modern cloud landscape

[ 05.1 Cloud & Infrastructure ]

Cloud Platforms
Oracle Cloud Infrastructure (OCI)Amazon Web Services (AWS)Google Cloud Platform (GCP)
Infrastructure as Code
TerraformGitOps (ArgoCD/Flux)AWS CloudFormation
Networking
OCI VCNAWS VPCZero-Trust Architecture (Tailscale, SSH tunneling)

[ 05.2 Containerization & Orchestration ]

Containerization
DockerOCI containers
Orchestration
Kubernetes (OKE/EKS/GKE)HelmKustomize

[ 05.3 CI/CD & Automation ]

CI/CD Platforms
GitHub ActionsJenkinsGitLab CI
Automation
BashPythonGo scripting
Pipeline Stages
Buildtestsecurity scandeployrollback

[ 05.4 Observability & Monitoring ]

Metrics
PrometheusGrafanaAlertmanager
Logging
ELK StackLokiFluentd
Tracing
JaegerTempoOpenTelemetry
APM
Custom instrumentationhealth checks

[ 05.5 Programming Languages ]

Primary
PythonJavaGoTypeScript/JavaScript
Scripting
BashPowerShell
Query
SQLPromQLLogQL

[ 05.6 DevOps Practices ]

Infrastructure
Immutable infrastructureblue/green deploymentscanary releases
Reliability
Chaos engineeringfault tolerancecircuit breakers
Security
Secrets managementimage scanningpolicy as code (OPA)
Testing
UnitintegrationsnapshotE2Eperformancechaos testing

▣ Certifications_Shelf

Oracle · 2026

Oracle Cloud Infrastructure Architect Associate

Oracle · 2026

Oracle Cloud Infrastructure DevOps Professional

Oracle · 2026

Oracle Cloud Infrastructure Developer Professional

Oracle · 2026

Oracle Cloud Infrastructure Foundations

Oracle · Jan 2026 - Present

Oracle ACE Associate

McKinsey & Company · 2026

McKinsey Forward Program Graduate

06Signal to Noise

// ~/chapters/06_field_notes.sh

Thoughts, post-mortems, and architectural notes scaled for public consumption — published onyunus25jmi.medium.com.

OCI reference architecturesKubernetes at scaledistributed systemsopen-source fixesobservability strategiessupply chain security
Apr 2026

Building a Netflix-Style IaC Platform on OCI: Resource Manager Stacks as Product APIs

Md Yunus

READ ->
Mar 2026

Zero-Downtime Migration to Oracle Autonomous Database

Md Yunus

READ ->
Mar 2026

How I Fixed Vite’s “Dynamic require of .mjs is not supported” Error in Cloudflare’s vinext

Level Up Coding

READ ->
Mar 2026

How I Fixed Apollo Client SSR in Cloudflare’s vinext by Wiring a Missing React Context

Level Up Coding

READ ->
Mar 2026

Building a Lightweight, Scalable Database Stack with Oracle Container Engine

Md Yunus

READ ->
Mar 2026

How I Fixed ESM Resolution in Cloudflare’s vinext

Md Yunus

READ ->
Mar 2026

When to NOT Use Kafka on OCI: A Cost and Complexity Analysis for Event Streaming Architecture

Level Up Coding

READ ->
Mar 2026

The Cold Start Problem Nobody Talks About (It’s Not Just Serverless)

Md Yunus

READ ->
Feb 2026

Beyond NetworkPolicies: Infrastructure-Native Microsegmentation with Oracle Kubernetes Engine

Md Yunus

READ ->
Feb 2026

Designing a 1 Million RPS Kubernetes Ingress Layer: From TCP Handshake to Kernel Tuning

Md Yunus

READ ->
Feb 2026

Control Plane High Availability on OKE: What Oracle Actually Manages (And Why It Matters)

Md Yunus

READ ->
Feb 2026

Scaling The Software Architecture: Beyond Linear Growth to Phase Transitions

Startup Stash

READ ->
Feb 2026

Multi-Tenancy in Kubernetes: Soft vs. Hard Isolation

Md Yunus

READ ->
Feb 2026

Observability Economics: Cost vs. Fidelity in Distributed Tracing

Md Yunus

READ ->
Feb 2026

Securing the Software Supply Chain: Beyond CVE Scanning to Cryptographic Verification

Md Yunus

READ ->
Jan 2026

The Split-Brain Dilemma: Architecting Consistency in Distributed Cloud Systems

Md Yunus

READ ->
Oct 2025

The Day the Internet Stopped: Inside the AWS Outage of October 20, 2025

Md Yunus

READ ->
Sep 2025

The Micro-Startup Stack: Engineering Velocity Without the Overhead

Md Yunus

READ ->
Sep 2025

System Design Deep Dive: How to Build a URL Shortener That Won’t Crash at Scale

Md Yunus

READ ->

07Write the Next Chapter

// ~/chapters/07_contact.sh

"Every good story needs collaborators."