My Technical Stack

A deep dive into the tools, languages, and platforms I work with on a daily basis.

Core Languages & Runtimes

  • Python: Primary language for ML and backend services. Knowledge scientific computing (NumPy, SciPy), Type Hinting (Pydantinc) and in AsyncIO.
  • SQL: Expert-level proficiency in complex window functions, CTEs, and query optimization for high-volume data.
  • Bash: Shell scripting for automation, system administration, and CI/CD entry points.

Machine Learning & Data Stack

  • PyTorch : Deep learning framework of choice. Experience building and deploying models, fine-tuning, distributed training, and scalable pipelines.
  • Transformers : Working with HuggingFace for NLP, Retrieval-Augmented Generation (RAG), and fine-tuning LLMs.
  • Model Optimization: Proficiency in ONNX, TensorRT, or quantization to reduce inference latency in production.
  • Vector Databases: Implementing and managing ChromeDB, and Pinecone for high-dimensional similarity search.
  • MLflow: Model tracking, experiment versioning, and registry management.

Platform & Infrastructure

  • Kubernetes (K8s) : Container orchestration, deployments, services, and monitoring. Familiar with Kubeflow for specialized ML workflows.
  • Docker : Containerization for reproducible environments and microservices.
  • Cloud (AWS/GCP):
    • AWS : SageMaer (Pipelines/Endpoints),S3, EC2, Lambda, Athena.
    • GCP : Vertex AI, BigQuery Cloud Run, GCS.
  • Terraform : Infrastructure as Code (IaC) for modular and reproducible resource managment.

Data Engineering

  • Apache Spark : Distributed data processing at scale.
  • Apache Kafka : Real-time event streaming and data pipelines.
  • Pandas / Polars : Advanced data manipulation, utilizing Polars for performance-critical, multi-threaded operations.
  • MLflow : Model tracking, versioning, registry, and serving.
  • Airflow : Workflow orchestration and data pipeline scheduling.

Databases & Orchestration

  • Snowflake/BigQuery : Cloud data warehouse for ELT pipelines, large scale analytics, and feature stores.
  • NoSQL : Redis for real-time features. DynamoDB for metadata.

DevOps & CI/CD

  • Testing: Comprehensive unit and integration testing using Pytest.
  • CI/CD: Automation via GitHub Actions or GitLab CI, including automated model testing and deployment.
  • Prometheus & Grafana : Monitoring and observability.

Development Tools

  • Code Quality: Strict adherence to Ruff, Black, and Mypy for maintainable, production-ready codebases
  • Git : Version control, branching strategies, and collaborative workflows.

Last updated: 2026. Always learning and exploring new tools that solve real problems.