Hands-on builder shipping production agentic AI end-to-end — agents, MCP tools, RAG, and the platforms that govern them. 13+ years across data & AI; currently Principal Software Engineer & Tech Lead at Autodesk, Greater Montreal.
I'm an AI Engineer and Data Architect with 13+ years across 11 companies and 2 continents. Today I lead a central team at Autodesk building an agentic-AI platform — a BYO-agent / BYO-MCP model with its own discovery, orchestration, and harness/standards layers — and I've shipped 3 production agents, 5 MCPs, and 2 IDE plugins used across the company. One flagship: a proactive agent that modernized 1,500+ internal apps, migrating legacy, vulnerability-prone APIs to GraphQL across five languages at >85% success — with strong ROI.
I'm a hands-on builder who takes ideas from MVP and POC to production-grade systems — agents, MCP tools, and RAG — on top of a decade of data-platform and big-data architecture. Along the way I've handled ~2 PB of data and ~200K events/sec, built Databook's first semantic layer from scratch, and architected platforms at Cloudera, Sitecore, Intact, Ericsson, and CGI.
I care about systems that are reliable as well as smart — data integrity, observability, and evals matter as much as embeddings and prompts. I've led teams of engineers, because great platforms are built by great teams.
Local-first income & expense tracker driven by a Claude tool-calling agent — chat it, WhatsApp it, or snap a receipt. SQLite is the single source of truth; OCR and Google sync are opt-in.
Leading a BYO-agent/MCP platform — agent & MCP catalog, discovery, orchestration, and worker standards. Shipped 3 agents, 5 MCPs & 2 IDE plugins used across Autodesk.
A proactive agent that migrates legacy, vulnerability-prone APIs to GraphQL across 1,500+ internal apps — direct & dependency-based usage in Python, Go, Java, Rails & Perl — at >85% success by code complexity, with a UI dashboard. High ROI.
Built Databook's data platform from scratch and its first semantic layer (early 2023) powering Databook AI — with agentic workflows in LangGraph & Google ADK.
A GPT product for strategic sales with multi-query & step-back retrieval, reranking, and LLM-as-judge to cut hallucinations and lift engagement.
Led the core Search team handling ~200K events/sec; architected a Data Lake/Warehouse for a global retailer, cutting storage costs.
A Scala rule engine that validates streaming and batch data against rules defined in YAML — data-quality enforcement for pipelines.
View on GitHub →More on github.com/samsandeepmalik
Professional Services expert onboarding customers to Cloudera Data Platform (public/private/hybrid); designed on-prem → AWS migrations and trained engineering teams.
Led a team of data engineers; built on-prem → AWS historical-data migration and refactored pipelines onto Databricks.
A decade of data & software engineering — 5G optimization, real-time trade-settlement pipelines, an in-memory data fabric, batch + real-time big-data over ~2 PB (Dubai), and enterprise platforms.
Master of Science in Computer Science — Manav Bharti University (2013).
Snowflake SnowPro Core · AWS Solutions Architect – Associate · AWS Cloud Practitioner · Databricks Spark 3.0 · CCA-175 · Microsoft AZ-900.
Building the harness around LLMs — agent runtimes, MCP, orchestration, evals, and guardrails for production-grade agentic systems.
Open to Principal / Staff roles in agentic AI. The fastest way to reach me:
CV in two formats — designed PDF for reading, or an ATS-friendly version for application systems.