I'm a Data Scientist and AI Consultant with 4+ years of experience in building scalable machine learning systems, intelligent data pipelines, and LLM-powered applications. I specialize in creating end-to-end GenAI solutions—from real-time web crawling and feature engineering to deploying LLM-based tools like resume builders, Q&A bots, and document intelligence systems.
I’ve worked across industries including finance, education, and media tech—combining my skills in Python, cloud (GCP, AWS), LangChain, Kafka, and ML frameworks to deliver impactful, production-ready tools. Passionate about solving real-world problems with AI, I focus on building solutions that are scalable, smart, and user-friendly.
Self-employed
June 2023 - Present
Designed and delivered AI-powered tools and applications using LLMs, LangChain, and cloud technologies. Focused on creating scalable, real-time NLP systems for document intelligence, resume automation, and financial analysis.
Admazes Limited, Hong Kong
December 2021 - Present
Led the development of scalable data pipelines, automated data ingestion systems, and ML classification tools across high-volume data sources. Focused on building reliable infrastructure and delivering actionable business insights through robust engineering and modeling practices.
Codemarket, California
December 2020 - February 2021
Contributed to backend development and cloud-based deployments of modern web applications, supporting real-time API workflows and data integrations.
Built an LLM-powered Retrieval-Augmented Generation (RAG) system that allows users to ask natural language questions about 100+ universities. Integrated data from Quora and Reddit using scalable crawlers. Designed chunking, metadata tagging, and vector search using LangChain and OpenAI to return accurate, context-aware responses.
View ProjectEngineered a data pipeline to process 100M+ monthly Google SERP records and forecast keyword trends using clustering and regression models. Handled large-scale data ingestion, transformation, and analytics for business insights.
View ProjectCreated an end-to-end machine learning pipeline to classify search queries into relevant marketing categories. Implemented text preprocessing, model training using fine-tuned BERT, and deployed real-time prediction APIs on Google Cloud. Used for search analytics and content tagging.
View Project