Follow me on YouTube • X • LinkedIn • Subscribe to my Newsletter
The AI Engineering Roadmap categorizes the journey into 3 stages:
-
Beginner
- Build basic application to learn to consume LLM APIs, carefully engineering prompts for your applications and working with open source LLMs. -
Intermediate
- Go deep into building more context aware advanced applications using Retreival Augmented Generation (RAG). Learn about vector databases and how to work with one. Learn to build agents with LLMs and tools. -
Advanced
- After mastering application building, learn to deploy, optimise and manage LLM-powered apps in production with LLMOps. Learn to fine-tune pre-trained models to adapt to downstream applications efficiently and at low cost.
I'd suggest you go deep in each stage, build projects, POCs or better yet, functional products and then move to the next stage.
The following table enlists learning resources and references that I found helpful and plan to use myself to build.
https://www.youtube.com/playlist?list=PLIkXejH7XPT8x9iUGvlsYt44aPx6ns8BV
Skills | Resources |
---|---|
Beginner | |
Working with LLMs and APIs | - Commonly used LLM APIs: OpenAI, Anthropic(Claude), Hugging Face |
Prompt Engineering | - DeepLearning.AI Course on ChatGPT Prompt Engineering for Developers - Prompt Engineering Guide: A detailed resource encapsulating the latest papers, advanced prompting techniques, learning guides, model-specific prompting guides, lectures, references, new LLM capabilities, and tools related to prompt engineering. - Awesome ChatGPT Prompts: A compilation of great prompts to be used with ChatGPT models. |
Running and working with Open Source LLMs | - Deeplearning.AI course on Open Source Models with Hugging Face - Open Source LLMs can be accessed via Hugging Face Hub and you can play with a few of them in Hugging Face Spaces - OpenRouter Docs - Run LLMs on your local machine using LM Studio |
Chain of Operations - LangChain | - Quickstart guide on how to build an application with LangChain. - Deeplearning.AI course on LangChain for LLM Application Development |
Code / Image / Audio Generation | - This is covered in parts in the Open Source Models with HF course on DeepLearning.AI - Building Generative AI Applications with Gradio - Code Gen: Check out these resources on code generation - gpt-engineer, Tabby, gpt-migrate to migrate your codebase from one framework to another or one language to another. - Audio Gen: text to speech by openAI, resemble.ai, elevenlabs API - Image Gen: Image generation by Open AI, creating images using Stable Diffusion API |
Intermediate | |
Working with Vector Databases | - Text chunking and splitting by LangChain - Course on vector databases: Learn what are embeddings and how to store them. Build applications. - Another course on building applications with vector databases using Pinecone - Learn to compute sentence, text, and image embeddings using Framework like SentenceTransformers. - Check out top embedding models here. |
Building RAG Applications | - RAG applications are all about building connections between tools, databases, context lengths, embeddings, memories, etc. You need frameworks like LangChain, LlamaIndex, FastRAG to build these. - Step-by-step tutorial to build a Q&A RAG pipeline by LangChain - LangChain’s RAG from Scratch playlist on YouTube is pretty detailed and amazing. |
Advanced RAGs | - Check out Jerrry Liu’s course on Building and evaluating Advanced RAG Application on DeepLearning.AI for best practices and improving your RAG pipeline’s performance. - Cheat Sheet and some recipes for building Advanced RAG - A comprehensive guide on building RAG-based LLM application by AnyScale |
Evaluating RAGs | - Hugging Face Cookbook on How to evaluate RAG system - Evaluating all components of your RAG pipeline - RAGAS framework to evaluate RAG pipelines. |
Building Agents | - Quickstart guide by LangChain to build agents to have a sequence of actions taken to do a job or multiple jobs. - Course on Functions Tools and Agents with LangChain by Harrison Chase on DeepLearning.AI - Autogen: Framework to develop LLM applications using multiple agents that can converse with each other to solve tasks. - Crew AI: AI Agents for real use cases. |
Deploying Apps | - Local deployment: Running open source LLMs on local machines (LM Studio, Ollama, oobabooga, kobold.cpp, etc.) - Building POCs and demo applications using frameworks like Gradio and Streamlit - Deploying LLMs at scale on cloud technologies like vLLM and SkyPilot. - Deploying LangChain applications (runnables and chains) as a REST API. |
Advanced | |
Fine-tuning for specific use cases | - DeepLearning.AI course on finetuning LLMs - A Beginner’s Guide to LLM Fine-Tuning is a detailed guide on finetuning LLMs - A very detailed and simplified read on how to fine-tune LLMs with Hugging Face by Philipp Schmid - 4-part blog series by Anyscale is a comprehensive guide on fine tuning and serving LLMs. - Fine-Tune Your Own Llama 2 Model in a Colab Notebook - LLM Datasets for finetuning |
LLMOps | - Deeplearning.AI Course on LLMOPs is a good starting place for advanced practitioners. - GPU Inference optimization techniques like FlashAttention and FlashAttention-2 - LLMOps guide by DataBricks - Efficiently serving LLMs course on DeepLearning.AI. |
Multi-modal applications | - Building hybrid search apps with vector databases like Pinecone - Cookbook for multimodal RAG pipelines. |
Model Evals and benchmarking | - How to evaluate LLMs with Hugging Face Lighteval - Course on Automated Testing for LLMOps: Learn to test and evaluate LLM application using an evaluation LLM. |
AI Security | - Red Teaming LLM Applications - learn to identify and evaluate vulnerabilities in LLM apps. - Planning red teaming for large language models (LLMs) and their applications - A detailed list of resources on LLM security highlighting all potential risks and vulnerabilities in AI applications. |
Here's the content of your HTML table converted into a well-formatted markdown table:
S.No. | Project Idea | Skills required / to be developed |
---|---|---|
Beginner | ||
1. | Building a chatGPT / Midjourney -like bot for a niche persona / use-case and integrate it with WhatsApp or Slack or Discord or build an app using Streamlit / Gradio. | LLMs , Prompt Engineering , open-source LLMs , Working with APIs , Python |
2. | Chrome extension to summarise / ideate / extract takeaways / research with web pages. | LLM APIs , Prompt Engineering , open-source LLMs , Working with API , JS , HTML ,CSS |
3. | Create a news aggregator for a targeted persona (PMs, AI Engg., ) | Web Parsing , Prompt Engineering , LLMs |
4. | Multi-modal generation via Discord. | All of above and Multi-modal generation |
Intermediate | ||
5. | Q&A bot for specific persona / department - Doctor / Lawyer / Engineer, Marketer, Sales etc. |
text splitting , chunking , indexing , working with vector databases , building RAG pipelines , deploying apps |
6. | StackOverflow-like search engine that provides response from multiple data sources - Advanced RAG pipeline to answer complex query using multiple data sources. |
All above and vector retrieval , summary retrieval , sub-question query engine , deploying apps |
7. | AI Interview Buddy - an AI agent that helps in preparing for specific roles in an interactive manner. | Building agents , agentic design , text-to-speech , speech-to-text , deployment |
Advanced | ||
8. | Personal assistant fine-tuned on domain-specific data. Personal Healthcare, Country’s Law, Finance, etc. | Dataset curation , Data engineering , model evaluation , benchmarking , PEFT , LoRA , QLoRA |
9. | An e2e e-commerce product assistant. | Advanced RAG pipeline , LLMOps - Data Engineering , CI/CD , Scaling , Monitoring , Observability |
10. | InvestGPT - fine-tuned LLM to classify personal investments based on technical (algorithmic) and fundamental (LLMs) analysis. | fine tuning , evals and benchmarking , AI Security |
You can extrapolate each of these ideas into full-fledged products / MVPs for different domains, industries and verticals.
If you liked this roadmap, support the work by starring the repo.
You can follow me on YouTube, Twitter, LinkedIn, Substack, GitHub.