/ai-engineer-roadmap

A roadmap describing the required skills, learning resources and sample tools to become an AI Engineer

Apache License 2.0Apache-2.0

AI Engineer Roadmap & Resources 🤖

Follow me on YouTube • X • LinkedIn • Subscribe to my Newsletter


The AI Engineering Roadmap categorizes the journey into 3 stages:

  1. Beginner - Build basic application to learn to consume LLM APIs, carefully engineering prompts for your applications and working with open source LLMs.

  2. Intermediate - Go deep into building more context aware advanced applications using Retreival Augmented Generation (RAG). Learn about vector databases and how to work with one. Learn to build agents with LLMs and tools.

  3. Advanced - After mastering application building, learn to deploy, optimise and manage LLM-powered apps in production with LLMOps. Learn to fine-tune pre-trained models to adapt to downstream applications efficiently and at low cost.

I'd suggest you go deep in each stage, build projects, POCs or better yet, functional products and then move to the next stage.

AI Engineer Roadmap Cover

Learning Resources & References

The following table enlists learning resources and references that I found helpful and plan to use myself to build.

Follow me as I execute this roadmap end-to-end on YouTube:

https://www.youtube.com/playlist?list=PLIkXejH7XPT8x9iUGvlsYt44aPx6ns8BV

Skills Resources
Beginner
Working with LLMs and APIs - Commonly used LLM APIs: OpenAI, Anthropic(Claude), Hugging Face
Prompt Engineering - DeepLearning.AI Course on ChatGPT Prompt Engineering for Developers
- Prompt Engineering Guide: A detailed resource encapsulating the latest papers, advanced prompting techniques, learning guides, model-specific prompting guides, lectures, references, new LLM capabilities, and tools related to prompt engineering.
- Awesome ChatGPT Prompts: A compilation of great prompts to be used with ChatGPT models.
Running and working with Open Source LLMs - Deeplearning.AI course on Open Source Models with Hugging Face
- Open Source LLMs can be accessed via Hugging Face Hub and you can play with a few of them in Hugging Face Spaces
- OpenRouter Docs
- Run LLMs on your local machine using LM Studio
Chain of Operations - LangChain - Quickstart guide on how to build an application with LangChain.
- Deeplearning.AI course on LangChain for LLM Application Development
Code / Image / Audio Generation - This is covered in parts in the Open Source Models with HF course on DeepLearning.AI
- Building Generative AI Applications with Gradio
- Code Gen: Check out these resources on code generation - gpt-engineer, Tabby, gpt-migrate to migrate your codebase from one framework to another or one language to another.
- Audio Gen: text to speech by openAI, resemble.ai, elevenlabs API
- Image Gen: Image generation by Open AI, creating images using Stable Diffusion API
Intermediate
Working with Vector Databases - Text chunking and splitting by LangChain
- Course on vector databases: Learn what are embeddings and how to store them. Build applications.
- Another course on building applications with vector databases using Pinecone
- Learn to compute sentence, text, and image embeddings using Framework like SentenceTransformers.
- Check out top embedding models here.
Building RAG Applications - RAG applications are all about building connections between tools, databases, context lengths, embeddings, memories, etc. You need frameworks like LangChainLlamaIndexFastRAG to build these.
- Step-by-step tutorial to build a Q&A RAG pipeline by LangChain
- LangChain’s RAG from Scratch playlist on YouTube is pretty detailed and amazing.
Advanced RAGs - Check out Jerrry Liu’s course on Building and evaluating Advanced RAG Application on DeepLearning.AI for best practices and improving your RAG pipeline’s performance.
- Cheat Sheet and some recipes for building Advanced RAG
- A comprehensive guide on building RAG-based LLM application by AnyScale
Evaluating RAGs - Hugging Face Cookbook on How to evaluate RAG system
- Evaluating all components of your RAG pipeline
- RAGAS framework to evaluate RAG pipelines.
Building Agents - Quickstart guide by LangChain to build agents to have a sequence of actions taken to do a job or multiple jobs.
- Course on Functions Tools and Agents with LangChain by Harrison Chase on DeepLearning.AI
- Autogen: Framework to develop LLM applications using multiple agents that can converse with each other to solve tasks.
- Crew AI: AI Agents for real use cases.
Deploying Apps - Local deployment: Running open source LLMs on local machines (LM StudioOllamaoobaboogakobold.cpp, etc.)
- Building POCs and demo applications using frameworks like Gradio and Streamlit
- Deploying LLMs at scale on cloud technologies like vLLM and SkyPilot.
- Deploying LangChain applications (runnables and chains) as a REST API.
Advanced
Fine-tuning for specific use cases - DeepLearning.AI course on finetuning LLMs
- A Beginner’s Guide to LLM Fine-Tuning is a detailed guide on finetuning LLMs
- A very detailed and simplified read on how to fine-tune LLMs with Hugging Face by Philipp Schmid
- 4-part blog series by Anyscale is a comprehensive guide on fine tuning and serving LLMs.
- Fine-Tune Your Own Llama 2 Model in a Colab Notebook
- LLM Datasets for finetuning
LLMOps - Deeplearning.AI Course on LLMOPs is a good starting place for advanced practitioners.
- GPU Inference optimization techniques like FlashAttention and FlashAttention-2
- LLMOps guide by DataBricks
- Efficiently serving LLMs course on DeepLearning.AI.
Multi-modal applications - Building hybrid search apps with vector databases like Pinecone
- Cookbook for multimodal RAG pipelines.
Model Evals and benchmarking - How to evaluate LLMs with Hugging Face Lighteval
- Course on Automated Testing for LLMOps: Learn to test and evaluate LLM application using an evaluation LLM.
AI Security - Red Teaming LLM Applications - learn to identify and evaluate vulnerabilities in LLM apps.
- Planning red teaming for large language models (LLMs) and their applications
- A detailed list of resources on LLM security highlighting all potential risks and vulnerabilities in AI applications.

Your trajectory

Image Trajectory

Functional Product / Project Ideas to build the above skills

Here's the content of your HTML table converted into a well-formatted markdown table:

S.No. Project Idea Skills required / to be developed
Beginner
1. Building a chatGPT / Midjourney -like bot for a niche persona / use-case and integrate it with WhatsApp or Slack or Discord or build an app using Streamlit / Gradio. LLMs, Prompt Engineering, open-source LLMs, Working with APIs, Python
2. Chrome extension to summarise / ideate / extract takeaways / research with web pages. LLM APIs, Prompt Engineering, open-source LLMs, Working with API, JS, HTML,CSS
3. Create a news aggregator for a targeted persona (PMs, AI Engg., ) Web Parsing, Prompt Engineering, LLMs
4. Multi-modal generation via Discord. All of above and Multi-modal generation
Intermediate
5. Q&A bot for specific persona / department - Doctor / Lawyer / Engineer, Marketer, Sales etc. text splitting, chunking, indexing, working with vector databases, building RAG pipelines, deploying apps
6. StackOverflow-like search engine that provides response from multiple data sources - Advanced RAG pipeline to answer complex query using multiple data sources. All above and vector retrieval, summary retrieval, sub-question query engine, deploying apps
7. AI Interview Buddy - an AI agent that helps in preparing for specific roles in an interactive manner. Building agents, agentic design, text-to-speech, speech-to-text, deployment
Advanced
8. Personal assistant fine-tuned on domain-specific data. Personal Healthcare, Country’s Law, Finance, etc. Dataset curation, Data engineering, model evaluation, benchmarking, PEFT, LoRA, QLoRA
9. An e2e e-commerce product assistant. Advanced RAG pipeline, LLMOps - Data Engineering, CI/CD, Scaling, Monitoring, Observability
10. InvestGPT - fine-tuned LLM to classify personal investments based on technical (algorithmic) and fundamental (LLMs) analysis. fine tuning, evals and benchmarking, AI Security

You can extrapolate each of these ideas into full-fledged products / MVPs for different domains, industries and verticals.

If you liked this roadmap, support the work by starring the repo.

You can follow me on YouTube, Twitter, LinkedIn, Substack, GitHub.