(Go: >> BACK << -|- >> HOME <<)

SlideShare a Scribd company logo
Building LLM
Applications
A 50-hour roadmap
A comprehensive roadmap to building large language model
applications in ~50 hours
Types of generative AI models
Text-based models
Image-based models
Foundation models & LLMs
Encoder decoder
Attention mechanism
Transformers model and BERT model
Intro to Image Generation
Image captioning models
Diffusion models
Generative AI applications
ChatGPT & Bard
DALL-E & Midjourney
Quick overview of generative AI, LLMs, and foundation models. Learn more about how
transformers and attention mechanism works behind the text and image-based models:
Introduction to Generative AI
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Large language models and foundation models
Vector databases, embeddings, and LLM cache
Prompts and prompt engineering
Context window and token limits
Embeddings and vector databases
Build custom LLM applications by:
Training a new model from scratch
Fine-tuning foundation LLMs
In-context learning
Canonical architecture for an end-to-end LLM application
Understand the common use cases of large language models and the fundamental building
blocks of such applications. Learners will be introduced to the following topics at a very high
level without going into the technical details:
Emerging Architectures
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Review of classical techniques
Review of binary/one-hot, count-based, and TF-IDF techniques for vectorization
Capturing local context with n-grams and challenges
Semantic encoding techniques
Overview of Word2Vec and dense word embeddings
Application of Word2Vec in text analytics and NLP tasks
Hands-on exercise
Creating a TF-IDF and semantic embeddings on a document corpus
In this module, we will be reviewing how embeddings have evolved from the simplest one-hot
encoding approach to more recent semantic embedding approaches. The module will go over the
following topics:
Embeddings
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Text embeddings
Word and sentence embeddings
Multilingual sentence embeddings
Text similarity measures
Dot product, cosine similarity, inner product
Hands-on exercise
Calculating similarity between sentences using cosine similarity and dot product
Attention mechanism and transformer models
Neural machine translation (NMT) and sequence-to-sequence models
Attention mechanism components
Self-attention and multi-head attention
Transformer networks: Tokenization, embedding, positional encoding, and transformers block
Hands-on exercise
Understanding attention mechanisms: Self-attention for contextual word analysis
Dive into the world of large language models, discovering the potent mix of text embeddings, attention mechanisms, and the
game-changing transformer model architecture. This module consists of:
Roadmap to become an LLM applications developer
Attention Mechanism and Transformers
Data Science for Everyone
https://datasciencedojo.com
Overview
The rationale for vector databases
Importance of vector databases in LLMs
Popular vector databases
Indexing techniques
Product quantization (PQ), Locality sensitive hashing (LSH), and Hierarchical
navigable small world (HNSW)
Retrieval techniques
Cosine similarity
Nearest neighbor search
Hands-on exercise
Creating a vector store using HNSW
Creating, storing, and retrieving embeddings using cosine similarity and nearest
neighbors
Learn about efficient vector storage and retrieval with vector databases, indexing techniques,
retrieval methods, and hands-on exercises:
Vector Databases
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Understanding and implementing semantic search
Introduction and importance of semantic search
Distinguishing semantic search from the lexical search
Semantic search using text embeddings
Exploring advanced concepts and techniques in semantic search
Multilingual search
Limitations of embeddings and similarity in semantic search
Improving semantic search beyond embeddings and similarity
Hands-on exercise
Building a simple semantic search engine with multilingual capability
Understand how semantic search overcomes the fundamental limitation in lexical search i.e. lack
of semantics. Learn how to use embeddings and similarity in order to build a semantic search
model:
Semantic Search
Data Science for Everyone
https://datasciencedojo.com
Roadmap to become an LLM applications developer
Prompt design and engineering
Prompting by instruction
Prompting by example
Controlling the model output
When to stop
Being creative vs. predictable
Saving and sharing your prompts
Use case Ideation
Utilizing goal, task, and domain for perfect prompt
Example use cases
Summarizing (summarizing a technical report)
Inferring (sentiment classification, topic extraction)
Transforming text (translation, spelling, and grammar correction)
Expanding (automatically writing emails)
Unleash your creativity and efficiency with prompt engineering. Seamlessly prompt models, control
outputs, and generate captivating content across various domains and tasks. This module includes:
Prompt Engineering
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Fine-tuning foundation LLMs
Rationale for fine-tuning
Limitations of fine-tuning
Parameter efficient fine-tuning
Hands-on exercise
Fine-tuning and deploying the OpenAI GPT model on Azure
Discover the ins and outs of fine-tuning foundation language models (LLMs) through theory
discussions, exploring rationale, limitations, and parameter efficient fine-tuning (PEFT):
Fine-Tuning Foundation Models
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Why are Orchestration Frameworks (OF) needed?
Eliminate the need for foundation model retraining
Overcoming token limits
Connecters for data sources
Explore the necessity of orchestration frameworks, tackling issues like foundation model retraining,
token limits, data source connectivity, and boilerplate code. Discover popular frameworks, their
creators, and open-source availability:
Orchestration Frameworks
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Introduction toLangChain
Schema, models, and prompts
Memory and chains
Loading, transforming, indexing, and retrieving data
Document loader
Text splitters
Retrievers
LangChain use cases
Summarization: Summarizing long documents
QnA using documents as context
Extraction: Getting structured data from unstructured text
Evaluation: Evaluating outputs generated from LLM models
Querying tabular data without using any extra code
Hands-on exercise
Using LangChain loader, splitter, and retrievals on a pdf document
Build LLM apps using LangChain. Learn about LangChain's key components such as models,
prompts, parsers, memory, chains, and QnA. Get hands-on evaluation experience:
LangChain
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Agents and tools
Agent types
Conversational agents
OpenAI functions agents
ReAct agents
Plan and execute agents
Hands-on exercise: Create and execute some of the following agents
Excel agent
JSON agent
Python Pandas agent
Document comparison agent
Power BI agent
Use LLMs to make decisions about what to do next. Enable these decisions with tools. We’ll learn
what they are, how they work, and how to use them within the LangChain library to superpower our
LLMs. In this module, we’ll talk about:
Autonomous Agents
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Ethics, bias, fairness
Sources of bias in acquisition/annotation of training data, model building
Precautions against safeguarding the model from bias
Review some of the regulations/legislation
Principles of responsible AI
Fairness and eliminating bias
Reliability and safety
Privacy and data protection
Transparency and explainability
Accountability and governance
Inclusivity and accessibility
Review some of the tools available to assess the following in a large language model
application
Correctness and security
Bias, fairness, and explainability of the model
Bias can creep in at any stage of the lifecycle of a model. While large language models offer
tremendous business value, humans are involved in all stages of the lifecycle of an LLM from
acquisition of data to interpretation of insights. In this module, we will learn about the following:
Bias, Fairness and Explainablity
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Virtual assistant: A dynamic customer service agent designed for the car manufacturing industry.
Content generation (Marketing co-pilot): Enhancing your marketing strategies with an intelligent
co-pilot.
Conversational agent (Legal and compliance assistant): Assisting with legal and compliance
matters through interactive conversations.
QnA (IRS tax bot): An intelligent bot designed to answer your questions about IRS tax-related
topics.
Content personalizer: Tailoring content specifically to your preferences and needs.
YouTube virtual assistant: Engage in interactive conversations with your favorite YouTube
channels and playlists.
Recommended Projects
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Learn to Build
LLM Applications
New York
December 4-8, 2023
Roadmap to become an LLM applications developer
Data Science for Everyone
https://datasciencedojo.com
Join this 5-day | 40-hour bootcamp to get started with building
large language model applications on your enterprise data
Seattle
September 18-22, 2023
Washington, D.C.
October 16-20, 2023
Austin
November 6-10, 2023
Singapore
January, 2024

More Related Content

What's hot

An Introduction to Generative AI
An Introduction  to Generative AIAn Introduction  to Generative AI
An Introduction to Generative AI
Cori Faklaris
 
Generative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdfGenerative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdf
Liming Zhu
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdf
Dung Hoang
 
Transformers, LLMs, and the Possibility of AGI
Transformers, LLMs, and the Possibility of AGITransformers, LLMs, and the Possibility of AGI
Transformers, LLMs, and the Possibility of AGI
SynaptonIncorporated
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
Maxim Salnikov
 
Generative AI
Generative AIGenerative AI
Generative AI
Carlos J. Costa
 
A Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptxA Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptx
SaiPragnaKancheti
 
Generative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptxGenerative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptx
Colleen Farrelly
 
The current state of generative AI
The current state of generative AIThe current state of generative AI
The current state of generative AI
Benjaminlapid1
 
Journey of Generative AI
Journey of Generative AIJourney of Generative AI
Journey of Generative AI
thomasjvarghese49
 
Introduction to LLMs
Introduction to LLMsIntroduction to LLMs
Introduction to LLMs
Loic Merckel
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray10
 
Build an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdfBuild an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdf
AnastasiaSteele10
 
LLMs Bootcamp
LLMs BootcampLLMs Bootcamp
LLMs Bootcamp
Fiza987241
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN Framework
Keymate.AI
 
What Is GPT-3 And Why Is It Revolutionizing Artificial Intelligence?
What Is GPT-3 And Why Is It Revolutionizing Artificial Intelligence?What Is GPT-3 And Why Is It Revolutionizing Artificial Intelligence?
What Is GPT-3 And Why Is It Revolutionizing Artificial Intelligence?
Bernard Marr
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas1
 
Customizing LLMs
Customizing LLMsCustomizing LLMs
Customizing LLMs
Jim Steele
 
And then there were ... Large Language Models
And then there were ... Large Language ModelsAnd then there were ... Large Language Models
And then there were ... Large Language Models
Leon Dohmen
 
ChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptxChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptx
Jesus Rodriguez
 

What's hot (20)

An Introduction to Generative AI
An Introduction  to Generative AIAn Introduction  to Generative AI
An Introduction to Generative AI
 
Generative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdfGenerative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdf
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdf
 
Transformers, LLMs, and the Possibility of AGI
Transformers, LLMs, and the Possibility of AGITransformers, LLMs, and the Possibility of AGI
Transformers, LLMs, and the Possibility of AGI
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
 
Generative AI
Generative AIGenerative AI
Generative AI
 
A Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptxA Comprehensive Review of Large Language Models for.pptx
A Comprehensive Review of Large Language Models for.pptx
 
Generative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptxGenerative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptx
 
The current state of generative AI
The current state of generative AIThe current state of generative AI
The current state of generative AI
 
Journey of Generative AI
Journey of Generative AIJourney of Generative AI
Journey of Generative AI
 
Introduction to LLMs
Introduction to LLMsIntroduction to LLMs
Introduction to LLMs
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
 
Build an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdfBuild an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdf
 
LLMs Bootcamp
LLMs BootcampLLMs Bootcamp
LLMs Bootcamp
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN Framework
 
What Is GPT-3 And Why Is It Revolutionizing Artificial Intelligence?
What Is GPT-3 And Why Is It Revolutionizing Artificial Intelligence?What Is GPT-3 And Why Is It Revolutionizing Artificial Intelligence?
What Is GPT-3 And Why Is It Revolutionizing Artificial Intelligence?
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
 
Customizing LLMs
Customizing LLMsCustomizing LLMs
Customizing LLMs
 
And then there were ... Large Language Models
And then there were ... Large Language ModelsAnd then there were ... Large Language Models
And then there were ... Large Language Models
 
ChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptxChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptx
 

Similar to Large Language Models Bootcamp

Introduction to MDE
Introduction to MDEIntroduction to MDE
Introduction to MDE
Fáber D. Giraldo
 
Build an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdfBuild an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdf
StephenAmell4
 
TechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptxTechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptx
Usama Wahab Khan Cloud, Data and AI
 
Sioux Hot-or-Not: Model Driven Software Development (Markus Voelter)
Sioux Hot-or-Not: Model Driven Software Development (Markus Voelter)Sioux Hot-or-Not: Model Driven Software Development (Markus Voelter)
Sioux Hot-or-Not: Model Driven Software Development (Markus Voelter)
siouxhotornot
 
ROS 2 AI Integration Working Group 1: ALMA, SustainML & ROS 2 use case
ROS 2 AI Integration Working Group 1: ALMA, SustainML & ROS 2 use case ROS 2 AI Integration Working Group 1: ALMA, SustainML & ROS 2 use case
ROS 2 AI Integration Working Group 1: ALMA, SustainML & ROS 2 use case
eProsima
 
MOND Semantics Integration
MOND Semantics IntegrationMOND Semantics Integration
MOND Semantics Integration
Sales Emea
 
CASE tools and their effects on software quality
CASE tools and their effects on software qualityCASE tools and their effects on software quality
CASE tools and their effects on software quality
Utkarsh Agarwal
 
Newsletter Infographics (8).pdf
Newsletter Infographics (8).pdfNewsletter Infographics (8).pdf
Newsletter Infographics (8).pdf
Fiza987241
 
Final Jspring2009 Mda Slimmer Ontwikkelen Van Java Ee Applicaties
Final Jspring2009 Mda Slimmer Ontwikkelen Van Java Ee ApplicatiesFinal Jspring2009 Mda Slimmer Ontwikkelen Van Java Ee Applicaties
Final Jspring2009 Mda Slimmer Ontwikkelen Van Java Ee Applicaties
Ministry of Foreign Affairs, Netherlands
 
Bootcamp_AIAppsUCSD.pptx
Bootcamp_AIAppsUCSD.pptxBootcamp_AIAppsUCSD.pptx
Bootcamp_AIAppsUCSD.pptx
Object Automation
 
Bootcamp_AIApps.pdf
Bootcamp_AIApps.pdfBootcamp_AIApps.pdf
Bootcamp_AIApps.pdf
Object Automation
 
Bootcamp_AIApps.pdf
Bootcamp_AIApps.pdfBootcamp_AIApps.pdf
Bootcamp_AIApps.pdf
Object Automation
 
Integrating Machine Learning Capabilities into your team
Integrating Machine Learning Capabilities into your teamIntegrating Machine Learning Capabilities into your team
Integrating Machine Learning Capabilities into your team
Cameron Vetter
 
[2015/2016] Software systems engineering PRINCIPLES
[2015/2016] Software systems engineering PRINCIPLES[2015/2016] Software systems engineering PRINCIPLES
[2015/2016] Software systems engineering PRINCIPLES
Ivano Malavolta
 
Smart modeling of smart software
Smart modeling of smart softwareSmart modeling of smart software
Smart modeling of smart software
Jordi Cabot
 
Full-Stack Development or Data Science, Which is the more advantageous Career...
Full-Stack Development or Data Science, Which is the more advantageous Career...Full-Stack Development or Data Science, Which is the more advantageous Career...
Full-Stack Development or Data Science, Which is the more advantageous Career...
Uncodemy
 
Build an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdfBuild an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdf
MatthewHaws4
 
Generative AI in CSharp with Semantic Kernel.pptx
Generative AI in CSharp with Semantic Kernel.pptxGenerative AI in CSharp with Semantic Kernel.pptx
Generative AI in CSharp with Semantic Kernel.pptx
Alon Fliess
 
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full LifecycleMLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
Databricks
 
Doors Analyst
Doors AnalystDoors Analyst
Doors Analyst
Bill Duncan
 

Similar to Large Language Models Bootcamp (20)

Introduction to MDE
Introduction to MDEIntroduction to MDE
Introduction to MDE
 
Build an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdfBuild an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdf
 
TechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptxTechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptx
 
Sioux Hot-or-Not: Model Driven Software Development (Markus Voelter)
Sioux Hot-or-Not: Model Driven Software Development (Markus Voelter)Sioux Hot-or-Not: Model Driven Software Development (Markus Voelter)
Sioux Hot-or-Not: Model Driven Software Development (Markus Voelter)
 
ROS 2 AI Integration Working Group 1: ALMA, SustainML & ROS 2 use case
ROS 2 AI Integration Working Group 1: ALMA, SustainML & ROS 2 use case ROS 2 AI Integration Working Group 1: ALMA, SustainML & ROS 2 use case
ROS 2 AI Integration Working Group 1: ALMA, SustainML & ROS 2 use case
 
MOND Semantics Integration
MOND Semantics IntegrationMOND Semantics Integration
MOND Semantics Integration
 
CASE tools and their effects on software quality
CASE tools and their effects on software qualityCASE tools and their effects on software quality
CASE tools and their effects on software quality
 
Newsletter Infographics (8).pdf
Newsletter Infographics (8).pdfNewsletter Infographics (8).pdf
Newsletter Infographics (8).pdf
 
Final Jspring2009 Mda Slimmer Ontwikkelen Van Java Ee Applicaties
Final Jspring2009 Mda Slimmer Ontwikkelen Van Java Ee ApplicatiesFinal Jspring2009 Mda Slimmer Ontwikkelen Van Java Ee Applicaties
Final Jspring2009 Mda Slimmer Ontwikkelen Van Java Ee Applicaties
 
Bootcamp_AIAppsUCSD.pptx
Bootcamp_AIAppsUCSD.pptxBootcamp_AIAppsUCSD.pptx
Bootcamp_AIAppsUCSD.pptx
 
Bootcamp_AIApps.pdf
Bootcamp_AIApps.pdfBootcamp_AIApps.pdf
Bootcamp_AIApps.pdf
 
Bootcamp_AIApps.pdf
Bootcamp_AIApps.pdfBootcamp_AIApps.pdf
Bootcamp_AIApps.pdf
 
Integrating Machine Learning Capabilities into your team
Integrating Machine Learning Capabilities into your teamIntegrating Machine Learning Capabilities into your team
Integrating Machine Learning Capabilities into your team
 
[2015/2016] Software systems engineering PRINCIPLES
[2015/2016] Software systems engineering PRINCIPLES[2015/2016] Software systems engineering PRINCIPLES
[2015/2016] Software systems engineering PRINCIPLES
 
Smart modeling of smart software
Smart modeling of smart softwareSmart modeling of smart software
Smart modeling of smart software
 
Full-Stack Development or Data Science, Which is the more advantageous Career...
Full-Stack Development or Data Science, Which is the more advantageous Career...Full-Stack Development or Data Science, Which is the more advantageous Career...
Full-Stack Development or Data Science, Which is the more advantageous Career...
 
Build an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdfBuild an LLM-powered application using LangChain.pdf
Build an LLM-powered application using LangChain.pdf
 
Generative AI in CSharp with Semantic Kernel.pptx
Generative AI in CSharp with Semantic Kernel.pptxGenerative AI in CSharp with Semantic Kernel.pptx
Generative AI in CSharp with Semantic Kernel.pptx
 
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full LifecycleMLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
 
Doors Analyst
Doors AnalystDoors Analyst
Doors Analyst
 

Recently uploaded

01 - Motagua 3.0 - 16x9 - Light - [MAIN].pptx
01 - Motagua 3.0 - 16x9 - Light - [MAIN].pptx01 - Motagua 3.0 - 16x9 - Light - [MAIN].pptx
01 - Motagua 3.0 - 16x9 - Light - [MAIN].pptx
CindyBanurea3
 
Simon Fraser University degree offer diploma Transcript
Simon Fraser University  degree offer diploma TranscriptSimon Fraser University  degree offer diploma Transcript
Simon Fraser University degree offer diploma Transcript
taqyea
 
bcme welcome and ground rule required for bcme course (1).pptx
bcme welcome and ground rule required for bcme course (1).pptxbcme welcome and ground rule required for bcme course (1).pptx
bcme welcome and ground rule required for bcme course (1).pptx
BINITADASH3
 
[D3T1S03] Amazon DynamoDB design puzzlers
[D3T1S03] Amazon DynamoDB design puzzlers[D3T1S03] Amazon DynamoDB design puzzlers
[D3T1S03] Amazon DynamoDB design puzzlers
Amazon Web Services Korea
 
@Call @Girls Bandra phone 9920874524 You Are Serach A Beautyfull Dolle come here
@Call @Girls Bandra phone 9920874524 You Are Serach A Beautyfull Dolle come here@Call @Girls Bandra phone 9920874524 You Are Serach A Beautyfull Dolle come here
@Call @Girls Bandra phone 9920874524 You Are Serach A Beautyfull Dolle come here
SARITA PANDEY
 
buku report tentang analisis TIMSS 2023.pdf
buku report tentang analisis TIMSS 2023.pdfbuku report tentang analisis TIMSS 2023.pdf
buku report tentang analisis TIMSS 2023.pdf
ABDULKALAM847167
 
Daryaganj @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
Daryaganj @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model SafeDaryaganj @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
Daryaganj @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
nehadubay1
 
Streamlining Legacy Complexity Through Modernization
Streamlining Legacy Complexity Through ModernizationStreamlining Legacy Complexity Through Modernization
Streamlining Legacy Complexity Through Modernization
sanjay singh
 
Karol Bagh @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Jya Khan Top Model Safe
Karol Bagh @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Jya Khan Top Model SafeKarol Bagh @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Jya Khan Top Model Safe
Karol Bagh @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Jya Khan Top Model Safe
bookmybebe1
 
11th-CS system overview ppt chapter-01.pdf
11th-CS system overview ppt chapter-01.pdf11th-CS system overview ppt chapter-01.pdf
11th-CS system overview ppt chapter-01.pdf
ravimeera74
 
*Call *Girls in Hyderabad 🤣 8826483818 🤣 Pooja Sharma Best High Class Hyderab...
*Call *Girls in Hyderabad 🤣 8826483818 🤣 Pooja Sharma Best High Class Hyderab...*Call *Girls in Hyderabad 🤣 8826483818 🤣 Pooja Sharma Best High Class Hyderab...
*Call *Girls in Hyderabad 🤣 8826483818 🤣 Pooja Sharma Best High Class Hyderab...
roobykhan02154
 
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
#kalyanmatkaresult #dpboss #kalyanmatka #satta #matka #sattamatka
 
@Call @Girls Kolkata 0000000000 Shivani Beautiful Girl any Time
@Call @Girls Kolkata 0000000000 Shivani Beautiful Girl any Time@Call @Girls Kolkata 0000000000 Shivani Beautiful Girl any Time
@Call @Girls Kolkata 0000000000 Shivani Beautiful Girl any Time
manjukaushik328
 
[D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning
[D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning[D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning
[D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning
Donghwan Lee
 
[D3T2S03] Data&AI Roadshow 2024 - Amazon DocumentDB 실습
[D3T2S03] Data&AI Roadshow 2024 - Amazon DocumentDB 실습[D3T2S03] Data&AI Roadshow 2024 - Amazon DocumentDB 실습
[D3T2S03] Data&AI Roadshow 2024 - Amazon DocumentDB 실습
Amazon Web Services Korea
 
SAP ANalytics Cloud -SAP SAC planning 22
SAP ANalytics Cloud -SAP SAC planning 22SAP ANalytics Cloud -SAP SAC planning 22
SAP ANalytics Cloud -SAP SAC planning 22
ramana4bw
 
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
#kalyanmatkaresult #dpboss #kalyanmatka #satta #matka #sattamatka
 
2024 June - Orange County (CA) Tableau User Group Meeting
2024 June - Orange County (CA) Tableau User Group Meeting2024 June - Orange County (CA) Tableau User Group Meeting
2024 June - Orange County (CA) Tableau User Group Meeting
Alison Pitt
 
How We Added Replication to QuestDB - JonTheBeach
How We Added Replication to QuestDB - JonTheBeachHow We Added Replication to QuestDB - JonTheBeach
How We Added Replication to QuestDB - JonTheBeach
javier ramirez
 
Delhi @ℂall @Girls ꧁❤ 9711199012 ❤꧂Glamorous sonam Mehra Top Model Safe
Delhi @ℂall @Girls ꧁❤ 9711199012 ❤꧂Glamorous sonam Mehra Top Model SafeDelhi @ℂall @Girls ꧁❤ 9711199012 ❤꧂Glamorous sonam Mehra Top Model Safe
Delhi @ℂall @Girls ꧁❤ 9711199012 ❤꧂Glamorous sonam Mehra Top Model Safe
dipti singh$A17
 

Recently uploaded (20)

01 - Motagua 3.0 - 16x9 - Light - [MAIN].pptx
01 - Motagua 3.0 - 16x9 - Light - [MAIN].pptx01 - Motagua 3.0 - 16x9 - Light - [MAIN].pptx
01 - Motagua 3.0 - 16x9 - Light - [MAIN].pptx
 
Simon Fraser University degree offer diploma Transcript
Simon Fraser University  degree offer diploma TranscriptSimon Fraser University  degree offer diploma Transcript
Simon Fraser University degree offer diploma Transcript
 
bcme welcome and ground rule required for bcme course (1).pptx
bcme welcome and ground rule required for bcme course (1).pptxbcme welcome and ground rule required for bcme course (1).pptx
bcme welcome and ground rule required for bcme course (1).pptx
 
[D3T1S03] Amazon DynamoDB design puzzlers
[D3T1S03] Amazon DynamoDB design puzzlers[D3T1S03] Amazon DynamoDB design puzzlers
[D3T1S03] Amazon DynamoDB design puzzlers
 
@Call @Girls Bandra phone 9920874524 You Are Serach A Beautyfull Dolle come here
@Call @Girls Bandra phone 9920874524 You Are Serach A Beautyfull Dolle come here@Call @Girls Bandra phone 9920874524 You Are Serach A Beautyfull Dolle come here
@Call @Girls Bandra phone 9920874524 You Are Serach A Beautyfull Dolle come here
 
buku report tentang analisis TIMSS 2023.pdf
buku report tentang analisis TIMSS 2023.pdfbuku report tentang analisis TIMSS 2023.pdf
buku report tentang analisis TIMSS 2023.pdf
 
Daryaganj @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
Daryaganj @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model SafeDaryaganj @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
Daryaganj @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Yogita Mehra Top Model Safe
 
Streamlining Legacy Complexity Through Modernization
Streamlining Legacy Complexity Through ModernizationStreamlining Legacy Complexity Through Modernization
Streamlining Legacy Complexity Through Modernization
 
Karol Bagh @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Jya Khan Top Model Safe
Karol Bagh @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Jya Khan Top Model SafeKarol Bagh @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Jya Khan Top Model Safe
Karol Bagh @ℂall @Girls ꧁❤ 9873777170 ❤꧂VIP Jya Khan Top Model Safe
 
11th-CS system overview ppt chapter-01.pdf
11th-CS system overview ppt chapter-01.pdf11th-CS system overview ppt chapter-01.pdf
11th-CS system overview ppt chapter-01.pdf
 
*Call *Girls in Hyderabad 🤣 8826483818 🤣 Pooja Sharma Best High Class Hyderab...
*Call *Girls in Hyderabad 🤣 8826483818 🤣 Pooja Sharma Best High Class Hyderab...*Call *Girls in Hyderabad 🤣 8826483818 🤣 Pooja Sharma Best High Class Hyderab...
*Call *Girls in Hyderabad 🤣 8826483818 🤣 Pooja Sharma Best High Class Hyderab...
 
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
 
@Call @Girls Kolkata 0000000000 Shivani Beautiful Girl any Time
@Call @Girls Kolkata 0000000000 Shivani Beautiful Girl any Time@Call @Girls Kolkata 0000000000 Shivani Beautiful Girl any Time
@Call @Girls Kolkata 0000000000 Shivani Beautiful Girl any Time
 
[D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning
[D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning[D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning
[D2T2S04] SageMaker를 활용한 Generative AI Foundation Model Training and Tuning
 
[D3T2S03] Data&AI Roadshow 2024 - Amazon DocumentDB 실습
[D3T2S03] Data&AI Roadshow 2024 - Amazon DocumentDB 실습[D3T2S03] Data&AI Roadshow 2024 - Amazon DocumentDB 실습
[D3T2S03] Data&AI Roadshow 2024 - Amazon DocumentDB 실습
 
SAP ANalytics Cloud -SAP SAC planning 22
SAP ANalytics Cloud -SAP SAC planning 22SAP ANalytics Cloud -SAP SAC planning 22
SAP ANalytics Cloud -SAP SAC planning 22
 
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
❻❸❼⓿❽❻❷⓿⓿❼ SATTA MATKA DPBOSS KALYAN FAST RESULTS CHART KALYAN MATKA MATKA RE...
 
2024 June - Orange County (CA) Tableau User Group Meeting
2024 June - Orange County (CA) Tableau User Group Meeting2024 June - Orange County (CA) Tableau User Group Meeting
2024 June - Orange County (CA) Tableau User Group Meeting
 
How We Added Replication to QuestDB - JonTheBeach
How We Added Replication to QuestDB - JonTheBeachHow We Added Replication to QuestDB - JonTheBeach
How We Added Replication to QuestDB - JonTheBeach
 
Delhi @ℂall @Girls ꧁❤ 9711199012 ❤꧂Glamorous sonam Mehra Top Model Safe
Delhi @ℂall @Girls ꧁❤ 9711199012 ❤꧂Glamorous sonam Mehra Top Model SafeDelhi @ℂall @Girls ꧁❤ 9711199012 ❤꧂Glamorous sonam Mehra Top Model Safe
Delhi @ℂall @Girls ꧁❤ 9711199012 ❤꧂Glamorous sonam Mehra Top Model Safe
 

Large Language Models Bootcamp

  • 1. Building LLM Applications A 50-hour roadmap A comprehensive roadmap to building large language model applications in ~50 hours
  • 2. Types of generative AI models Text-based models Image-based models Foundation models & LLMs Encoder decoder Attention mechanism Transformers model and BERT model Intro to Image Generation Image captioning models Diffusion models Generative AI applications ChatGPT & Bard DALL-E & Midjourney Quick overview of generative AI, LLMs, and foundation models. Learn more about how transformers and attention mechanism works behind the text and image-based models: Introduction to Generative AI Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 3. Large language models and foundation models Vector databases, embeddings, and LLM cache Prompts and prompt engineering Context window and token limits Embeddings and vector databases Build custom LLM applications by: Training a new model from scratch Fine-tuning foundation LLMs In-context learning Canonical architecture for an end-to-end LLM application Understand the common use cases of large language models and the fundamental building blocks of such applications. Learners will be introduced to the following topics at a very high level without going into the technical details: Emerging Architectures Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 4. Review of classical techniques Review of binary/one-hot, count-based, and TF-IDF techniques for vectorization Capturing local context with n-grams and challenges Semantic encoding techniques Overview of Word2Vec and dense word embeddings Application of Word2Vec in text analytics and NLP tasks Hands-on exercise Creating a TF-IDF and semantic embeddings on a document corpus In this module, we will be reviewing how embeddings have evolved from the simplest one-hot encoding approach to more recent semantic embedding approaches. The module will go over the following topics: Embeddings Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 5. Text embeddings Word and sentence embeddings Multilingual sentence embeddings Text similarity measures Dot product, cosine similarity, inner product Hands-on exercise Calculating similarity between sentences using cosine similarity and dot product Attention mechanism and transformer models Neural machine translation (NMT) and sequence-to-sequence models Attention mechanism components Self-attention and multi-head attention Transformer networks: Tokenization, embedding, positional encoding, and transformers block Hands-on exercise Understanding attention mechanisms: Self-attention for contextual word analysis Dive into the world of large language models, discovering the potent mix of text embeddings, attention mechanisms, and the game-changing transformer model architecture. This module consists of: Roadmap to become an LLM applications developer Attention Mechanism and Transformers Data Science for Everyone https://datasciencedojo.com
  • 6. Overview The rationale for vector databases Importance of vector databases in LLMs Popular vector databases Indexing techniques Product quantization (PQ), Locality sensitive hashing (LSH), and Hierarchical navigable small world (HNSW) Retrieval techniques Cosine similarity Nearest neighbor search Hands-on exercise Creating a vector store using HNSW Creating, storing, and retrieving embeddings using cosine similarity and nearest neighbors Learn about efficient vector storage and retrieval with vector databases, indexing techniques, retrieval methods, and hands-on exercises: Vector Databases Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 7. Understanding and implementing semantic search Introduction and importance of semantic search Distinguishing semantic search from the lexical search Semantic search using text embeddings Exploring advanced concepts and techniques in semantic search Multilingual search Limitations of embeddings and similarity in semantic search Improving semantic search beyond embeddings and similarity Hands-on exercise Building a simple semantic search engine with multilingual capability Understand how semantic search overcomes the fundamental limitation in lexical search i.e. lack of semantics. Learn how to use embeddings and similarity in order to build a semantic search model: Semantic Search Data Science for Everyone https://datasciencedojo.com Roadmap to become an LLM applications developer
  • 8. Prompt design and engineering Prompting by instruction Prompting by example Controlling the model output When to stop Being creative vs. predictable Saving and sharing your prompts Use case Ideation Utilizing goal, task, and domain for perfect prompt Example use cases Summarizing (summarizing a technical report) Inferring (sentiment classification, topic extraction) Transforming text (translation, spelling, and grammar correction) Expanding (automatically writing emails) Unleash your creativity and efficiency with prompt engineering. Seamlessly prompt models, control outputs, and generate captivating content across various domains and tasks. This module includes: Prompt Engineering Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 9. Fine-tuning foundation LLMs Rationale for fine-tuning Limitations of fine-tuning Parameter efficient fine-tuning Hands-on exercise Fine-tuning and deploying the OpenAI GPT model on Azure Discover the ins and outs of fine-tuning foundation language models (LLMs) through theory discussions, exploring rationale, limitations, and parameter efficient fine-tuning (PEFT): Fine-Tuning Foundation Models Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 10. Why are Orchestration Frameworks (OF) needed? Eliminate the need for foundation model retraining Overcoming token limits Connecters for data sources Explore the necessity of orchestration frameworks, tackling issues like foundation model retraining, token limits, data source connectivity, and boilerplate code. Discover popular frameworks, their creators, and open-source availability: Orchestration Frameworks Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 11. Introduction toLangChain Schema, models, and prompts Memory and chains Loading, transforming, indexing, and retrieving data Document loader Text splitters Retrievers LangChain use cases Summarization: Summarizing long documents QnA using documents as context Extraction: Getting structured data from unstructured text Evaluation: Evaluating outputs generated from LLM models Querying tabular data without using any extra code Hands-on exercise Using LangChain loader, splitter, and retrievals on a pdf document Build LLM apps using LangChain. Learn about LangChain's key components such as models, prompts, parsers, memory, chains, and QnA. Get hands-on evaluation experience: LangChain Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 12. Agents and tools Agent types Conversational agents OpenAI functions agents ReAct agents Plan and execute agents Hands-on exercise: Create and execute some of the following agents Excel agent JSON agent Python Pandas agent Document comparison agent Power BI agent Use LLMs to make decisions about what to do next. Enable these decisions with tools. We’ll learn what they are, how they work, and how to use them within the LangChain library to superpower our LLMs. In this module, we’ll talk about: Autonomous Agents Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 13. Ethics, bias, fairness Sources of bias in acquisition/annotation of training data, model building Precautions against safeguarding the model from bias Review some of the regulations/legislation Principles of responsible AI Fairness and eliminating bias Reliability and safety Privacy and data protection Transparency and explainability Accountability and governance Inclusivity and accessibility Review some of the tools available to assess the following in a large language model application Correctness and security Bias, fairness, and explainability of the model Bias can creep in at any stage of the lifecycle of a model. While large language models offer tremendous business value, humans are involved in all stages of the lifecycle of an LLM from acquisition of data to interpretation of insights. In this module, we will learn about the following: Bias, Fairness and Explainablity Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 14. Virtual assistant: A dynamic customer service agent designed for the car manufacturing industry. Content generation (Marketing co-pilot): Enhancing your marketing strategies with an intelligent co-pilot. Conversational agent (Legal and compliance assistant): Assisting with legal and compliance matters through interactive conversations. QnA (IRS tax bot): An intelligent bot designed to answer your questions about IRS tax-related topics. Content personalizer: Tailoring content specifically to your preferences and needs. YouTube virtual assistant: Engage in interactive conversations with your favorite YouTube channels and playlists. Recommended Projects Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com
  • 15. Learn to Build LLM Applications New York December 4-8, 2023 Roadmap to become an LLM applications developer Data Science for Everyone https://datasciencedojo.com Join this 5-day | 40-hour bootcamp to get started with building large language model applications on your enterprise data Seattle September 18-22, 2023 Washington, D.C. October 16-20, 2023 Austin November 6-10, 2023 Singapore January, 2024