Get in Touch

Course Outline

Introduction to Large Language Models (LLMs)

  • Overview of LLMs
  • Definition and significance
  • Applications in AI today

Transformer Architecture

  • Understanding transformers and their operation
  • Key components and features
  • Embedding and positional encoding
  • Multi-head attention
  • Feed-forward neural network
  • Normalization and residual connections

    Transformer Models

    • Self-attention mechanism
    • Encoder-decoder architecture
    • Positional embeddings
    • BERT (Bidirectional Encoder Representations from Transformers)
    • GPT (Generative Pretrained Transformer)

    Performance Optimisation and Pitfalls

    • Context length
    • Mamba and state-space models
    • Flash attention
    • Sparse transformers
    • Vision transformers
    • The importance of quantisation

    Enhancing Transformers

    • Retrieval augmented text generation
    • Mixture of models
    • Tree of thoughts

    Fine-Tuning

    • Theory of low-rank adaptation
    • Fine-Tuning with QLora

    Scaling Laws and Optimisation in LLMs

    • The importance of scaling laws for LLMs
    • Data and model size scaling
    • Computational scaling
    • Parameter efficiency scaling

    Optimisation

    • The relationship between model size, data size, compute budget, and inference requirements
    • Optimising performance and efficiency of LLMs
    • Best practices and tools for training and fine-tuning LLMs

    Training and Fine-Tuning LLMs

    • Steps and challenges of training LLMs from scratch
    • Data acquisition and maintenance
    • Large-scale data, CPU, and memory requirements
    • Optimisation challenges
    • Landscape of open-source LLMs

    Fundamentals of Reinforcement Learning (RL)

    • Introduction to Reinforcement Learning
    • Learning through positive reinforcement
    • Definition and core concepts
    • Markov Decision Process (MDP)
    • Dynamic programming
    • Monte Carlo methods
    • Temporal Difference Learning

    Deep Reinforcement Learning

    • Deep Q-Networks (DQN)
    • Proximal Policy Optimization (PPO)
    • Elements of Reinforcement Learning

    Integration of LLMs and Reinforcement Learning

    • Combining LLMs with Reinforcement Learning
    • How RL is used in LLMs
    • Reinforcement Learning with Human Feedback (RLHF)
    • Alternatives to RLHF

    Case Studies and Applications

    • Real-world applications
    • Success stories and challenges

    Advanced Topics

    • Advanced techniques
    • Advanced optimisation methods
    • Cutting-edge research and developments

    Summary and Next Steps

Requirements

  • Foundational understanding of Machine Learning

Audience

  • Data scientists
  • Software engineers
 21 Hours

Upcoming Courses

Related Categories