FileMood

Download Evaluating Large Language Models (LLMs)

Evaluating Large Language Models LLMs

Name

Evaluating Large Language Models (LLMs)

  DOWNLOAD Copy Link

Trouble downloading? see How To

Total Size

2.3 GB

Total Files

72

Hash

0C091C11453E0D5D0104120606E2382E42F066B0

/Lesson 2 Evaluating Generative Tasks/

004. 2.3 Evaluating Free Text Response Tasks, Part 2.mp4

197.2 MB

001. Learning objectives.en.srt

0.7 KB

001. Learning objectives.mp4

5.3 MB

002. 2.1 Evaluating Multiple-Choice Tasks.en.srt

18.5 KB

002. 2.1 Evaluating Multiple-Choice Tasks.mp4

54.1 MB

003. 2.2 Evaluating Free Text Response Tasks, Part 1.en.srt

31.6 KB

003. 2.2 Evaluating Free Text Response Tasks, Part 1.mp4

105.7 MB

004. 2.3 Evaluating Free Text Response Tasks, Part 2.en.srt

38.4 KB

005. 2.3 AIs Supervising AIs LLM as a Judge.mp4

51.3 MB

005. 2.4 AIs Supervising AIs LLM as a Judge.en.srt

17.7 KB

/Introduction/

001. Evaluating Large Language Models (LLMs) Introduction.mp4

18.3 MB

001. Evaluating Large Language Models (LLMs) Introduction.en.srt

2.6 KB

/Lesson 1 Foundations of LLM Evaluation/

001. Learning objectives.en.srt

0.8 KB

001. Learning objectives.mp4

5.0 MB

002. 1.1 Introduction to Evaluation Why It Matters.en.srt

18.5 KB

002. 1.1 Introduction to Evaluation Why It Matters.mp4

53.5 MB

003. 1.2 Generative versus Understanding Tasks.en.srt

14.9 KB

003. 1.2 Generative versus Understanding Tasks.mp4

42.7 MB

004. 1.3 Key Metrics for Common Tasks.en.srt

26.4 KB

004. 1.3 Key Metrics for Common Tasks.mp4

90.2 MB

/Lesson 3 Evaluating Understanding Tasks/

001. Learning objectives.en.srt

0.9 KB

001. Learning objectives.mp4

7.1 MB

002. 3.1 Evaluating Embedding Tasks.en.srt

21.0 KB

002. 3.1 Evaluating Embedding Tasks.mp4

65.4 MB

003. 3.2 Evaluating Classification Tasks.en.srt

28.3 KB

003. 3.2 Evaluating Classification Tasks.mp4

86.4 MB

004. 3.3 Building an LLM Classifier with BERT and GPT.en.srt

31.6 KB

004. 3.3 Building an LLM Classifier with BERT and GPT.mp4

92.9 MB

/Lesson 4 Using Benchmarks Effectively/

001. Learning objectives.en.srt

0.9 KB

001. Learning objectives.mp4

6.7 MB

002. 4.1 The Role of Benchmarks.en.srt

11.1 KB

002. 4.1 The Role of Benchmarks.mp4

33.4 MB

003. 4.2 Interrogating Common Benchmarks.en.srt

30.6 KB

003. 4.2 Interrogating Common Benchmarks.mp4

94.5 MB

004. 4.3 Evaluating LLMs with Benchmarks.en.srt

31.9 KB

004. 4.3 Evaluating LLMs with Benchmarks.mp4

136.9 MB

/Lesson 5 Probing LLMs for a World Model/

001. Learning objectives.en.srt

0.7 KB

001. Learning objectives.mp4

5.3 MB

002. 5.1 Probing LLMs for Knowledge.en.srt

25.8 KB

002. 5.1 Probing LLMs for Knowledge.mp4

77.4 MB

003. 5.2 Probing LLMs to Play Games.en.srt

34.7 KB

003. 5.2 Probing LLMs to Play Games.mp4

155.1 MB

/Lesson 6 Evaluating LLM Fine-Tuning/

001. Learning objectives.en.srt

0.7 KB

001. Learning objectives.mp4

5.2 MB

002. 6.1 Fine-Tuning Objectives.en.srt

14.1 KB

002. 6.1 Fine-Tuning Objectives.mp4

34.5 MB

003. 6.2 Metrics for Fine-Tuning Success.en.srt

14.5 KB

003. 6.2 Metrics for Fine-Tuning Success.mp4

43.7 MB

004. 6.3 Practical Demonstration Evaluating Fine-Tuning.en.srt

37.6 KB

004. 6.3 Practical Demonstration Evaluating Fine-Tuning.mp4

106.9 MB

005. 6.4 Evaluating and Cleaning Data.en.srt

46.1 KB

005. 6.4 Evaluating and Cleaning Data.mp4

174.7 MB

/Lesson 7 Case Studies/

001. Learning objectives.en.srt

0.8 KB

001. Learning objectives.mp4

6.5 MB

002. 7.1 Evaluating AI Agents Task Automation and Tool Integration.en.srt

24.9 KB

002. 7.1 Evaluating AI Agents Task Automation and Tool Integration.mp4

81.5 MB

003. 7.2 Measuring Retrieval-Augmented Generation (RAG) Systems.en.srt

15.4 KB

003. 7.2 Measuring Retrieval-Augmented Generation (RAG) Systems.mp4

50.2 MB

004. 7.3 Building and Evaluating a Recommendation Engine Using LLMs.en.srt

24.9 KB

004. 7.3 Building and Evaluating a Recommendation Engine Using LLMs.mp4

91.6 MB

005. 7.4 Using Evaluation to Combat AI Drift.en.srt

29.2 KB

005. 7.4 Using Evaluation to Combat AI Drift.mp4

123.5 MB

006. 7.5 Time-Series Regression.en.srt

24.7 KB

006. 7.5 Time-Series Regression.mp4

111.9 MB

/Lesson 8 Summary of Evaluation and Looking Ahead/

001. Learning objectives.en.srt

0.8 KB

001. Learning objectives.mp4

5.7 MB

002. 8.1 When and How to Evaluate.en.srt

14.7 KB

002. 8.1 When and How to Evaluate.mp4

48.8 MB

003. 8.2 Looking Ahead Trends in LLM Evaluation.en.srt

8.9 KB

003. 8.2 Looking Ahead Trends in LLM Evaluation.mp4

27.8 MB

/Summary/

001. Evaluating Large Language Models (LLMs) Summary.en.srt

1.2 KB

001. Evaluating Large Language Models (LLMs) Summary.mp4

8.3 MB

 

Total files 72


Copyright © 2025 FileMood.com