Evaluating Large Language Models LLMs | 0C091C11453E0D5D0104120606E2382E42F066B0

Evaluating Large Language Models LLMs

Name	Evaluating Large Language Models (LLMs)	DOWNLOAD Copy Link Trouble downloading? see How To
Total Size	2.3 GB
Total Files	72
Last Seen	2025-08-01 00:23
Hash	0C091C11453E0D5D0104120606E2382E42F066B0

/Lesson 2 Evaluating Generative Tasks/
004. 2.3 Evaluating Free Text Response Tasks, Part 2.mp4	197.2 MB
001. Learning objectives.en.srt	0.7 KB
001. Learning objectives.mp4	5.3 MB
002. 2.1 Evaluating Multiple-Choice Tasks.en.srt	18.5 KB
002. 2.1 Evaluating Multiple-Choice Tasks.mp4	54.1 MB
003. 2.2 Evaluating Free Text Response Tasks, Part 1.en.srt	31.6 KB
003. 2.2 Evaluating Free Text Response Tasks, Part 1.mp4	105.7 MB
004. 2.3 Evaluating Free Text Response Tasks, Part 2.en.srt	38.4 KB
005. 2.3 AIs Supervising AIs LLM as a Judge.mp4	51.3 MB
005. 2.4 AIs Supervising AIs LLM as a Judge.en.srt	17.7 KB
/Introduction/
001. Evaluating Large Language Models (LLMs) Introduction.mp4	18.3 MB
001. Evaluating Large Language Models (LLMs) Introduction.en.srt	2.6 KB
/Lesson 1 Foundations of LLM Evaluation/
001. Learning objectives.en.srt	0.8 KB
001. Learning objectives.mp4	5.0 MB
002. 1.1 Introduction to Evaluation Why It Matters.en.srt	18.5 KB
002. 1.1 Introduction to Evaluation Why It Matters.mp4	53.5 MB
003. 1.2 Generative versus Understanding Tasks.en.srt	14.9 KB
003. 1.2 Generative versus Understanding Tasks.mp4	42.7 MB
004. 1.3 Key Metrics for Common Tasks.en.srt	26.4 KB
004. 1.3 Key Metrics for Common Tasks.mp4	90.2 MB
/Lesson 3 Evaluating Understanding Tasks/
001. Learning objectives.en.srt	0.9 KB
001. Learning objectives.mp4	7.1 MB
002. 3.1 Evaluating Embedding Tasks.en.srt	21.0 KB
002. 3.1 Evaluating Embedding Tasks.mp4	65.4 MB
003. 3.2 Evaluating Classification Tasks.en.srt	28.3 KB
003. 3.2 Evaluating Classification Tasks.mp4	86.4 MB
004. 3.3 Building an LLM Classifier with BERT and GPT.en.srt	31.6 KB
004. 3.3 Building an LLM Classifier with BERT and GPT.mp4	92.9 MB
/Lesson 4 Using Benchmarks Effectively/
001. Learning objectives.en.srt	0.9 KB
001. Learning objectives.mp4	6.7 MB
002. 4.1 The Role of Benchmarks.en.srt	11.1 KB
002. 4.1 The Role of Benchmarks.mp4	33.4 MB
003. 4.2 Interrogating Common Benchmarks.en.srt	30.6 KB
003. 4.2 Interrogating Common Benchmarks.mp4	94.5 MB
004. 4.3 Evaluating LLMs with Benchmarks.en.srt	31.9 KB
004. 4.3 Evaluating LLMs with Benchmarks.mp4	136.9 MB
/Lesson 5 Probing LLMs for a World Model/
001. Learning objectives.en.srt	0.7 KB
001. Learning objectives.mp4	5.3 MB
002. 5.1 Probing LLMs for Knowledge.en.srt	25.8 KB
002. 5.1 Probing LLMs for Knowledge.mp4	77.4 MB
003. 5.2 Probing LLMs to Play Games.en.srt	34.7 KB
003. 5.2 Probing LLMs to Play Games.mp4	155.1 MB
/Lesson 6 Evaluating LLM Fine-Tuning/
001. Learning objectives.en.srt	0.7 KB
001. Learning objectives.mp4	5.2 MB
002. 6.1 Fine-Tuning Objectives.en.srt	14.1 KB
002. 6.1 Fine-Tuning Objectives.mp4	34.5 MB
003. 6.2 Metrics for Fine-Tuning Success.en.srt	14.5 KB
003. 6.2 Metrics for Fine-Tuning Success.mp4	43.7 MB
004. 6.3 Practical Demonstration Evaluating Fine-Tuning.en.srt	37.6 KB
004. 6.3 Practical Demonstration Evaluating Fine-Tuning.mp4	106.9 MB
005. 6.4 Evaluating and Cleaning Data.en.srt	46.1 KB
005. 6.4 Evaluating and Cleaning Data.mp4	174.7 MB
/Lesson 7 Case Studies/
001. Learning objectives.en.srt	0.8 KB
001. Learning objectives.mp4	6.5 MB
002. 7.1 Evaluating AI Agents Task Automation and Tool Integration.en.srt	24.9 KB
002. 7.1 Evaluating AI Agents Task Automation and Tool Integration.mp4	81.5 MB
003. 7.2 Measuring Retrieval-Augmented Generation (RAG) Systems.en.srt	15.4 KB
003. 7.2 Measuring Retrieval-Augmented Generation (RAG) Systems.mp4	50.2 MB
004. 7.3 Building and Evaluating a Recommendation Engine Using LLMs.en.srt	24.9 KB
004. 7.3 Building and Evaluating a Recommendation Engine Using LLMs.mp4	91.6 MB
005. 7.4 Using Evaluation to Combat AI Drift.en.srt	29.2 KB
005. 7.4 Using Evaluation to Combat AI Drift.mp4	123.5 MB
006. 7.5 Time-Series Regression.en.srt	24.7 KB
006. 7.5 Time-Series Regression.mp4	111.9 MB
/Lesson 8 Summary of Evaluation and Looking Ahead/
001. Learning objectives.en.srt	0.8 KB
001. Learning objectives.mp4	5.7 MB
002. 8.1 When and How to Evaluate.en.srt	14.7 KB
002. 8.1 When and How to Evaluate.mp4	48.8 MB
003. 8.2 Looking Ahead Trends in LLM Evaluation.en.srt	8.9 KB
003. 8.2 Looking Ahead Trends in LLM Evaluation.mp4	27.8 MB
/Summary/
001. Evaluating Large Language Models (LLMs) Summary.en.srt	1.2 KB
001. Evaluating Large Language Models (LLMs) Summary.mp4	8.3 MB
Total files 72