All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
Improving LLM Performance (with Evals)
1K views
11 months ago
substack.com
Understanding Framework Analysis: An Introductory Guide — Delve
Jun 24, 2021
delvetool.com
29:17
From BLEU to G-Eval: LLM-as-a-Judge Techniques & Limitations
3 weeks ago
YouTube
deepsense
3:22
DeepResearch Arena: Benchmarking LLM Research
27 views
3 months ago
YouTube
AI Research Roundup
39:13
AI Breakthrough - DeepSearch Trains Language Models 5.7X FAS
…
6 views
2 months ago
YouTube
Data Magician
4:03
SpecEval: Auditing LLMs vs. Provider Specs
3 months ago
YouTube
AI Research Roundup
2:55
Inverse IFEval: Benchmarking LLM Cognitive Inertia
3 months ago
YouTube
AI Research Roundup
46:15
LLM Eval Tools Compared: LangSmith
1.5K views
2 months ago
YouTube
Hamel Husain
3:49
DITING: LLM Benchmark for Web Novel Translation
12 views
2 months ago
YouTube
AI Research Roundup
6:31
LLM Evaluation Explained: BLEU, ROUGE, BERTScore & the Full Pip
…
3 weeks ago
YouTube
Peetha Academy
4:24
Measuring Belief Depth in LLM Knowledge Edits
40 views
2 months ago
YouTube
AI Research Roundup
0:53
Stop Guessing: What LLM Evals Actually Are | @LennysPodcast
925 views
2 months ago
YouTube
Hamel Husain
55:02
How to Systematically Improve LLM Applications
14.1K views
3 months ago
YouTube
Dave Ebbelaar
32:40
LLM Eval Tools Compared: Arize Phoenix
1.2K views
2 months ago
YouTube
Hamel Husain
30:03
Responsible AI with fmeval - an open source library to evaluate LL
…
52 views
2 months ago
YouTube
PyData
41:42
LLM Eval Tools Compared: Braintrust
958 views
2 months ago
YouTube
Hamel Husain
8:41
Master LLMs: Top Strategies to Evaluate LLM Performance
8K views
Oct 29, 2023
YouTube
What's AI by Louis-François Bouchard
2017 LLVM Developers’ Meeting: “DLVM: A Compiler Framework fo
…
2.7K views
Oct 31, 2017
YouTube
LLVM
8:40
Mastering LLM Evaluation: Metrics and Methodologies
371 views
Jun 6, 2024
YouTube
H2O.ai
Day 75/75 LLM Evaluation Metrics [Explained] using HELM Framewo
…
635 views
May 5, 2024
YouTube
FreeBirds Crew - Data Science and GenAI
DeepEval for RAG: Let’s Test If Your LLM Really Works as expected! 🔥
8.6K views
9 months ago
YouTube
Execute Automation
15:09
Advancing AI - LLM Evaluation with MLFlow 2 4
3.4K views
Aug 11, 2023
YouTube
Advancing Analytics
LLM Evals and LLM as a Judge: Fundamentals
3.9K views
Jun 25, 2024
YouTube
Arize AI
4:11
LLaMA | New open foundation Large Language Model by Meta AI | Pape
…
4.2K views
Feb 28, 2023
YouTube
The NLP Lab
11:24
Evaluating LLMs using Langchain
5.9K views
Oct 23, 2023
YouTube
Data Science in your pocket
9:35
The Kirkpatrick Model of Training Evaluation
123.9K views
Dec 28, 2020
YouTube
Devlin Peck
5:40
Dlubal RFEM 5 | Introduction to the FEA structural analysis software
3.4K views
Aug 13, 2020
YouTube
Dlubal Software EN
8:45
Evaluate LLMs - RAG
5.5K views
Oct 6, 2023
YouTube
Hands-on AI
34:23
LLM Evals - Part 1: Evaluating Performance
3.8K views
11 months ago
YouTube
Trelis Research
6:36
LLM Evaluation using DeepEval
4.1K views
Aug 12, 2024
YouTube
Data Science in your pocket
See more videos
More like this
Feedback