Deepval LLM Evaluation Framework - Search Videos

Improving LLM Performance (with Evals)

Improving LLM Performance (with Evals)

1K views11 months ago

Understanding Framework Analysis: An Introductory Guide — Delve

Understanding Framework Analysis: An Introductory Guide — Delve

From BLEU to G-Eval: LLM-as-a-Judge Techniques & Limitations

From BLEU to G-Eval: LLM-as-a-Judge Techniques & Limitations

YouTubedeepsense

DeepResearch Arena: Benchmarking LLM Research

DeepResearch Arena: Benchmarking LLM Research

27 views3 months ago

YouTubeAI Research Roundup

AI Breakthrough - DeepSearch Trains Language Models 5.7X FASTER to Master Math

AI Breakthrough - DeepSearch Trains Language Models 5.7X FAS…

6 views2 months ago

YouTubeData Magician

SpecEval: Auditing LLMs vs. Provider Specs

SpecEval: Auditing LLMs vs. Provider Specs

YouTubeAI Research Roundup

Inverse IFEval: Benchmarking LLM Cognitive Inertia

Inverse IFEval: Benchmarking LLM Cognitive Inertia

YouTubeAI Research Roundup

LLM Eval Tools Compared: LangSmith

1.5K views2 months ago

YouTubeHamel Husain

DITING: LLM Benchmark for Web Novel Translation

12 views2 months ago

YouTubeAI Research Roundup

LLM Evaluation Explained: BLEU, ROUGE, BERTScore & the Full Pip…

YouTubePeetha Academy

Measuring Belief Depth in LLM Knowledge Edits

40 views2 months ago

YouTubeAI Research Roundup

Stop Guessing: What LLM Evals Actually Are | @LennysPodcast

925 views2 months ago

YouTubeHamel Husain

How to Systematically Improve LLM Applications

14.1K views3 months ago

YouTubeDave Ebbelaar

LLM Eval Tools Compared: Arize Phoenix

1.2K views2 months ago

YouTubeHamel Husain

Responsible AI with fmeval - an open source library to evaluate LL…

52 views2 months ago

LLM Eval Tools Compared: Braintrust

958 views2 months ago

YouTubeHamel Husain

Master LLMs: Top Strategies to Evaluate LLM Performance

8K viewsOct 29, 2023

YouTubeWhat's AI by Louis-François Bouchard

2017 LLVM Developers’ Meeting: “DLVM: A Compiler Framework fo…

2.7K viewsOct 31, 2017

Mastering LLM Evaluation: Metrics and Methodologies

371 viewsJun 6, 2024

Day 75/75 LLM Evaluation Metrics [Explained] using HELM Framewo…

635 viewsMay 5, 2024

YouTubeFreeBirds Crew - Data Science and GenAI

DeepEval for RAG: Let’s Test If Your LLM Really Works as expected! 🔥

8.6K views9 months ago

YouTubeExecute Automation

Advancing AI - LLM Evaluation with MLFlow 2 4

3.4K viewsAug 11, 2023

YouTubeAdvancing Analytics

LLM Evals and LLM as a Judge: Fundamentals

3.9K viewsJun 25, 2024

YouTubeArize AI

LLaMA | New open foundation Large Language Model by Meta AI | Pape…

4.2K viewsFeb 28, 2023

YouTubeThe NLP Lab

Evaluating LLMs using Langchain

5.9K viewsOct 23, 2023

YouTubeData Science in your pocket

The Kirkpatrick Model of Training Evaluation

123.9K viewsDec 28, 2020

YouTubeDevlin Peck

Dlubal RFEM 5 | Introduction to the FEA structural analysis software

3.4K viewsAug 13, 2020

YouTubeDlubal Software EN

Evaluate LLMs - RAG

5.5K viewsOct 6, 2023

YouTubeHands-on AI

LLM Evals - Part 1: Evaluating Performance

3.8K views11 months ago

YouTubeTrelis Research

LLM Evaluation using DeepEval

4.1K viewsAug 12, 2024

YouTubeData Science in your pocket

See more videos