Evaluate and Track Your LLM Experiments: Introducing TruLens for LLMs

Y	Hacker News new \| ask \| show \| jobs

	Evaluate and Track Your LLM Experiments: Introducing TruLens for LLMs (github.com)
	25 points by shayaks 1129 days ago

7 comments

westurner 1129 days ago

https://news.ycombinator.com/item?id=35810320 :

> - [ ] dvc, GitHub Actions, GitLab CI, Gitea Actions,: how to add PROV RDF Linked Data metadata to workflows like DVC.org's & container+command-in-YAML approach

https://dvc.org/

https://news.ycombinator.com/item?id=34619424 https://westurner.github.io/hnlog/#comment-34619424 :

- XAI: Explainable AI: https://en.wikipedia.org/wiki/Explainable_artificial_intelli...

- > Right to explanation: https://en.wikipedia.org/wiki/Right_to_explanation

- > A more logged approach with IDK all previous queries in a notebook and their output over time would be more scientific-like and thus closer to "Engineering": https://en.wikipedia.org/wiki/Engineering

link

shayaks 1129 days ago

We open-sourced TruLens for LLMs to help evaluate and track your LLM experiments. We also built in special integration with langchain to capture the metadata around your entire chain stack to use with your evaluations. Give it a spin and send us your feedback! We're adding new functionality every day.

link

shayaks 1129 days ago

Here's a companion blog that explain it works under the hood: https://medium.com/trulens/evaluate-and-track-your-llm-exper...

link

warthog454 1129 days ago

Better LLM app testing is an urgent need when you see the stuff getting put out there.

link

arielmia 1129 days ago

This is a godsend exactly what i have been searching for

link

duncanid 1129 days ago

Smart toolkit to for quick sanity check why developing

link

laryb2k 1128 days ago

Amazing and very helpful package!!!

link