-
Couldn't load subscription status.
- Fork 2
Evaluator Tool #114
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Evaluator Tool #114
Conversation
.deepeval/.deepeval_telemetry.txt
Outdated
| @@ -0,0 +1,2 @@ | |||
| DEEPEVAL_ID=c2b8806f-4b6c-46bf-80f4-a300c8ed69e3 | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do we need this deep eval artifact in the git tree?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@NISH1001 I'm not sure, but I guess probably not.
@leothomas what do you think?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@NISH1001 I've removed .deepeval_telemetry.txt since it's automatically created every time DeepEval is run.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@TigranTigranTigran maybe we can just add this to gitignore? I also see .deepeval dir which might have to be ignored. Let's add that and then we can move forward with this PR. Can we remove the .deepeval from git tree and add to gitignore.
Summary 📝
Tool for evaluating LLM output or search result quality (continuing from PR #44)
Details
Usage
The
LLMEvaluatoruses a locally running model(IBM: granite3-dense:8b)but can be easily updated to use any models running locally on Ollama using the following config:file: .deepeval/.deepevalThe following code demonstrates its usage with a search result and a direct input:
Checks