-
Notifications
You must be signed in to change notification settings - Fork 2
wip docs and edge cases #15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
ericphanson
wants to merge
7
commits into
main
Choose a base branch
from
eph/docs
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Changes from 1 commit
Commits
Show all changes
7 commits
Select commit
Hold shift + click to select a range
885f0ad
wip docs and edge cases
ericphanson 219586c
some more docs pages
ericphanson 749bb66
add push previews and clean them up
ericphanson 30b6a7c
wip
ericphanson 311c780
more descriptive argument name
ericphanson 4022886
throw some argument errors and test them
ericphanson ae76a32
log the accuracy
ericphanson File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,28 @@ | ||
# Evaluation metrics | ||
|
||
Lighthouse automatically generates a suite of evaluation metrics. | ||
Here, we briefly describe these. This page uses terms defined in [Terminology](@ref), | ||
so see that page for any unfamiliar words. | ||
|
||
## Confusion matrices | ||
|
||
Lighthouse plots confusion matrices, which are simple tables | ||
showing the empirical distribution of predicted class (the rows) | ||
versus the elected class (the columns). These come in two variants: | ||
|
||
* row-normalized: this means each row has been normalized to sum to 1. Thus, the row-normalized confusion matrix shows the empirical distribution of elected classes for a given predicted class. E.g. the first row of the row-normalized confusion matrix shows the empirical probabilities of the elected classes for a sample which was predicted to be in the first class. | ||
* column-normalized: this means each column has been normalized to sum to 1. Thus, the column-normalized confusion matrix shows the empirical distribution of predicted classes for a given elected class. E.g. the first column of the column-normalized confusion matrix shows the empirical probabilities of the predicted classes for a sample which was elected to be in the first class. | ||
|
||
[insert example plot] | ||
|
||
## Inter-rater reliability | ||
|
||
## ROC curves | ||
|
||
## PR curves | ||
|
||
## PR-gain curves | ||
|
||
## Prediction-reliability calibration | ||
|
||
## |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,9 @@ | ||
# Terminology | ||
|
||
* _sample_: a piece of data to be classified by the model, or a labelled piece of training/test/validation data. | ||
* _classes_: the set of possible class labels which the model attempts to predict. | ||
* _voters_: the individual sources of labelled data, such as human labellers. Each voter may supply a "vote" for a class label for a sample. | ||
* _votes_: the matrix of votes corresponding to a set of data, whose rows correspond to the index of a sample in a set of data, whose columns correspond to voters, and whose values are the indices of class labels (i.e. numbers in `1:length(classes)`). E.g. if 2 voters have voted on ten samples, then `votes` is a 10 by 2 matrix of integers. If a voter has not voted on a particular sample, any value outside `1:length(classes)` may be supplied to indicate this. | ||
* _elected class_: the class elected by the voters. By default in [`learn!`](@ref), | ||
the elected class is chosen by a simple majority of the votes with ties broken randomly. | ||
* _predicted class_: the class predicted by the model for a given input. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I wonder if these should be docstrings? The reason I didn't do so from the start is that I meant this page to be more pedagogical (and hopefully to have references), instead of just documenting the programmatic API.