Fix lf summary no abstain 1446 #1447

garaud · 2019-09-05T15:14:51Z

Description of proposed changes

Fix the computation of correct/incorrect values from the confusion matrix when the L input data didn't have ABSTAIN label. We only slice the confusion matrix on [1:, 1:] when there is some ABSTAIN label, i.e. -1 values. The full confusion matrix should be taken into account when you don't have -1 values.

Related issue(s)

Fixes #1446

Test plan

Checklist

Need help on these? Just ask!

I have read the CONTRIBUTING document.
I have updated the documentation accordingly.
I have added tests to cover my changes.
All new and existing tests passed.

I've some trouble with pyspark for the test part. But I think it's not related with this fix.

henryre

@garaud great find and great fix, thanks!! Looks like it's just a formatting issue. You can fix it by running tox -e fix, then verify by running tox

test/labeling/test_analysis.py

bhancock8

Thanks so much for posting the issue and the PR! I made one suggestion for simplifying the calculation. Then once we clarify the naming of lfa_bis in the unit test, looks great to me!

snorkel/labeling/analysis.py

codecov · 2019-09-05T18:08:44Z

Codecov Report

❗ No coverage uploaded for pull request base (master@77f49b4). Click here to learn what that means.
The diff coverage is 100%.

@@            Coverage Diff            @@
##             master    #1447   +/-   ##
=========================================
  Coverage          ?   97.55%           
=========================================
  Files             ?       55           
  Lines             ?     2002           
  Branches          ?      328           
=========================================
  Hits              ?     1953           
  Misses            ?       22           
  Partials          ?       27

Impacted Files	Coverage Δ
snorkel/labeling/analysis.py	`100% <100%> (ø)`

henryre · 2019-09-05T19:45:57Z

test/labeling/test_analysis.py

Last nit, I promise: can we add a 3 to L_wo_abstain and a 4 to the Y used here? This will make sure that any future changes account for non-identical label sets between L and Y, which we handle correctly in this change.

Good idea :)
I updated the test part.

related to snorkel-team#1446

…trix when the L input data didn't have ABSTAIN label. We only slice the confusion matrix on [1:, 1:] when there is some ABSTAIN label, i.e. -1 values. The full confusion matrix should be taken into account when you don't have -1 values. fix snorkel-team#1446

henryre

@garaud awesome!! Let's ship it. Feel free to merge whenever you want.

henryre · 2019-09-05T23:47:39Z

@garaud we usually leave it to authors to merge, but we're pushing a patch today so I'm going to go ahead and merge for you. Thanks again, this is great!!

garaud · 2019-09-06T07:10:53Z

we usually leave it to authors to merge, but we're pushing a patch today so I'm going to go ahead and merge for you.

Don't worry. Thank you for the review.

The weak supervision is quite new for me and snorkel is a great project! We use the weak supervision to label massive 3D point clouds (LIDAR scans in natural environment such as rocks, cliff, vegetation, etc.) and it's really helpful.

henryre reviewed Sep 5, 2019

View reviewed changes

test/labeling/test_analysis.py Outdated Show resolved Hide resolved

bhancock8 reviewed Sep 5, 2019

View reviewed changes

snorkel/labeling/analysis.py Outdated Show resolved Hide resolved

garaud force-pushed the fix-lf-summary-no-abstain-1446 branch from 75dbd02 to 8b2d3af Compare September 5, 2019 18:53

henryre reviewed Sep 5, 2019

View reviewed changes

Damien Garaud added 2 commits September 5, 2019 22:09

Add a unit test about lf_summary without ABSTAIN label

cfeed25

related to snorkel-team#1446

garaud force-pushed the fix-lf-summary-no-abstain-1446 branch from 8b2d3af to 95ca47f Compare September 5, 2019 20:10

henryre approved these changes Sep 5, 2019

View reviewed changes

henryre merged commit ff1074a into snorkel-team:master Sep 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix lf summary no abstain 1446 #1447

Fix lf summary no abstain 1446 #1447

Uh oh!

garaud commented Sep 5, 2019 •

edited

Loading

Uh oh!

henryre left a comment

Uh oh!

Uh oh!

bhancock8 left a comment

Uh oh!

Uh oh!

codecov bot commented Sep 5, 2019 •

edited

Loading

Uh oh!

henryre Sep 5, 2019

Uh oh!

garaud Sep 5, 2019

Uh oh!

henryre left a comment

Uh oh!

henryre commented Sep 5, 2019

Uh oh!

garaud commented Sep 6, 2019

Uh oh!

Uh oh!

Fix lf summary no abstain 1446 #1447

Fix lf summary no abstain 1446 #1447

Uh oh!

Conversation

garaud commented Sep 5, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of proposed changes

Related issue(s)

Test plan

Checklist

Uh oh!

henryre left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bhancock8 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

codecov bot commented Sep 5, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

henryre Sep 5, 2019

Choose a reason for hiding this comment

Uh oh!

garaud Sep 5, 2019

Choose a reason for hiding this comment

Uh oh!

henryre left a comment

Choose a reason for hiding this comment

Uh oh!

henryre commented Sep 5, 2019

Uh oh!

garaud commented Sep 6, 2019

Uh oh!

Uh oh!

garaud commented Sep 5, 2019 •

edited

Loading

codecov bot commented Sep 5, 2019 •

edited

Loading