Fix classifier bug #585

ahn1340 · 2018-11-21T18:51:11Z

This PR fixes the problem of the ensemble of classifiers returning prediction values larger than 1. This happens because predict() in ensemble_selection.py sometimes receive model predictions as the input, where predictions of models with zero weights are excluded, and sometimes it receives predictions including those of zero weight models. Therefore, predict() now deals with these two cases to make sure that model weights are applied correctly.

… self.

Fix minor printing error in sprint_statistics.

Fix#460

Revert "Fix#460"

…into development

* . * . * AutoSklearnClassifier/Regressor's fit, refit, fit_ensemble now return self. * Initial commit. Work in Progress. * Fix minor printing error in sprint_statistics. * Revert "Fix#460" * Resolve rebase conflict * combined unittests to reduce travis runtime * . * . * . * . * .

…into target_type

…into development

mfeurer

Thanks a lot. Could you please also fix the conflict?

mfeurer · 2018-11-29T16:34:01Z

autosklearn/ensemble_builder.py

            # train ensemble
            ensemble = self.fit_ensemble(selected_keys=selected_models)
-            
+


Please revert changes in this file as there are no changes to the actual code in this file.

mfeurer · 2018-11-29T16:36:40Z

autosklearn/ensembles/ensemble_selection.py

    def predict(self, predictions):
-        non_null_weights = (weight for  weight in self.weights_ if weight > 0)
-        for i, weight in enumerate(non_null_weights):
+        #non_null_weights = (weight for weight in self.weights_ if weight > 0)


Could you please remove these comments?

mfeurer · 2018-11-29T16:37:34Z

autosklearn/ensembles/ensemble_selection.py

+        #    predictions[i] *= weight
+        for i, weight in enumerate(self.weights_):
            predictions[i] *= weight
        return np.sum(predictions, axis=0)


Could this code maybe be simplified a lot more to return np.average(predictions, axis=0, weights=self.weights_)?

mfeurer · 2018-12-03T09:33:22Z

autosklearn/estimators.py

            X, batch_size=batch_size, n_jobs=n_jobs)
+        assert np.allclose(np.sum(pred_proba, axis=1),
+                           np.ones_like(pred_proba[:, 0])),\
+        "prediction probability does not sum up to 1!"


Could you please change the formatting to

assert ( np.allclose( np.sum(pred_proba, axis=1), np.ones_like(pred_proba[:, 0])), ), "prediction probability does not sum up to 1!"

Yep, I will make that modification. Currently some unittests fail due to this line. I'm working on the fix and I will remove [WIP] as soon as it is done.

codecov-io · 2018-12-05T15:17:45Z

Codecov Report

Merging #585 into development will decrease coverage by 0.03%.
The diff coverage is 92.85%.

@@               Coverage Diff               @@
##           development     #585      +/-   ##
===============================================
- Coverage        78.63%   78.59%   -0.04%     
===============================================
  Files              130      130              
  Lines            10119    10129      +10     
===============================================
+ Hits              7957     7961       +4     
- Misses            2162     2168       +6

Impacted Files	Coverage Δ
autosklearn/ensembles/ensemble_selection.py	`58.18% <100%> (+0.77%)`	⬆️
autosklearn/estimators.py	`90.65% <85.71%> (-0.44%)`	⬇️
..._preprocessing/select_percentile_classification.py	`82.75% <0%> (-6.9%)`	⬇️
...e/components/feature_preprocessing/select_rates.py	`84.61% <0%> (-1.54%)`	⬇️
...ipeline/components/classification/decision_tree.py	`93.75% <0%> (ø)`	⬆️
...rn/pipeline/components/regression/decision_tree.py	`94.64% <0%> (ø)`	⬆️
autosklearn/evaluation/train_evaluator.py	`93.77% <0%> (+0.02%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 66d9f09...b6336e3. Read the comment docs.

…[0, 1].

Jin Woo Ahn and others added 30 commits May 3, 2018 18:10

.

b8ee4d6

.

5417950

AutoSklearnClassifier/Regressor's fit, refit, fit_ensemble now return…

d11cdfe

… self.

Initial commit. Work in Progress.

66ca590

Fix minor printing error in sprint_statistics.

43f6bb4

Merge pull request #1 from ahn1340/Fix#420

f9634a3

Fix minor printing error in sprint_statistics.

Merge pull request #2 from ahn1340/Fix#460

3f7cd1a

Fix#460

Revert "Fix#460"

28f6805

Merge pull request #3 from ahn1340/revert-2-Fix#460

4f33872

Revert "Fix#460"

Merge branch 'development' of https://github.com/ahn1340/auto-sklearn …

6614c23

…into development

Merge remote-tracking branch 'upstream/development' into development

2f5b318

Check target type at the beginning of the fitting process.

5519d5c

.

eec11b3

Fixed minor error in uniitest

6632a53

.

5eb8a14

Add unittest for target type checking.

9c67c1d

.

616a35a

.

7d9a315

[Debug] try with numpy version 1.14.5

c75d2b2

[Debug] Check if numpy version 1.14.6 raises error.

34d0a7c

Check target type at the beginning of the fitting process.

e207cec

.

ea11474

Fixed minor error in uniitest

71c0abb

.

82ad644

Add unittest for target type checking.

bec184b

.

45f0757

.

0b8758d

[Debug] Check if numpy version 1.14.6 raises error.

080961a

Merge branch 'target_type' of https://github.com/ahn1340/auto-sklearn …

612e132

…into target_type

ahn1340 and others added 7 commits October 19, 2018 11:31

Fix minor error

ddd4997

Merge branch 'development' of https://github.com/ahn1340/auto-sklearn …

ca1dc63

…into development

FIX classifier returning prediction larger than 1

4c8d853

Remove comments

21493cb

ADD unittest for ensemble_selection.predict()

24da442

minor FIX

597cc8b

Merge branch 'development' of https://github.com/automl/auto-sklearn …

a2df7ca

…into development

mfeurer reviewed Nov 29, 2018

View reviewed changes

Jin Woo Ahn added 3 commits November 30, 2018 14:43

ADD assertion in predict_proba to check probabilities sum up to 1.

096d207

REVERT changes in autosklearn/ensemble_builder.py

0ce0c16

simplify ensemble prediction method

a789c41

ahn1340 changed the title ~~[WIP] Fix classifier bug~~ Fix classifier bug Nov 30, 2018

ahn1340 changed the title ~~Fix classifier bug~~ [WIP]Fix classifier bug Nov 30, 2018

Jin Woo Ahn added 2 commits November 30, 2018 16:23

Merge branch 'target_type' into classifier_bug

0066005

Merge branch 'development' into classifier_bug

cc31c99

mfeurer reviewed Dec 3, 2018

View reviewed changes

Jin Woo Ahn and others added 3 commits December 3, 2018 16:57

Modify assertion statement

6a0f737

ADD case check in ensemble_selection.predict()

9eeb0ad

Fix minor error in pred_probs verficiation.

5f1970b

ahn1340 added 4 commits December 5, 2018 17:16

Modify unittest for ensemble_selection.predict()

02fbaeb

FIX flake8 errors

c3a2aa6

FIX flake8 error

0837c67

ADD Ignore assertion for multilabel, check probabilities lie between …

4cce583

…[0, 1].

ahn1340 changed the title ~~[WIP]Fix classifier bug~~ Fix classifier bug Dec 6, 2018

mfeurer approved these changes Dec 6, 2018

View reviewed changes

mfeurer merged commit b53c7e1 into automl:development Dec 6, 2018

Debug flake8 error

b6336e3

ahn1340 deleted the classifier_bug branch December 6, 2018 15:46

ghost mentioned this pull request Feb 3, 2020

Classification predictions outside [0, 1] #775

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix classifier bug #585

Fix classifier bug #585

Uh oh!

ahn1340 commented Nov 21, 2018 •

edited

Loading

Uh oh!

mfeurer left a comment

Uh oh!

mfeurer Nov 29, 2018

Uh oh!

mfeurer Nov 29, 2018

Uh oh!

mfeurer Nov 29, 2018

Uh oh!

mfeurer Dec 3, 2018

Uh oh!

ahn1340 Dec 3, 2018

Uh oh!

codecov-io commented Dec 5, 2018 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# train ensemble
		ensemble = self.fit_ensemble(selected_keys=selected_models)

Fix classifier bug #585

Fix classifier bug #585

Uh oh!

Conversation

ahn1340 commented Nov 21, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfeurer left a comment

Choose a reason for hiding this comment

Uh oh!

mfeurer Nov 29, 2018

Choose a reason for hiding this comment

Uh oh!

mfeurer Nov 29, 2018

Choose a reason for hiding this comment

Uh oh!

mfeurer Nov 29, 2018

Choose a reason for hiding this comment

Uh oh!

mfeurer Dec 3, 2018

Choose a reason for hiding this comment

Uh oh!

ahn1340 Dec 3, 2018

Choose a reason for hiding this comment

Uh oh!

codecov-io commented Dec 5, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ahn1340 commented Nov 21, 2018 •

edited

Loading

codecov-io commented Dec 5, 2018 •

edited

Loading