Fix BaseCV objects early crashes #897

charlesfu4 · 2020-07-08T20:06:54Z

When specifying customized cross validation objects, AutoML process tends to crash in its early stage.
The corresponding debug message shown below

[DEBUG] [2020-06-25 13:06:54,163:AutoMLSMBO(2176371645)::129e649232a63f02b1825a7f17c56883]
Return: Status: <StatusType.CRASHED: 3>, cost: 1.000000, time: 0.059507, 
additional: {'traceback': 'Traceback      (most recent call last):
\n  File "/home/charles/anaconda3/envs/autoskdev/lib/python3.7/site-packages/autosklearn/evaluation/__init__.py", line 29, 
in fit_predict_try_except_decorator\n    return ta(queue=queue, **kwargs)\n  
File "/home/charles/anaconda3/envs/autoskdev/lib/python3.7/site-packages/autosklearn/evaluation/train_evaluator.py", 
line 1237, in eval_cv\n    budget_type=budget_type,\n  
File "/home/charles/anaconda3/envs/autoskdev/lib/python3.7/site-packages/autosklearn/evaluation/train_evaluator.py",
 line 180, in __init__\n    self.splitter = self.get_splitter(self.datamanager)\n  
File  "/home/charles/anaconda3/envs/autoskdev/lib/python3.7/site-packages/autosklearn/evaluation/train_evaluator.py", 
line 952, in get_splitter\n    cv = copy.deepcopy(self.resampling_strategy)(**init_dict)
\nTypeError: __init__() got an unexpected keyword argument \'cost_for_crash\'\n', 'error': 
'TypeError("__init__() got an unexpected keyword argument \'cost_for_crash\'")', 
'configuration_origin': 'Random initial design.'}

It is basically caused by deepcopying resampling_strategy_args into init_dict in train_evaluator.py. It obtains extra parameters that are not for initializing cross validation objects. Therefore, I did some modification to make sure init_dict only contain defaults arguments defined in the beginning of train_evaluator.py. Besides, due to the change mentioned, PredefinedSplits also got default arguments to avoid failure in unit test.

The results of same resampling_strategy for example KFold and 'cv' should get the same results with same arguments and seeds. However, they have quite different outcomes in cv_results_, pipeline, and hyper-parameters in the end('cv' apparently overfitted and KFold did not). Not sure what caused this. I will do more research on it later this week.

charlesfu4 · 2020-07-14T10:29:47Z

Hi @mfeurer, can you do a small code review? I think the problem is now fixed.
Regarding the different results from same fold of 'cv' and KFold I mentioned above was just because 'cv' without shuffle parameter will be default set to True in train_evaluator.py(code snippet below). I rechecked them with same folds and set shuffle both as False and got the same cv results in the end.

Thank you!

962   y = D.data['Y_train']
963   shuffle = self.resampling_strategy_args.get('shuffle', True)

mfeurer · 2020-07-17T20:29:04Z

Hey @charlesfu4 I'm currently busy attending the ICML conference. I'll do my best to have a look at this PR next week.

codecov-commenter · 2020-07-17T21:01:27Z

Codecov Report

Merging #897 into development will decrease coverage by 0.09%.
The diff coverage is 66.66%.

@@               Coverage Diff               @@
##           development     #897      +/-   ##
===============================================
- Coverage        84.98%   84.89%   -0.10%     
===============================================
  Files              128      128              
  Lines             9439     9442       +3     
===============================================
- Hits              8022     8016       -6     
- Misses            1417     1426       +9

Impacted Files	Coverage Δ
autosklearn/evaluation/train_evaluator.py	`72.58% <66.66%> (-0.05%)`	⬇️
...eline/components/feature_preprocessing/fast_ica.py	`91.30% <0.00%> (-6.53%)`	⬇️
...mponents/feature_preprocessing/nystroem_sampler.py	`85.29% <0.00%> (-5.89%)`	⬇️
...ine/components/classification/gradient_boosting.py	`91.89% <0.00%> (-0.91%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2786d63...bf5b187. Read the comment docs.

charlesfu4 · 2020-07-18T00:44:35Z

Hey @charlesfu4 I'm currently busy attending the ICML conference. I'll do my best to have a look at this PR next week.

Thanks a lot, hope you enjoy it! By the way, do you know why my codecov build keeps staying at 2 month ago? Is it a normal bug or just something I forget to set up?

mfeurer

Hey @charlesfu4 thanks for looking into this. Your approach would most likely work, but I was wondering where these additional variables came from and found that one needs to update parts of the code for the latest SMAC version. I created a new PR here #908 could you please have a look whether that fixes the original issue?

I believe your PR also adds extra code for handling multi-output regression, but that should be a different PR.

mfeurer · 2020-07-22T16:13:43Z

autosklearn/evaluation/train_evaluator.py

                                   'LeaveOneOut': {},
                                   'LeavePOut': {'p': 2},
-                                   'PredefinedSplit': {},
+                                   'PredefinedSplit': {'test_fold': [0, 1, 2, 3]},


Could this also be an empty list?

PredefinedSplit with empty list will return no index(None) after split, so I do not think it will work.

mfeurer · 2020-07-22T16:14:27Z

autosklearn/evaluation/train_evaluator.py

                                         'for chosen CrossValidator.')
                    try:
-                        if self.resampling_strategy_args['groups'].shape != y.shape:
+                        if self.resampling_strategy_args['groups'].shape[-1] != y.shape[-1]:


Could you please let me know when this fails? Also, could you please update the error message?

With removing the ravel above, shape[-1] represents the number of samples whatever type of target it is. It fail when there is a for example multi-output regression target with n_samples = 100, n_targets = 3. If we keep the ravel in line 912 and keep line 925 as shape. The 'groups' we have to define is (300,) in shape. There, however are only 100 samples which led to this error message.

[ERROR] [2020-07-22 22:27:55,917:AutoML(1):d4069f363ae29ee607f565032a35aa5f] Error creating dummy predictions: {'traceback': 'Traceback (most recent call last):\n File "/home/charles/anaconda3/envs/autoskdev/lib/python3.7/site- packages/autosklearn/evaluation/__init__.py", line 29, in fit_predict_try_except_decorator\n return ta(queue=queue, **kwargs)\n File "/home/charles/anaconda3/envs/autoskdev/lib/python3.7/site- packages/autosklearn/evaluation/train_evaluator.py", line 1239, in eval_cv\n .fit_predict_and_loss(iterative=iterative)\n File "/home/charles/anaconda3/envs/autoskdev/lib/python3.7/site- packages/autosklearn/evaluation/train_evaluator.py", line 446, in fit_predict_and_loss\n groups=self.resampling_strategy_args.get(\'groups\')\n File "/home/charles/anaconda3/envs/autoskdev/lib/python3.7/site- packages/sklearn/model_selection/_split.py", line 327, in split\n X, y, groups = indexable(X, y, groups)\n File "/home/charles/anaconda3/envs/autoskdev/lib/python3.7/site-packages/sklearn/utils/validation.py", line 248, in indexable\n check_consistent_length(*result)\n File "/home/charles/anaconda3/envs/autoskdev/lib/python3.7/site -packages/sklearn/utils/validation.py", line 212, in check_consistent_length\n " samples: %r" % [int(l) for l in lengths])\nValueError: Found input variables with inconsistent numbers of samples: [100, 100, 300]\n', 'error': "ValueError('Found input variables with inconsistent numbers of samples: [100, 100, 300]')", 'configuration_origin': 'DUMMY'}

I have tried it manually here and a clearer version is shown in the photo.

mfeurer · 2020-07-22T16:14:37Z

autosklearn/evaluation/train_evaluator.py

                else:
                    if 'groups' in self.resampling_strategy_args:
-                        if self.resampling_strategy_args['groups'].shape != y.shape:
+                        if self.resampling_strategy_args['groups'].shape[-1] != y.shape[-1]:


mfeurer · 2020-07-24T07:52:14Z

Hey @charlesfu4 I just merged #908 to allow custom resampling strategies. As you said, this PR also allows using multi-output regression with custom resampling strategies. Would you like to create a new PR allowing custom resampling strategies for multi-output regression?

charlesfu4 · 2020-07-24T08:29:01Z

Hey @charlesfu4 I just merged #908 to allow custom resampling strategies. As you said, this PR also allows using multi-output regression with custom resampling strategies. Would you like to create a new PR allowing custom resampling strategies for multi-output regression?

Sure, thank you! I will do it soon.

* Fix #897 - allow groups strategy cv on multi-output regression * Fix classification * Fix coding length. * fix single_reg missing ravel * Revert "fix single_reg missing ravel" This reverts commit 66f95c0. * Fix single_reg ravel missing * Fix flake8 warning * Unittest updated for get_splitter of regressions * Fix letters cast to lower-case * Fix letters cased to lower-case 2 * Asserts functions updated.

charlesfu4 mentioned this pull request Jul 15, 2020

Unrecognized "cost_for_crash" keyword to sklearn resampling strategies #901

Closed

charlesfu4 added 2 commits July 18, 2020 01:59

Fix BaseCV obj crashes

eb42b77

Fix group strategy CV for multiout-reg

bf5b187

charlesfu4 force-pushed the fixbaseCV branch from 32792bd to bf5b187 Compare July 17, 2020 23:59

mfeurer reviewed Jul 22, 2020

View reviewed changes

mfeurer closed this Jul 24, 2020

charlesfu4 added a commit to charlesfu4/auto-sklearn that referenced this pull request Jul 25, 2020

Fix automl#897 - allow groups strategy cv on multi-output regression

a9c98f3

charlesfu4 mentioned this pull request Jul 25, 2020

Fix groups strategy cv problem in multi-output regression. #910

Merged

charlesfu4 added a commit to charlesfu4/auto-sklearn that referenced this pull request Sep 7, 2020

Fix automl#897 - allow groups strategy cv on multi-output regression

8cf722a

Fix BaseCV objects early crashes #897

Fix BaseCV objects early crashes #897

Uh oh!

Conversation

charlesfu4 commented Jul 8, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

charlesfu4 commented Jul 14, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfeurer commented Jul 17, 2020

Uh oh!

codecov-commenter commented Jul 17, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

charlesfu4 commented Jul 18, 2020

Uh oh!

mfeurer left a comment

Choose a reason for hiding this comment

Uh oh!

mfeurer Jul 22, 2020

Choose a reason for hiding this comment

Uh oh!

charlesfu4 Jul 22, 2020

Choose a reason for hiding this comment

Uh oh!

mfeurer Jul 22, 2020

Choose a reason for hiding this comment

Uh oh!

charlesfu4 Jul 22, 2020

Choose a reason for hiding this comment

Uh oh!

mfeurer Jul 22, 2020

Choose a reason for hiding this comment

Uh oh!

mfeurer commented Jul 24, 2020

Uh oh!

charlesfu4 commented Jul 24, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

charlesfu4 commented Jul 8, 2020 •

edited

Loading

charlesfu4 commented Jul 14, 2020 •

edited

Loading

codecov-commenter commented Jul 17, 2020 •

edited

Loading