Skip to content

AndreyML/RuCode-Payroll-2022

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

RuCode-Payroll-2022

Our top 2 solution to RuCode 2022 AI Payroll task

Public leaderboard Private Leaderboard
alt text

Our team:

alt text


Contest description:

  • Our task was to predict the salary for given vacanvies from trudvsem.ru
  • The data was given as a .csv file with approx. 1kk samples in train.csv (71 columns) and 40k samples in test.csv (68 columns) (public - 40%, private - 60%)
  • The training data included categorical, binary, continous and text features
  • The target variable was mean_salary and additional targets were base_salary_max and base_salary_mean
  • Many of the values were missed, e.g. in one of the features 95% were nan values

Solution desctiption:

  1. Fill in the misssing values
  2. Generate new features
  3. Select features
  4. Target Transformation
  5. Cross validate model
  6. Stabilize the model

About

Our top 2 solution to RuCode 2022 Payroll task

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%