Beyond feature engineering, data-centric AI can help in Kaggle. This notebook shows how easily cleanlab can improve the training dataset for an XGBoost model, producing 12% reduction in error without any change to the existing model+training+data-processing code:
We are looking for more contributors to cleanlab in 2023. Help shape the future of data-centric AI and ensure it remains free software, especially if you love Python and practical tools for real-world data science!