
Summaries like this, in your inbox every morning.
Sign up free →A new course titled 'Data-Centric AI' ran from January 16–26, 2024 at MIT, co-taught by Anish, Curtis, and Jonas. The course covers algorithms to find and fix common issues in ML data and to construct better datasets, concentrating on supervised learning tasks like classification.
Data-Centric AI treats dataset improvement as a systematic engineering discipline, in contrast to traditional machine learning classes that focus on producing effective models for a given dataset. The course emphasizes practical techniques not covered in most ML classes to address the 'garbage in, garbage out' problem in real-world ML applications.
Topics included label errors, confident learning, class imbalance, outliers, distribution shift, dataset creation and curation, data-centric evaluation of ML models, and data curation for LLMs. Each lecture included an accompanying hands-on lab assignment in Python / Jupyter Notebook.
No comments yet. Be the first to share your thoughts!
Log in to join the discussion




Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.
Get Started Free5 minutes a day. The AI essentials.
200+ sources · Email / LINE / Slack