Data Intelligence Lab
Welcome to the Data Intelligence Lab!
Mission statement: to push the boundaries of data intelligence research and train the next leaders from KAIST
Software 2.0 is a fundamental shift in software engineering where machine learning is prevalent and data becomes a first-class citizen, on par with code. The goal of the Data Intelligence Lab is to pioneer the inevitable trend of Responsible/Trustworthy AI, Data-centric AI, and Big Data – AI Integration. More recently, we are extending these research directions to Large Language Models (LLMs) as well. We work closely with the industry (Google Research, Microsoft Research, NVIDIA Research, Samsung Electronics, SK Hynix, and SK Telecom). Check out our vision paper Responsible AI Challenges in End-to-end Machine Learning (IEEE Data Eng. Bull '21).
We are looking for experienced Post-docs and highly-motivated Masters and PhD students. If you are interested in joining the DI Lab, please read this first. Here is a list of recommended courses and a lab fair poster designed by my students.
Latest News
[2024/3] Seungjun Oh joined our lab. Welcome!
[2024/3] Promoted to Tenured Associate Professor
[2024/2] Ki Hyun Tae and Yuji Roh are the first Ph.D. graduates from our lab. Congrats and looking forward to a very bright future!
[2023/12] Falcon: Fair Active Learning using Multi-armed Bandits accepted to VLDB 2024 (Top Database conference). Congrats Ki Hyun Tae and Jaeyoung Park!
[2023/12] Serving as an Associate Editor for VLDB 2025 (PVLDB Volume 18; Top Database conference)
[2023/12] Quilt: Robust Data Segment Selection against Concept Drifts accepted to AAAI 2024 (Top AI conference). Congrats Minsu Kim and Seong-Hyeon Hwang!
[2023/11] Serving as an Associate Editor for the IEEE TKDE journal (Top Database/Data Mining journal; currently only editor from Korea)
[2023/11] The second NYU-KAIST Inclusive AI Center workshop was held at NYU.
[2023/10] Supported by a new Google Research Award for a year in collaboration with the TensorFlow Extended (TFX) team!
[2023/8] The NYU-KAIST Inclusive AI Center workshop was held at KAIST.
[2023/6] Yuji Roh is a research intern at Google DeepMind & Youtube during the summer.
[2023/4] Improving Fair Training under Correlation Shifts accepted to ICML 2023 (Top Machine Learning conference). Congrats Yuji Roh!
[2023/4] Dr-Fairness: Dynamic Data Ratio Adjustment for Fair Training on Real and Generated Data accepted to Transactions on Machine Learning Research (TMLR), a new Machine Learning journal. Congrats Yuji Roh!
[2023/3] Gave a tech talk on Responsible AI to the Google TensorFlow Extended (TFX) US and Korea teams.
[2022/3] Kahee Lim and Jio Oh joined our lab. Welcome!
[2022/12] Data Collection and Quality Challenges in Deep Learning: A Data-Centric AI Perspective accepted to the VLDB Journal (Top Database journal). Congrats Yuji Roh!
[2022/11] Gave a tech talk on Responsible AI to Microsoft Research Asia.
[2022/11] Two papers Redactor: A Data-centric and Individualized Defense Against Inference Attacks and XClusters: Explainability-first Clustering were accepted to AAAI 2023 (Top AI conference). Congrats Geon Heo and Hyunseung Hwang!
[2022/10] Yuji Roh is a recipient of the prestigious Microsoft Research PhD Fellowship 2022. She is among 36 students worldwide and the only recipient from universities in Korea. Congrats Yuji Roh! (KAIST article English & Korean, interview)
[2022/10] Received a KAIST EE Best Teaching Award for EE477 Database and Big Data Systems, Spring 2022 (# students: 64, course rating: 4.84/5, which is highest among all KAIST EE undergrad courses). Supported by a Google Cloud Platform (GCP) Education Grant.
[2022/8] iFlipper: Label Flipping for Individual Fairness accepted to ACM SIGMOD 2023 (Top Database conference). Congrats Ki Hyun Tae and Jaeyoung Park!