TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data LakesAamod KhatiwadaHarsha Kokelet al.2025ICDE 2025
CodeGenWrangler: Data Wrangling task automation using Code-Generating ModelsAkella AshleshaAbhijit Manatkaret al.2025NAACL 2025
DELIFT: DATA EFFICIENT LANGUAGE MODEL INSTRUCTION FINE-TUNINGIshika AgarwalKrishnateja Killamsettyet al.2025ICLR 2025
Evolution of catalysis at IBM: From microelectronics to biomedicine to sustainability with AI-driven innovationJames HedrickTim Erdmannet al.2025ACS Spring 2025
Preparing Good Data for Generative AI: Challenges and Approaches (Good-Data)David VazquezLaure Berti-equilleet al.2025AAAI 2025
IBM Solution: Data FabricOur research is regularly developed into new features for Data Fabric in IBM Cloud Pak for Data. Learn more
Simplified and Performant Access to Data in the CloudReducing friction for scientific and foundation model workflows in Kubernetes.