Lise Getoor, professor of computer science and engineering, will discuss "Responsible Data Science" in the 53rd annual Faculty Research Lecture at UC Santa Cruz. The lecture, which is free and open to the public, will take place on Tuesday, February 26, at 7 p.m. in the Music Recital Hall on the UCSC campus.
Getoor's lecture will address the promises and challenges of data science, a field fast-becoming one of the strongest forces shaping society today and an area of expertise at the Baskin School of Engineering at UC Santa Cruz.
Getoor is a data science legend. She’s given keynotes at major conferences all over the world before crowds of thousands of experts in areas as diverse as artificial intelligence, statistics, and database systems. Her lab created an open-source tool called Probabilistic Soft Logic, used for everything from energy disaggregation and hybrid recommender systems to the analysis of human trafficking.
“My research mixes tools from different areas,” Getoor said. “It makes use of logic and probability to model networks and takes into account context and structure.”
Data science aims to extrapolate useful data from the huge amounts of information created by modern society. Context, as Getoor’s work has revealed, is crucial. Most approaches to analyzing data require extracting information from one database and placing it in another (such as a spreadsheet), a process which can flatten intricate structures within the databases that might have revealed important insights.
Getoor heads two major data science projects at UC Santa Cruz. In 2017, the National Science Foundation awarded a $1.5 million grant to Getoor and a group of other UCSC computer scientists, statisticians, and mathematicians as part of the Transdisciplinary Research in Principles of Data Science (TRIPODS) program, an effort to develop the theoretical principles of the field. Getoor’s group looks at the challenges of incompleteness, uncertainty, and bias in large, heterogeneous sets of interconnected data.
“For a long time, there’s been an active informal data science group on campus,” Getoor said. “Under TRIPODS there’s fascinating work being done.”
Getoor also directs the D3 Data Science Research Center at UC Santa Cruz, a collaboration between academia and industry designed to develop open-source tools for collecting data, discovering patterns, and making decisions. Collaborations are also important across disciplines, particularly with respect to the ethical and social concerns.
“There’s been an explosion of interest in high stakes decision-making [involving data science],” Getoor said. “Some common examples are recidivism prediction, where there is an urgent need to address fairness and bias, as well as other tasks like loans and automated hiring decisions. Perspective is important: we need to have conversations about privacy and ethics while we develop these powerful new algorithmic tools.”
As the risks and benefits of insights developed by data science spread through society, Getoor sees a growing need for data science literacy and recognition that there are serious limits to what algorithms and artificial intelligence can achieve.
“Algorithms aren’t a magic bullet for all society’s ills,” she said. “Doing good data science requires a collaborative and curious outlook so you’re collaborating with the people who will be using and affected by the system, and at the same time understand the powers and limitations of data science. And it’s important to communicate that at all levels.”
Getoor said she hopes her lecture will prompt a broader conversation about data science within society.
Parking for the event at the Performing Arts Complex is $4. For disability related accommodations, contact senate@ucsc.edu.