The Washington PostDemocracy Dies in Darkness

The never-ending quest to predict crime using AI

The practice has a long history of skewing police toward communities of color. But that hasn’t stopped researchers from building crime-predicting tools.

July 15, 2022 at 7:00 a.m. EDT
Jeff Brantingham, anthropology professor at the University of California at Los Angeles, displays a computer-generated view of “predictive policing” zones at the L.A. Police Department Unified Command Post. (Damian Dovarganes/AP)
7 min

In the world of the 2002 movie “Minority Report,” crime is nearly nonexistent. Clairvoyants predict when murders are about to happen, allowing police to swoop in and arrest the soon-to-be felons.

Though Tom Cruise’s all-powerful police force is evidence of a dystopian society, researchers have long chased the tantalizing prospect of being able to predict crime before it happens.

And as the United States faces rising rates of violent crime, another research project emerged: A group of University of Chicago scientists unveiled an algorithm last month, boasting in a news release of its ability to predict crime with “90% accuracy.”

The algorithm identifies locations in major cities that it calculates have a high likelihood of crimes, like homicides and burglaries, occurring in the next week. The software can also evaluate how policing varies across neighborhoods in eight major cities in the United States, including Chicago, Los Angeles and Philadelphia.

But using artificial intelligence to direct law enforcement rings alarm bells for many social justice scholars and criminologists, who cite a long history of such technology unfairly suggesting increased policing of Black and Latino people. Even one of the study’s authors acknowledges that an algorithm’s ability to predict crime is limited.

“The past does not tell you anything about the future,” said Ishanu Chattopadhyay, a professor from the University of Chicago and lead researcher of the algorithm. “The question is: To what degree does the past actually influence the future? And to what degree are the events spontaneous or truly random? … Our ability to predict is limited by that.”

The Google engineer who thinks the company’s AI has come to life

Police have long used any tool available to predict crime. Before technological advances, cops would huddle in conference rooms and put pins of crime incidents on a map, hoping the clusters would help them figure out where they should look next.

Over the past 15 years, the country’s largest police departments — such as New York, Los Angeles and Chicago — started thinking about ways of using artificial intelligence to not just analyze crime but predict it. They often turned to data analytics companies such as PredPol and Palantir, which create software that law enforcement can use to forecast crime.

Predictive policing tools are built by feeding data — such as crime reports, arrest records and license plate images — to an algorithm, which is trained to look for patterns to predict where and when a certain type of crime will occur in the future.

But algorithms are only as good as the data they are fed, which is a problem particularly for people in the United States, said Vincent Southerland, the co-faculty director of New York University’s Center on Race, Inequality and the Law.

Historically, police data in the United States is biased, according to Southerland. Cops are more likely to arrest or charge someone with a crime in low-income neighborhoods dominated by people of color, a reality that doesn’t necessarily reflect where crime is happening, but where cops are spending their time.

That means most data sets of criminal activity overrepresent people of color and low-income neighborhoods. Feeding that data into an algorithm leads it to suggest more criminal activity is in those areas, creating a feedback loop that is racially and socioeconomically biased, Southerland added.

“You have data that is infected by, or tainted by, some bias — and that bias is going to appear on the other side of the analysis,” he said. “You get out of it, what you put into it.”

The war inside Palantir: Data-mining firm’s ties to ICE under attack by employees

In the real world, predictive policing software has caused significant problems.

In 2019, the Los Angeles Police Department suspended its crime prediction program, LASER, which used historical crime data to predict crime hotspots and Palantir software to assign people criminal risk scores, after an internal audit showed it led to police to unfairly subject Black and Latino people to more surveillance.

In Chicago, the police used predictive policing software from the Illinois Institute of Technology to create a list of people most likely to be involved in a violent crime. A study from RAND and a subsequent investigation from the Chicago Sun-Times showed that the software included every single person arres­ted or fingerprinted in Chicago since 2013 on the list. The program was scrapped in 2020.

Predictive policing algorithms are “not a crystal ball,” said John S. Hollywood, a senior operations researcher at RAND, who helped audit the Chicago police department’s use of predictive algorithms. “It is better to look more holistically … what is happening in terms of specific things in my community that are leading to crimes right now.”

Chattopadhyay said his team’s software was made knowing the troubled past of algorithms.

In making the algorithm, Chattopadhyay’s team segmented major cities into 1,000 square foot city blocks and used city crime data from the last three to five years to train it. The algorithm spits out whether there is a high or low risk of crime happening in a segment at a certain time, up to one week into the future.

To limit bias, the team omitted crime data such as marijuana arrests, traffic stops or low-level petty crimes, because research shows Black and Latino people are more often targeted for those types of offenses. Instead, they gave the algorithm data on homicides, assaults and batteries, along with property crimes like burglaries and motor vehicle thefts.

But the main point of the study, he said, was to use the algorithm to interrogate how police are biased. His team compared arrest data from neighborhoods of varying socioeconomic levels. They found crime that happened in wealthier areas led to more arrests, whereas in poorer neighborhoods, crime didn’t always have the same effect, showing a discrepancy in enforcement.

Chattopadhyay said these results help provide evidence to people who complain that law enforcement ignores poorer neighborhoods when there’s a spike in violent or property crime. “This allows you to quantify that,” he said. “To show the evidence.”

Arvind Narayanan, a computer science professor at Princeton University, said the study’s news release and news articles about it did not focus enough on the study’s attempt to investigate biases in police crime enforcement and overemphasized the algorithms’ accuracy claims.

“For predictive policing, a single accuracy … figure is totally insufficient to assess whether a tool is useful or just,” he said. “Crime is rare, so it’s probable that most predictions of crime are false positives.”

There’s racial bias in our police systems. Here’s the proof.

Criminal justice scholars, policing experts and technologists note that even if an algorithm is accurate, it can still be used by law enforcement to target people of color and those living in poorer neighborhoods for unjustified surveillance and monitoring.

Andrew Papachristos, a sociology professor at Northwestern University, said that when law enforcement uses algorithms to map and analyze crime, it often subjects people of color and low-income communities to more policing. When criticized for over-policing in certain neighborhoods, they often use data to justify the tactics, he said.

Papachristos said that if community groups could use the tools instead to figure out where to provide more social services, increase community engagement and deal with the root social causes of violence, that would be a better use of the technology. Though, he said, it is unlikely that will happen because organizations doing this work are cash-strapped and skeptical of using data.

“They have seen data misused against them in court. They’ve seen it to use to profile individuals,” he said. “So, if somebody rolls up like me and says, ‘Hey, we want to help you use data.’ It’s not an immediate like, ‘Oh my God, thank you.’ It’s like, ‘What data are you using?’ ”

Hollywood, of the RAND Corporation, agreed. He said that to truly reduce crime, police departments need to work in tandem with social workers and community groups to address issues of education, housing and civic engagement.

“[Algorithms] are a bright, shiny object,” he said. “These things tend to be distractions.”