Raw data for crime statistics is available at: http://bjs.ojp.usdoj.gov/index.cfm?ty=pbdetail&iid=4515
I decided to look at one small subset of crime statistics, number of arrests. In the mind of some, if you are arrested for a crime, then you are likely to have done it, again this is not a correct assumption, neither is the one that if you are convicted of the crime then you are clearly guilty, but I am not going to address either of these assumptions in this particular blog, that is a project for a later date.
Statistics about race and arrests are often used to promote ideology, to push for various laws and can have an impact on the way people view others based on race. It can be a surprise to some that over 91% of blacks between 1990 to 2010 have never been arrested. What is true is that the percent of blacks arrested (around 9%) is higher than when compared to that of whites (around 3.6%) as shown in the graph below.
Depending on how you look at the statistics and numbers, the picture can seem very different. The graph above clearly shows that while most blacks are not arrested, a larger percent of blacks are arrested compared to whites. Percents tell you something about the populations you are looking at, but you have to take raw numbers into consideration when you look at these statistics especially when the populations being described are not the same size.
For example you can look at the people from each a certain city that commit a specific crime. If you look at the statistics for 2 streets you may find that 10% of the people in one street committed a crime, while that number is 50% in another street. You might be justified in thinking wow, there is something going on in the street with the 50% crime rate, until you find out that one street has 10 people in it, and the other 2. The number of people from each street that were arrested for the crime was the same 1, but if you looked only at the statistics and not the raw numbers it would seem that more were arrested from one street than the other.
So here are the raw numbers for arrests that gave rise to the percentages seen above:
What this shows, which many forget when looking at this type of data, is that most people in the US are white, thus the actual number of arrests (while a smaller percentage of the total white population) are of white people, not blacks. What is interesting, is that you can get two almost exactly opposite graphs, simply by either plotting percent arrests (of total population) or raw numbers.
Here is the graph for percent arrests:
and here is the graph for number of arrests:
If you look at the graph for percentages carefully and look at the y axis, the axis only goes up to 10, this is a trick often used to make differences look bigger than they are (sometimes it is OK if you are looking at small, but statistically significant differences), below you can see what happens if you expand the graph to 100 percent as it was in the first graph (the same can be done for the second by using the scale of the similar graph above):
Sometimes population percentages are not presented as a percent of the total US population, but as a subset, for example as a percentage of hose arrested. The following graphs, use stats derived from the percent (or number) arrested per crime as a percentage of the arrested population (NOT the total population).
The above graph shows that when looking at the people being arrested and at the percentages of whites and blacks that make up that population, more whites are arrested than blacks, even though the percentage of blacks as a proportion of the black population being arrested is higher than that of whites, the result shown above is simply a result of the fact that there are so many more whites than blacks. Since most blacks are not arrested, and most of those who are arrested are white one has to question the logic of many racial profiling practices.
The graph above is for arrests for all crimes in general, however if you look at arrests broken down by type of crime, the picture is even more interesting. I will just post the graphs and you can take away the messages that you see fit. Note for the bubble graphs, I looked at total arrests for that crime (x axis) compared to arrests for either whites, or blacks for that particular crime, the size of the bubble is determined by what percent of the total arrests for that crime were represented by the particular race. So the bubbles in the lower left corner show low arrest rates as a total and per that particular race, while those in the upper right hand corner show that those arrests were larger in number and that the particular race had a higher number of arrests.
The graph above shows raw numbers for those arrested, what it shows is that most people (of both races) are arrested for assault, and very few for rape or murder. It also shows that more whites are arrested for assault, however it is also clear that blacks are over represented for those crimes. Two things that people often get confused, over represented, does not mean "most people of that race" or most arrests, but just that if you find arrest percentages above 13 for blacks, then they are over represented in that particular group, it is a subtle, but very important distinction.
I had to split up these two graphs as the number of rape and murder arrests were so much smaller than those for the other violent crimes (another thing that is often not readily obvious from the way the media and various politicians talk). But the arrests are about equal for murder (low for both races, very disproportionate for blacks), while they are much lower for blacks, than they are for whites.
In the graph above you can see that blacks are arrested at higher rates for robbery, but at much lower rates for both aggravated and simple assaults which happens to be the crime the most arrests. Compare the graph below to the similar one above the bubble graphs, here I am looking at each race as a percent (above we were looking at raw numbers).
This shows much the same as the bubble graphs, just a different way of looking at it, the numbers here the percent of the total arrested population is represented for each race for each crime. The last two bars show the respective percent of each race in the total US population, thus if the percent is higher than that number the group is over represented as is the case for blacks in almost each category, but highest for murder and robbery, while if it is lower, as for whites in almost every category, but especially in robbery and murder, then that group is under represented. Again this is representation, not raw numbers.
The graph above shows the raw numbers for some other crimes, what is striking is the very large different in arrests for all crimes involving alcohol between the races (blacks under represented, whites over represented), especially for driving under the influence. These differences hold up when you look at the percent arrested by race
And last but not least, here is a look at drug enforcement, raw numbers here and again whites make up the majority of those arrested for drug use and abuse, while both blacks and whites (at much lower rates, should that not be reversed), are arrested at similar rates.
I guess the moral to this story is that crime is a complex issue, even when looking at one facet, arrests. It is not (excuse the pun) black and white, (quite literally as I have not included the data for other racial groups, or split it along gender lines), and it can seem to say different things depending on how you look at it. So when this kind of data is presented, look carefully at what they are showing you, or listen carefully to what they are saying. Are they talking raw numbers or percentages? What do the axes look like on the graphs? Are they looking at percentages of the total population, or are they looking at percentages of those arrested, or convicted of crimes? Is the data a snapshot in time, or was it complied over a number of years?
Even with these numbers, what do arrest rates really reflect? Do they reflect true criminal activity, or do they reflect biased policing? If you are more likely to be arrested because you are black than white for the same crime, the you will be over represented in arrest percentages, if you are more in number, then you will also be over represented in the raw numbers. Are these numbers a reflection of differences in race when it comes to arrests, or is it a reflection of where you are more likely to live if you belong to a specific race? Much more analysis is needed to answer these questions.
What is clear, is that most blacks are not arrested for crimes, more whites than blacks are arrested for crimes, blacks are over represented when it comes to arrests, and the ratio of blacks to whites arrested varies by crime (and I warrant by state, city and community). Another thing that is often not taken into consideration is the effect small numbers relative to large numbers can have on statistics.
In the article linked below, there is an interesting story about how it seemed that people in rural areas were more affected by cancer and had worse outcomes, but when population size was taken into account, those differences vanished.
Effect of community population size on breast cancer screening, stage distribution, treatment use and outcomes.
Olson RA, Nichol A, Caron NR, Olivotto IA, Speers C, Chia S, Davidson A, Coldman A, Bajdik C, Tyldesley S
BC Cancer Agency, Centre for the North, Prince George, BC. rolson2@bccancer.bc.ca
Anyhow, it is good to know that I have basically come to the same conclusions as found in this paper, that the best predictor of crime, is population number, i.e. the more there are of you, the more likely you are to be involved in a crime, both as a victim (I looked at this before and may do so again for this blog) or a perpetrator. Using over, and under representation, on populations of different sizes to me does not make much sense, until you have a firm understanding that those differences are due to something about the population, and are not simply due to differences in numbers between the populations.