Entry Date:
May 14, 2015

On the Reidentifiability of Credit Card Metadata


Even when real names and other personal information are stripped from metadata datasets, it is often possible to use just a few pieces of information to identify a specific person. Here, we study three months of credit card records for 1.1 million people and show that four spatiotemporal points are enough to uniquely reidentify 90 percent of individuals. We show that knowing the price of a transaction increases the risk of reidentification by 22 percent, on average. Finally, we show that even data sets that provide coarse information at any or all of the dimensions provide little anonymity and that women are more reidentifiable than men in credit card metadata.