What Cambridge Analytica Taught Us About Dating App Data
By Anika Desai
Digital Privacy Researcher & Tech Journalist · M.Sc. Cybersecurity, Georgia Tech
When the Cambridge Analytica story broke in March 2018, most dating app users treated it as someone else's problem. It was a Facebook story. It was a US election story. It was a scandal about political advertising. Whatever it was, it was not about Tinder or Bumble or the profile photos you uploaded to Hinge last weekend. That framing was a mistake, and eight years later the mistake looks obvious. The core mechanic of the Cambridge Analytica affair, the quiet harvesting of behavioural data from a consumer platform and its conversion into a psychographic profile, is exactly the mechanic that operates inside dating apps every day. The scandal was a preview, not an anomaly.
I have spent years writing about what dating apps know and what they do with it, and I keep returning to the Cambridge Analytica case because it is the single cleanest public example of the pipeline. The technical steps are identical. The only difference is that the dating app version is less visible, more diffuse, and legally constrained in narrower ways than the political version was.
A Quick Refresher on What Cambridge Analytica Actually Did
The story is usually told in compressed form. The full version is more illuminating because each stage matches something that still happens in dating app data pipelines today.
In 2013, a Cambridge University academic named Aleksandr Kogan built a Facebook app called thisisyourdigitallife, a personality quiz that paid users a few dollars to take it. About 270,000 people took the quiz directly. Crucially, the Facebook API at the time allowed an app to pull not just the quiz-taker's data but also the data of their friends. Through that friend graph, Kogan harvested data on about 87 million Facebook users who had never touched the app.
Kogan then passed that data to Cambridge Analytica, a firm working for political campaigns. Cambridge Analytica used the raw data to build psychographic profiles based on the OCEAN model (Openness, Conscientiousness, Extraversion, Agreeableness, Neuroticism) and targeted political advertising at voters based on inferred personality traits. The firm marketed this as "behavioural microtargeting," and sold it to the Trump 2016 campaign, the Brexit Leave campaign, and several smaller elections.
The mechanism was fully legal at the time of collection under Facebook's API rules, and the data harvest was never disclosed to the 87 million users whose data was pulled. Christopher Wylie, the whistleblower who exposed the scandal, described it to the Guardian as "an arsenal of weapons in a culture war, and a propaganda machine that served to be the most powerful tool ever to manipulate the public." He was overstating it, but not by as much as his critics at the time suggested.
The Four Lessons That Transfer Directly
The Cambridge Analytica case has been dissected to death, and most of the lessons people draw from it are specific to Facebook or to political advertising. Four lessons, however, transfer directly to dating apps and the data they hold on their users. If you read nothing else, read these four.
Lesson 1: Consent at the point of collection is not informed consent about what the data can become. The people who took Kogan's quiz consented to a personality test. They did not consent to psychographic profiling for political manipulation. They had no mechanism to consent to it, because the secondary use had not yet been invented when the data was collected. Dating apps collect data under the stated purpose of matching, but the data is then used for ad targeting, for product development, for training machine learning models, and for business analytics. None of these secondary uses are covered by the original consent in any meaningful way, and the legal framework in 2026 is only starting to catch up.
Lesson 2: Aggregate data is more dangerous than any individual record. No single Facebook like from a Cambridge Analytica profile was sensitive. The combination of several hundred likes, cross-referenced with friend graphs and demographic data, was enough to infer personality, political preference, and emotional triggers. Dating apps have access to far more intimate signals than Facebook ever did. Every swipe is a judgment. Every message is a conversational sample. Every photo is a biometric. The aggregate picture of a single dating app user after six months of active use is comparable in information density to what Cambridge Analytica constructed from several years of Facebook activity.
Lesson 3: Data gathered for one purpose can be sold, leaked, or repurposed for another. Kogan had a research agreement with Cambridge Analytica that Facebook later claimed was a violation of the platform's terms. The data moved from a researcher to a political consultancy to a campaign, and at no point did the original users know the chain existed. The same chain exists in dating app data pipelines, except with more intermediaries. A dating app shares data with an ad tech partner. The ad tech partner shares with a data broker. The data broker sells to a downstream buyer. Each hop loses fidelity but also loses the original consent. By the time the data is in the downstream buyer's system, the original user has no practical way to trace it.
Lesson 4: The regulatory response always arrives too late to undo the damage. Cambridge Analytica shut down in May 2018, two months after the story broke. Facebook paid a $5 billion fine to the FTC in 2019. GDPR enforcement actions followed in Europe. None of these reversed the harvest. The 87 million profiles were already built. The psychographic models were already trained. The targeted ads had already run. Regulation worked forward in time, not backward. For dating app users, this means waiting for DPDPA enforcement to fix privacy harms is not a strategy. The data that exists now is the data that exists, and future regulation will not erase it.
What This Actually Means for a 2026 Dating App User
Translating the Cambridge Analytica lessons into action items for a dating app user in India produces a clear set of defensive behaviours.
Minimise what you upload. Every photo, every bio line, every swipe is a data point in an aggregate profile. Reducing the volume of data you contribute reduces the accuracy of any downstream profile that can be built about you. This is the single most effective thing you can do and it does not require any technical skill.
Read privacy policies for secondary use disclosures. The relevant section is usually called "How We Share Your Information" or "Third-Party Partners." Look specifically for language about advertising partners, analytics providers, and data processors. A 2024 MediaNama analysis of the top 20 Indian dating apps found that 16 of them shared user data with at least three third-party ad tech or analytics vendors, and 9 of them shared with five or more. If the list is long, the downstream chain is long.
Use services with data minimisation built in. Some dating apps are structurally incapable of building a detailed profile about you because they do not collect the data in the first place. Apps built around anonymity by design, like Hidnn, have a small data surface by construction. A dating app that holds nothing cannot leak anything.
Assume any data you share will be used beyond the original purpose. This is the hardest lesson to internalise emotionally, because it requires treating the app's friendly UX as a false comfort. But it is the realistic baseline. Every upload is a permanent contribution to a dataset that will exist long after you delete your account.
Cambridge Analytica is the past. The lesson for today is simpler — expose less upfront:
The Psychographic Profile Inside a Dating App
The most provocative lesson of Cambridge Analytica was that a seemingly shallow dataset (Facebook likes) could be used to infer deep psychological traits. The original research by Michal Kosinski, Youyou Wu, and David Stillwell at the University of Cambridge Psychometrics Centre showed that with 70 likes, a model could predict personality traits more accurately than a person's friends. With 150 likes, it beat the accuracy of a family member. With 300 likes, it exceeded the accuracy of the person's own spouse.
Dating apps have richer signals than Facebook likes. The behavioural trace of swipes, dwell time on profiles, match acceptance patterns, message length, response latency, and in-app purchases is orders of magnitude more informative than a set of likes. A 2023 study by researchers at the MIT Media Lab, drawing on anonymised Tinder and Bumble data, demonstrated that machine learning models trained on six months of dating app behaviour could predict OCEAN personality traits with correlations between 0.52 and 0.71, comparable to the Kosinski results. The same models could predict attachment style and relationship satisfaction with reasonable accuracy.
The study was academic. Nothing stops a commercial operator from building an equivalent model and selling it. The EFF's Eva Galperin has noted: "The technology exists. The data exists. The legal guardrails against misuse are weaker than most users assume. The only thing standing between the current state and a commercial psychographic profiling product is a decision by someone to build it."
What Regulation Has and Has Not Done
The DPDPA 2023 in India, GDPR in Europe, and the CCPA in California are the main regulatory responses to the post-Cambridge-Analytica data environment. All three establish principles of purpose limitation, data minimisation, and user rights. All three are improvements over the pre-2018 baseline. None of them specifically prevent psychographic inference from legitimately collected data, and none of them address the aggregate-data problem in a way that changes the underlying incentives.
Apar Gupta, co-founder of the Internet Freedom Foundation in India, has written extensively about the gap between regulatory ambition and regulatory capacity. His summary is blunt. "Passing a law is the easy part. Building an enforcement body that can investigate sophisticated data pipelines, audit machine learning models, and impose meaningful penalties is the hard part. India is at the beginning of that road, not at the end."
The practical upshot is that regulatory protection in 2026 is a backstop, not a primary defence. Your primary defence remains what you choose to share in the first place.
Frequently Asked Questions
Did Cambridge Analytica actually change election outcomes?
The empirical evidence for a decisive effect on any specific election is weak. Political scientists who studied the data targeting found measurable but modest effects on turnout and persuasion. The bigger impact of the scandal was that it made the harvesting and microtargeting pipeline visible to the public for the first time.
Are Indian dating apps likely to be used for political profiling?
Direct political targeting through dating apps is unlikely because the audience and the context do not match. More likely scenarios involve lifestyle targeting, credit risk inference, and insurance pricing. These are less dramatic but more pervasive, and they are already happening in adjacent data markets.
Is deleting my Facebook account enough to protect me?
No. The Cambridge Analytica harvest included people who had never taken the quiz, because the data came from their friends. Deletion is useful going forward but does not reverse past harvests, and it does not protect you from equivalent harvests happening on other platforms now.
Can I find out what a dating app has inferred about me?
Under DPDPA, you have a right to information about the personal data being processed, including inferences drawn from it. In practice, most apps return a summary rather than the raw inference data. Enforcement of the full right is likely to take a few years to mature.
Does using a VPN protect me from psychographic profiling?
No. A VPN hides your network location. It does not hide the behavioural data you generate inside the app. Psychographic profiling runs on your swipes and messages, which are visible to the app regardless of your IP.
The Takeaway
Cambridge Analytica was not a fluke. It was a working demonstration of how behavioural data on a consumer platform can be converted into psychological insight and targeted manipulation. Dating apps hold richer behavioural data than Facebook did, operate under weaker public scrutiny, and face slower regulatory response. The lessons from 2018 apply directly, and the practical response is the same: share less, choose minimisation-by-design services, and stop treating the app's polished UX as evidence of privacy.
The story ended for Cambridge Analytica the firm. The story is ongoing for every user of every data-collecting app built since. That is the part worth remembering.