OSINT

Sock Puppets Can’t Hide Their Slang: Linguistic OSINT for Detecting Fakes

Keystone Collective

07 Jul 2025 — 7 min read

So you want to run a sock puppet army online? Brilliant. Just don’t be the person who makes every fake account say “y’all,” obsess over Succession, and flood every post with the same tired three emojis. Because, surprise surprise, your “totally different” personalities start sounding like a broken record, and thanks to stylometry, the internet’s low-budget polygraph, you get caught faster than you can say “I’m not a bot.”

Stylometry is basically linguistic fingerprinting. Forget IP addresses or VPNs; it’s how you type that gives you away. So, before you create your next fake persona, maybe learn to write like a different person. Or just don’t be lazy.

What Is a Sock Puppet?

No, not the cute thing you had as a kid. A sock puppet is a fake online identity, created by one person to argue with themselves, push agendas, or troll with extra flair. The goal is to pretend you’re a crowd rather than a lonely troll hiding behind a screen.

But sock puppets are like bad magicians: they think no one will notice the same hand waving all the tricks. Spoiler alert: we notice.

Stylometry: Your Writing Style Betrays You

Stylometry is where math meets your bad habit of always putting a space before a question mark or obsessively using semicolons incorrectly. It crunches numbers on word choice, punctuation, sentence length, and vocabulary to create a “you-shaped” linguistic fingerprint. According to serious-sounding studies, it’s right 80–96% of the time when given enough text (Grieve, 2007, Stamatatos, 2009).

Try as you might to fake it, your brain inevitably slips up, especially when you’re angry, tired, or drunk-posting at 2 AM. That’s when stylometry sees your soul.

Slang and Dialect as Linguistic Tells

Slang and dialect are like unintentional signatures, practically screaming “I’m from here” or “I’m trying way too hard to sound like I’m from there.” Whether it’s the casual “y’all” giving away Southern roots or a misplaced “innit” trying to fake a British accent, these language quirks are hard to fake consistently. Sock puppets often either cling to their natural slang or awkwardly overcompensate, making them sound like a bad accent at a high school play.

This isn’t just armchair linguistics. Studies in sociolinguistics show that regional dialects and slang carry unique phonetic, lexical, and syntactic markers that can be identified computationally. For example, Tagliamonte and Denis (2008) demonstrated that subtle features like vowel shifts and particular lexical choices reliably cluster by region in English speakers.

More importantly for OSINT, research by Nguyen et al. (2017) found that slang usage patterns and dialect markers improve the accuracy of author profiling models, meaning the way someone throws around slang can narrow down their location, age, or social group. So when a sock puppet suddenly switches from “hella” to “deadass” without skipping a beat, it’s a red flag waving in the digital wind.

In short, slang isn’t just casual chat, it’s a linguistic tattoo that, once analyzed, exposes more about the speaker than their IP address ever could.

How to Spot a Sock Puppet

Think stylometry is some secret black magic only nerds understand? Think again. If you can read and scroll through social media without face-planting, you’re halfway there. Here are three easy steps for any wannabe online detective to start spotting sock puppets without a PhD in linguistics:

1. Look for the Copy-Paste Personality Syndrome
If two accounts argue like they’re trying out for a one-man play, chances are they’re just the same guy rehearsing different roles. Check if they share weird catchphrases, use identical slang, or have the same oddly specific pet peeves. If both accounts scream “I love bad pizza” or “Trust me, oat milk is superior,” congratulations, you’re on the scent.

2. Analyze the Writing Quirks and Grammar Fails
Pay attention to punctuation obsession and spelling oddities. Does every post have a misplaced semicolon? Do they randomly capitalize words like “TRUTH” for dramatic effect? Are their sentences unusually short or painfully long? And here’s the kicker: watch out for classic bad grammar blunders, missing articles, subject-verb disagreement, or weird tense shifts. Sock puppets often sound like they skipped English class, which ironically makes them easier to spot. It’s like North Korean cyber spooks trying to pass as Californian influencers, slapping ‘YOLO’ on propaganda posts like it’s still 2012. Adorable, but painfully obvious.

3. Check Posting Patterns and Behavior
Sock puppets often don’t have real lives, so they post at weird hours, spam the same links, or always pop up to back each other up. If you notice two or more accounts that show up exactly when your insomnia kicks in and use the same meme repertoire, you’re probably watching a puppet show.

For the brave souls ready to dive deeper, there are free stylometry tools like JStylo or Writeprints that can analyze writing style with nerd-level precision (Brennan & Greenstadt, 2009). But honestly, most sock puppets get caught because they’re just too lazy to pretend well.

Real Examples

Let’s talk about some sock puppets who thought they were slick but got caught red-handed by stylometry, because apparently, being lazy is their real crime.

Take the case of Terry A. Davis. The guy was a programming genius with a split personality, or so he tried to convince us. He ran multiple accounts on forums, probably thinking he was starring in some solo one-man show. But stylometry saw through the act. His unique writing quirks were all over those accounts like glitter at a bad party (Stamatatos, 2009). Sorry Terry, your sock puppets had one job, not sound exactly like you.

Then there’s Wikipedia’s sock puppet drama. When a bunch of wannabe puppet masters tried to rewrite history with fake accounts, the Wikimedia Foundation rolled out stylometry like the bouncer at a nightclub, kicking out all the phonies. Those sock puppets kept showing the same quirky word choices and posting habits, basically waving neon signs screaming “We’re all the same person!” (Kumar et al., 2017). Spoiler: Wikipedia does not like fakery, no matter how many conspiracy theories you spin.

So yeah, sock puppets can try all they want to act like a crowd, but when stylometry’s on the case, it’s more like one confused guy talking to himself in different accents, and everyone notices.

What to Do With Stylometric Clues

Stylometry alone won’t bust your sock puppet empire. Think of it like spotting a weird accent in a crowded room, it’s a clue, not a conviction. To actually move from “Hmm, suspicious” to “Gotcha,” you need to do two things right:

Combine Stylometry with Behavioral Patterns
Stylometric clues are helpful, but they only tell part of the story. That semicolon obsession? Could just be a grammar nerd. But when you pair writing quirks with when and how accounts behave, the puzzle starts to click. Are these accounts posting the same nonsense at 3 AM, flooding identical hashtags, or constantly liking each other’s painfully obvious propaganda? According to Hosseinia and Mukherjee (2017), combining linguistic features with behavioral clues like posting time and activity bursts significantly improves detecting sock puppets. So when stylometry flags a suspect, watching their digital footsteps is the next step.

Know When to Call in the Big Guns
Stylometry is your friendly neighborhood snitch, not the whole damn police force. If you’re just scrolling through memes and whining about fake accounts, these clues help you spot the obvious frauds, like that guy who still thinks “yeet” makes him sound cool. But if you’re playing in the big leagues of OSINT, journalism, cybersecurity, or just obsessively stalking your ex’s mysterious new friends, it’s time to call in the pros. Forensic linguists, for example, didn’t just sit around counting semicolons; they cracked the Unabomber case by dissecting his manifesto’s writing style, proving that even psychotic manifesto writers can’t hide their word vomit. Behavioral analysts have chased down coordinated disinformation mobs by tracking their bot-like posting schedules and recycled buzzwords flooding election hashtags like a bad virus. And cybersecurity teams? They’ve smashed troll farms pumping out political propaganda so blatant, you’d think it was written by an FSB operative trying way too hard to be a hip teenager online. So unless you want to waste hours yelling at shadows armed with nothing but your snark and a thesaurus, hand off your stylish little leads to these experts, because real impact means knowing when to quit pretending you’re Sherlock and call in the cavalry.

Final Thought: Keep Your Eyes Peeled and Your Snark Sharpened

So, you're knee-deep in the social media swamp, wading through bots, trolls, and suspiciously patriotic eggs named “_freedom1234.” You’re tired. We get it. The digital cesspool known as Twitter (sorry, “X”, because apparently the rebrand makes it less toxic?) has you questioning whether anyone is actually real anymore.

But here’s the thing: you’re not powerless. Every lazy sock puppet that forgets how punctuation works, every wannabe psyop that can’t fake slang without sounding like your dad pretending to be a Zoomer, every “totally real user” who shares copy-pasted rage at suspiciously identical intervals, they’re leaving behind fingerprints. And you, dear reader, are armed with stylometry and just enough cynicism to spot them.

Sure, the internet is a dumpster fire, but now you’ve got a magnifying glass to watch it burn more intelligently.

So keep going. Keep noticing. Keep asking why three accounts are all using the same catchphrase no one has uttered since 2008. Keep calling out that account that somehow posts in fluent English 90% of the time, but suddenly forgets basic grammar when trying to impersonate a boomer from Kent. Keep the good fight alive, even if it’s just by making fun of badly written bot posts from your couch.

Because as long as they keep faking it, and we keep spotting it, the puppet show stays exactly what it is: ridiculous. And honestly? That’s half the fun.

References:

Brennan, M. & Greenstadt, R., 2009. Practical Attacks Against Authorship Recognition Techniques. In: Proceedings of the Twenty-First International Joint Conference on Artificial Intelligence (IJCAI-09). [online] Available at: https://cdn.aaai.org/ocs/i/iaai0020/257-3903-1-PB.pdf (Accessed: 7 July 2025)

Grieve, J., 2007. Quantitative Authorship Attribution: An Evaluation of Techniques. Literary and Linguistic Computing, 22(3), pp.251–270. [online] Available at: https://doi.org/10.1093/llc/fqm020 (Accessed: 7 July 2025)

Hosseinia, M. & Mukherjee, A., 2017. A Comparative Study of Supervised Algorithms for Sockpuppet Detection in Deceptive Opinion Spam. Information Processing & Management, 53(6), pp.1292–1307. [online] Available at: https://arxiv.org/pdf/1703.03149 (Accessed: 7 July 2025)

Kumar, S., Cheng, J., Leskovec, J. & Subrahmanian, V.S., 2017. An Army of Me: Sockpuppets in Online Discussion Communities. In: Proceedings of the 26th International Conference on World Wide Web (WWW '17). [online] Available at: https://arxiv.org/pdf/1703.07355 (Accessed: 7 July 2025)

Nguyen, D., Trieschnigg, D., Dogruöz, A.S., Gravel, R., Theune, M., Meder, T. & van den Bosch, A., 2017. Why Gender and Age Prediction from Tweets is Hard: Lessons from a Crowdsourcing Experiment. In: Proceedings of COLING 2014. [online] Available at: https://aclanthology.org/C14-1184.pdf (Accessed: 7 July 2025)

Stamatatos, E., 2009. A Survey of Modern Authorship Attribution Methods. Journal of the American Society for Information Science and Technology, 60(3), pp.538–556. [online] Available at: https://doi.org/10.1002/asi.21001

Tagliamonte, S. & Denis, D., 2008. Linguistic Ruin? Lol! Instant Messaging and Teen Language. American Speech, 83(1), pp.3–34. [online] Available at: https://doi.org/10.1215/00031283-2008-001