Recent comments in /f/dataisbeautiful
Dravous t1_j6ahc3c wrote
Reply to comment by OhGodNotAnotherOne in [OC] How news stories evolve in the news cycle by PartisanPlayground
there's a reason, but that's all I can say.
mikeholczer t1_j6agh1p wrote
Reply to comment by kilopeter in Transition probabilities (shown as percentages) between successive letters in the names of girls born in 2021 in the USA [OC] by kilopeter
Following those rules the most likely outcome is “A”
simonezchen OP t1_j6afztf wrote
Reply to comment by worriedshuffle in [OC] Youtube has over 1 billion hours of videos, we Built an AI Search Engine that can find exact timestamps for anything on Youtube by simonezchen
Good point, you're right we should've labelled the y-axis. It's "Number of Books" as we calculated the numbers approximately to that unit in sheets.
[deleted] t1_j6afvfh wrote
Reply to comment by Kazko25 in [OC] How news stories evolve in the news cycle by PartisanPlayground
[removed]
coodgee33 t1_j6aft3z wrote
Reply to Transition probabilities (shown as percentages) between successive letters in the names of girls born in 2021 in the USA [OC] by kilopeter
Q to u transition is strong.
[deleted] t1_j6afsfv wrote
Reply to comment by iiioiia in [OC] How news stories evolve in the news cycle by PartisanPlayground
[removed]
[deleted] t1_j6afh3t wrote
Reply to comment by [deleted] in [OC] Youtube has over 1 billion hours of videos, we Built an AI Search Engine that can find exact timestamps for anything on Youtube by simonezchen
[deleted]
TaqPCR t1_j6aeh8d wrote
Reply to Transition probabilities (shown as percentages) between successive letters in the names of girls born in 2021 in the USA [OC] by kilopeter
If you start at the most common letter at the beginning of the name, A, then the highest value is to immediately end the name. But if you exclude that the highest value is N which then leads to another A with probability just above that of ending the name. So in a way this chart comes close to spelling out Ana just from the data presented. The probability being .17*.12*.25*.23 = 0.1173%
iiioiia t1_j6aedge wrote
Reply to comment by MEMENARDO_DANK_VINCI in [OC] How news stories evolve in the news cycle by PartisanPlayground
> What? How do I know that the human psyche changes when met with unknowns vs knowns?
You very well may not know it....it's not exactly common knowledge!
> Making a point to yourself is what you did, troll me is also what you did.
Here you are describing your experience. The experiences of others (more commonly known as "reality", or what "is") are not necessarily the same.
> I didn’t state a meme I informed you that your method of argument is lackluster in a polite discussion, though I didn’t use so many words.
"Not arguing in good faith" is a meme.
meme:
-
an image, video, piece of text, etc., typically humorous in nature, that is copied and spread rapidly by internet users, often with slight variations.
-
an element of a culture or system of behavior passed from one individual to another by imitation or other nongenetic means.
> It’s okay if you can’t articulate your point well just try your best! And I’ll help you refine what you actually wanted to say :)
Haha, I love it!! 🙏 You just earned yourself an updoot, partner!
kilopeter OP t1_j6adleq wrote
Reply to Transition probabilities (shown as percentages) between successive letters in the names of girls born in 2021 in the USA [OC] by kilopeter
Interpretation [EDITED]: Take all the names on U.S. Social Security card applications for girls born in 2021. Scan through the names one letter at a time, accounting for the start and end of each name. Count how many times each particular letter is followed by every other letter in the alphabet. Finally, convert your counts to probabilities (technically, percent chance = probability * 100%) to give the chance that any given letter in a name is followed by every other possible letter (or by the end of the name). This visualization summarizes the resulting transition matrix.
Background: The ongoing explosion of generative AI systems got me thinking about the statistical structure of words, and of given names in general. I started with the fairly basic idea of treating the sequence of characters in given names as a Markov process, i.e., the next state is chosen randomly according to transition probabilities that depend only on the current state, not prior states.
Code: https://github.com/pjleimbigler/baby-name-analysis
Data: https://www.ssa.gov/oact/babynames/limits.html
Generating new "names": The letters in given names are pretty clearly not generated by a Markov (memoryless) process. However, it's fun to assume the Markov property anyway and see what "names" fall out of the transition matrix.
To iterate through the transition matrix manually using this viz, follow these steps:
- Start with the bottom row,
(Name Start). - Each number in your current row is the percent chance (probability * 100%) of choosing your name's next letter as the letter corresponding to each column.
- Randomly choose the next letter of your name, based on the percent chances in the current row.
- The letter you just chose now becomes your "current letter." Look up its row.
- Repeat Steps 2–4 until you land on the last column,
(Name End), which signifies the end of your generated name.
This yields some hilarious yet often oddly plausible names, such as: Silian, Slya, Lialy, Maerollia, Chla, Zalah, Lay, Lonanaadievayle, Zoralepa, Peiemophaly, Dralesa, Wiada, Miea, Giaberosh, Bisodiaremanninenn, Seelanida, Einnn, Penasoetimala, Hepanelei, Mia, Mierolynakynisayloloneeloa, Sargandniamilida, Eldyanempe, Pinahanariloma, Alian, Melivevilllohayasisa, Olyna, Die, Mizaramiceatelyalla, Jon, Adelun, Cesklienzolena, Zolyryn, Ema, Leyla, Aclan, Bra, Maeylises, Bryn, Khiemi, Sly, Annnlis, Aisyasa, Xily, Kara, Handanaria, Manla, Pama, Heyanama, Eylisidr, Brah, Llee, Anelerynaevega, Ayatryalofa, Mediza, Caniesty, Oliceeelys, Sannllora, Dassole, Sonnasse, Mmatarieleteyneroselasylin.
markersquire t1_j6adi6i wrote
qwilfish chimecho and pincurchin surprising kinda
MEMENARDO_DANK_VINCI t1_j6ad9ck wrote
Reply to comment by iiioiia in [OC] How news stories evolve in the news cycle by PartisanPlayground
What? How do I know that the human psyche changes when met with unknowns vs knowns?
Making a point to yourself is what you did, troll me is also what you did. I didn’t state a meme I informed you that your method of argument is lackluster in a polite discussion, though I didn’t use so many words.
It’s okay if you can’t articulate your point well just try your best! And I’ll help you refine what you actually wanted to say :)
iiioiia t1_j6acsoo wrote
Reply to comment by MEMENARDO_DANK_VINCI in [OC] How news stories evolve in the news cycle by PartisanPlayground
> No, you come on and use your words to describe the point YOU want to make.
I've made it above, you are welcome to do with it as you please.
> Arguing by allusion is not arguing in good faith.
lol, memes are not effective on me, though I suspect they'll be rather influential on 3rd party observers (which is the point perhaps?).
> Any Interpretor would have to parse their thoughts through you not being forthcoming with yours.
Oh God....how do you know how people you've never met would experience the situation?
Sir: are you putting me on?
dbeats20 t1_j6acpf4 wrote
Reply to Transition probabilities (shown as percentages) between successive letters in the names of girls born in 2021 in the USA [OC] by kilopeter
Can you explain how to interpret it?
SnooKiwis3332 t1_j6acna1 wrote
Only reason I remember Lickitung was because of a Pokémon book I had as a kid and thought it was such a weird and funny one
MEMENARDO_DANK_VINCI t1_j6abvab wrote
Reply to comment by iiioiia in [OC] How news stories evolve in the news cycle by PartisanPlayground
No, you come on and use your words to describe the point YOU want to make. Arguing by allusion is not arguing in good faith. Any Interpretor would have to parse their thoughts through you not being forthcoming with yours.
iiioiia t1_j6absgj wrote
Reply to comment by merc08 in [OC] How news stories evolve in the news cycle by PartisanPlayground
Haha....funny that eh!
Elpacoverde t1_j6ab7ht wrote
I'm surprised, I didn't even remember Larvitar existing and was surprised when it dropped in Pokémon Go.
merc08 t1_j6ab510 wrote
Reply to comment by iiioiia in [OC] How news stories evolve in the news cycle by PartisanPlayground
Discovering something like that seems like that kind of thing that would drive a person to commit suicide by shooting themselves three times in the back of the head.
Plokmijn27 t1_j6aak6e wrote
ukraine tanks and debt ceiling look interesting
levoniust t1_j6aa85t wrote
Is there a website for this? I would love to go back in history even a few months back and take a look at what was super popular in a chart like this. Or just every day see a snapshot of what is being popular. Having something that is arguably unbiased and simply showing the topics that are most popular I find quite intriguing and useful.
askLubich OP t1_j6aa51r wrote
Reply to comment by Training-Purpose802 in [OC] Total Formula 1 laps completed by each driver on the 2023 F1 grid by askLubich
It's the amount of laps over their entire career. So there isn't really an upper limit.
[deleted] t1_j6a9tse wrote
[removed]
Training-Purpose802 t1_j6a9i7e wrote
what's the max possible? anyone at 100%?
StachedCrusader t1_j6aj297 wrote
Reply to comment by spicer2 in [OC] Herd you liek Mudkips: the most (and least) memorable Pokemon by spicer2
Totally anecdotal, but I remember failing at guessing certain Pokemon due to not remembering how to spell them (I actually had to check how to spell Lickitung's name just now). When I'd fail at spelling it the first few times, I'd just decide to leave it for later to maximize my score.
Maybe the complexity of the name also has a factor?