Recent comments in /f/dataisbeautiful

StachedCrusader t1_j6aj297 wrote

Totally anecdotal, but I remember failing at guessing certain Pokemon due to not remembering how to spell them (I actually had to check how to spell Lickitung's name just now). When I'd fail at spelling it the first few times, I'd just decide to leave it for later to maximize my score.

Maybe the complexity of the name also has a factor?

5

TaqPCR t1_j6aeh8d wrote

If you start at the most common letter at the beginning of the name, A, then the highest value is to immediately end the name. But if you exclude that the highest value is N which then leads to another A with probability just above that of ending the name. So in a way this chart comes close to spelling out Ana just from the data presented. The probability being .17*.12*.25*.23 = 0.1173%

6

iiioiia t1_j6aedge wrote

> What? How do I know that the human psyche changes when met with unknowns vs knowns?

You very well may not know it....it's not exactly common knowledge!

> Making a point to yourself is what you did, troll me is also what you did.

Here you are describing your experience. The experiences of others (more commonly known as "reality", or what "is") are not necessarily the same.

> I didn’t state a meme I informed you that your method of argument is lackluster in a polite discussion, though I didn’t use so many words.

"Not arguing in good faith" is a meme.

meme:

  • an image, video, piece of text, etc., typically humorous in nature, that is copied and spread rapidly by internet users, often with slight variations.

  • an element of a culture or system of behavior passed from one individual to another by imitation or other nongenetic means.

> It’s okay if you can’t articulate your point well just try your best! And I’ll help you refine what you actually wanted to say :)

Haha, I love it!! 🙏 You just earned yourself an updoot, partner!

1

kilopeter OP t1_j6adleq wrote

Interpretation [EDITED]: Take all the names on U.S. Social Security card applications for girls born in 2021. Scan through the names one letter at a time, accounting for the start and end of each name. Count how many times each particular letter is followed by every other letter in the alphabet. Finally, convert your counts to probabilities (technically, percent chance = probability * 100%) to give the chance that any given letter in a name is followed by every other possible letter (or by the end of the name). This visualization summarizes the resulting transition matrix.

Background: The ongoing explosion of generative AI systems got me thinking about the statistical structure of words, and of given names in general. I started with the fairly basic idea of treating the sequence of characters in given names as a Markov process, i.e., the next state is chosen randomly according to transition probabilities that depend only on the current state, not prior states.

Code: https://github.com/pjleimbigler/baby-name-analysis

Data: https://www.ssa.gov/oact/babynames/limits.html

Generating new "names": The letters in given names are pretty clearly not generated by a Markov (memoryless) process. However, it's fun to assume the Markov property anyway and see what "names" fall out of the transition matrix.

To iterate through the transition matrix manually using this viz, follow these steps:

  1. Start with the bottom row, (Name Start).
  2. Each number in your current row is the percent chance (probability * 100%) of choosing your name's next letter as the letter corresponding to each column.
  3. Randomly choose the next letter of your name, based on the percent chances in the current row.
  4. The letter you just chose now becomes your "current letter." Look up its row.
  5. Repeat Steps 2–4 until you land on the last column, (Name End), which signifies the end of your generated name.

This yields some hilarious yet often oddly plausible names, such as: Silian, Slya, Lialy, Maerollia, Chla, Zalah, Lay, Lonanaadievayle, Zoralepa, Peiemophaly, Dralesa, Wiada, Miea, Giaberosh, Bisodiaremanninenn, Seelanida, Einnn, Penasoetimala, Hepanelei, Mia, Mierolynakynisayloloneeloa, Sargandniamilida, Eldyanempe, Pinahanariloma, Alian, Melivevilllohayasisa, Olyna, Die, Mizaramiceatelyalla, Jon, Adelun, Cesklienzolena, Zolyryn, Ema, Leyla, Aclan, Bra, Maeylises, Bryn, Khiemi, Sly, Annnlis, Aisyasa, Xily, Kara, Handanaria, Manla, Pama, Heyanama, Eylisidr, Brah, Llee, Anelerynaevega, Ayatryalofa, Mediza, Caniesty, Oliceeelys, Sannllora, Dassole, Sonnasse, Mmatarieleteyneroselasylin.

20

MEMENARDO_DANK_VINCI t1_j6ad9ck wrote

What? How do I know that the human psyche changes when met with unknowns vs knowns?

Making a point to yourself is what you did, troll me is also what you did. I didn’t state a meme I informed you that your method of argument is lackluster in a polite discussion, though I didn’t use so many words.

It’s okay if you can’t articulate your point well just try your best! And I’ll help you refine what you actually wanted to say :)

2

iiioiia t1_j6acsoo wrote

> No, you come on and use your words to describe the point YOU want to make.

I've made it above, you are welcome to do with it as you please.

> Arguing by allusion is not arguing in good faith.

lol, memes are not effective on me, though I suspect they'll be rather influential on 3rd party observers (which is the point perhaps?).

> Any Interpretor would have to parse their thoughts through you not being forthcoming with yours.

Oh God....how do you know how people you've never met would experience the situation?

Sir: are you putting me on?

0

levoniust t1_j6aa85t wrote

Is there a website for this? I would love to go back in history even a few months back and take a look at what was super popular in a chart like this. Or just every day see a snapshot of what is being popular. Having something that is arguably unbiased and simply showing the topics that are most popular I find quite intriguing and useful.

1