Recent comments in /f/dataisbeautiful
myspicename t1_j6551tv wrote
Reply to comment by Jrubas in [OC] Search interest for "Trump" vs "Biden" (2014-2022) by 1brt
Ok what does that have to do with the graph?
GhotiGhetoti t1_j654lav wrote
Reply to comment by LiverOfStyx in [OC] A map of the backroads in Finland, perfectly captured by OpenStreetMap and visualized with QGIS. This map is a definitive not-a-guide to rally driving in Finland! by Geodienst
Not even close to accurate because they’re missing one road?
CeeMX t1_j6528wz wrote
Reply to comment by [deleted] in [OC] Youtube has over 1 billion hours of videos, we Built an AI Search Engine that can find exact timestamps for anything on Youtube by simonezchen
Much likely count of videos compared to the books in the library, which is a weird metric, as books contain much more content than a video and on the other hand the amount of data would put YouTube on rank 1 by far
Lyndon91 t1_j650wlw wrote
Reply to [OC] Youtube has over 1 billion hours of videos, we Built an AI Search Engine that can find exact timestamps for anything on Youtube by simonezchen
Don’t get how it makes sense. Is the book equivalent to the video once it’s been transcribed?
voracious_villain t1_j64y8x2 wrote
Reply to [OC] A map of the backroads in Finland, perfectly captured by OpenStreetMap and visualized with QGIS. This map is a definitive not-a-guide to rally driving in Finland! by Geodienst
Curious as to the methodology used to extract, use QGIS and OSM every day at work.
Chramir t1_j64xwtj wrote
Reply to [OC] Youtube has over 1 billion hours of videos, we Built an AI Search Engine that can find exact timestamps for anything on Youtube by simonezchen
They made a estimate of how many words are there in every youtube video uploaded. That estimate is calculated by the total runtime of all the videos multiplied by average word count in a conversation per given time. And the total words are devided by the number of words in a average book. To get a 'books size'.
I don't know, but that just seems kinda iffy. First youtube videos are rarely a back and forth conversation. And secondly it's like pointing to a skyscraper and saying it's like a big sandcastle because sand is used in concrete.
Edit: grammar and added the 'word count' estimate explanation.
0_0_0 t1_j64x7fi wrote
Reply to comment by MettaRosvo in [OC] A map of the backroads in Finland, perfectly captured by OpenStreetMap and visualized with QGIS. This map is a definitive not-a-guide to rally driving in Finland! by Geodienst
My hypothesis is it's a road inside the area Finland rents from Russia for the Saimaa Canal.
Or a coordinate mistake.
Scared-Conflict-653 t1_j64vnxs wrote
Reply to comment by NaturalNines in [OC] Search interest for "Trump" vs "Biden" (2014-2022) by 1brt
Yes. Right leaning media and left leaning media both talking about Trunp every damn day either in overly positive or overly negative light. Trump kept leaning into it, by either making statements on Twitter and rambling on TV shows. Know when to shut up is an important lesson.
Veggies-are-okay t1_j64uxbn wrote
Reply to [OC] Puerto Rico, with 3.3M people or 0.4% of LatAm's population, is the birthplace of 6 of the region's top 10 most streamed artists on Spotify. 🇵🇷 by latinometrics
Woohooo another thing America can pillage from Puerto Rico!!!
[deleted] t1_j64uoba wrote
Reply to comment by Scorpian42 in [OC] Nintendo SwitchGames Review Analysis by matman89
[removed]
LiverOfStyx t1_j64sdsj wrote
Reply to [OC] A map of the backroads in Finland, perfectly captured by OpenStreetMap and visualized with QGIS. This map is a definitive not-a-guide to rally driving in Finland! by Geodienst
Not even close to accurate. I know one road that is missing and how perfect it would be for rallying. My family owns a share of that road and government pays for the upkeeping and maintenance cause it is quite neat way to shortcut between two highways and can handle a tank... The whole area is a black void in this map when i know it is fully of backroads that are in excellent condition. They cut thru marshlands that form natural obstacles, unless you use those roads...
ar243 t1_j64sc3g wrote
Reply to [OC] Nintendo SwitchGames Review Analysis by matman89
The new Kirby game is fantastic! I enjoyed star allies too, but not as much.
Plushhorizon t1_j64oxms wrote
Reply to comment by Putoigituresse in [OC] Youtube has over 1 billion hours of videos, we Built an AI Search Engine that can find exact timestamps for anything on Youtube by simonezchen
What about the entire internet?
malachai926 t1_j64oqam wrote
Reply to comment by terrykrohe in [OC] best-fit lines, correlations: ed spending vs evangelical –– 2020 election by terrykrohe
To be frank, it's just poor presentation. Statisticians like myself will see lots of problems with this. If I am confused, I guarantee that the layperson will be even more so.
>red = Rep states in 2020 election
blue = Dem states in 2020 election
Even here, you aren't being clear enough. Are they "republican" because their votes for president in the 2020 election were majority in favor of the Republican candidate? Republican because they elected more Republican House congresspeople / senators? I can infer that you're likely referring to the electoral college result, but when people have to infer what you mean with your data, that's just bad practice that is bound to get you in trouble in the future.
>t-tests are usually reported using the p-value
Not always, no. A lot of published research will tell you both the t-statistic AND the p-value. If you're giving us a p-value, you should say it's a p-value, end of story.
>the t-test is sensitive to small mean variations: the top right plot shows the means separated by a SD, which is NOT a small difference ( t-test = 0.000015).
That's great, but why didn't you state that result in the graph? And again, don't say "t-test equals", at least say "t-test p-value equals". It's nonsense to say that a test equals something. The test generates a statistic and a p-value which equal something, but the test itself is a test. It pays to be explicit with what you are saying, or else other statisticians could misinterpret what you are saying. In this case, if someone thought you meant the t-statistic was 0.000015, that would mean the results were highly non-significant and would think you screwed up your calculation.
You seem to have some idea in your mind of how things are "typically" interpreted by various groups of people, but you should NOT rely on those assumptions because inevitably someone will interpret gray area in a way you didn't intend. It is always far, far preferable to be as explicit as you can with your definitions of things.
Again I think showing this as a sorted scatterplot is just weird. You really ought to show this data as a histogram. You're using a t-test, yeah? So it's really incumbent on you to demonstrate that the data really does follow the shape of a t-distribution to prove to your audience that such a test is acceptable. A histogram achieves that; this scatterplot does not.
Finally, maybe it's just me, but grouping these things together on a state level just feels like you're losing so much detail and misclassifying so much data that I really question the validity of your results. Maybe this is the best you have to work with, but you are classifying a state that went 51% in favor of the Democrat as 100% Democratic and vice versa, which then classifies every single school district in that state, including the likely numerous rural school districts where people are more likely to be conservative, as "Democratic" school districts contributing however much money they contributed towards education. You'd get a lot more robust data and far less of this kind of error if you were able to get this data by school district. If you don't have that data, it is what it is, but the end result is that I'll consider everything I said here and think "eh, this is kinda just bad analysis and is meaningless" and it gets disregarded. And I imagine you wouldn't want the analysis you spent all of this time and effort on to be disregarded, yeah?
GoodAfternoon2459 t1_j64onzs wrote
Reply to [OC] Russia new cars sales 2005/2022 by SeeYouHenTee
Now, they care about the environment.
BradMH88 t1_j64ohh8 wrote
Reply to [OC] Youtube has over 1 billion hours of videos, we Built an AI Search Engine that can find exact timestamps for anything on Youtube by simonezchen
I feel like we’ve all let Reddit down. Look how small it is. It’s time to increase our Reddit participation. This is just embarrassing. I have to imagine there are more random safes or something to generate mini hysteria.
latinometrics OP t1_j64nw5z wrote
Reply to [OC] Puerto Rico, with 3.3M people or 0.4% of LatAm's population, is the birthplace of 6 of the region's top 10 most streamed artists on Spotify. 🇵🇷 by latinometrics
Source: https://kworb.net/spotify/artists.html
Tools: Excel, Rawgraphs, Affinity Designer
latinometrics OP t1_j64npvc wrote
Reply to [OC] Puerto Rico, with 3.3M people or 0.4% of LatAm's population, is the birthplace of 6 of the region's top 10 most streamed artists on Spotify. 🇵🇷 by latinometrics
From our newsletter:
Is Puerto Rico the music capital of the world? 🇵🇷
Puerto Rico, with 3.3M people or 0.4% of LatAm's population, is the birthplace of 6 of the region's top 10 most streamed artists on Spotify. Many of them are also top artists worldwide.
There is no way such a stat is a product of chance.
There must be an incredible force behind the success of so many artists from a tiny island roughly the size of Connecticut, the US's 3rd smallest state.
For over a hundred years, the island has been the motherland of original music genres:
• Bomba by enslaved Africans
• Plena by Jíbaros (native farmers)
• Danza (adapted from Europe's contradanza)
• and more recently, Reggaeton and Latin trap
The US territory, the only one that maintains Spanish as the official language, has one of the world's highest concentrations of music stars per capita (perhaps the highest).
When looking at Spotify streams, singers like Ricky Martin or Chayanne are at a disadvantage because they became big well before Spotify existed, so they do not appear on our chart.
However, the last 20 or so years have brought about a new era of rappers like Residente and reggaeton superstars, best exemplified by Bad Bunny, currently the most streamed artist on the planet for three years in a row.
Bad Bunny was inspired by “the King of reggaeton,” Daddy Yankee, who is 4th on the list despite also having somewhat of a disadvantage. His iconic song, Gasolina, came out in 2004, when your writer still burned custom CDs and used a Walkman.
Gasolina was listed as #50 by Rolling Stone's 500 Greatest Songs of All Time, and there's absolutely no way you haven’t heard it before.
Colombian J Balvin is number two on the list and has more streams than Dua Lipa and Taylor Swift. Behind every great artist, there's a great producer.
In J Balvin's case, that person is “Sky Rompiendo,” who is responsible for some of J Balvin's greatest hits and collaborations like Safari with Pharell Williams. Sky has also produced songs for Ozuna and Maluma, also top 10 artists, and many other Latin stars.
So, undoubtedly, Puerto Rico and Colombia LatAm's music capitals. The big inexplicable question is: why are there 0 artists from Brazil and Mexico in Latin America's top 10? 🇲🇽🇧🇷
Jrubas t1_j64nlyo wrote
Reply to comment by myspicename in [OC] Search interest for "Trump" vs "Biden" (2014-2022) by 1brt
Fox News is a lying propaganda machine too. None of these media outlets is objective. They're all biased one way or another and staffed by lying scumbags. And that's just the news team. The pundits are actively evil and work night and day to continue dividing us. If there's a wound on the body politick, you can count on these assholes to rub dirt and broken glass in it.
[deleted] t1_j64nkb2 wrote
ar243 t1_j64mzpn wrote
JoffeJoffer t1_j64moop wrote
Reply to comment by miskathonic in [OC] Youtube has over 1 billion hours of videos, we Built an AI Search Engine that can find exact timestamps for anything on Youtube by simonezchen
Tbf, that would be the case for a significant portion of the British Library as well.
Levi_MS OP t1_j64m79k wrote
Reply to comment by porncornroz in [OC] Current World-Champions and Gold-Medalists in most popular team sports at the end of 2022 by Levi_MS
you mean Basketball ;-)
ReturnToOdessa t1_j655ajm wrote
Reply to comment by Veggies-are-okay in [OC] Puerto Rico, with 3.3M people or 0.4% of LatAm's population, is the birthplace of 6 of the region's top 10 most streamed artists on Spotify. 🇵🇷 by latinometrics
In what way?