suflaj t1_j4pdd6o wrote on January 17, 2023 at 9:11 AM

Reply to comment by elf7979 in Is 100 mega byte text corpus big enought to train? by elf7979

You're closer but not yet quite there - the smaller Google News Dataset W2V is trained on is 10 GB. The full one used is around 300GB IIRC

2thleZ t1_j4ooqax wrote on January 17, 2023 at 4:33 AM

Reply to comment by myth_drannon in Is doing TensorFlow certificate worth the time and effort? by MACKBULLERZ

I've heard a lot of people saying the same thing but could you explain why PyTorch is better than TF?

elf7979 OP t1_j4o933j wrote on January 17, 2023 at 2:38 AM

Reply to comment by BellyDancerUrgot in Is 100 mega byte text corpus big enought to train? by elf7979

I will check Gensim documentation. Thank you

elf7979 OP t1_j4o90u3 wrote on January 17, 2023 at 2:37 AM

Reply to comment by suflaj in Is 100 mega byte text corpus big enought to train? by elf7979

I think trascript from company's conference call includes some certain characterstics since business professionals may use some particular verbs or expressions. I haven't checked out w2v datasets you mentioned yet. Is there existing corpus that's business-oriented?

What if dataset size increases to 1 giga bytes. Is it big enough?

BellyDancerUrgot t1_j4nw2ku wrote on January 17, 2023 at 1:06 AM

Reply to comment by suflaj in Is 100 mega byte text corpus big enought to train? by elf7979

Gensim documentation itself has them highlighted along with the necessary arguments to use to download and use them.

Moises-Tohias t1_j4mw2nc wrote on January 16, 2023 at 9:04 PM

Reply to comment by agentfuzzy999 in Retrieve voice from noisy audio file by BackgroundPass2082

VAD won't do it, since the speech and the noise overlap

bhargavkartik t1_j4mqdsf wrote on January 16, 2023 at 8:28 PM

Reply to comment by suflaj in Is 100 mega byte text corpus big enought to train? by elf7979

This.

SometimesZero t1_j4mo6ra wrote on January 16, 2023 at 8:15 PM

Reply to comment by myth_drannon in Is doing TensorFlow certificate worth the time and effort? by MACKBULLERZ

Thanks for the info. I knew PyTorch was popular, but didn’t know about TF’s waning popularity. Good to know.

myth_drannon t1_j4mmfha wrote on January 16, 2023 at 8:04 PM

Reply to comment by SometimesZero in Is doing TensorFlow certificate worth the time and effort? by MACKBULLERZ

TF out, Pytorch in. Google always struggled to build a long lasting ecosystems for their products. Angular was first but then Facebook came and swooped in with React, same happened with TF and pytorch

[deleted] t1_j4mho46 wrote on January 16, 2023 at 7:34 PM

Reply to comment by SometimesZero in Is doing TensorFlow certificate worth the time and effort? by MACKBULLERZ

PyTorch is where it’s at !

SometimesZero t1_j4mgn69 wrote on January 16, 2023 at 7:27 PM

Reply to comment by myth_drannon in Is doing TensorFlow certificate worth the time and effort? by MACKBULLERZ

Can you say more about “on its way out?” Are other libraries more popular in your opinion?

SupremeChampionOfDi t1_j4lwzff wrote on January 16, 2023 at 5:26 PM

Reply to comment by Legitimate-Gold-8711 in Help with deep learning project "autocorrection" by Legitimate-Gold-8711

This how a Chinese person I know speaks.

Legitimate-Gold-8711 OP t1_j4l9swg wrote on January 16, 2023 at 2:54 PM

Reply to comment by SupremeChampionOfDi in Help with deep learning project "autocorrection" by Legitimate-Gold-8711

Which reasons :D

SupremeChampionOfDi t1_j4l4r6u wrote on January 16, 2023 at 2:16 PM

Reply to Help with deep learning project "autocorrection" by Legitimate-Gold-8711

I read this in a funny Chinese accent for some reason.

suflaj t1_j4l1i5l wrote on January 16, 2023 at 1:49 PM

Reply to Is 100 mega byte text corpus big enought to train? by elf7979

Likely not enough, at least not for what is considered good. But I fail to see why you'd want to trian it yourself, there are plenty of readily available w2v weights or vocabularies.

myth_drannon t1_j4l048h wrote on January 16, 2023 at 1:37 PM

Reply to Is doing TensorFlow certificate worth the time and effort? by MACKBULLERZ

TF is on its way out. But in general get certificates for fundamentals not for frameworks that change every couple of years. Certificates are popular in ops domain, so if you are planning on doing mlops then get something related to that.

Trick2206 t1_j4kkijl wrote on January 16, 2023 at 10:42 AM

Reply to Is doing TensorFlow certificate worth the time and effort? by MACKBULLERZ

Imo not really, from what I've heard and seen most jobs don't really care for random certs from courses. You can just learn everything the course will teach from free resources so you can just search online for the topics you're interested in.

Legitimate-Gold-8711 OP t1_j4kfxo9 wrote on January 16, 2023 at 9:40 AM

Reply to comment by thatoneboii in Help with deep learning project "autocorrection" by Legitimate-Gold-8711

I tried levenshtein distance algorithm but it's not works like I want

thatoneboii t1_j4jf8h5 wrote on January 16, 2023 at 3:23 AM

Reply to Help with deep learning project "autocorrection" by Legitimate-Gold-8711

Do you absolutely need to use deep learning? There are tons of way faster autocorrect implementations that use levenshtein distances and non-DL techniques such as SymSpell or Norvig’s algorithm. DL is complicated, expensive, and requires tons of data to train on - I would stay away from that unless you’re doing it for your own enrichment or a school project.

shmollerup t1_j4htyot wrote on January 15, 2023 at 8:52 PM

Reply to Help with deep learning project "autocorrection" by Legitimate-Gold-8711

You could try something that works on a character level, like a sequence tobsequence model, or maybe a rnn approach like char2vec. Both approaches should work pretty good if you have enough training data

tsgiannis t1_j4fcuo3 wrote on January 15, 2023 at 8:41 AM

Reply to Building an NBA game prediction model - failing to improve between epochs by vagartha

I meant on the betting site

ivan_kudryavtsev t1_j4c5702 wrote on January 14, 2023 at 5:45 PM

Reply to comment by Infamous_Age_7731 in Cloud VM GPU is much slower than my local GPU by Infamous_Age_7731

Ram performance also may be affected by meltdown, spectre patches.

vagartha OP t1_j4c3wer wrote on January 14, 2023 at 5:37 PM

Reply to comment by tsgiannis in Building an NBA game prediction model - failing to improve between epochs by vagartha

Simulate? How would I go about doing that? Sorry if that's a silly/involved question, but I'm not sure how I would simulate NBA games.

tsgiannis t1_j4c3d7c wrote on January 14, 2023 at 5:34 PM

Reply to Building an NBA game prediction model - failing to improve between epochs by vagartha

You can always simulate

vagartha OP t1_j4c36zw wrote on January 14, 2023 at 5:32 PM

Reply to comment by tsgiannis in Building an NBA game prediction model - failing to improve between epochs by vagartha

Haha, I live in CA so sports gambling so that's out of the question...

I was actually hoping to maybe write a paper or something and submit it to something like the Sloane conference or send it in to 538 as an add-on to my resume?

Also, my model uses data from seasons going back all the way to 2014 as of right now. Larger datasets would make a better model, right? So why not use more historical data?

Recent comments in /f/deeplearning