Recent comments in /f/deeplearning
JJJJJJtti t1_j17zluh wrote
Reply to Backprop, beginners question by Matthyou200
Watch Karpathy's videos about it and you good to go!
veb101 t1_j17yqh7 wrote
Reply to Backprop, beginners question by Matthyou200
Shameful plug, but I didn't want copy and format links on mobile.
https://github.com/veb-101/Neural-Networks-from-scratch/tree/master/Multi-Layer-Perceptron
In the readme I've listed the links that I used for learning and implementing neural networks from scratch.
Nerveregenerator t1_j16yfnw wrote
Reply to Backprop, beginners question by Matthyou200
Write all the equations out one paper, then do one forward and backward pass on paper as well with a simple mlp. I believe bias can be easily incorporated using an extra 1 in the input and using an extra weight as the bias, so it’s updated the same as any other weight. Also learn the basics of matrix multiplication.
Outrageous_Room_3167 OP t1_j16i2km wrote
Reply to comment by peder2tm in Biggest 3090 deep learning rigs? 4x ? 8x? 64x? by Outrageous_Room_3167
Cool, this seems interesting. Do you know what CPUs they are ?
Outrageous_Room_3167 OP t1_j16hyqb wrote
Reply to comment by VinnyVeritas in Biggest 3090 deep learning rigs? 4x ? 8x? 64x? by Outrageous_Room_3167
That would be funny infrastructure startup that doesn't know anything about infrastructure LOL we're a robotics company :)
Nerveregenerator t1_j1572fl wrote
Reply to comment by SwanningNonchalantly in Course/Book for intermediate to advanced knowledge by SwanningNonchalantly
Deep learning is much different from typical programming topics in that it is composed of a large amount of mathematical and complex theoretical concepts that are not avoidable using a library. Getting the code to run is relatively easy, and the choice of library has mostly to do with deployment goals and utilizing existing implementations. When things aren't working, theres not really a compiler error as to what's wrong with the model/data pipeline, and deep theoretical knowledge comes into play.
jetnew_sg t1_j14agqq wrote
I very much enjoyed The Kaggle Book, which shares many useful insights from Kaggle grandmasters from their competition experience. Many of them are also active on Twitter where they share more practical tips.
Alone_Bee_6221 t1_j13x5dr wrote
Reply to comment by No_Client_8130 in Course/Book for intermediate to advanced knowledge by SwanningNonchalantly
Deep Learning by Goodfellow really gave me some good insights and tips for training a model, also covering the mathematical background around the ideas.
rubbledubbletrubble t1_j13ejfu wrote
Reply to How to train a model to distinguish images of class 'A' from images of class 'B'. The model can only be trained on images of class 'A'. by 1kay7
Would a Siamese network work here?
[deleted] t1_j137ek9 wrote
GrumpyGeologist t1_j12yawz wrote
Reply to How to train a model to distinguish images of class 'A' from images of class 'B'. The model can only be trained on images of class 'A'. by 1kay7
Train a GAN on the images of class A. The generator will draw samples from the distribution outlined by the images in class A. The discriminator measures the distance between given sample and this distribution. So once you finish training on class A, the critic will tell you whether or not a given image belongs to class A.
An alternative approach is to do self-supervised representation learning (like BYOL) and compare the projection distance between a pair of A and B images.
SwanningNonchalantly OP t1_j12st1d wrote
Reply to comment by Nerveregenerator in Course/Book for intermediate to advanced knowledge by SwanningNonchalantly
Interesting - so it’s not only about the practical details of using those libraries but also contains general nuggets of info? I use PyTorch.
No_Client_8130 t1_j12if26 wrote
I always use the deeplearningbook from Goodfellow and Mathematics for machine learning as reference. Although old, the first book is definitely worth the effort.
danja t1_j121q19 wrote
Reply to How to train a model to distinguish images of class 'A' from images of class 'B'. The model can only be trained on images of class 'A'. by 1kay7
You're a bit stuck, surely..? As far as the model is concerned, there is no class 'B'. Would mashing up images in 'A' be allowed? Random images? Noise?
[deleted] t1_j11ae9i wrote
Reply to How to train a model to distinguish images of class 'A' from images of class 'B'. The model can only be trained on images of class 'A'. by 1kay7
come back to this planet in 10 years and everybody will be screaming 1s and 0s at each other.
Nerveregenerator t1_j10ylwt wrote
Hands on machine learning with tensorflow and scikit learn
mschrimpf t1_j10xnch wrote
[deleted] t1_j10a7y0 wrote
trajo123 t1_j1001dl wrote
Reply to How to train a model to distinguish images of class 'A' from images of class 'B'. The model can only be trained on images of class 'A'. by 1kay7
Look into OOD (Out of distribution) sample detection. If you go down the auto-encoder route then this paper can give you some pointers: Detecting Out-of-distribution Samples via Variational Auto-encoder with Reliable Uncertainty Estimation
Please note that OOD sample detection is an open problem and active research topic.
sayoonarachu t1_j0zlw4i wrote
Reply to comment by macORnvidia in laptop for Data Science and Scientific Computing: proart vs legion 7i vs thinkpad p16/p1-gen5 by macORnvidia
No. I was just using pandas (cpu) for simple quick regex and removing and replacing text rows. It was just for a hobby project. The data was scraped from Midjourney and Stable diffusion discord so there were millions of rows of duplicate prompts and poor quality prompts which I had pandas delete and in the end the number of unique rows with more than 50 characters amounted to about 700k which was then used to train gpt-neo 125m.
I didn't know about cudf. Thanks 😅
PredictorX1 t1_j0z7qtr wrote
Reply to How to train a model to distinguish images of class 'A' from images of class 'B'. The model can only be trained on images of class 'A'. by 1kay7
This is known as one-class learning or one-class classification. You could try obtaining "background class" images (images similar to yours in resolution, overall brightness, ...) and training an ordinary classifier on the combination of the two. Obviously, the background class images cannot contain food, but searches for things unrelated to food ("nail", "dancer", "floor", "statue", ...) followed up by quick visual inspection should serve.
macORnvidia OP t1_j0z24ie wrote
Reply to comment by sayoonarachu in laptop for Data Science and Scientific Computing: proart vs legion 7i vs thinkpad p16/p1-gen5 by macORnvidia
>. For example, the largest Parque file I've cleaned in pandas was about 7 million rows and about 10gb in size of just text. It can run queries through it in a few seconds.
Using rapids? Like cudf?
sayoonarachu t1_j0yn75o wrote
Reply to comment by macORnvidia in laptop for Data Science and Scientific Computing: proart vs legion 7i vs thinkpad p16/p1-gen5 by macORnvidia
I've only starting learning DL a month ago so mostly have been doing simple ANN. But inferencing larger param NLP models, GANS, Diffusion models, etc is fine. It's no desktop 3090s or enterprise grade GPU but for a laptop it's by far the best on the market. For example, the largest Parque file I've cleaned in pandas was about 7 million rows and about 10gb in size of just text. It can run queries through it in a few seconds.
Guess it depends on what kind of data science or dl you're looking to do. The 3080s probably won't be able to fine tune something like BLOOM model but can fine tune stable diffusion models with enough optimizations.
For modeling in blender or procedural generation in something like Houdini, I haven't had issues. I've made procedurally generated 20km height maps in Houdini to export to Unreal Engine and was not a problem.
mr_birrd t1_j0ybbj8 wrote
Reply to comment by MeMyself_And_Whateva in Biggest 3090 deep learning rigs? 4x ? 8x? 64x? by Outrageous_Room_3167
mining rigs are for mining not dl
ReallySeriousFrog t1_j1dzzvg wrote
Reply to Student & Need Help by alla_n_barakat
You could try to ask your University if they have a computer cluster/server for these things or alternatively ask if they would reimburse the cost for colab resources