Recent comments in /f/deeplearning
tartenpi0n t1_j5qvv9e wrote
I personally started DeepLearning with this free course : Intro to DeepLearning with Pytorch and I really recommend it. It's long but you don't have to finish it entirely to understand key principles. This course does not learn how to deal with segmentation tasks, but one you've understand key principles of DeepLearning, you will understand any blogpost on segmentation.
suflaj t1_j5qb32y wrote
There is this: https://www.microsoft.com/en-us/research/blog/%C2%B5transfer-a-technique-for-hyperparameter-tuning-of-enormous-neural-networks/
However, it's unlikely to help in your case. The best thing you can do is grid search if you know something about the problem, or just random search. I prefer random search even if I'm am expert for the problem, ESPECIALLY with ML models.
But I'm curious how it takes a long time. You don't have to train the whole dataset. Take 10% for training and 10% for validation, or less if that dataset is huge. You just need enough data to learn something. Then your optimal hyperparameters are a good enough approximation.
Also, it might help to just not tune redundant hyperparameters. Layer sizes are usually such, as is almost any hyperparameter in the Adam family of optimizers besides learning rate and to a lesser extent first momentum. Which ones are you optimizing?
Ayakalam t1_j5pvihn wrote
Reply to comment by BullyMaguireJr in arXiv Feed: Keep up with AI research, the easy way. by BullyMaguireJr
Ah, so you hand pick LLM papers and compute their embedding s essentially ?
BullyMaguireJr OP t1_j5pq2uj wrote
Reply to comment by Ayakalam in arXiv Feed: Keep up with AI research, the easy way. by BullyMaguireJr
Calculating embeddings for the paper using OpenAI's API.
Ayakalam t1_j5pa0uo wrote
Reply to comment by BullyMaguireJr in arXiv Feed: Keep up with AI research, the easy way. by BullyMaguireJr
Ok, but when you say you are “indexing papers”, what do you mean exactly ?
SimulatedAnnealing t1_j5p78wg wrote
The best way to learn anything (also deep learning), especially for beginners, tends to be IMO to follow some structured approach and stick to it. So pick a good book or course and try to finish it (or the fundamental parts). It may take a bit longer to see progress than randomly following youtobe tutorials or notebooks, but builds a solid base that makes learning in the mid/long term more efficient.
operator_alpha t1_j5ouj8o wrote
FastestLearner t1_j5ogton wrote
It actually depends on what you want to achieve. For example, if you want to do research in DL, the best way is not to start with DL at all and instead do some fundamental math courses like LinAlg, Prob/Stats, Intermediate and Advanced Calc, etc., then turn to traditional ML, and only after that you do DL. This is the bottom-up approach and it is a long journey that takes years. But from your post, it seems that you are looking for a quick top-down approach. For that, I would suggest you simply look into some medium.com articles, youtube videos, udemy courses and most importantly the dive head first into coding (try running as many examples from github as you can). Try reproducing some basic results, like getting >90% accuracy on CIFAR-10 classification with a ResNet model. You could also try getting into a bootcamp if there's one going on nearby.
JJJJJJtti t1_j5oe17b wrote
There's this thing called google where you can insert text, type 'deep learning' and, after that, press the search button. You'll be amazed by how many results you'll get in a fraction of a second. Then you click on, say, a blog post, you start reading it. The first technical word that you see and don't know what it means you select it, right click it and press 'Search 'word' with google' and open whatever file that seems plausible. Do this for all unknown words on all posts, articles, papers, videos until your recursion ends. Also, don't forget to get your hands dirty!
Blue_mecha_ t1_j5o4bu5 wrote
I started learning machine and deep learning with two youtube channel.
Daniel Bourke -> https://www.youtube.com/@mrdbourke
TechWithTim -> https://www.youtube.com/@TechWithTim
​
Once you learned the basics, you should start a short deep learning project, in this project, you will make mistakes wich you will learn a lot from.
maxwell-alive t1_j5o1j25 wrote
This is a good series for understanding the fundamental ideas of deep learning: https://www.youtube.com/playlist?list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi
Other than that, I would recommend installing PyTorch and running/modifying some projects from GitHub. I personally learn the most from looking at other people's code.
BullyMaguireJr OP t1_j5n3itx wrote
Reply to comment by Ayakalam in arXiv Feed: Keep up with AI research, the easy way. by BullyMaguireJr
Thanks! I'm currently just using the arXiv API with OpenAI embeddings for semantic search.
idktbhfamsenpai t1_j5mlxvb wrote
Kinda lame content for this sub
ImpeccableLlama t1_j5lpnc5 wrote
oh my goodness the ears
Not_DavidGrinsfelder t1_j5loz4p wrote
This honestly just looks like really shitty photoshop
Ayakalam t1_j5loxsx wrote
Awesome tool!
Some qs, how do you actually index those papers exactly ? Do you use openalex or ?…
Secondly, are you using a library for semantic search ?
bbbastiannnn t1_j5lnnda wrote
Reply to Tensorflow or Pytorch by ContributionWild5778
Darknet
zabirauf t1_j5lebj0 wrote
Reply to comment by fasync in arXiv Feed: Keep up with AI research, the easy way. by BullyMaguireJr
+1 to adding RSS feed
Hairy_Advice6669 t1_j5lbbfx wrote
So he also sweat through all of them? Including the leather jacket? Thats some impressive sweating.
Internal_Bell_3142 t1_j5l5nsn wrote
Reply to comment by mining4goldwinsmith in Chris Hemsworth Dressed in different attires using AI by oridnary_artist
I think it’s instruct pix2pix. Check
[deleted] t1_j5l5i14 wrote
[deleted]
mining4goldwinsmith t1_j5l2to9 wrote
Reply to comment by developers_hutt in Chris Hemsworth Dressed in different attires using AI by oridnary_artist
meant what ai is op using
developers_hutt t1_j5l2qzw wrote
Reply to comment by mining4goldwinsmith in Chris Hemsworth Dressed in different attires using AI by oridnary_artist
Yup
mining4goldwinsmith t1_j5l2p1q wrote
Reply to comment by developers_hutt in Chris Hemsworth Dressed in different attires using AI by oridnary_artist
what ai are you using?
agentfuzzy999 t1_j5qy82t wrote
Reply to Best cloud to train models with 100-200 GB of data? by Zealousideal-Copy463
“Should I just buy a 4090”
Ok Jeff Bezos
4090 clock speed is going to be faster than similar instances that use T4s, plus wayyyyyyy more CUDA cores. Training will be significantly faster, if you can fit the model on the 4090. If you can “business expense” a 4090 for your own machine, good lord do that.