Recent comments in /f/deeplearning
saintshing t1_jc26rk0 wrote
Reply to Which topic in deep learning do you think will become relevant or popular in the future? by gokulPRO
I feel like it should be possible to extend diffusion transformer technique to code generation for web development.
You can input a screenshot of a static webpage, then use a text prompt like 'Change the style to fit a futuristic theme', or just input a low fidelity UI wireframe and it can generate a detailed webpage with the html and css. We can get training data from the internet for self supervised learning.
Also retrieval transformers or models that know how to query APIs, databases and prompt other models.
Buddy77777 t1_jc1ytfu wrote
Reply to Which topic in deep learning do you think will become relevant or popular in the future? by gokulPRO
RL has potential
N0bb1 t1_jc12rsa wrote
Reply to Which topic in deep learning do you think will become relevant or popular in the future? by gokulPRO
Symbolic AI
atm_vestibule t1_jc0shl5 wrote
Reply to Which topic in deep learning do you think will become relevant or popular in the future? by gokulPRO
Which continual learning papers are you referring to?
amhotw t1_jc0r0nn wrote
Reply to comment by nirnamous in Recommendations sources for Understanding Advanced Mathematical Concepts in Research Papers? by nirnamous
I just meant this would take significant amount of time. I think it is impossible to do research in a quantitative field without understanding these so I would say it is well worth the investment. But most people are not concerned with research or even understanding the methods.
Sanavesa t1_jc0qosn wrote
Reply to comment by hondajacka in Which topic in deep learning do you think will become relevant or popular in the future? by gokulPRO
I guess they're referring to GPT4's release
hondajacka t1_jc0o7my wrote
Reply to comment by SweatyBicycle9758 in Which topic in deep learning do you think will become relevant or popular in the future? by gokulPRO
Why next week will be big?
nirnamous OP t1_jc0n7vx wrote
Reply to comment by amhotw in Recommendations sources for Understanding Advanced Mathematical Concepts in Research Papers? by nirnamous
Just asking (Not trying to offend you or anything)
Why you asked whether this is serious ? Are above are very basic things ?
nirnamous OP t1_jc0mvz9 wrote
Reply to comment by amhotw in Recommendations sources for Understanding Advanced Mathematical Concepts in Research Papers? by nirnamous
Thank you very much.
Your comment is very helpful.
I'll refer these sources.
amhotw t1_jc0mf55 wrote
Reply to Recommendations sources for Understanding Advanced Mathematical Concepts in Research Papers? by nirnamous
If you are serious, I would recommend working on Rudin's Principles of Math Analysis. It might take a day (or more...) to wrap your head around a single proof but at the end you'll be ready to read anything (of course you might need to check some definitions.)
For KL divergence, entropy etc., Info Theory book by Mackay is great.
For hessian, well it is just calculus; the second derivative of a multivariate function. To understand its uses, you would need some understanding of numerical analysis and concave programming. For the latter, Boyd's optimization book is a classic. I don't remember a good book on numerical analysis but some diff. eqn.s books have nice chapters on it.
SweatyBicycle9758 t1_jc0meri wrote
Reply to comment by Philpax in Which topic in deep learning do you think will become relevant or popular in the future? by gokulPRO
It’s already happening and also next week will be big
deephugs t1_jc0lhog wrote
Reply to Recommendations sources for Understanding Advanced Mathematical Concepts in Research Papers? by nirnamous
First try and understand every symbol in the equation, there are cheat sheets online. Second, most math concepts have a wikipedia page you can read, go down those rabbit holes and sooner or later you will find common threads and start to build an understanding. Finally, just put the time in, math is like everything else and just takes lots and lots of practice.
Philpax t1_jc0jcie wrote
Reply to Which topic in deep learning do you think will become relevant or popular in the future? by gokulPRO
The usual answer to this is "multimodal" and I think that's still true, especially with recent advances. We'll see in the next few months :)
RoboiosMut t1_jc00wmj wrote
Reply to Recommendations sources for Understanding Advanced Mathematical Concepts in Research Papers? by nirnamous
I have an idea, maybe ask Chatgpt to make some stories to explain those abstract concepts
Nerveregenerator t1_jbzo0x3 wrote
Reply to Recommendations sources for Understanding Advanced Mathematical Concepts in Research Papers? by nirnamous
Do problems involving the equations on paper and also read and copy down articles that are written on them.
eugene129 OP t1_jbwdfl2 wrote
Reply to comment by [deleted] in what exactly is Variance(Xt) during the Forward Process in Diffusion model ? by eugene129
As fas as I know, N(Xt ; ... , BtI) means that the V(Xt) = Bt, But if it is so, the equation two equation in the picture seems to be contradictory.
eugene129 OP t1_jbwd8i7 wrote
Reply to comment by [deleted] in what exactly is Variance(Xt) during the Forward Process in Diffusion model ? by eugene129
So... N(Xt ; ... , BtI) doesn't mean that the V(Xt) = Bt ?
mmeeh t1_jbwcubc wrote
Reply to How to code a PPO neural network in java by SpigotNerd
oh no, not java -_-
SpigotNerd OP t1_jbulrzc wrote
Reply to comment by stillworkin in How to code a PPO neural network in java by SpigotNerd
My primary interest is RL, because I need it for my project (I don't think, that standard NN would work, because the enviroment changes so rapidly, that I can't use multiple agents, bec the results won't be comparable). I know some basics - I used TensorFlow in python for few times and I've built my own simple NN using matrix multiplication in unity few years ago. I just don't know much about RL and I don't know anything about neural networks in java, but I am trying to mod a java game and I didn't want to write the project in two languages before asking if there is some less wired solution.
SpigotNerd OP t1_jbukdj8 wrote
Reply to comment by deepneuralnetwork in How to code a PPO neural network in java by SpigotNerd
I need to add it as a mod for a java game and I am trying to avoid having a second program in a different language because I am too lazy to do it, so I was asking if there is some better solution.
No_Dust_9578 t1_jbujxkt wrote
Reply to How to code a PPO neural network in java by SpigotNerd
Remind me 999999 years
seamacke t1_jbubl55 wrote
You can learn with lower cards. I still use a GTX1050Ti for training on some models. Works great. The important thing is learning how to use resources efficiently. If you can use Collab or a lower card well, then you’ll know the moment you really, actually need a better card (or cards).
morifo t1_jbu6rs8 wrote
Reply to How to code a PPO neural network in java by SpigotNerd
OP u ok mate?
deepneuralnetwork t1_jbu43a1 wrote
Reply to How to code a PPO neural network in java by SpigotNerd
> in Java
but why?
mcottondesign t1_jc2dygc wrote
Reply to Using GANs to generate defective data by Tekno-12345
You can increase the amount of your existing defect images but flipping, rotating, cropping the images in a pre-processing step.
It isn’t a perfect answer but it is a great way to augment the limited data you already have.