Recent comments in /f/deeplearning

mcottondesign t1_jc2dygc wrote

You can increase the amount of your existing defect images but flipping, rotating, cropping the images in a pre-processing step.

It isn’t a perfect answer but it is a great way to augment the limited data you already have.

7

saintshing t1_jc26rk0 wrote

I feel like it should be possible to extend diffusion transformer technique to code generation for web development.

You can input a screenshot of a static webpage, then use a text prompt like 'Change the style to fit a futuristic theme', or just input a low fidelity UI wireframe and it can generate a detailed webpage with the html and css. We can get training data from the internet for self supervised learning.

Also retrieval transformers or models that know how to query APIs, databases and prompt other models.

2

amhotw t1_jc0mf55 wrote

If you are serious, I would recommend working on Rudin's Principles of Math Analysis. It might take a day (or more...) to wrap your head around a single proof but at the end you'll be ready to read anything (of course you might need to check some definitions.)

For KL divergence, entropy etc., Info Theory book by Mackay is great.

For hessian, well it is just calculus; the second derivative of a multivariate function. To understand its uses, you would need some understanding of numerical analysis and concave programming. For the latter, Boyd's optimization book is a classic. I don't remember a good book on numerical analysis but some diff. eqn.s books have nice chapters on it.

8

deephugs t1_jc0lhog wrote

First try and understand every symbol in the equation, there are cheat sheets online. Second, most math concepts have a wikipedia page you can read, go down those rabbit holes and sooner or later you will find common threads and start to build an understanding. Finally, just put the time in, math is like everything else and just takes lots and lots of practice.

3

SpigotNerd OP t1_jbulrzc wrote

My primary interest is RL, because I need it for my project (I don't think, that standard NN would work, because the enviroment changes so rapidly, that I can't use multiple agents, bec the results won't be comparable). I know some basics - I used TensorFlow in python for few times and I've built my own simple NN using matrix multiplication in unity few years ago. I just don't know much about RL and I don't know anything about neural networks in java, but I am trying to mod a java game and I didn't want to write the project in two languages before asking if there is some less wired solution.

1

seamacke t1_jbubl55 wrote

You can learn with lower cards. I still use a GTX1050Ti for training on some models. Works great. The important thing is learning how to use resources efficiently. If you can use Collab or a lower card well, then you’ll know the moment you really, actually need a better card (or cards).

2