DigThatData
DigThatData t1_ivtx4y1 wrote
the summary reporting you offer describes some of the net differences, but I'd be interested to see numbers describing the distribution of what your team considered to be incorrect labels in the original dataset.
DigThatData t1_iuibdo1 wrote
Reply to comment by __mantissa__ in [D] DL Practitioners, Do You Use Layer Visualization Tools s.a GradCam in Your Process? by DisWastingMyTime
Judea Pearl - "The Book of Why"
DigThatData t1_iue7pne wrote
Reply to comment by jellyfishwhisperer in [D] DL Practitioners, Do You Use Layer Visualization Tools s.a GradCam in Your Process? by DisWastingMyTime
very thought provoking stuff! I wonder if maybe an alternative interpretation of these observations might be something along the lines of deep image prior, i.e. maybe randomly initialized deep architectures are capable of performing edge detection just by virtue of how the gradient responds to the stacked operators?
DigThatData t1_iue3h89 wrote
Reply to comment by Borky_ in [D] DL Practitioners, Do You Use Layer Visualization Tools s.a GradCam in Your Process? by DisWastingMyTime
I think "recently" started about two years after pytorch was released.
DigThatData t1_iue34mz wrote
Reply to comment by slippu in [D] What are the bottlenecks in your ML project lifecycle? What tools would you like to see more widely used? by Fine-Topic-6127
it's called "python"
DigThatData t1_iuadfxx wrote
Reply to [D] What are the bottlenecks in your ML project lifecycle? What tools would you like to see more widely used? by Fine-Topic-6127
i'd like to see more researchers publish their code with a setup.py
DigThatData t1_iu9ppwr wrote
Reply to comment by __mantissa__ in [D] DL Practitioners, Do You Use Layer Visualization Tools s.a GradCam in Your Process? by DisWastingMyTime
have you played with any techniques from causal inference, like counterfactual explanations?
DigThatData t1_iu6zlbo wrote
Reply to comment by Borky_ in [D] DL Practitioners, Do You Use Layer Visualization Tools s.a GradCam in Your Process? by DisWastingMyTime
i have tunnel vision on the pytorch ecosystem (with the occasional jax cameo)
DigThatData t1_iu4rr8y wrote
Reply to [D] DL Practitioners, Do You Use Layer Visualization Tools s.a GradCam in Your Process? by DisWastingMyTime
sometimes, usually more likely to bust something like this out if I have a specific need than it being part of my general process. simpler metrics like gradient magnitude often get the job done well enough.
in any event, it sounds like you're interested in the tooling space so here are a few projects I think are interesting, regardless of whether or not I use them myself:
- https://github.com/f-dangel/cockpit
- https://github.com/f-dangel/backpack
- https://github.com/f-dangel/vivit
- https://github.com/DistrictDataLabs/yellowbrick
- https://github.com/tomgoldstein/loss-landscape
- https://github.com/hila-chefer/Transformer-MM-Explainability
- https://github.com/pytorch/captum
- https://github.com/cleverhans-lab/cleverhans
- https://github.com/TorchDrift/TorchDrift
- https://github.com/MAIF/shapash
- https://github.com/uncertainty-toolbox/uncertainty-toolbox
- https://github.com/ropas/pytea
- https://github.com/oegedijk/explainerdashboard
- https://github.com/deepchecks/deepchecks
- https://github.com/Trusted-AI/AIX360
- https://github.com/delve-team/delve
- https://github.com/CalculatedContent/WeightWatcher
- https://github.com/archinetai/surgeon-pytorch
- https://github.com/xl0/lovely-tensors
DigThatData t1_iu3eb42 wrote
Reply to comment by yaosio in [R] "Re3: Generating Longer Stories With Recursive Reprompting and Revision" - Generating stories of 2000+ words (or even much longer) by 0xWTC
computer vision often overshadows NLP. Hard to compete when something novel is making the rounds with pretty pictures to go with it.
DigThatData t1_iu3dzo3 wrote
Reply to comment by FutureIsMine in [R] "Re3: Generating Longer Stories With Recursive Reprompting and Revision" - Generating stories of 2000+ words (or even much longer) by 0xWTC
i like to think of it as "oh sweet, they did the work for me, now I can jump straight into that other idea that built on top of this one."
DigThatData t1_itvh0i8 wrote
Reply to comment by EnvironmentalBar338 in [D]Cheating in AAAI 2023 rebuttal by [deleted]
> Since the author already knows me, if I do anything he would know it is me.
who cares? you're not the one violating protocol here, they are.
DigThatData t1_ittx0rf wrote
Reply to comment by pommedeterresautee in [P] Up to 12X faster GPU inference on Bert, T5 and other transformers with OpenAI Triton kernels by pommedeterresautee
name your next project "java"
DigThatData t1_itqbww4 wrote
CLIP is definitely what you want here, and it's unclear to me why you are so convinced that a categorical text representation is an important feature considering you're planning on projecting it to a dense text embedding anyway.
You should really learn about CLIP or at least survey the state of multi-modal representation learning before committing to your current layout.
DigThatData t1_it8rc38 wrote
Reply to comment by respeckKnuckles in [D] Accurate blogs on machine learning? by likeamanyfacedgod
that's a variant that people definitely do sometimes. If you think adding score annotations a particular way should be an out-of-the-box feature in a particular tool you use, you should create an issue on their gh to recommend it or implement it yourself and submit a PR.
DigThatData t1_it8hvbj wrote
Reply to comment by respeckKnuckles in [D] Accurate blogs on machine learning? by likeamanyfacedgod
each point on the curve represents a decision threshold. given a particular decision threshold, your model will classify points a certain way. as you increment the threshold, it will hit the score of one or more observations, creating a step function as observations are moved from one bin to another as the decision threshold moves across their score.
DigThatData t1_it59gaq wrote
Reply to [D] Is it worth paying a data sourcing company to crowdsource a bespoke dataset? by quantifiedvagabond
it depends on the data. considering the kind of data your working with is one of the least mature media in the analytics industry (video), it might be both significantly more cost effective and likely to produce a high-quality result if you buy the dataset. That said, if you were thinking of spinning up an in-house data annotation resource, this might be a good opportunity to go that route, and I'm sure the ML team wouldn't have any complaints if you gave them a persistent data generating resource like that.
DigThatData t1_it43hfu wrote
Reply to [D] Discussion Panel for FOSS Instruct by FerretDude
> FerretDude
sus.
DigThatData t1_isu432w wrote
it sounds like this was probably a job you wouldn't have wanted anyway. I'm sorry you had such a frustrating experience, but I strongly suspect you dodged a bullet here
DigThatData t1_iscbmnp wrote
Reply to comment by Technical-Vast1314 in [R] detrex: the open source toolbox for Transformer based object detection algorithms by Technical-Vast1314
glad to hear it, lay that referring expression segmentation on me!
DigThatData t1_isb57ij wrote
Reply to [R] detrex: the open source toolbox for Transformer based object detection algorithms by Technical-Vast1314
no love for MDETR?
DigThatData t1_is60e8i wrote
Reply to comment by M4xM9450 in [D] Are GAN(s) still relevant as a research topic? or is there any idea regarding research on generative modeling? by aozorahime
touche!
DigThatData t1_is4jiv3 wrote
Reply to [D] Are GAN(s) still relevant as a research topic? or is there any idea regarding research on generative modeling? by aozorahime
totally still relevant. also, you never know when an older line of research will experience some innovation and renew interest. That's basically what happened with diffusion.
DigThatData t1_irtg94z wrote
Reply to [N] Using machine learning to find an optimal mixture of metals to create a desired alloy by cyphersanthosh
could you link to the article next time?
DigThatData t1_iw5mqyx wrote
Reply to [D] When was the last time you wrote a custom neural net? by cautioushedonist
i was implementing something from a paper that didn't have a public implementation and I wanted to play with it.