Recent comments in /f/MachineLearning
marcus_hk t1_jdfjp25 wrote
Reply to [N] ChatGPT plugins by Singularian2501
How is this different from prompt engineering with langchain? They don't say.
Steve____Stifler t1_jdfjo7z wrote
Reply to comment by Puzzleheaded_Acadia1 in [N] ChatGPT plugins by Singularian2501
ChatGPT: Wolfram Alpha is a website that you can use to get answers to questions and do calculations on a wide range of topics, from science and math to history and finance. It's like having a really powerful calculator and encyclopedia that you can access anytime from your computer or mobile device.
ironmagnesiumzinc t1_jdfjdbj wrote
Reply to comment by wendten in [D] What is the best open source chatbot AI to do transfer learning on? by to4life4
When fine-tuning alpaca, would it make sense to use an unsupervised approach with raw text data (say only 100mb) or would a supervised QA approach be way better?
BraianP t1_jdfjbq4 wrote
Reply to comment by RoyalCities in [P] The next generation of Stanford Alpaca by [deleted]
so, open assistant?
tOSUfever t1_jdfj8k9 wrote
Reply to comment by C0demunkee in [Project] Alpaca-30B: Facebook's 30b parameter LLaMa fine-tuned on the Alpaca dataset by imgonnarelph
where are you finding 24gb p40's for $150?
Puzzleheaded_Acadia1 t1_jdfiiqr wrote
Reply to comment by endless_sea_of_stars in [N] ChatGPT plugins by Singularian2501
Cool but pls explain what is Wolfram i see it alot but I don't know what it is
zy415 OP t1_jdfhjy4 wrote
Reply to comment by zhaoyl18 in [D] ICML 2023 Reviewer-Author Discussion by zy415
Agreed. It’s basically a joke
to4life4 OP t1_jdfhhup wrote
Reply to comment by wendten in [D] What is the best open source chatbot AI to do transfer learning on? by to4life4
"Best" I suppose meaning closest to the latest ChatGPT on the usual benchmarks.
First on my own gpu to test (3080ti), then on a cluster if I can prove out the concept.
Thanks I'll definitely look into Alpaca. It can be customized to work with human ratings of generated output?
wendten t1_jdfh6ya wrote
best is a very vague term. Do you have access to a gpu cluster, or do you plan to run it on an office laptop. However id say the Alpaca model would be a good candidate. you can follow their guidance and make your own custom model from one of metas Llama models
zhaoyl18 t1_jdfh2p5 wrote
Reply to [D] ICML 2023 Reviewer-Author Discussion by zy415
So my only hope is the low-score-reviewer will not show up on the last day and say his/her mind is not changing.
Wacov t1_jdfh05b wrote
Reply to comment by signed7 in [N] ChatGPT plugins by Singularian2501
Don't typical home assistants already do voice recognition in the cloud? It's just the attention phrase ("ok Google" etc) they recognize locally
YaAbsolyutnoNikto t1_jdffurk wrote
Reply to comment by rautap3nis in [N] ChatGPT plugins by Singularian2501
Bing Create Image?
YaAbsolyutnoNikto t1_jdfft8y wrote
Reply to comment by modeless in [N] ChatGPT plugins by Singularian2501
Well, you can use facebook, youtube, google calendar, etc. through safari/chrome/etc. on your phone too. Doesn't mean the experience isn't better when it is tailored to the platform you're using.
Having a lot of these platforms converted into chatGPT in the most ideal manner seems like a better way and more practical way to use it.
RedditLovingSun t1_jdfds8b wrote
Reply to comment by signed7 in [N] ChatGPT plugins by Singularian2501
I'm optimistic, between the hardware and algorithmic advances being made
light24bulbs t1_jdfd2yz wrote
Reply to comment by sebzim4500 in [N] ChatGPT plugins by Singularian2501
Oh, yeah, understanding what the tools do isn't the problem.
The thing changing its mind about how to fill out the prompt is the issue, forgetting the prompt altogether, etc. And then you have to have smarter and smarter regexs and..yeah. it's rough.
It's POSSIBLE to get it to work but it's a pain. And it introduces lots of round trips to their slow API and multiplies the token costs.
lIllIIIllIllIIlIlllI t1_jdfczea wrote
Reply to [N] ChatGPT plugins by Singularian2501
I feel like I can't fully grok the implications of this because I'm so exhausted from keeping up with all the recent developments in ML. Can we have one day without new product launches or research breakthroughs 😩
signed7 t1_jdfcyxr wrote
Reply to comment by RedditLovingSun in [N] ChatGPT plugins by Singularian2501
Models need to get a lot smaller (without sacrificing too much capability) and/or phone TPUs need to get a lot better first
keelezibel1990 t1_jdfcttb wrote
Reply to comment by _Arsenie_Boca_ in [N] ChatGPT plugins by Singularian2501
There is still utility for on Prem deployment of LLMs
signed7 t1_jdfcly9 wrote
Reply to comment by wind_dude in [N] ChatGPT plugins by Singularian2501
It's a shame that 'Open'AI has become so closed. Would be so cool to see a proper paper with technical details on how this works...
ghostfaceschiller t1_jdfc5uj wrote
Reply to [D] "Sparks of Artificial General Intelligence: Early experiments with GPT-4" contained unredacted comments by QQII
Leaving out the best part: a commented out line reveals that the original/alternate title of the paper was “First Contact With An AGI System”
zhaoyl18 t1_jdf9tvo wrote
Reply to comment by sleeplessinseattle00 in [D] ICML 2023 Reviewer-Author Discussion by zy415
same. None reaction from any reviewer. In this case the 'discussion period' is vague and pretty funny
iJfbQd t1_jdf9cqi wrote
Reply to comment by ---AI--- in [N] ChatGPT plugins by Singularian2501
I've just been parsing the json output using a json5 parser (ie in Python, import json5 as json). In my experience, this catches all of the occasional json output syntax errors (like putting a comma after the terminal element).
jakderrida t1_jdf8zip wrote
Reply to comment by VictorMollo in [R] Introducing SIFT: A New Family of Sparse Iso-FLOP Transformations to Improve the Accuracy of Computer Vision and Language Models by CS-fan-101
Whoever is downvoting you just doesn't get it.
My joke was that "structural" was so meaningless that it's obviously a backronym solely in service of my pun.
/r/VictorMollo 's joke is that we should all just go off the deep-end and double down on blatantly obvious backronyms.
Notice he used the word "Widget" instead of freaking "Weighted"? He obviously chose to Taylor it that way because he appreciates my puns.
Educational-Walk8098 t1_jdf8ctj wrote
Reply to [D] ICML 2023 Reviewer-Author Discussion by zy415
We obtain 7/7/5 with confidence score 5/4/4. Our rebuttal addresses the questions and comments of each reviewer but we could not make it on time for an additional experiment that one of the reviewer may find interesting. Our rebuttal was well-written but there are some points we wish that we could express in another way. I'm now extremely worried if it could negatively affect the final recommendations of the reviewers. None of the reviewers respond to my rebuttal yet...
JimiSlew3 t1_jdfk435 wrote
Reply to [D] Simple Questions Thread by AutoModerator
Nublet question: is there anything linking LLMs and data analyst and visualizations yet? I saw a bit with MS Copilot and Excel. I want to know if there is anymore advanced in the works. Thanks!