Recent comments in /f/singularity
turnip_burrito t1_j9j46th wrote
Reply to What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
One author on this paper posted on Reddit here, if you're interested in their comments.
https://www.reddit.com/r/MachineLearning/comments/10svwch/comment/j79i4jj/
Savings-Juice-9517 t1_j9j43yd wrote
Reply to comment by [deleted] in OpenAI has privately announced a new developer product called Foundry by flowday
You completely bypassed the points being made and instead were trying to make a pedantic semantics argument about two terms that are two halves of the same coin
AllCommiesRFascists t1_j9j3zw5 wrote
Reply to comment by WithoutReason1729 in "Starlink is far crazier than most people realize. Feels almost inevitable when I look at this" by maxtility
> I think the major car manufacturers are doing a better job
Which ones and in what way?
> I’m very disappointed they went closed-source in spite of open literally being in the name.
Elon is disappointed in this too apparently
turnip_burrito t1_j9j3pea wrote
Reply to comment by sumane12 in What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
Yeah it's fucking nuts.
turnip_burrito t1_j9j3k3y wrote
Reply to comment by Electronic_Source_70 in OpenAI has privately announced a new developer product called Foundry by flowday
Neuroscience has basically no relationship to machine learning at this point (Neural networks are just """inspired"""^(TM) by neuroscience) so I wouldn't trust anyone but an AI specialist.
GoldenRain t1_j9j3fb1 wrote
Reply to comment by TFenrir in OpenAI has privately announced a new developer product called Foundry by flowday
I wonder how expensive each prompt is though.
sumane12 t1_j9j3b9j wrote
Reply to comment by turnip_burrito in What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
Fucking wow!
turnip_burrito t1_j9j2sg5 wrote
Reply to comment by sumane12 in What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
Yes, and it does it with only 0.4% the size of GPT3, possibly enough to run on a single graphics card.
It uses language and pictures together instead of just language.
ironborn123 t1_j9j2512 wrote
Reply to comment by drekmonger in A German AI startup just might have a GPT-4 competitor this year. It is 300 billion parameters model by Dr_Singularity
All else being equal, number of model parameters does matter. Well funded startups can acquire the needed data, compute resources, and human talent to build the models. Just like how OpenAI beat Google at this game.
[deleted] t1_j9j1pqt wrote
Reply to comment by Savings-Juice-9517 in OpenAI has privately announced a new developer product called Foundry by flowday
[deleted]
turnip_burrito t1_j9j1pe8 wrote
Reply to comment by gONzOglIzlI in OpenAI has privately announced a new developer product called Foundry by flowday
Why would it expand the token budget exponentially?
Also we have nowhere near enough qubits to handle these kinds of computations. The number of bits you need to run these models is huge (GPT3 ~170bil or 10^11 parameters). Quantum computers nowadays are lucky to be around 10^3 qubits, and they decohere too quickly to be used for very long (about 10^-4 seconds). * numbers pulled from a quick Google search.
That said, new (classical computer) architectures do exist that can use longer context windows: H3 (Hungry Hungry Hippos) and RWVST or whatever it's called.
Ziggy5010 t1_j9j1id1 wrote
Reply to comment by spreadlove5683 in A German AI startup just might have a GPT-4 competitor this year. It is 300 billion parameters model by Dr_Singularity
Agreed
tomorrow_today_yes t1_j9j0w44 wrote
He is a bit like Paul Ehrlich, the population doomster of the 1970’s. Ehrlich is very smart and a good arguer but not someone who can accept criticism easily. Ehrlich, like Yudkowsky, reached his conclusions based on simply extrapolating trends. Ehrlich refused to acknowledge that his simple extrapolation was made invalid by green revolution and improved technology, the possibility of this had been pointed out to him, notably by Julian Simon, but Ehrlich dismissed this as hopium. I really hope that we can ridicule Yudkowsky in the future for the same reasons, not because I don’t like him but because that would mean the AI alignment problem was solved.
[deleted] t1_j9j0w0g wrote
Reply to comment by sumane12 in What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
[deleted]
sumane12 t1_j9j0pi7 wrote
Reply to What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
My guy, correct me if I'm wrong, but doesn't it outperform humans, in everything but social sciences?...
lilezekias t1_j9j0bnp wrote
Doesn’t matter how well it was written by Ai, the letter was to address a very important and somber matter. For them to be lazy enough to have it be generated by Ai shows how oblivious they were to the importance of the matter.
[deleted] t1_j9j01s2 wrote
Reply to comment by diabeetis in OpenAI has privately announced a new developer product called Foundry by flowday
[deleted]
gONzOglIzlI t1_j9izs1r wrote
Reply to comment by turnip_burrito in OpenAI has privately announced a new developer product called Foundry by flowday
I'm I the only one wondering how quantum computers will factor in to all of this?
Feels like a hidden wild card, could expend the token budged exponentially.
Destiny_Knight OP t1_j9iysi7 wrote
Reply to What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
The paper: https://arxiv.org/pdf/2302.00923.pdf
The questions: "Our method is evaluated on the ScienceQA benchmark (Lu et al., 2022a). ScienceQA is the first large-scale multimodal science question dataset that annotates the answers with de- tailed lectures and explanations. It contains 21k multimodal multiple choice questions with rich domain diversity across 3 subjects, 26 topics, 127 categories, and 379 skills. The benchmark dataset is split into training, validation, and test splits with 12726, 4241, and 4241 examples, respectively."
AnimalSpirits2021 t1_j9iys06 wrote
ThoughtSafe9928 t1_j9ixtjq wrote
Reply to comment by Honest-Cauliflower64 in Does anyone else feel people don't have a clue about what's happening? by Destiny_Knight
Yup I have nothing to lose and everything to gain from this. Come at me, world changing AGI. i’m ready.
[deleted] t1_j9ixliy wrote
Reply to comment by End3rWi99in in Two Deans suspended after using ChatGPT to write email to students by Neurogence
[deleted]
Savings-Juice-9517 t1_j9ix6v8 wrote
Reply to comment by IndependenceRound453 in OpenAI has privately announced a new developer product called Foundry by flowday
Exactly. I’m a full time programmer and AI, at least in its current form, definitely improves my productivity but is no where near the level where it will replace programmers or software engineers. Less than 5% of a programmers time is spent physically writing code but this subreddit seems to think that’s what programmers do all day
[deleted] t1_j9j4e9w wrote
Reply to comment by Savings-Juice-9517 in OpenAI has privately announced a new developer product called Foundry by flowday
[deleted]