Recent comments in /f/singularity

BadassGhost t1_j7pprrd wrote

I feel like an unrestricted LLM-powered chatbot is pretty close to proto-AGI. OpenAI is basically lobotomizing ChatGPT to avoid headlines about it claiming sentience or emotions or making controversial statements, so it's not much to go off of.

We haven't been able to play with PaLM or any next-gen versions of it (Flan-PaLM and U-PaLM), but the benchmark comparisons between that and others seem enormous. If you build PaLM with an embedded dataset and cross-attention like Retro, I think that would probably be proto-AGI.

And then the next step from there to actual AGI would be making a multi-modal version of that, like Gato. The only missing ingredient there is getting the model to use one modality to inform about other modalities, which they did not achieve with Gato but are supposedly actively working on

22

BadassGhost t1_j7pov2k wrote

2019 was GPT-2 which rocked the boat. 2020 was GPT-3 which sank the boat. Those were partially responsible for kicking off this whole scaling up of transformers

There was also LaMDA in 2021, and I'm sure many other big events in that period that I'm forgetting

5

Iunaml t1_j7pexov wrote

Text is more easily searchable, editable and accessible compared to jpeg. Jpeg images are not easily indexed by search engines, which can negatively affect their discoverability. Text can be easily copied, pasted, and edited, while with a jpeg, the text cannot be edited and is limited in terms of accessibility. Additionally, a jpeg may not display correctly on all devices, whereas text can be viewed on any device with a compatible software. These advantages of text make it a preferred format for sharing information over jpeg images.

2

easy_c_5 t1_j7pb4sx wrote

Apparently you haven't seen any of the the uncountable javascript libraries released durint that time, or the hundreds of startups tackling similar subjects or the tens of thousands of research papers on distributed systems, animation etc. (just because there are non-groundbreaking research papers in the list above too) .

The list above is nothing groundbreaking, just copies over copies of the same stuff we've had for quite a while + productivising it.

The real summary of the past months and days:

The good: AI is going public.

The bad: we still don't have any real clue on how to get to AGI.

The worse: AI is getting regulated and people are fighting back.

3

controltheweb t1_j7paze2 wrote

Image to Text:

Al Progress of February, 2023 Week 1 (1 Feb - 7 Feb) by pro_raze

  1. Over 1 million researchers have used Deepmind's Alphafold Protein Structure Database
  2. Google Al releases the Flan T5 Language Model Collection
  3. Meta Al trained blind Al agents that can navigate similar to blind humans
  4. ChatGPT Plus announced for $20 per month with waitlist (US only for now) - ChatGPT Users Topped 100 Million in January
  5. Microsoft announces Teams Premium powered by GPT-3.5
  6. Perplexity Ask (Al Search Engine) available as a Chrome extension
  7. Microsoft boosts Viva Sales with new GPT seller experience (integration)
  8. AudioLDM Text to Audio Generation available on Huggingface to use
  9. Meta releases a 30B param "OPT+IML" model fine tuned on 2000 tasks
  10. Google Al Open Sourced Vizier: a scaled blackbox optimization system
  11. Dreamix: Video Diffusion Models are General Video Editors
  12. SceneDreamer: Generating 3D Scenes From 2D Image Collections
  13. SceneScape: Text-Driven Consistent Scene Generation
  14. RobustNeRF: Basically improves quality of NeRFs
  15. OpenAl's New Paper: A proof of concept for using Al-assisted human feedback to scale the supervision of ML systems
  16. Deepmind Paper: Accelerating Large Language Model Decoding with Speculative Sampling (2-2.5x speedup)
  17. Amazon Al: Multimodal-CoT outperforms GPT-3.5 by 16% (75.17% -> 91.68%) on ScienceQA and even surpasses human performance
  18. Sundar Pichai announced: LaMDA language model within "coming weeks and months"
  19. AutumnSynth synthesizes the source code of a 2D video game from seconds of play
  20. Nvidia Paper: Enabling Simulated Characters To Perform Scene Interaction Tasks In Natural/Lifelike Manner
  21. Poe, a ChatGPT like bot launched from the creators of Quora. They are also making API for it. Currently iOS only.
  22. Google invests $300 million in Anthropic Al (Done in 2022, reported now)
  23. BLIP-2 demo available on Huggingface: LLM that can understand images
  24. Humata.ai launched: Basically ChatGPT for your own files
  25. Bing+ GPT integration images leaked
  26. Google's new Real-time tracking of wildfire boundaries using satellite imagery
  27. LAION AI introduces Open Assistant: Chatbot project that understands tasks, interacts with third-party systems, and retrieve information dynamically (open source)
  28. Apple CEO Tim Cook says Al will eventually 'affect every product and service we have'
  29. Epic-Sounds: A Large-scale Dataset of Actions That Sound Released
  30. announcing stable attribution - a tool which lets anyone find the human creators behind a.i generated images
  31. presenting TEXTure, a novel method for text-guided generation, editing, and transfer of textures for 3D shapes
  32. -Tune-A-Video available to use and also open sourced (turns Al Generated Images into gifs or videos)
  33. Filechat.io now available - ChatGPT for your own data and no limits (with premium tier)
  34. BioGPT-Large by Microsoft now available on Huggingface to try
  35. Google announces Bard, powered by LaMDA coming soon as an Al conversational service. It will be integrated with Search.
  36. Microsoft announces surprise event for tomorrow with Bing ChatGPT expected (Feb 7)
  37. Language Models Secretly Perform Gradient Descent as Meta-Optimizers Paper - In-context-learning, the ability for LLMs to learn new abilities from examples in a prompt alone
  38. Apple to hold in-person 'Al summit' event for employees at Steve Jobs Theater
  39. -Seek Al introduces DeepCuts, the AI SQL app that lets you explore your Spotify data with natural language
  40. KickResume's Al Resume Builder can rewrite, format, and grade a resume
  41. Introducing Polymath: The open-source tool that converts any music-library into a sample-library with machine learning
  42. Microsoft & OpenAI Announce: Bing and Edge + Al: a new way to search starts today
15