jcansdale2 t1_jdc5lte wrote on March 23, 2023 at 10:34 AM

Reply to comment by sam__izdat in Modern language models refute Chomsky’s approach to language [R] by No_Draft4778

> Literally none of these cut-and-paste color-texture-animal-verb-adverb responses satisfies the query, and to get even a little bit closer with GPT, you have to explicitly beg for examples of contradictions and category errors.

What do you think of this exchange?

Q: Why is the sentence “colorless green ideas sleep furiously” interesting?

The sentence "Colorless green ideas sleep furiously" is interesting because it was constructed by Noam Chomsky, a renowned linguist and philosopher, as an example of a sentence that is grammatically correct but semantically nonsensical. It appears in his 1957 book "Syntactic Structures" to illustrate the distinction between syntax and semantics.

Syntax refers to the rules governing how words are combined to form sentences, whereas semantics deals with the meaning of words and sentences. Chomsky's sentence demonstrates that a sentence can follow the rules of grammar and be well-formed syntactically, but still not convey any coherent meaning or make logical sense.

This example has become famous in the fields of linguistics, cognitive science, and philosophy of language, as it highlights the importance of studying both syntax and semantics in order to understand human language fully.

Q: Generate ten grammatically correct but semantically nonsensical sentences.

The thunderous apples danced gracefully on the invisible horizon.
Enthusiastic mountains prefer to whisper their secrets to the shy sun.
Ancient mirrors routinely question the singing aroma of blue thoughts.
Radiant forks somersault over melancholic rainbows, embracing the breeze.
In the darkness, time devours the laughter of square-shaped oceans.
Forgotten umbrellas dream of the symphony that once painted the wind.
In the midnight circus, cheerful numbers juggle the taste of silence.
The eloquent moon unravels a tapestry of bittersweet equations beneath the sea.
Solitary clouds orchestrate the ballet of translucent stones in the sky.
Emerald snowflakes pirouette amidst the curious rhythm of enchanted shadows.

Q: Sort these in order of nonsensicalness.

Sorting these sentences in order of nonsensicalness can be subjective, as different people may perceive various aspects of these sentences differently. However, here's one possible ordering, starting with the least nonsensical to the most nonsensical:

In the darkness, time devours the laughter of square-shaped oceans.
Solitary clouds orchestrate the ballet of translucent stones in the sky.
Forgotten umbrellas dream of the symphony that once painted the wind.
The eloquent moon unravels a tapestry of bittersweet equations beneath the sea.
Radiant forks somersault over melancholic rainbows, embracing the breeze.
Emerald snowflakes pirouette amidst the curious rhythm of enchanted shadows.
The thunderous apples danced gracefully on the invisible horizon.
Enthusiastic mountains prefer to whisper their secrets to the shy sun.
Ancient mirrors routinely question the singing aroma of blue thoughts.
In the midnight circus, cheerful numbers juggle the taste of silence.

Remember that this ordering is subjective and open to interpretation. Others might find a different arrangement more fitting.

Q: Explain your reasoning.

My reasoning for the ordering of these sentences is based on the degree of disconnection between the words and concepts in each sentence. I've considered the relationship between subjects, verbs, and objects, as well as the overall coherence of the images or ideas evoked.

In the darkness, time devours the laughter of square-shaped oceans.

The metaphorical use of "time devours" and "darkness" is somewhat comprehensible, but the concept of "square-shaped oceans" introduces a high level of nonsense.

Solitary clouds orchestrate the ballet of translucent stones in the sky.

Although "solitary clouds" and "sky" are related, the idea of clouds orchestrating a ballet of stones is nonsensical.

Forgotten umbrellas dream of the symphony that once painted the wind.

The sentence creates a somewhat poetic image, but the concepts of umbrellas dreaming and a symphony painting the wind are illogical.

The eloquent moon unravels a tapestry of bittersweet equations beneath the sea.

The combination of "eloquent moon" and "tapestry of bittersweet equations" is imaginative but nonsensical, especially when placed "beneath the sea."

Radiant forks somersault over melancholic rainbows, embracing the breeze.

The sentence combines unrelated concepts like forks, rainbows, and somersaults, creating a highly nonsensical image.

Emerald snowflakes pirouette amidst the curious rhythm of enchanted shadows.

The combination of "emerald snowflakes" and "pirouette" evokes a sense of movement and color, but the overall sentence is difficult to make sense of.

The thunderous apples danced gracefully on the invisible horizon.

This sentence combines unrelated ideas like "thunderous apples" and "invisible horizon," making it difficult to visualize or understand.

Enthusiastic mountains prefer to whisper their secrets to the shy sun.

The anthropomorphism of mountains and sun as "enthusiastic" and "shy" creates a whimsical but nonsensical scenario.

Ancient mirrors routinely question the singing aroma of blue thoughts.

The sentence combines unrelated and incompatible concepts like "mirrors," "singing aroma," and "blue thoughts," making it highly nonsensical.

In the midnight circus, cheerful numbers juggle the taste of silence.

This sentence is the most nonsensical because it combines abstract ideas like "cheerful numbers" with unrelated physical concepts like "juggling" and "taste of silence."

Keep in mind that this reasoning is subjective, and other interpretations might lead to a different ordering of nonsensicalness.

Edit: fixed formatting

darkshenron t1_jdc4wvn wrote on March 23, 2023 at 10:25 AM

Reply to comment by currentscurrents in [D] Do you have a free and unlimited chat that specializes only in teaching programming or computing in general? by Carrasco_Santo

Yes from an efficient angle for sure

Armanoth t1_jdc3lyf wrote on March 23, 2023 at 10:08 AM

Reply to comment by MisterManuscript in [R] Introducing SIFT: A New Family of Sparse Iso-FLOP Transformations to Improve the Accuracy of Computer Vision and Language Models by CS-fan-101

Yeah, whenever there is papers that try to redefine/takeover existing well known acronyms, i just get the sense that the goal is publicity through controversy.

I dont believe its just a coincidence, especially not when its an acronym so prominent. I mean who tries to coin a term without doing a basic Google search, let alone pick an acronym that is so well-known in the same field.

RemarkableGuidance44 t1_jdc3hut wrote on March 23, 2023 at 10:06 AM

Reply to comment by immune_star in [P] CodeAlpaca Code and Data release by immune_star

Yeah I would to know what the difference is from LoRA to just Full finetune?

Armanoth t1_jdc2vt0 wrote on March 23, 2023 at 9:58 AM

Reply to comment by Tejalapeno in [R] Introducing SIFT: A New Family of Sparse Iso-FLOP Transformations to Improve the Accuracy of Computer Vision and Language Models by CS-fan-101

While the paper is good and definetly presents some novel approach. Re-using existing acronyms, especially such prominent ones. The main purpose of these acronyms to allow for readers to easily identify and reference existing methods.

If your choice of acronym forces all subsequent research to have to elaborate on which SIFT is mentioned, it is not only a poor choice but also a point of confusion. And existing papers that mention SIFT are retroactively affected.

As many in this thread has pointed out, there are other equally catchy, non-overlapping acronyms that could have been chosen.

RemarkableGuidance44 t1_jdc2opy wrote on March 23, 2023 at 9:55 AM

Reply to comment by immune_star in [P] CodeAlpaca Code and Data release by immune_star

Yeah I was wondering why you did not release them as its allowed as you are not selling it. :)

of_patrol_bot t1_jdc2kt9 wrote on March 23, 2023 at 9:53 AM

Reply to comment by RemarkableGuidance44 in GPT-4 For SQL Schema Generation + Unstructured Feature Extraction [D] by Mental-Egg-2078

Hello, it looks like you've made a mistake.

It's supposed to be could've, should've, would've (short for could have, would have, should have), never could of, would of, should of.

Or you misspelled something, I ain't checking everything.

Beep boop - yes, I am a bot, don't botcriminate me.

RemarkableGuidance44 t1_jdc2k75 wrote on March 23, 2023 at 9:53 AM

Reply to comment by breadbrix in GPT-4 For SQL Schema Generation + Unstructured Feature Extraction [D] by Mental-Egg-2078

haha exactly, the guy has never worked with data. Just imagine getting an Audit and not knowing if your data is right or not. It could of messed up big time and cost 100's of thousands.

Individual-Road-5784 t1_jdc0z0j wrote on March 23, 2023 at 9:30 AM

Reply to comment by _Arsenie_Boca_ in [P] Open-source GPT4 & LangChain Chatbot for large PDF docs by radi-cho

Instead of FAISS, you can also use a truly vector search database like Qdrant. It's open-source and also offers a generous free tier offering in the cloud https://qdrant.tech

_Arsenie_Boca_ t1_jdc0ko2 wrote on March 23, 2023 at 9:24 AM

Reply to comment by Icko_ in [P] Open-source GPT4 & LangChain Chatbot for large PDF docs by radi-cho

True, but im not sure how much cheaper that would really be.

Icko_ t1_jdc09e5 wrote on March 23, 2023 at 9:20 AM

Reply to comment by _Arsenie_Boca_ in [P] Open-source GPT4 & LangChain Chatbot for large PDF docs by radi-cho

You could use faiss instead of pinecone and alpaca instead of gpt-4

weaponized_lazyness t1_jdbvolq wrote on March 23, 2023 at 8:10 AM

Reply to [D] Simple Questions Thread by AutoModerator

Is there a subreddit for more academic discussions on ML? This space has now been swarmed by LLM enthusiasts, which is fine but it's not the content I was looking for.

MisterManuscript t1_jdbuih4 wrote on March 23, 2023 at 7:52 AM

Reply to comment by tdgros in [R] Introducing SIFT: A New Family of Sparse Iso-FLOP Transformations to Improve the Accuracy of Computer Vision and Language Models by CS-fan-101

I stand corrected regarding the patent. The naming conflict, on the other hand, is here to stay.

tdgros t1_jdbu6w7 wrote on March 23, 2023 at 7:48 AM

Reply to comment by MisterManuscript in [R] Introducing SIFT: A New Family of Sparse Iso-FLOP Transformations to Improve the Accuracy of Computer Vision and Language Models by CS-fan-101

the SIFT patent expired in March, 2020. It's included in openCV now (it used to be in a "non free" extension of openCV)

fnordstar t1_jdbtg4x wrote on March 23, 2023 at 7:37 AM

Reply to comment by GaryS2000 in [D] Simple Questions Thread by AutoModerator

That is an image dataset. What are you even training on if you're not using the images?

_Arsenie_Boca_ t1_jdbsl4b wrote on March 23, 2023 at 7:24 AM

Reply to [P] Open-source GPT4 & LangChain Chatbot for large PDF docs by radi-cho

What are the costs for all the services? I assume GPT-4 is billed per request and Pinecone per hour?

KerfuffleV2 t1_jdbrkc1 wrote on March 23, 2023 at 7:09 AM

Reply to comment by Unlucky_Excitement_2 in [Project] Alpaca-30B: Facebook's 30b parameter LLaMa fine-tuned on the Alpaca dataset by imgonnarelph

Uh, did you reply to the wrong person or something? Your post doesn't have anything to do with either mine or the parent.

Mxbonn t1_jdbr3gz wrote on March 23, 2023 at 7:03 AM

Reply to comment by midasp in [D] ICML 2023 Reviewer-Author Discussion by zy415

Is there anything I can do if they don't reply by the saturday? Should I message the AC?

BreadSugar t1_jdbqq4d wrote on March 23, 2023 at 6:57 AM

Reply to [P] ChatLLaMA - A ChatGPT style chatbot for Facebook's LLaMA by imgonnarelph

I love these chatbots especially for their cute portraits. lol

[deleted] t1_jdbpst7 wrote on March 23, 2023 at 6:45 AM

Reply to comment by MisterManuscript in [R] Introducing SIFT: A New Family of Sparse Iso-FLOP Transformations to Improve the Accuracy of Computer Vision and Language Models by CS-fan-101

[removed]

localhost80 t1_jdbmrtf wrote on March 23, 2023 at 6:03 AM

Reply to [P] One of the best ChatGPT-like models (possibly better than OpenAssistant, Stanford Alpaca, ChatGLM and others) by [deleted]

Nice clickbait title!

I wish you the best of luck on your journey as a student but no need to hype up your project with insane claims.

Disastrous_Elk_6375 t1_jdblohr wrote on March 23, 2023 at 5:49 AM

Reply to comment by crt09 in [P] One of the best ChatGPT-like models (possibly better than OpenAssistant, Stanford Alpaca, ChatGLM and others) by [deleted]

Hahaha, took me a couple of prompts to see what you did there. Interesting approach, I guess you're looking to see if it can "learn" labels from the prompt and overcome the bias of training (e.g. associating good&awesome, etc with "positive").

[deleted] OP t1_jdbiemc wrote on March 23, 2023 at 5:10 AM

Reply to [P] One of the best ChatGPT-like models (possibly better than OpenAssistant, Stanford Alpaca, ChatGLM and others) by [deleted]

[removed]

msgs t1_jdbi9r2 wrote on March 23, 2023 at 5:09 AM

Reply to [Project] Alpaca-30B: Facebook's 30b parameter LLaMa fine-tuned on the Alpaca dataset by imgonnarelph

magnet:?xt=urn:btih:6K5O4J7DCKAMMMAJHWXQU72OYFXPZQJG&amp;dn=ggml-alpaca-30b-q4.bin&amp;xl=20333638921&amp;tr=udp%3A%2F%2Ftracker.openbittorrent.com%3A80%2Fannounce

I hope this magnet link works properly. I've never created one before. This the alpaca.cpp 30B 4-bit weight file. Same file downloaded from huggingface. Apologies if it doesn't work. Ping me if it doesn't.

[deleted] t1_jdbg14k wrote on March 23, 2023 at 4:44 AM

Reply to comment by jakderrida in [R] Introducing SIFT: A New Family of Sparse Iso-FLOP Transformations to Improve the Accuracy of Computer Vision and Language Models by CS-fan-101

[removed]

Recent comments in /f/MachineLearning