derek_ml t1_jcudkzh wrote on March 19, 2023 at 5:12 PM

C:\Users\****\Downloads\alpaca\alpaca.cpp&gt;make chat
I llama.cpp build info: I UNAME_S:  CYGWIN_NT-10.0 I UNAME_P:  
unknown I UNAME_M:  x86_64 I CFLAGS:   -I.              -O3 -
DNDEBUG -std=c11   -fPIC -mfma -mf16c -mavx -mavx2 I 
CXXFLAGS: -I. -I./examples -O3 -DNDEBUG -std=c++11 -fPIC I 
LDFLAGS: I CC:       cc (GCC) 10.2.0 I CXX:      g++ (GCC) 
10.2.0
g++ -I. -I./examples -O3 -DNDEBUG -std=c++11 -fPIC chat.cpp 
ggml.o utils.o -o chat chat.cpp: In function 'int main(int, 
char**)': chat.cpp:883:26: error: aggregate 'main(int, 
char**)::sigaction sigint_action' has incomplete type and 
cannot be defined 883 |         struct sigaction 
sigint_action; |                          ~~~~~~~~~~~~ 
chat.cpp:885:9: error: 'sigemptyset' was not declared in this 
scope 885 |         sigemptyset (&amp;sigint_action.sa_mask); |         
~~~~~~~~~~ chat.cpp:887:47: error: invalid use of incomplete 
type 'struct main(int, char**)::sigaction' 887 |         
sigaction(SIGINT, &amp;sigint_action, NULL); |                                               
^ chat.cpp:883:16: note: forward declaration of 'struct 
main(int, char**)::sigaction' 883 |         struct sigaction 
sigint_action; |                ~~~~~~~~ make: *** [Makefile:195: chat] Error 1

evangelion-unit-two t1_jcu2o0f wrote on March 19, 2023 at 3:56 PM

Reply to comment by farmingvillein in [R] ChatGLM-6B - an open source 6.2 billion parameter Eng/Chinese bilingual LLM trained on 1T tokens, supplemented by supervised fine-tuning, feedback bootstrap, and RLHF. Runs on consumer grade GPUs by MysteryInc152

What are they going to do if I violate it? Cry like a baby?

simpleuserhere OP t1_jcu2ikl wrote on March 19, 2023 at 3:55 PM

Reply to comment by ninjasaid13 in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere

For Windows you need Visual C++ compiler, so install Visual Studio C++ 2019 build tools, follow the instruction here https://github.com/rupeshs/alpaca.cpp#windows

evangelion-unit-two t1_jcu2fw9 wrote on March 19, 2023 at 3:54 PM

Reply to comment by gkaykck in [R] ChatGLM-6B - an open source 6.2 billion parameter Eng/Chinese bilingual LLM trained on 1T tokens, supplemented by supervised fine-tuning, feedback bootstrap, and RLHF. Runs on consumer grade GPUs by MysteryInc152

Tankie detected

ninjasaid13 t1_jcu1odb wrote on March 19, 2023 at 3:49 PM

Reply to comment by simpleuserhere in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere

I have a problem with

C:\Users\****\source\repos\alpaca.cpp\build&gt;make chat
make: *** No rule to make target 'chat'.  Stop.

and

C:\Users\****\source\repos\alpaca.cpp&gt;make chat
I llama.cpp build info: I UNAME_S:  CYGWIN_NT-10.0 I 
UNAME_P:  unknown I UNAME_M:  x86_64 I CFLAGS:   -I.              
-O3 -DNDEBUG -std=c11   -fPIC -mfma -mf16c -mavx -
mavx2 I CXXFLAGS: -I. -I./examples -O3 -DNDEBUG -
std=c++11 -fPIC I LDFLAGS: I CC:       cc (GCC) 
10.2.0 I CXX:      g++ (GCC) 10.2.0
cc  -I.              -O3 -DNDEBUG -std=c11   -fPIC -
mfma -mf16c -mavx -mavx2   -c ggml.c -o ggml.o g++ -
I. -I./examples -O3 -DNDEBUG -std=c++11 -fPIC -c 
utils.cpp -o utils.o g++ -I. -I./examples -O3 -
DNDEBUG -std=c++11 -fPIC chat.cpp ggml.o utils.o -o 
chat chat.cpp: In function 'int main(int, char**)': 
chat.cpp:883:26: error: aggregate 'main(int, 
char**)::sigaction sigint_action' has incomplete type 
and cannot be defined 883 |         struct sigaction 
sigint_action; |                          
~~~~~~~~~~~~ chat.cpp:885:9: error: 'sigemptyset' was 
not declared in this scope 885 |         sigemptyset 
(&amp;sigint_action.sa_mask); |         ~~~~~~~~~~ 
chat.cpp:887:47: error: invalid use of incomplete 
type 'struct main(int, char**)::sigaction' 887 |         
sigaction(SIGINT, &amp;sigint_action, NULL); |                                               
^ chat.cpp:883:16: note: forward declaration of 
'struct main(int, char**)::sigaction' 883 |         
struct sigaction sigint_action; |                
~~~~~~~~ make: *** [Makefile:195: chat] Error 1

using windows.

michaelthwan_ai OP t1_jctw0gx wrote on March 19, 2023 at 3:09 PM

Reply to comment by ramtingxf in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai

It is a little bit exaggerated but thanks! I believe Bing and some companies are using similar tech, but a highly polished one, to solve similar issues.

michaelthwan_ai OP t1_jctvmqe wrote on March 19, 2023 at 3:07 PM

Reply to comment by [deleted] in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai

Cool! Thanks for the sharing.

During my development, I've also found 5+ projects, some open-source and some are closed, where they are doing similar things.

In exact, it is called retrieval-based language model.

Some discussion on that:

https://ai.stanford.edu/blog/retrieval-based-NLP/

michaelthwan_ai OP t1_jctvcas wrote on March 19, 2023 at 3:05 PM

Reply to comment by rowleboat in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai

Theoretically yes but in exact the objective you want to do is crucial.

SQL database don't support similarity/elastic search, which is very useful in natural language. It may limit what you can do or make your product less good.

michaelthwan_ai OP t1_jctv2tm wrote on March 19, 2023 at 3:03 PM

Reply to comment by Tostino in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai

Thank you.
Due to people close to me and my googling, my choices of indexer is like this

pyterrier -> faiss -> native embedding

Then I found llama-index, but it currently won't give extra values to me so I didn't adopt.

I have stories on pros/cons on those lib...

michaelthwan_ai OP t1_jcturwz wrote on March 19, 2023 at 3:01 PM

Reply to comment by KingsmanVince in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai

I believe it is a frontend problem. We are not frontend developers thus but we think that Gradio is too plain to show the result, thus we built a minimal UI.
That markdown (``` <code> ```) is currently not supported to pretty print like ChatGPT one.

fullstackai t1_jcttgyd wrote on March 19, 2023 at 2:51 PM

Reply to comment by gamerx88 in [D] Unit and Integration Testing for ML Pipelines by Fender6969

Should have been more precise. 100% of what goes into any pipeline or the deployment gets tested. We deploy many models on the edge in manufacturing. If the model fails, the production line might stand still. Can't risk that.

gamerx88 t1_jctqruk wrote on March 19, 2023 at 2:32 PM

Reply to [D] Unit and Integration Testing for ML Pipelines by Fender6969

For ETL, write unit tests to handle some input edge cases. E.g Null values, mis-formatting, values out of range as well as some simple working cases.

For model training, the test focus is on having "valid" hyperparams and configurations. I write test cases to try to overfit on a small training set. i.e Confirm the model learns. There are also some robustness tests that I sometimes run post training, but those are very specific to certain NLP tasks, applications.

For model serving, successful parsing of the request and subsequent feature transformation (if any), very similar to ETL.