Recent comments in /f/MachineLearning
simpleuserhere OP t1_jcuc25e wrote
Reply to comment by ninjasaid13 in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere
Yes,
ninjasaid13 t1_jcubyue wrote
Reply to comment by simpleuserhere in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere
so it won't work? do I need to install MinGW?
simpleuserhere OP t1_jcubta9 wrote
Reply to comment by ninjasaid13 in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere
I haven't tried cygwin for Alpaca.cpp.
ninjasaid13 t1_jcuajwh wrote
Reply to comment by simpleuserhere in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere
yes I have cygwin.
simpleuserhere OP t1_jcu9x05 wrote
Reply to comment by ninjasaid13 in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere
Are you using cygwin?
ninjasaid13 t1_jcu9nfv wrote
Reply to comment by simpleuserhere in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere
I believe I already have the build.
I still get this error
C:\Users\****\Downloads\alpaca\alpaca.cpp>make chat
I llama.cpp build info: I UNAME_S: CYGWIN_NT-10.0 I UNAME_P:
unknown I UNAME_M: x86_64 I CFLAGS: -I. -O3 -
DNDEBUG -std=c11 -fPIC -mfma -mf16c -mavx -mavx2 I
CXXFLAGS: -I. -I./examples -O3 -DNDEBUG -std=c++11 -fPIC I
LDFLAGS: I CC: cc (GCC) 10.2.0 I CXX: g++ (GCC)
10.2.0
g++ -I. -I./examples -O3 -DNDEBUG -std=c++11 -fPIC chat.cpp
ggml.o utils.o -o chat chat.cpp: In function 'int main(int,
char**)': chat.cpp:883:26: error: aggregate 'main(int,
char**)::sigaction sigint_action' has incomplete type and
cannot be defined 883 | struct sigaction
sigint_action; | ~~~~~~~~~~~~
chat.cpp:885:9: error: 'sigemptyset' was not declared in this
scope 885 | sigemptyset (&sigint_action.sa_mask); |
~~~~~~~~~~ chat.cpp:887:47: error: invalid use of incomplete
type 'struct main(int, char**)::sigaction' 887 |
sigaction(SIGINT, &sigint_action, NULL); |
^ chat.cpp:883:16: note: forward declaration of 'struct
main(int, char**)::sigaction' 883 | struct sigaction
sigint_action; | ~~~~~~~~ make: *** [Makefile:195: chat] Error 1
evangelion-unit-two t1_jcu2o0f wrote
Reply to comment by farmingvillein in [R] ChatGLM-6B - an open source 6.2 billion parameter Eng/Chinese bilingual LLM trained on 1T tokens, supplemented by supervised fine-tuning, feedback bootstrap, and RLHF. Runs on consumer grade GPUs by MysteryInc152
What are they going to do if I violate it? Cry like a baby?
simpleuserhere OP t1_jcu2ikl wrote
Reply to comment by ninjasaid13 in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere
For Windows you need Visual C++ compiler, so install Visual Studio C++ 2019 build tools, follow the instruction here https://github.com/rupeshs/alpaca.cpp#windows
evangelion-unit-two t1_jcu2fw9 wrote
ninjasaid13 t1_jcu1odb wrote
Reply to comment by simpleuserhere in [Research] Alpaca 7B language model running on my Pixel 7 by simpleuserhere
I have a problem with
C:\Users\****\source\repos\alpaca.cpp\build>make chat
make: *** No rule to make target 'chat'. Stop.
and
C:\Users\****\source\repos\alpaca.cpp>make chat
I llama.cpp build info: I UNAME_S: CYGWIN_NT-10.0 I
UNAME_P: unknown I UNAME_M: x86_64 I CFLAGS: -I.
-O3 -DNDEBUG -std=c11 -fPIC -mfma -mf16c -mavx -
mavx2 I CXXFLAGS: -I. -I./examples -O3 -DNDEBUG -
std=c++11 -fPIC I LDFLAGS: I CC: cc (GCC)
10.2.0 I CXX: g++ (GCC) 10.2.0
cc -I. -O3 -DNDEBUG -std=c11 -fPIC -
mfma -mf16c -mavx -mavx2 -c ggml.c -o ggml.o g++ -
I. -I./examples -O3 -DNDEBUG -std=c++11 -fPIC -c
utils.cpp -o utils.o g++ -I. -I./examples -O3 -
DNDEBUG -std=c++11 -fPIC chat.cpp ggml.o utils.o -o
chat chat.cpp: In function 'int main(int, char**)':
chat.cpp:883:26: error: aggregate 'main(int,
char**)::sigaction sigint_action' has incomplete type
and cannot be defined 883 | struct sigaction
sigint_action; |
~~~~~~~~~~~~ chat.cpp:885:9: error: 'sigemptyset' was
not declared in this scope 885 | sigemptyset
(&sigint_action.sa_mask); | ~~~~~~~~~~
chat.cpp:887:47: error: invalid use of incomplete
type 'struct main(int, char**)::sigaction' 887 |
sigaction(SIGINT, &sigint_action, NULL); |
^ chat.cpp:883:16: note: forward declaration of
'struct main(int, char**)::sigaction' 883 |
struct sigaction sigint_action; |
~~~~~~~~ make: *** [Makefile:195: chat] Error 1
using windows.
michaelthwan_ai OP t1_jctw0gx wrote
Reply to comment by ramtingxf in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
It is a little bit exaggerated but thanks! I believe Bing and some companies are using similar tech, but a highly polished one, to solve similar issues.
michaelthwan_ai OP t1_jctvmqe wrote
Reply to comment by [deleted] in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
Cool! Thanks for the sharing.
During my development, I've also found 5+ projects, some open-source and some are closed, where they are doing similar things.
In exact, it is called retrieval-based language model.
Some discussion on that:
michaelthwan_ai OP t1_jctvcas wrote
Reply to comment by rowleboat in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
Theoretically yes but in exact the objective you want to do is crucial.
SQL database don't support similarity/elastic search, which is very useful in natural language. It may limit what you can do or make your product less good.
michaelthwan_ai OP t1_jctv2tm wrote
Reply to comment by Tostino in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
Thank you.
Due to people close to me and my googling, my choices of indexer is like this
pyterrier -> faiss -> native embedding
Then I found llama-index, but it currently won't give extra values to me so I didn't adopt.
I have stories on pros/cons on those lib...
michaelthwan_ai OP t1_jcturwz wrote
Reply to comment by KingsmanVince in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
I believe it is a frontend problem. We are not frontend developers thus but we think that Gradio is too plain to show the result, thus we built a minimal UI.
That markdown (``` <code> ```) is currently not supported to pretty print like ChatGPT one.
fullstackai t1_jcttgyd wrote
Reply to comment by gamerx88 in [D] Unit and Integration Testing for ML Pipelines by Fender6969
Should have been more precise. 100% of what goes into any pipeline or the deployment gets tested. We deploy many models on the edge in manufacturing. If the model fails, the production line might stand still. Can't risk that.
gamerx88 t1_jctqruk wrote
For ETL, write unit tests to handle some input edge cases. E.g Null values, mis-formatting, values out of range as well as some simple working cases.
For model training, the test focus is on having "valid" hyperparams and configurations. I write test cases to try to overfit on a small training set. i.e Confirm the model learns. There are also some robustness tests that I sometimes run post training, but those are very specific to certain NLP tasks, applications.
For model serving, successful parsing of the request and subsequent feature transformation (if any), very similar to ETL.
ramtingxf t1_jctqpvo wrote
Reply to [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
Google need to be worried
[deleted] t1_jctq5xu wrote
Tostino t1_jctq5az wrote
Reply to comment by rowleboat in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
Look into llama-index
rowleboat t1_jctpu8c wrote
Reply to comment by michaelthwan_ai in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
Can this use a SQL database as an external reference?
KingsmanVince t1_jctpr5l wrote
Reply to comment by michaelthwan_ai in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
Not sure this is frontend problem or not, but the python code is printed without identation.
gamerx88 t1_jctp6px wrote
Reply to comment by fullstackai in [D] Unit and Integration Testing for ML Pipelines by Fender6969
Is there a reason why you feel there is need for such rigour? 100% is quite an overkill even for the typical software projects IMO.
You probably end up having to write tests for even simple one liner functions which gets exhausting.
michaelthwan_ai OP t1_jctooa2 wrote
Reply to comment by squareOfTwo in [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
Of course we do! Open-source projects are cool
derek_ml t1_jcudkzh wrote
Reply to [P] searchGPT - a bing-like LLM-based Grounded Search Engine (with Demo, github) by michaelthwan_ai
Any chance you will add this as a Hugging Face Space?