Update sever README build/testing instructions (#461)

Mozilla-Ocho · Jun 1, 2024 · 9cd8d70 · 9cd8d70
1 parent 7d8dd1b
commit 9cd8d70
Showing 1 changed file with 9 additions and 1 deletion.
diff --git a/llama.cpp/server/README.md b/llama.cpp/server/README.md
@@ -34,7 +34,7 @@ Command line options:
 - `--grp-attn-w`: Set the group attention width to extend context size through self-extend(default: 512), used together with group attention factor `--grp-attn-n`
 ## Build
 
-server is build alongside everything else from the root of the project
+The server is vendored from `llama.cpp` and built at the root of the `llamafile` as part of the compilation of `llama.cpp/main/main`. To build it, compile the project at root. 
 
 - Using `make`:
 
@@ -48,6 +48,14 @@ server is build alongside everything else from the root of the project
  cmake --build . --config Release
  ```
 
+Then, to test your compiled artifacts, run the `--server` CLI command at the root of the repo where you pointed your compiled artifacts to from your root `sudo make install` command. 
+
+Example:
+
+```
+$PREFIX/bin/llamafile --server -m /mistral-7b-instruct-v0.1.Q4_K_M.gguf 
+```
+
 ## Quick Start
 
 To get started right away, run the following command, making sure to use the correct path for the model you have: