npmpackage.info

node-llama-cpp

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

3.3.1

1,132

MIT

TypeScript

19.71 MB

Installations

npm install node-llama-cpp

Developer Guide

BETA

Typescript

Yes

Module System

ESM

Min. Node Version

>=18.0.0

Node Version

20.18.1

NPM Version

10.8.1 Score

Supply Chain

94.9

Quality

90.3

Maintenance

100

Vulnerability

98.6

License

Pull Requests

Open

0

Total

183

Closed

12

Merged

171

Issues

Open

6

Total

99

Closed

93

Releases

101

v3.3.1

Published on 09 Dec 2024

v3.3.0

Published on 02 Dec 2024

v3.2.0

Published on 31 Oct 2024

v3.1.1

Published on 06 Oct 2024

v3.1.0

Published on 05 Oct 2024

v3.0.3

Published on 25 Sept 2024

View all 101 releases

Contributors

View all 6 contributors

Languages

TypeScript

C++

CSS

Vue

JavaScript

Shell

CMake

HTML

TypeScript (90.58%)

C++ (4.74%)

CSS (1.96%)

Vue (1.19%)

JavaScript (0.98%)

Shell (0.33%)

CMake (0.2%)

HTML (0.02%)

C (0.01%)

Developer

withcatai

Download Statistics

Total Downloads

Last Day

Last Week

Last Month

Last Year

GitHub Statistics

1,132 Stars

185 Commits

100 Forks

14 Watching

1 Branches

6 Contributors

Sponsor this package

https://github.com/sponsors/giladgd

Maintainers

Package Meta Information

Latest Version

3.3.1

Package Id

node-llama-cpp@3.3.1

Unpacked Size

19.71 MB

Size

17.61 MB

File Count

769

NPM Version

10.8.1

Node Version

20.18.1

Publised On

09 Dec 2024

Total Downloads

Cumulative downloads

Total Downloads

0

Last day

Compared to previous day

Last week

Compared to previous week

Last month

Compared to previous month

Last year

Compared to previous year

Daily Downloads

Weekly Downloads

Monthly Downloads

Yearly Downloads

Dependencies

Peer Dependencies

typescript

Dev Dependencies

Optional Dependencies

@node-llama-cpp/linux-arm64 @node-llama-cpp/linux-armv7l @node-llama-cpp/linux-x64 @node-llama-cpp/linux-x64-cuda @node-llama-cpp/linux-x64-vulkan @node-llama-cpp/mac-arm64-metal @node-llama-cpp/mac-x64 @node-llama-cpp/win-arm64 @node-llama-cpp/win-x64 @node-llama-cpp/win-x64-cuda @node-llama-cpp/win-x64-vulkan

Versions

node-llama-cpp

Run AI models locally on your machine

_{Pre-built bindings are provided with a fallback to building from source with cmake}

✨ v3.0 is here! ✨

Features

Run LLMs locally on your machine
Metal, CUDA and Vulkan support
Pre-built binaries are provided, with a fallback to building from source without node-gyp or Python
Adapts to your hardware automatically, no need to configure anything
A Complete suite of everything you need to use LLMs in your projects
Use the CLI to chat with a model without writing any code
Up-to-date with the latest llama.cpp. Download and compile the latest release with a single CLI command
Enforce a model to generate output in a parseable format, like JSON, or even force it to follow a specific JSON schema
Provide a model with functions it can call on demand to retrieve information of perform actions
Embedding support
Great developer experience with full TypeScript support, and complete documentation
Much more

Documentation

Try It Without Installing

Chat with a model in your terminal using a single command:

1npx -y node-llama-cpp chat

Installation

1npm install node-llama-cpp

This package comes with pre-built binaries for macOS, Linux and Windows.

If binaries are not available for your platform, it'll fallback to download a release of llama.cpp and build it from source with cmake. To disable this behavior, set the environment variable NODE_LLAMA_CPP_SKIP_DOWNLOAD to true.

Usage

1import {fileURLToPath} from "url";
2import path from "path";
3import {getLlama, LlamaChatSession} from "node-llama-cpp";
4
5const __dirname = path.dirname(fileURLToPath(import.meta.url));
6
7const llama = await getLlama();
8const model = await llama.loadModel({
9    modelPath: path.join(__dirname, "models", "Meta-Llama-3.1-8B-Instruct.Q4_K_M.gguf")
10});
11const context = await model.createContext();
12const session = new LlamaChatSession({
13    contextSequence: context.getSequence()
14});
15
16
17const q1 = "Hi there, how are you?";
18console.log("User: " + q1);
19
20const a1 = await session.prompt(q1);
21console.log("AI: " + a1);
22
23
24const q2 = "Summarize what you said";
25console.log("User: " + q2);
26
27const a2 = await session.prompt(q2);
28console.log("AI: " + a2);