Added llamacpp model runtime with docker based deployment by AhmedSeemalK · Pull Request #85 · opea-project/Enterprise-Inference

AhmedSeemalK · 2026-04-07T07:48:30Z

This pull request introduces a new Dockerfile and README for running llama.cpp with Intel oneAPI compilers and oneMKL BLAS, providing optimized CPU inference for the llama-server. The Dockerfile sets up the environment, builds llama.cpp from source with Intel optimizations, and configures the container for easy model serving. The README gives step-by-step instructions for building, running, and testing the container.

Documentation and usage instructions:

Added README.md with detailed build, run, and test instructions for the new Dockerfile. The README explains how to build the image, start and stop the container, configure model caching, access logs, and test the server endpoint.

Added llamacpp as blueprint with docker based deployment Signed-off-by: AhmedSeemalK <ahmed.seemal@intel.com>

addressed review comments Signed-off-by: AhmedSeemalK <ahmed.seemal@intel.com>

Add llama.cpp Dockerfile and README

7abdd0c

Added llamacpp as blueprint with docker based deployment Signed-off-by: AhmedSeemalK <ahmed.seemal@intel.com>

AhmedSeemalK requested a review from psurabh April 7, 2026 07:48

AhmedSeemalK changed the title ~~Added llamacpp model runtime as docker based deployment~~ Added llamacpp model runtime with docker based deployment Apr 7, 2026

AhmedSeemalK changed the base branch from dev to main April 17, 2026 05:11

psurabh previously approved these changes Apr 17, 2026

View reviewed changes

sandeshk-intel approved these changes Apr 21, 2026

View reviewed changes

sgurunat reviewed Apr 22, 2026

View reviewed changes

Comment thread blueprints/llamacpp/README.md

sgurunat reviewed Apr 22, 2026

View reviewed changes

Comment thread blueprints/llamacpp/README.md

sgurunat requested changes Apr 22, 2026

View reviewed changes

amberjain1 reviewed Apr 22, 2026

View reviewed changes

Comment thread blueprints/llamacpp/README.md Outdated

addressed review comments

3bdba15

addressed review comments Signed-off-by: AhmedSeemalK <ahmed.seemal@intel.com>

AhmedSeemalK dismissed psurabh’s stale review via 3bdba15 April 22, 2026 03:16

sgurunat approved these changes Apr 22, 2026

View reviewed changes

amberjain1 approved these changes Apr 22, 2026

View reviewed changes

AhmedSeemalK requested a review from psurabh April 22, 2026 03:22

vivekrsintc approved these changes Apr 22, 2026

View reviewed changes

AhmedSeemalK requested a review from mdfaheem-intel April 22, 2026 04:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added llamacpp model runtime with docker based deployment#85

Added llamacpp model runtime with docker based deployment#85
AhmedSeemalK wants to merge 2 commits intoopea-project:mainfrom
AhmedSeemalK:llamacpp-docker

AhmedSeemalK commented Apr 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

AhmedSeemalK commented Apr 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants