This website works better with JavaScript
Home
Explore
Help
Sign In
WangKang
/
QC_Specialized_Model_Training_Main
Watch
1
Star
0
Fork
0
Files
Issues
0
Pull Requests
0
Wiki
Branch:
master
Branches
Tags
master
QC_Specialized_...
/
recipes
/
quickstart
/
inference
WangKang
ea091b08ce
init
6 months ago
..
code_llama
ea091b08ce
init
6 months ago
local_inference
ea091b08ce
init
6 months ago
mobile_inference
ea091b08ce
init
6 months ago
README.md
ea091b08ce
init
6 months ago
modelUpgradeExample.py
ea091b08ce
init
6 months ago
README.md
Quickstart > Inference
This folder contains scripts to get you started with inference on Meta Llama models.
Code Llama
contains scripts for tasks relating to code generation using CodeLlama
Local Inference
contains scripts to do memory efficient inference on servers and local machines
Mobile Inference
has scripts using MLC to serve Llama on Android (h/t to OctoAI for the contribution!)
Model Update Example
shows an example of replacing a Llama 3 model with a Llama 3.1 model.