This website works better with JavaScript
Página inicial
Explorar
Ajuda
Entrar
WangKang
/
QC_Specialized_Model_Training_Main
Observar
1
Favorito
0
Fork
0
Arquivos
Issues
0
Pull Requests
0
Wiki
Branch:
master
Branches
Tags
master
QC_Specialized_...
/
recipes
/
quickstart
/
inference
WangKang
ea091b08ce
init
6 meses atrás
..
code_llama
ea091b08ce
init
6 meses atrás
local_inference
ea091b08ce
init
6 meses atrás
mobile_inference
ea091b08ce
init
6 meses atrás
README.md
ea091b08ce
init
6 meses atrás
modelUpgradeExample.py
ea091b08ce
init
6 meses atrás
README.md
Quickstart > Inference
This folder contains scripts to get you started with inference on Meta Llama models.
Code Llama
contains scripts for tasks relating to code generation using CodeLlama
Local Inference
contains scripts to do memory efficient inference on servers and local machines
Mobile Inference
has scripts using MLC to serve Llama on Android (h/t to OctoAI for the contribution!)
Model Update Example
shows an example of replacing a Llama 3 model with a Llama 3.1 model.