This website works better with JavaScript
Inicio
Explorar
Ayuda
Iniciar sesión
WangKang
/
QC_Specialized_Model_Training_Main
Seguir
1
Destacar
0
Fork
0
Archivos
Incidencias
0
Pull Requests
0
Wiki
Rama:
master
Ramas
Etiquetas
master
QC_Specialized_...
/
recipes
/
quickstart
/
inference
WangKang
ea091b08ce
init
hace 6 meses
..
code_llama
ea091b08ce
init
hace 6 meses
local_inference
ea091b08ce
init
hace 6 meses
mobile_inference
ea091b08ce
init
hace 6 meses
README.md
ea091b08ce
init
hace 6 meses
modelUpgradeExample.py
ea091b08ce
init
hace 6 meses
README.md
Quickstart > Inference
This folder contains scripts to get you started with inference on Meta Llama models.
Code Llama
contains scripts for tasks relating to code generation using CodeLlama
Local Inference
contains scripts to do memory efficient inference on servers and local machines
Mobile Inference
has scripts using MLC to serve Llama on Android (h/t to OctoAI for the contribution!)
Model Update Example
shows an example of replacing a Llama 3 model with a Llama 3.1 model.