arxiv:2305.09781
Zhihao Zhang
JackFram
AI & ML interests
None yet
Recent Activity
commented
a paper
about 2 months ago
TidalDecode: Fast and Accurate LLM Decoding with Position Persistent
Sparse Attention
authored
a paper
about 2 months ago
SpecInfer: Accelerating Generative LLM Serving with Speculative
Inference and Token Tree Verification
Organizations
Papers
1
models
7
JackFram/llama-68m
Text Generation
•
Updated
•
628k
•
24
JackFram/llama-160m
Text Generation
•
Updated
•
244k
•
32
JackFram/llama-160m-base
Text Generation
•
Updated
•
6
JackFram/llama-160m-cbt-4
Text Generation
•
Updated
•
7
JackFram/llama-160m-cbt-3
Text Generation
•
Updated
•
7
JackFram/llama-160m-cbt-2
Text Generation
•
Updated
•
9
JackFram/llama-160m-cbt-1
Text Generation
•
Updated
•
8
datasets
None public yet