Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
LoftQ
/
Llama-2-7b-hf-4bit-64rank
like
1
Text Generation
Transformers
Safetensors
English
llama
quantization
lora
text-generation-inference
Inference Endpoints
4-bit precision
bitsandbytes
arxiv:
2310.08659
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Llama-2-7b-hf-4bit-64rank
2 contributors
History:
25 commits
LoftQ
Update README.md
a412479
verified
5 months ago
gsm8k
convert to bin
9 months ago
loftq_init
Update loftq_init/adapter_config.json
10 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
3.3 kB
Update README.md
5 months ago
config.json
1.17 kB
Upload folder using huggingface_hub
5 months ago
generation_config.json
183 Bytes
Upload folder using huggingface_hub
5 months ago
model.safetensors
4.17 GB
LFS
Upload folder using huggingface_hub
5 months ago
special_tokens_map.json
414 Bytes
Upload LoftQ models
10 months ago
tokenizer.json
1.84 MB
Upload LoftQ models
10 months ago
tokenizer.model
500 kB
LFS
Upload LoftQ models
10 months ago
tokenizer_config.json
867 Bytes
Upload LoftQ models
10 months ago