4 bit quantization in Gemma-2b

244 Views Asked by kishan payadi At 23 February 2024 at 08:10

Source-:

Hugging-face documentation.

My code-:

My error log

ImportError: Using bitsandbytes 8-bit quantization requires Accelerate: pip install accelerate and the latest version of bitsandbytes: pip install -i https://pypi.org/simple/ bitsandbytes

Note I have already installed accelerate and bitsandbytes

But I still have one confusion the log say that for 8-bit quantisation I need accelerate and other package, but I am doing 4 bit quantization.

Iam Trying to quantize and expecting model to download

Original Q&A

There are 1 best solutions below

user1871212 On 24 February 2024 at 09:58

I got the same error, felt like the accelerate is actually a wrapper of Cuda, so still GPU required. With GPU, I can use 4bit quantization tho.

4 bit quantization in Gemma-2b

There are 1 best solutions below

Related Questions in LARGE-LANGUAGE-MODEL

Related Questions in GEMINI

Trending Questions

Popular # Hahtags

Popular Questions