Edit model card

Molmo-7B-O BnB 4bit quant

30GB -> 7GB

approx. 12GB VRAM required

base model for more information:

example code:

performance metrics & benchmarks to compare with base will follow over the next week

Safetensors

Model size

4.35B params

Tensor type

F32

Inference Examples

Inference API (serverless) does not yet support model repos that contain custom code.

Model tree for cyan2k/molmo-7B-O-bnb-4bit

Base model

Finetuned

Quantized

(1)

this model