arxiv:2411.02355
Eldar Kurtić
ekurtic
AI & ML interests
Efficient inference
Recent Activity
updated
a model
about 3 hours ago
ekurtic/Qwen2.5-VL-7B-Instruct-weight-only-INT8-fake-quant
published
a model
about 3 hours ago
ekurtic/Qwen2.5-VL-7B-Instruct-weight-only-INT8-fake-quant
updated
a model
about 4 hours ago
ekurtic/Qwen2.5-VL-7B-Instruct-weight-only-FP8-fake-quant