Optimum library.">

🤗Optimum (original) (raw)

Topic Replies Views Activity
About the 🤗 Optimum category 0 1537 March 25, 2022
[Guide] Quantize LLM CoreML to int8 on Mac ARM (TinyLlama, May 2025, tested workflow & script) 0 29 May 26, 2025
Trying to convert DeepSeek-R1 into onnx 1 51 March 13, 2025
Optimum library optimization and quantization fails 8 1512 February 22, 2025
Paligemma2 onnx export KeyError: "Unknown task: image-text-to-text 4 99 February 11, 2025
Optimum-habana not working! 2 21 February 10, 2025
Incorrect Cross Attention Values from Generate Function of ORTModelForVision2Seq 3 51 February 1, 2025
Inference on models with custom head 1 19 January 28, 2025
Supported models 6 92 January 14, 2025
Qwen/Qwen1.5-7B-Chat RuntimeError: The serialized model is larger than the 2GiB ORTModelForCausalLM 2 363 January 1, 2025
Error when running examples in optimum habana 2 589 October 30, 2024
Question about the infernce flow for optimum exported decoder merged onnx model 4 50 October 11, 2024
Compiling SD1.5 for Neuron with resolution other than 512x512 fails 5 95 September 26, 2024
Error while optimizing seq2seq model using optimum 1 59 September 16, 2024
Neuron StableDiffusion ControlNet Pipeline fails when used with 2 controlnets 4 63 September 11, 2024
How can I export a transformers model into onnx that not supported with optimum yet 9 474 August 30, 2024
Can I convert llama 2 "Chat" model into onnx using llama/convert_to_onnx.py script? 5 1762 August 26, 2024
Optimum Failed download of jina-embeddings-v2-base-es 4 393 August 19, 2024
Optimum/Neuron: RuntimeError: forward() is missing value for argument 'argument_4' 2 32 August 13, 2024
What value should the sequence_length parameter be when converting to TFLite 0 17 August 10, 2024
How to export a fine-tuned SDXL model? 5 179 August 9, 2024
Make Text Embedding Server compatible 2 238 August 8, 2024
Exporting SegFormer Image Processor to ONNX Format Using "optimum.exporters.onnx.onnx_export_from_model" 0 87 July 30, 2024
How to export mistralai/Mistral-7B-v0.1 to Tflite to use in TensorFlow Autocomplete? 1 646 June 28, 2024
Optimum - exporting Tensorflow based transformers to openvino 0 83 June 27, 2024
Is it possible to make the first batch as fast as the subsequent ones? 1 82 June 25, 2024
Are object detection models supported in optimum? 7 823 June 21, 2024
UnboundLocalError: cannot access local variable 'all_files' where it is not associated with a value 4 1517 June 13, 2024
Not able run all nodes on DML with optimum 4 376 June 6, 2024
Not able to run on DML with pipeline 2 337 June 6, 2024