作者 | SHA1 备注 | 提交日期 |
---|---|---|
|
2ea7f57991 convertion missing preprocessor_config.json. | 6 月之前 |
|
e2f77dbc21 fix quant config | 9 月之前 |
|
6ef9a78458 Fix issues with quantization_config == None | 9 月之前 |
|
0920b1a415 Fix quantization for inference | 9 月之前 |
|
d51d2cce9c adding sdpa for flash attn | 1 年之前 |
|
db8af96ff0 update the model load with native flash attn | 1 年之前 |
|
4c9cc7d223 Move modules into separate src folder | 1 年之前 |