chore: Update lmms-eval to support video evaluations for LLaVA models · EvolvingLMMs-Lab/lmms-eval@cbeee20 (original) (raw)

`@@ -8,6 +8,7 @@

`

8

8

``

9

9

`🏠 LMMs-Lab Homepage | 🎉 Blog | 📚 Documentation | 🤗 Huggingface Datasets | Discord_Thread discord/lmms-eval

`

10

10

``

``

11

`+


`

11

12

``

12

13

`# Annoucement

`

13

14

``

`@@ -206,14 +207,41 @@ Please refer to our documentation.

`

206

207

``

207

208

`lmms_eval is a fork of lm-eval-harness. We recommend you to read through the docs of lm-eval-harness for relevant information.

`

208

209

``

``

210

`+


`

``

211

+

209

212

`Below are the changes we made to the original API:

`

210

213

`- Build context now only pass in idx and process image and doc during the model responding phase. This is due to the fact that dataset now contains lots of images and we can't store them in the doc like the original lm-eval-harness other wise the cpu memory would explode.

`

211

214

`- Instance.args (lmms_eval/api/instance.py) now contains a list of images to be inputted to lmms.

`

212

215

`- lm-eval-harness supports all HF language models as single model class. Currently this is not possible of lmms because the input/output format of lmms in HF are not yet unified. Thererfore, we have to create a new class for each lmms model. This is not ideal and we will try to unify them in the future.

`

213

216

``

214

``

`-

We also thank:

`

``

217

`+


`

``

218

+

``

219

`+

During the initial stage of our project, we thank:

`

215

220

`- Xiang Yue, Jingkang Yang, Dong Guo and Sheng Shen for early discussion and testing.

`

216

221

``

``

222

`+


`

``

223

+

``

224

`` +

During the v0.1 to v0.2, we thank the community support from pull requests (PRs):

``

``

225

+

``

226

`+

Datasets:

`

``

227

+

``

228

`+

`

``

229

`+

`

``

230

`+

`

``

231

`+

`

``

232

`+

`

``

233

`+

`

``

234

`+

`

``

235

`+

`

``

236

`+

`

``

237

+

``

238

`+

Models:

`

``

239

+

``

240

`+

`

``

241

`+

`

``

242

`+

`

``

243

`+

`

``

244

+

217

245

`## Citations

`

218

246

``

219

247

```` ```shell

````