📓 Text Generation docs rework (original) (raw)

What is this?

This is an issue to discuss and track the rework of the docs for text generation. Comments and feedback are appreciated, as always 🤗

Current issues

  1. Our main reference for text generation is not in the docs and is quite outdated
  2. The docs regarding text generation are scattered, and it is not simple to navigate between them -- the reader has to know where to look for them
  3. We lack examples beyond the simplest forms of text generation
  4. We have undocumented advanced use cases, such as setting a custom stopping criteria
  5. We are not clear about what the user can't do

Proposed plan

EDIT:

I'd like to split the plan into three parts:

  1. Designing a simpler entry point to text generation, from which all related documentation is discoverable
  2. Upgrading the developer guides to cover the full potential of text generation
  3. Make our code more self-documenting and other code changes

1. Designing a simpler entry point for text generation docs

Tackles issues 1 and 2.

This part is further divided into two actions:

Related docs:

  1. Tasks
  2. Related developer guides
  3. API reference
  4. Outside transformers (e.g. optimum, text-generation-inference, LLM leaderboard, non-HF libs like autogptq?)

2. Upgrading the developer guides

Tackles issues 3 and 4.

We currently have one developer guide, which writes about the API and a few basic ways to manipulate text generation. I propose we improve the existing one and add 2 new guides, preferably with examples that cover more modalities and use cases:

3. Self-documenting code and other code changes

Tackles issues 3 and 5.