eli5_retrieval_large_lm/requirements_a100.txt · master · Joel Wemboembo / google-research

Jules Gagnon-Marchand authored Dec 11, 2020

The objective is to test a reasonably large language model
(GPT2-XL, 1.5B parameters) on the Kilt(https://ai.facebook.com/tools/kilt/)
version of the ELI5(https://www.aclweb.org/anthology/P19-1346/) task when
combined with a retriever (REALM (https://arxiv.org/abs/2002.08909) in this case).

The objective is to observe whether larger causal language models can make use of the
retrieved reference data, as their reasoning capacities are stronger than that
previously tested models.

Inspecting what kinds of retrievals are useful and why would be a next step, as
well as investigating the effects of the retrieval on factual consistency in
generation, which is a problem of major interest right now.

PiperOrigin-RevId: 347074648

9518bd77