Galactica

Citation tokens, [START_REF], [END_REF]

One of the distinctive features of scientific papers is the presence of citations. In Galactica, the authors propose to encode this idea directly in the model by adding specific tokens to the vocabulary.

The authors evaluate different citation formats but the best results are obtained by using the following format:

It makes sense to use the paper title as it includes relevant information about the topic of the paper.

Galactica: A Large Language Model for Science

Keep it short and simple

Before anything else - Poll

Just another LLM?

Just another LLM?

Just another LLM?

Just another LLM?

Galactica

Galactica

Galactica

Galactica

Galactica

Galactica

Galactica

Results

Results - Multiple epochs

Results - Multiple epochs

Results - Latex equations

Results - Latex equations

Results - Reasoning

Results - Reasoning

Results - Scientific NLP

Results - Scientific NLP

Results - Scientific NLP

Results - Citations prediction

Results - Citations prediction

Results - Citations prediction

Results

Discussion

Discussion - Factuality

Discussion - Misinformation

Discussion - Academic companion

Discussion - Academic companion

Discussion - Academic companion

The end!