Transformer (machine learning model)


Topic history | v1 (current) | created by janarez

Details

Transformer (machine learning model)

| created by janarez | Add resource "The Annotated Transformer"
Title
Transformer (machine learning model)
Description
A transformer is a deep learning model that adopts the mechanism of attention, differentially weighing the significance of each part of the input data. It is used primarily in the field of natural language processing (NLP) and in computer vision (CV). Like recurrent neural networks (RNNs), transformers are designed to handle sequential input data, such as natural language, for tasks such as translation and text summarization. However, unlike RNNs, transformers do not necessarily process the data in order. Rather, the attention mechanism provides context for any position in the input sequence.
Link
https://en.wikipedia.org/?curid=61603971

resources

discussed in The Annotated Transformer
v1 | attached by janarez | Add resource "The Annotated Transformer"

authors

This topic has no history of related authors.

topics

tool for Deep learning
v1 | attached by janarez | Add resource "The Annotated Transformer"