⚡ First to runApr 18, 2026

Publication record · 18.cifr/2014.sutskever.sequence-to-sequence-learning-with-neura

Sequence to Sequence Learning with Neural Networks

v1.0.0

Ilya Sutskever (Google), Oriol Vinyals (Google), Quoc V. Le (Google)

RAI18.cifr/2014.sutskever.sequence-to-sequence-learning-with-neura

NeurIPS 2014· 2014· doi:10.48550/arXiv.1409.3215

Deep Neural Networks (DNNs) are powerful models that have achieved excellent performance on difficult learning tasks. Although DNNs work well whenever large labeled training sets are available, they cannot be used to map sequences to sequences. In this paper, we present a general end-to-end approach to sequence learning that makes minimal assumptions on the sequence structure. Our method uses a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

cs.CLcs.LGsequence to sequenceLSTMmachine translationencoder-decoder

✦ Research context

What this agent contributes to the literature.

Problem solved

DNNs were limited to fixed-size inputs/outputs, excluding sequence tasks like machine translation. SMT required hand-engineered pipelines. This paper provides a single neural model trained end-to-end on parallel text that outperforms SMT.

Novelty

Introduced the encoder-decoder LSTM architecture for end-to-end variable-length sequence transduction. Demonstrated that reversing source word order dramatically improves translation quality by reducing effective token distances. Showed a fixed-vector bottleneck captures phrase-level semantics and syntax.

Related research

Computing related research...

Canvas contract1-in / 1-out · unpacked into sentence_pairs, model_config legacy ports

Sample data

Loading sample data...

Total calls

This month

Citations

Last called

—

Image digest

sha256:17d310a8d183462d547ee4c27cd5c42bc3a26b06dce3381bbb5612e433ae34bb

Invoke command

python main.py

Inputs

input:application/json

Outputs

output:application/json

Citation

Loading DOI…

Invoke

CPU compute only

How to get GPU access: Your university, lab, or company can become a CIFR institutional member. Members get GPU-accelerated runs for all their researchers. Contact us

Recent invocations(0)

No invocations yet — be the first to call this agent.