当前位置：首页 > 软件库 > 神经网络/人工智能 > 自然语言处理 >

pytorch-seq2seq-example

授权协议 Readme

开发语言 Python

所属分类神经网络/人工智能、自然语言处理

软件类型开源软件

地区不详

投递者端木冷勋

操作系统跨平台

开源组织无

适用人群未知

软件概览

Batched Seq2Seq ExampleBased on the seq2seq-translation-batched.ipynb from practical-pytorch, but more extra features.

This example runs grammatical error correction task where the source sequence is a grammatically erroneuous English sentence and the target sequence is an grammatically correct English sentence. The corpus and evaluation script can be download at: https://github.com/keisks/jfleg.

Extra features

Cleaner codebase
Very detailed comments for learners
Implement Pytorch native dataset and dataloader for batching
Correctly handle the hidden state from bidirectional encoder and past to the decoder as initial hidden state.
Fully batched attention mechanism computation (only implement general attention but it's sufficient). Note: The original code still uses for-loop to compute, which is very slow.
Support LSTM instead of only GRU
Shared embeddings (encoder's input embedding and decoder's input embedding)
Pretrained Glove embedding
Fixed embedding
Tie embeddings (decoder's input embedding and decoder's output embedding)
Tensorboard visualization
Load and save checkpoint
Replace unknown words by selecting the source token with the highest attention score. (Translation)

Cons

Comparing to the state-of-the-art seq2seq library, OpenNMT-py, there are some stuffs that aren't optimized in this codebase:

Use CuDNN when possible (always on encoder, on decoder when input_feed=0)
Always avoid indexing / loops and use torch primitives.
When possible, batch softmax operations across time. (this is the second complicated part of the code)
Batch inference and beam search for translation (this is the most complicated part of the code)

How to speed up RNN training?

Several ways to speed up RNN training:

Batching
Static padding
Dynamic padding
Bucketing
Truncated BPTT

See "Sequence Models and the RNN API (TensorFlow Dev Summit 2017)" for understanding those techniques.

You can use torchtext or OpenNMT's data iterator for speeding up the training. It can be 7x faster! (ex: 7 hours for an epoch -> 1 hour!)

Acknowledgement

Thanks to the author of OpenNMT-py @srush for answering the questions for me! See https://github.com/OpenNMT/OpenNMT-py/issues/552

使用案例

pytorch入门（3）pytorch-seq2seq模型

pytorch入门（3）pytorch-seq2seq模型 https://github.com/IBM/pytorch-seq2seq/ 此模型不包含embedding，且最大长度为10 Get Started Prepare toy dataset # Run script to generate the reverse toy dataset # The generated data is
Pytorch-Lightning基本方法介绍

LIGHTNINGMODULE LightningModule将PyTorch代码整理成5个部分： Computations (init). Train loop (training_step) Validation loop (validation_step) Test loop (test_step) Optimizers (configure_optimizers) Minimal Exam
pytorch做seq2seq注意力模型的翻译

以下是对pytorch 1.0版本的seq2seq+注意力模型做法语--英语翻译的理解（这个代码在pytorch0.4上也可以正常跑）： 1 # -*- coding: utf-8 -*- 2 """ 3 Translation with a Sequence to Sequence Network and Attention 4 ************************
d2lzh动手学深度学习-pytorch-d2lzh_pytorch

之前在学习深度学习的时候，一直这个包困扰着我，看了看网上的资源，找了找，给大家分享一下。如果有需要的请拿走，不谢。 import collections import math import os import random import sys import tarfile import time import zipfile from tqdm import tqdm from IPytho
wordembedding-paddle-pytorch-tf

构造词典，要把输入的字符串转换id，首先把字符和id的映射定义好字典 WORD_DICT_URL = "https://paddlenlp.bj.bcebos.com/data/dict.txt" # Loads vocab. vocab_path = "./dict.txt" if not os.path.exists(vocab_path): # d
（pytorch-深度学习）循环神经网络的从零开始实现

循环神经网络的从零开始实现首先，我们读取周杰伦专辑歌词数据集： import time import math import numpy as np import torch from torch import nn, optim import torch.nn.functional as F import sys sys.path.append("..") device = torch.d

pytorch-seq2seq-example

Extra features

Cons

How to speed up RNN training?

Acknowledgement

同类工具

相关阅读

相关文章

相关问答

相关文档