Guy/fairseq

Message	Author	SHA1	Date
fix flag copy paste (decoder-normalize-before)	alexeib	12a47a64ce	6 years ago
add support for averaging last n checkpoints	Alexei Baevski	f6a5a54e24	6 years ago
make attn dropout 0.1 default for big en-de transformer	Alexei Baevski	23211c457d	6 years ago
fix to adding tokens to dictionary while thresholding	Angela Fan	d85b61d65e	6 years ago
Fix --prefix-size	Myle Ott	7f538f54d9	6 years ago
make sure tensor used to index is cuda if on gpu	Alexei Baevski	2a681d99e6	6 years ago
Remove src-padding from generation output	Myle Ott	88df72c0cf	6 years ago
Fix tests	Myle Ott	8afb77612c	6 years ago
Support --warmup-updates with fixed LR schedule	Myle Ott	7c7634f638	6 years ago
Save and restore wall time in checkpoints	Myle Ott	0daba38ecb	6 years ago
Simplify train.py (merge with singleprocess_train.py)	Myle Ott	dc40ac58f7	6 years ago
Fix embedding initialization for padding	Alexei Baevski	c6d4386c15	6 years ago
Use eval() to parse args.lr	Myle Ott	1ec5f0a0e4	6 years ago
Fix preprocess.py	Myle Ott	fa7c575a1c	6 years ago
Small optimization for LSTM	Myle Ott	f607d9e89a	6 years ago
Fix Flake8	Myle Ott	8fcdb9b726	6 years ago
remove completed sentences from batch	Alexei Baevski	2a84f46bf0	6 years ago
No more magical --fp16	Myle Ott	bcdc27dcf1	6 years ago
Pad dictionary to be a multiple of 8 in preprocessing	Myle Ott	745d5fbd7f	6 years ago
Revert "Make dictionary size a multiple of 8"	Myle Ott	4cd2bb702b	6 years ago
Make dictionary size a multiple of 8	Myle Ott	26f87c7d6b	6 years ago
Add FP16 support	Myle Ott	7ee1d28458	6 years ago
Fix batching during generation	Myle Ott	73a87327ed	6 years ago
Allow schedule for update-freq	Myle Ott	47b3b81c0d	6 years ago
Improve dataloader speed and deprecate concept of batch_offset (use --sample-without-replacement instead)	Myle Ott	4fa8760e9a	6 years ago
better batching	Sergey Edunov	c52f6ea4fc	6 years ago
Use FP32 for multi-head attention softmax	Myle Ott	d6be0c7e00	6 years ago
Simulated big batches	Sergey Edunov	2d27ae084a	6 years ago
More improvements to weight init and FP16 support	Myle Ott	60c4081b06	6 years ago
Use PyTorch LayerNorm and improve weight init	Myle Ott	36e360d907	6 years ago

Newer Older

Guy / fairseq

Guy
/
fairseq