Guy/fairseq

Message	Author	SHA1	Date
fix alignment when using uneven batches and left pad	Alexei Baevski	9f1b37ddcd	6 years ago
implement batching in interactive mode	Alexei Baevski	663fd8060e	6 years ago
Sampling doesn't work with interactive	Sergey Edunov	4ce453b18f	6 years ago
Fix --remove-bpe to strip trailing BPE symbols	Myle Ott	a04c4cf46f	6 years ago
Remove padding from --score-reference	Myle Ott	c53b2ee06a	6 years ago
fix flag copy paste (decoder-normalize-before)	alexeib	12a47a64ce	6 years ago
add support for averaging last n checkpoints	Alexei Baevski	f6a5a54e24	6 years ago
make attn dropout 0.1 default for big en-de transformer	Alexei Baevski	23211c457d	6 years ago
fix to adding tokens to dictionary while thresholding	Angela Fan	d85b61d65e	6 years ago
Fix --prefix-size	Myle Ott	7f538f54d9	6 years ago
make sure tensor used to index is cuda if on gpu	Alexei Baevski	2a681d99e6	6 years ago
Remove src-padding from generation output	Myle Ott	88df72c0cf	6 years ago
Fix tests	Myle Ott	8afb77612c	6 years ago
Support --warmup-updates with fixed LR schedule	Myle Ott	7c7634f638	6 years ago
Save and restore wall time in checkpoints	Myle Ott	0daba38ecb	6 years ago
Simplify train.py (merge with singleprocess_train.py)	Myle Ott	dc40ac58f7	6 years ago
Fix embedding initialization for padding	Alexei Baevski	c6d4386c15	6 years ago
Use eval() to parse args.lr	Myle Ott	1ec5f0a0e4	6 years ago
Fix preprocess.py	Myle Ott	fa7c575a1c	6 years ago
Small optimization for LSTM	Myle Ott	f607d9e89a	6 years ago
Fix Flake8	Myle Ott	8fcdb9b726	6 years ago
remove completed sentences from batch	Alexei Baevski	2a84f46bf0	6 years ago
No more magical --fp16	Myle Ott	bcdc27dcf1	6 years ago
Pad dictionary to be a multiple of 8 in preprocessing	Myle Ott	745d5fbd7f	6 years ago
Revert "Make dictionary size a multiple of 8"	Myle Ott	4cd2bb702b	6 years ago
Make dictionary size a multiple of 8	Myle Ott	26f87c7d6b	6 years ago
Add FP16 support	Myle Ott	7ee1d28458	6 years ago
Fix batching during generation	Myle Ott	73a87327ed	6 years ago
Allow schedule for update-freq	Myle Ott	47b3b81c0d	6 years ago
Improve dataloader speed and deprecate concept of batch_offset (use --sample-without-replacement instead)	Myle Ott	4fa8760e9a	6 years ago

Newer Older

Guy / fairseq

Guy
/
fairseq