Scripts for training models with various parameters.

Default parameters

These models are trained with the default parameters, which are as close as possible to Tufano et al.'s highest performing model. Refer to HephaestusModel.train for more information on what the default parameters are.

These models are trained with CompoundOperation machine strings in general form.

Model paths

Variable name Value
MODEL_DEFAULT_CONTROL "../models/default_params/control"
MODEL_DEFAULT_BASIC "../models/default_params/basic_ops"
MODEL_DEFAULT_STRICT "../models/default_params/strict_ops"
MODEL_DEFAULT_LOOSE "../models/default_params/loose_ops"

Models

Control -- train only on AbstractMethods (no EditOperations):

modelDefaultControl = HephaestusModel(MODEL_DEFAULT_CONTROL)
modelDefaultControl.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_METHODS_TRAIN_FIXED,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_METHODS_VALID_FIXED
)
[2021-04-23 00:53:06,526 INFO] Counter vocab from -1 samples.
[2021-04-23 00:53:06,526 INFO] n_sample=-1: Build vocab on full datasets.
[2021-04-23 00:53:06,530 INFO] corpus_1's transforms: TransformPipe()
[2021-04-23 00:53:06,530 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 00:53:07,131 INFO] Counters src:429
[2021-04-23 00:53:07,131 INFO] Counters tgt:423
[2021-04-23 00:53:07,131 WARNING] path ../models/default_params/control/save_data.vocab.src exists, may overwrite...
[2021-04-23 00:53:07,132 WARNING] path ../models/default_params/control/save_data.vocab.tgt exists, may overwrite...
[2021-04-23 00:53:07,607 INFO] Parsed 2 corpora from -data.
[2021-04-23 00:53:07,607 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-04-23 00:53:07,607 INFO] Loading vocab from text file...
[2021-04-23 00:53:07,607 INFO] Loading src vocabulary from ../models/default_params/control/save_data.vocab.src
[2021-04-23 00:53:07,608 INFO] Loaded src vocab has 429 tokens.
[2021-04-23 00:53:07,608 INFO] Loading tgt vocabulary from ../models/default_params/control/save_data.vocab.tgt
[2021-04-23 00:53:07,610 INFO] Loaded tgt vocab has 423 tokens.
[2021-04-23 00:53:07,610 INFO] Building fields with vocab in counters...
[2021-04-23 00:53:07,610 INFO]  * tgt vocab size: 427.
[2021-04-23 00:53:07,611 INFO]  * src vocab size: 431.
[2021-04-23 00:53:07,611 INFO]  * src vocab size = 431
[2021-04-23 00:53:07,611 INFO]  * tgt vocab size = 427
[2021-04-23 00:53:07,612 INFO] Building model...
[2021-04-23 00:53:08,835 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): LSTM(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(427, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedLSTM(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): LSTMCell(768, 256)
        (1): LSTMCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=427, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-04-23 00:53:08,835 INFO] encoder: 1535488
[2021-04-23 00:53:08,835 INFO] decoder: 2168235
[2021-04-23 00:53:08,835 INFO] * number of parameters: 3703723
[2021-04-23 00:53:08,836 INFO] Starting training on GPU: [0]
[2021-04-23 00:53:08,836 INFO] Start training loop and validate every 5000 steps...
[2021-04-23 00:53:08,837 INFO] corpus_1's transforms: TransformPipe()
[2021-04-23 00:53:08,837 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 00:53:18,915 INFO] Step 50/50000; acc:  10.93; ppl: 155.59; xent: 5.05; lr: 0.00010; 9894/9391 tok/s;     10 sec
[2021-04-23 00:53:29,040 INFO] Step 100/50000; acc:  16.06; ppl: 38.52; xent: 3.65; lr: 0.00010; 10233/9586 tok/s;     20 sec
[2021-04-23 00:53:38,899 INFO] Step 150/50000; acc:  20.89; ppl: 28.01; xent: 3.33; lr: 0.00010; 10108/9625 tok/s;     30 sec
[2021-04-23 00:53:49,008 INFO] Step 200/50000; acc:  24.08; ppl: 25.04; xent: 3.22; lr: 0.00010; 10061/9517 tok/s;     40 sec
[2021-04-23 00:53:59,093 INFO] Step 250/50000; acc:  30.11; ppl: 21.26; xent: 3.06; lr: 0.00010; 10095/9529 tok/s;     50 sec
[2021-04-23 00:54:09,210 INFO] Step 300/50000; acc:  37.33; ppl: 17.73; xent: 2.88; lr: 0.00010; 9963/9469 tok/s;     60 sec
[2021-04-23 00:54:19,542 INFO] Step 350/50000; acc:  44.23; ppl: 13.86; xent: 2.63; lr: 0.00010; 10160/9532 tok/s;     71 sec
[2021-04-23 00:54:29,668 INFO] Step 400/50000; acc:  48.21; ppl: 11.27; xent: 2.42; lr: 0.00010; 9963/9431 tok/s;     81 sec
[2021-04-23 00:54:39,904 INFO] Step 450/50000; acc:  50.71; ppl:  9.63; xent: 2.26; lr: 0.00010; 10119/9488 tok/s;     91 sec
[2021-04-23 00:54:49,880 INFO] Step 500/50000; acc:  52.70; ppl:  8.51; xent: 2.14; lr: 0.00010; 9946/9436 tok/s;    101 sec
[2021-04-23 00:55:00,036 INFO] Step 550/50000; acc:  54.26; ppl:  7.57; xent: 2.02; lr: 0.00010; 10108/9559 tok/s;    111 sec
[2021-04-23 00:55:10,017 INFO] Step 600/50000; acc:  55.33; ppl:  7.16; xent: 1.97; lr: 0.00010; 10231/9576 tok/s;    121 sec
[2021-04-23 00:55:20,033 INFO] Step 650/50000; acc:  56.95; ppl:  6.51; xent: 1.87; lr: 0.00010; 9977/9410 tok/s;    131 sec
[2021-04-23 00:55:30,198 INFO] Step 700/50000; acc:  57.41; ppl:  6.26; xent: 1.83; lr: 0.00010; 10127/9467 tok/s;    141 sec
[2021-04-23 00:55:31,014 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 00:55:40,190 INFO] Step 750/50000; acc:  58.94; ppl:  5.87; xent: 1.77; lr: 0.00010; 10031/9498 tok/s;    151 sec
[2021-04-23 00:55:50,352 INFO] Step 800/50000; acc:  58.87; ppl:  5.90; xent: 1.77; lr: 0.00010; 10214/9586 tok/s;    162 sec
[2021-04-23 00:56:00,172 INFO] Step 850/50000; acc:  60.17; ppl:  5.50; xent: 1.70; lr: 0.00010; 10021/9582 tok/s;    171 sec
[2021-04-23 00:56:10,478 INFO] Step 900/50000; acc:  60.42; ppl:  5.34; xent: 1.67; lr: 0.00010; 10118/9514 tok/s;    182 sec
[2021-04-23 00:56:20,394 INFO] Step 950/50000; acc:  61.81; ppl:  5.01; xent: 1.61; lr: 0.00010; 10066/9548 tok/s;    192 sec
[2021-04-23 00:56:30,521 INFO] Step 1000/50000; acc:  61.48; ppl:  4.99; xent: 1.61; lr: 0.00010; 9993/9480 tok/s;    202 sec
[2021-04-23 00:56:40,738 INFO] Step 1050/50000; acc:  62.27; ppl:  4.77; xent: 1.56; lr: 0.00010; 10142/9544 tok/s;    212 sec
[2021-04-23 00:56:50,789 INFO] Step 1100/50000; acc:  63.04; ppl:  4.60; xent: 1.53; lr: 0.00010; 10079/9527 tok/s;    222 sec
[2021-04-23 00:57:01,097 INFO] Step 1150/50000; acc:  63.37; ppl:  4.55; xent: 1.51; lr: 0.00010; 10007/9388 tok/s;    232 sec
[2021-04-23 00:57:11,181 INFO] Step 1200/50000; acc:  63.97; ppl:  4.42; xent: 1.49; lr: 0.00010; 10035/9500 tok/s;    242 sec
[2021-04-23 00:57:21,329 INFO] Step 1250/50000; acc:  64.05; ppl:  4.33; xent: 1.46; lr: 0.00010; 10200/9585 tok/s;    252 sec
[2021-04-23 00:57:31,219 INFO] Step 1300/50000; acc:  64.94; ppl:  4.19; xent: 1.43; lr: 0.00010; 10061/9564 tok/s;    262 sec
[2021-04-23 00:57:41,411 INFO] Step 1350/50000; acc:  65.50; ppl:  4.07; xent: 1.40; lr: 0.00010; 10124/9455 tok/s;    273 sec
[2021-04-23 00:57:51,445 INFO] Step 1400/50000; acc:  66.20; ppl:  3.96; xent: 1.38; lr: 0.00010; 10027/9400 tok/s;    283 sec
[2021-04-23 00:57:59,523 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 00:58:01,409 INFO] Step 1450/50000; acc:  66.66; ppl:  3.85; xent: 1.35; lr: 0.00010; 10089/9568 tok/s;    293 sec
[2021-04-23 00:58:11,647 INFO] Step 1500/50000; acc:  66.81; ppl:  3.83; xent: 1.34; lr: 0.00010; 10083/9441 tok/s;    303 sec
[2021-04-23 00:58:21,624 INFO] Step 1550/50000; acc:  67.50; ppl:  3.72; xent: 1.31; lr: 0.00010; 10034/9477 tok/s;    313 sec
[2021-04-23 00:58:31,649 INFO] Step 1600/50000; acc:  67.34; ppl:  3.72; xent: 1.31; lr: 0.00010; 10234/9777 tok/s;    323 sec
[2021-04-23 00:58:41,651 INFO] Step 1650/50000; acc:  68.39; ppl:  3.56; xent: 1.27; lr: 0.00010; 9944/9374 tok/s;    333 sec
[2021-04-23 00:58:51,935 INFO] Step 1700/50000; acc:  68.64; ppl:  3.50; xent: 1.25; lr: 0.00010; 10092/9523 tok/s;    343 sec
[2021-04-23 00:59:01,910 INFO] Step 1750/50000; acc:  69.13; ppl:  3.43; xent: 1.23; lr: 0.00010; 10066/9571 tok/s;    353 sec
[2021-04-23 00:59:12,088 INFO] Step 1800/50000; acc:  69.39; ppl:  3.38; xent: 1.22; lr: 0.00010; 10100/9520 tok/s;    363 sec
[2021-04-23 00:59:22,383 INFO] Step 1850/50000; acc:  69.72; ppl:  3.33; xent: 1.20; lr: 0.00010; 10012/9420 tok/s;    374 sec
[2021-04-23 00:59:32,488 INFO] Step 1900/50000; acc:  70.28; ppl:  3.26; xent: 1.18; lr: 0.00010; 9877/9349 tok/s;    384 sec
[2021-04-23 00:59:42,801 INFO] Step 1950/50000; acc:  70.21; ppl:  3.26; xent: 1.18; lr: 0.00010; 10059/9415 tok/s;    394 sec
[2021-04-23 00:59:52,819 INFO] Step 2000/50000; acc:  71.12; ppl:  3.14; xent: 1.14; lr: 0.00010; 10085/9574 tok/s;    404 sec
[2021-04-23 01:00:03,041 INFO] Step 2050/50000; acc:  71.07; ppl:  3.15; xent: 1.15; lr: 0.00010; 10184/9492 tok/s;    414 sec
[2021-04-23 01:00:12,853 INFO] Step 2100/50000; acc:  72.13; ppl:  3.03; xent: 1.11; lr: 0.00010; 10058/9505 tok/s;    424 sec
[2021-04-23 01:00:22,949 INFO] Step 2150/50000; acc:  71.92; ppl:  3.01; xent: 1.10; lr: 0.00010; 10107/9544 tok/s;    434 sec
[2021-04-23 01:00:28,201 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:00:32,943 INFO] Step 2200/50000; acc:  72.42; ppl:  2.98; xent: 1.09; lr: 0.00010; 10194/9582 tok/s;    444 sec
[2021-04-23 01:00:43,039 INFO] Step 2250/50000; acc:  72.66; ppl:  2.95; xent: 1.08; lr: 0.00010; 9991/9410 tok/s;    454 sec
[2021-04-23 01:00:53,025 INFO] Step 2300/50000; acc:  72.75; ppl:  2.93; xent: 1.07; lr: 0.00010; 10292/9716 tok/s;    464 sec
[2021-04-23 01:01:03,025 INFO] Step 2350/50000; acc:  73.72; ppl:  2.82; xent: 1.04; lr: 0.00010; 9947/9486 tok/s;    474 sec
[2021-04-23 01:01:13,224 INFO] Step 2400/50000; acc:  73.93; ppl:  2.80; xent: 1.03; lr: 0.00010; 10125/9547 tok/s;    484 sec
[2021-04-23 01:01:23,101 INFO] Step 2450/50000; acc:  74.43; ppl:  2.74; xent: 1.01; lr: 0.00010; 10007/9487 tok/s;    494 sec
[2021-04-23 01:01:33,531 INFO] Step 2500/50000; acc:  74.49; ppl:  2.73; xent: 1.00; lr: 0.00010; 10098/9490 tok/s;    505 sec
[2021-04-23 01:01:43,602 INFO] Step 2550/50000; acc:  75.14; ppl:  2.67; xent: 0.98; lr: 0.00010; 10044/9503 tok/s;    515 sec
[2021-04-23 01:01:53,806 INFO] Step 2600/50000; acc:  75.47; ppl:  2.65; xent: 0.98; lr: 0.00010; 9977/9383 tok/s;    525 sec
[2021-04-23 01:02:03,982 INFO] Step 2650/50000; acc:  75.55; ppl:  2.63; xent: 0.97; lr: 0.00010; 10066/9499 tok/s;    535 sec
[2021-04-23 01:02:14,148 INFO] Step 2700/50000; acc:  76.18; ppl:  2.58; xent: 0.95; lr: 0.00010; 9888/9371 tok/s;    545 sec
[2021-04-23 01:02:24,168 INFO] Step 2750/50000; acc:  76.32; ppl:  2.56; xent: 0.94; lr: 0.00010; 10284/9659 tok/s;    555 sec
[2021-04-23 01:02:34,216 INFO] Step 2800/50000; acc:  76.89; ppl:  2.51; xent: 0.92; lr: 0.00010; 10115/9474 tok/s;    565 sec
[2021-04-23 01:02:44,366 INFO] Step 2850/50000; acc:  76.88; ppl:  2.50; xent: 0.92; lr: 0.00010; 10143/9501 tok/s;    576 sec
[2021-04-23 01:02:54,174 INFO] Step 2900/50000; acc:  77.76; ppl:  2.42; xent: 0.89; lr: 0.00010; 10088/9574 tok/s;    585 sec
[2021-04-23 01:02:56,704 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:03:04,409 INFO] Step 2950/50000; acc:  77.72; ppl:  2.44; xent: 0.89; lr: 0.00010; 10051/9466 tok/s;    596 sec
[2021-04-23 01:03:14,380 INFO] Step 3000/50000; acc:  78.01; ppl:  2.41; xent: 0.88; lr: 0.00010; 10183/9591 tok/s;    606 sec
[2021-04-23 01:03:24,287 INFO] Step 3050/50000; acc:  78.51; ppl:  2.38; xent: 0.87; lr: 0.00010; 10087/9587 tok/s;    615 sec
[2021-04-23 01:03:34,450 INFO] Step 3100/50000; acc:  79.02; ppl:  2.33; xent: 0.85; lr: 0.00010; 10129/9528 tok/s;    626 sec
[2021-04-23 01:03:44,442 INFO] Step 3150/50000; acc:  79.54; ppl:  2.29; xent: 0.83; lr: 0.00010; 9973/9533 tok/s;    636 sec
[2021-04-23 01:03:54,547 INFO] Step 3200/50000; acc:  79.70; ppl:  2.28; xent: 0.82; lr: 0.00010; 10200/9587 tok/s;    646 sec
[2021-04-23 01:04:04,701 INFO] Step 3250/50000; acc:  80.08; ppl:  2.23; xent: 0.80; lr: 0.00010; 9982/9448 tok/s;    656 sec
[2021-04-23 01:04:14,949 INFO] Step 3300/50000; acc:  79.89; ppl:  2.24; xent: 0.81; lr: 0.00010; 10168/9540 tok/s;    666 sec
[2021-04-23 01:04:24,964 INFO] Step 3350/50000; acc:  80.52; ppl:  2.21; xent: 0.79; lr: 0.00010; 10015/9487 tok/s;    676 sec
[2021-04-23 01:04:35,198 INFO] Step 3400/50000; acc:  80.63; ppl:  2.21; xent: 0.79; lr: 0.00010; 9972/9375 tok/s;    686 sec
[2021-04-23 01:04:45,296 INFO] Step 3450/50000; acc:  81.08; ppl:  2.17; xent: 0.77; lr: 0.00010; 10111/9566 tok/s;    696 sec
[2021-04-23 01:04:55,150 INFO] Step 3500/50000; acc:  81.48; ppl:  2.13; xent: 0.76; lr: 0.00010; 10240/9664 tok/s;    706 sec
[2021-04-23 01:05:05,364 INFO] Step 3550/50000; acc:  81.63; ppl:  2.11; xent: 0.75; lr: 0.00010; 10091/9394 tok/s;    717 sec
[2021-04-23 01:05:15,381 INFO] Step 3600/50000; acc:  81.88; ppl:  2.11; xent: 0.75; lr: 0.00010; 10058/9509 tok/s;    727 sec
[2021-04-23 01:05:18,692 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:05:25,621 INFO] Step 3650/50000; acc:  82.13; ppl:  2.09; xent: 0.74; lr: 0.00010; 10151/9521 tok/s;    737 sec
[2021-04-23 01:05:35,513 INFO] Step 3700/50000; acc:  82.46; ppl:  2.08; xent: 0.73; lr: 0.00010; 10037/9513 tok/s;    747 sec
[2021-04-23 01:05:45,520 INFO] Step 3750/50000; acc:  82.37; ppl:  2.08; xent: 0.73; lr: 0.00010; 10198/9622 tok/s;    757 sec
[2021-04-23 01:05:55,548 INFO] Step 3800/50000; acc:  82.92; ppl:  2.03; xent: 0.71; lr: 0.00010; 10065/9553 tok/s;    767 sec
[2021-04-23 01:06:05,516 INFO] Step 3850/50000; acc:  83.20; ppl:  2.00; xent: 0.70; lr: 0.00010; 10070/9555 tok/s;    777 sec
[2021-04-23 01:06:15,663 INFO] Step 3900/50000; acc:  83.49; ppl:  1.99; xent: 0.69; lr: 0.00010; 10116/9484 tok/s;    787 sec
[2021-04-23 01:06:25,750 INFO] Step 3950/50000; acc:  83.79; ppl:  1.97; xent: 0.68; lr: 0.00010; 9961/9493 tok/s;    797 sec
[2021-04-23 01:06:36,059 INFO] Step 4000/50000; acc:  83.86; ppl:  1.95; xent: 0.67; lr: 0.00010; 10204/9577 tok/s;    807 sec
[2021-04-23 01:06:46,083 INFO] Step 4050/50000; acc:  83.90; ppl:  1.95; xent: 0.67; lr: 0.00010; 9910/9389 tok/s;    817 sec
[2021-04-23 01:06:56,310 INFO] Step 4100/50000; acc:  83.97; ppl:  1.96; xent: 0.67; lr: 0.00010; 10194/9579 tok/s;    827 sec
[2021-04-23 01:07:06,384 INFO] Step 4150/50000; acc:  84.27; ppl:  1.94; xent: 0.66; lr: 0.00010; 9934/9395 tok/s;    838 sec
[2021-04-23 01:07:16,461 INFO] Step 4200/50000; acc:  84.50; ppl:  1.91; xent: 0.65; lr: 0.00010; 10119/9578 tok/s;    848 sec
[2021-04-23 01:07:26,527 INFO] Step 4250/50000; acc:  84.72; ppl:  1.89; xent: 0.64; lr: 0.00010; 10231/9568 tok/s;    858 sec
[2021-04-23 01:07:36,545 INFO] Step 4300/50000; acc:  85.18; ppl:  1.87; xent: 0.62; lr: 0.00010; 9976/9404 tok/s;    868 sec
[2021-04-23 01:07:46,637 INFO] Step 4350/50000; acc:  84.87; ppl:  1.88; xent: 0.63; lr: 0.00010; 10190/9532 tok/s;    878 sec
[2021-04-23 01:07:47,076 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:07:56,717 INFO] Step 4400/50000; acc:  85.50; ppl:  1.86; xent: 0.62; lr: 0.00010; 10046/9493 tok/s;    888 sec
[2021-04-23 01:08:06,945 INFO] Step 4450/50000; acc:  85.15; ppl:  1.88; xent: 0.63; lr: 0.00010; 10169/9542 tok/s;    898 sec
[2021-04-23 01:08:16,668 INFO] Step 4500/50000; acc:  85.37; ppl:  1.86; xent: 0.62; lr: 0.00010; 10068/9642 tok/s;    908 sec
[2021-04-23 01:08:26,888 INFO] Step 4550/50000; acc:  85.80; ppl:  1.82; xent: 0.60; lr: 0.00010; 10040/9465 tok/s;    918 sec
[2021-04-23 01:08:36,914 INFO] Step 4600/50000; acc:  85.94; ppl:  1.81; xent: 0.59; lr: 0.00010; 10102/9555 tok/s;    928 sec
[2021-04-23 01:08:46,947 INFO] Step 4650/50000; acc:  86.13; ppl:  1.81; xent: 0.59; lr: 0.00010; 9945/9430 tok/s;    938 sec
[2021-04-23 01:08:57,332 INFO] Step 4700/50000; acc:  86.28; ppl:  1.79; xent: 0.58; lr: 0.00010; 10080/9492 tok/s;    948 sec
[2021-04-23 01:09:07,370 INFO] Step 4750/50000; acc:  86.36; ppl:  1.78; xent: 0.58; lr: 0.00010; 10043/9515 tok/s;    959 sec
[2021-04-23 01:09:17,736 INFO] Step 4800/50000; acc:  86.51; ppl:  1.78; xent: 0.58; lr: 0.00010; 9968/9341 tok/s;    969 sec
[2021-04-23 01:09:27,726 INFO] Step 4850/50000; acc:  86.70; ppl:  1.78; xent: 0.57; lr: 0.00010; 9971/9447 tok/s;    979 sec
[2021-04-23 01:09:37,983 INFO] Step 4900/50000; acc:  86.83; ppl:  1.76; xent: 0.57; lr: 0.00010; 10151/9559 tok/s;    989 sec
[2021-04-23 01:09:47,855 INFO] Step 4950/50000; acc:  86.82; ppl:  1.75; xent: 0.56; lr: 0.00010; 10199/9634 tok/s;    999 sec
[2021-04-23 01:09:57,949 INFO] Step 5000/50000; acc:  87.12; ppl:  1.73; xent: 0.55; lr: 0.00010; 10121/9487 tok/s;   1009 sec
[2021-04-23 01:09:57,949 INFO] valid's transforms: TransformPipe()
[2021-04-23 01:09:57,952 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-23 01:10:07,594 INFO] Validation perplexity: 1.65393
[2021-04-23 01:10:07,594 INFO] Validation accuracy: 88.5469
[2021-04-23 01:10:07,596 INFO] Saving checkpoint ../models/default_params/control/model_step_5000.pt
[2021-04-23 01:10:18,329 INFO] Step 5050/50000; acc:  87.17; ppl:  1.74; xent: 0.55; lr: 0.00010; 4976/4673 tok/s;   1029 sec
[2021-04-23 01:10:25,980 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:10:28,327 INFO] Step 5100/50000; acc:  87.27; ppl:  1.72; xent: 0.54; lr: 0.00010; 10081/9518 tok/s;   1039 sec
[2021-04-23 01:10:38,502 INFO] Step 5150/50000; acc:  87.43; ppl:  1.73; xent: 0.55; lr: 0.00010; 10116/9511 tok/s;   1050 sec
[2021-04-23 01:10:48,545 INFO] Step 5200/50000; acc:  87.41; ppl:  1.72; xent: 0.54; lr: 0.00010; 10066/9485 tok/s;   1060 sec
[2021-04-23 01:10:58,645 INFO] Step 5250/50000; acc:  87.49; ppl:  1.72; xent: 0.54; lr: 0.00010; 10187/9693 tok/s;   1070 sec
[2021-04-23 01:11:08,680 INFO] Step 5300/50000; acc:  87.72; ppl:  1.70; xent: 0.53; lr: 0.00010; 9882/9340 tok/s;   1080 sec
[2021-04-23 01:11:18,777 INFO] Step 5350/50000; acc:  87.98; ppl:  1.68; xent: 0.52; lr: 0.00010; 10117/9563 tok/s;   1090 sec
[2021-04-23 01:11:28,935 INFO] Step 5400/50000; acc:  88.08; ppl:  1.68; xent: 0.52; lr: 0.00010; 10027/9493 tok/s;   1100 sec
[2021-04-23 01:11:39,006 INFO] Step 5450/50000; acc:  88.36; ppl:  1.66; xent: 0.51; lr: 0.00010; 10085/9531 tok/s;   1110 sec
[2021-04-23 01:11:49,340 INFO] Step 5500/50000; acc:  87.88; ppl:  1.68; xent: 0.52; lr: 0.00010; 10045/9457 tok/s;   1121 sec
[2021-04-23 01:11:59,447 INFO] Step 5550/50000; acc:  88.31; ppl:  1.67; xent: 0.51; lr: 0.00010; 9841/9335 tok/s;   1131 sec
[2021-04-23 01:12:09,759 INFO] Step 5600/50000; acc:  88.47; ppl:  1.65; xent: 0.50; lr: 0.00010; 10088/9447 tok/s;   1141 sec
[2021-04-23 01:12:19,646 INFO] Step 5650/50000; acc:  88.47; ppl:  1.65; xent: 0.50; lr: 0.00010; 10037/9561 tok/s;   1151 sec
[2021-04-23 01:12:29,878 INFO] Step 5700/50000; acc:  88.34; ppl:  1.65; xent: 0.50; lr: 0.00010; 10215/9503 tok/s;   1161 sec
[2021-04-23 01:12:39,761 INFO] Step 5750/50000; acc:  88.58; ppl:  1.64; xent: 0.50; lr: 0.00010; 10111/9521 tok/s;   1171 sec
[2021-04-23 01:12:49,788 INFO] Step 5800/50000; acc:  88.73; ppl:  1.63; xent: 0.49; lr: 0.00010; 10110/9556 tok/s;   1181 sec
[2021-04-23 01:12:54,737 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:13:00,009 INFO] Step 5850/50000; acc:  88.60; ppl:  1.64; xent: 0.50; lr: 0.00010; 10034/9445 tok/s;   1191 sec
[2021-04-23 01:13:10,112 INFO] Step 5900/50000; acc:  88.81; ppl:  1.63; xent: 0.49; lr: 0.00010; 10014/9415 tok/s;   1201 sec
[2021-04-23 01:13:20,062 INFO] Step 5950/50000; acc:  88.71; ppl:  1.63; xent: 0.49; lr: 0.00010; 10284/9729 tok/s;   1211 sec
[2021-04-23 01:13:30,163 INFO] Step 6000/50000; acc:  89.11; ppl:  1.61; xent: 0.48; lr: 0.00010; 9963/9470 tok/s;   1221 sec
[2021-04-23 01:13:40,460 INFO] Step 6050/50000; acc:  89.07; ppl:  1.61; xent: 0.48; lr: 0.00010; 10044/9474 tok/s;   1232 sec
[2021-04-23 01:13:50,214 INFO] Step 6100/50000; acc:  89.12; ppl:  1.61; xent: 0.48; lr: 0.00010; 10083/9556 tok/s;   1241 sec
[2021-04-23 01:14:00,541 INFO] Step 6150/50000; acc:  89.39; ppl:  1.59; xent: 0.46; lr: 0.00010; 10067/9489 tok/s;   1252 sec
[2021-04-23 01:14:10,645 INFO] Step 6200/50000; acc:  89.45; ppl:  1.59; xent: 0.46; lr: 0.00010; 10136/9560 tok/s;   1262 sec
[2021-04-23 01:14:20,761 INFO] Step 6250/50000; acc:  89.24; ppl:  1.60; xent: 0.47; lr: 0.00010; 9928/9365 tok/s;   1272 sec
[2021-04-23 01:14:30,967 INFO] Step 6300/50000; acc:  89.47; ppl:  1.59; xent: 0.46; lr: 0.00010; 10137/9540 tok/s;   1282 sec
[2021-04-23 01:14:41,016 INFO] Step 6350/50000; acc:  89.33; ppl:  1.59; xent: 0.46; lr: 0.00010; 9942/9397 tok/s;   1292 sec
[2021-04-23 01:14:51,071 INFO] Step 6400/50000; acc:  89.43; ppl:  1.58; xent: 0.46; lr: 0.00010; 10300/9709 tok/s;   1302 sec
[2021-04-23 01:15:01,031 INFO] Step 6450/50000; acc:  89.61; ppl:  1.58; xent: 0.46; lr: 0.00010; 10021/9430 tok/s;   1312 sec
[2021-04-23 01:15:11,210 INFO] Step 6500/50000; acc:  89.64; ppl:  1.57; xent: 0.45; lr: 0.00010; 10160/9495 tok/s;   1322 sec
[2021-04-23 01:15:21,072 INFO] Step 6550/50000; acc:  89.54; ppl:  1.58; xent: 0.45; lr: 0.00010; 10185/9637 tok/s;   1332 sec
[2021-04-23 01:15:23,169 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:15:31,148 INFO] Step 6600/50000; acc:  89.73; ppl:  1.57; xent: 0.45; lr: 0.00010; 10117/9547 tok/s;   1342 sec
[2021-04-23 01:15:41,228 INFO] Step 6650/50000; acc:  89.82; ppl:  1.57; xent: 0.45; lr: 0.00010; 10133/9547 tok/s;   1352 sec
[2021-04-23 01:15:51,145 INFO] Step 6700/50000; acc:  89.68; ppl:  1.57; xent: 0.45; lr: 0.00010; 10085/9560 tok/s;   1362 sec
[2021-04-23 01:16:01,279 INFO] Step 6750/50000; acc:  89.95; ppl:  1.55; xent: 0.44; lr: 0.00010; 10157/9584 tok/s;   1372 sec
[2021-04-23 01:16:11,330 INFO] Step 6800/50000; acc:  89.92; ppl:  1.55; xent: 0.44; lr: 0.00010; 10015/9524 tok/s;   1382 sec
[2021-04-23 01:16:21,486 INFO] Step 6850/50000; acc:  90.12; ppl:  1.55; xent: 0.44; lr: 0.00010; 10169/9565 tok/s;   1393 sec
[2021-04-23 01:16:31,603 INFO] Step 6900/50000; acc:  90.45; ppl:  1.52; xent: 0.42; lr: 0.00010; 9980/9434 tok/s;   1403 sec
[2021-04-23 01:16:41,752 INFO] Step 6950/50000; acc:  89.89; ppl:  1.55; xent: 0.44; lr: 0.00010; 10114/9530 tok/s;   1413 sec
[2021-04-23 01:16:51,887 INFO] Step 7000/50000; acc:  90.06; ppl:  1.55; xent: 0.44; lr: 0.00010; 10024/9436 tok/s;   1423 sec
[2021-04-23 01:17:02,078 INFO] Step 7050/50000; acc:  90.15; ppl:  1.54; xent: 0.43; lr: 0.00010; 9889/9368 tok/s;   1433 sec
[2021-04-23 01:17:12,207 INFO] Step 7100/50000; acc:  90.19; ppl:  1.54; xent: 0.43; lr: 0.00010; 10161/9592 tok/s;   1443 sec
[2021-04-23 01:17:22,066 INFO] Step 7150/50000; acc:  90.43; ppl:  1.53; xent: 0.42; lr: 0.00010; 10202/9620 tok/s;   1453 sec
[2021-04-23 01:17:32,338 INFO] Step 7200/50000; acc:  90.50; ppl:  1.52; xent: 0.42; lr: 0.00010; 10058/9398 tok/s;   1464 sec
[2021-04-23 01:17:42,181 INFO] Step 7250/50000; acc:  90.23; ppl:  1.54; xent: 0.43; lr: 0.00010; 10044/9507 tok/s;   1473 sec
[2021-04-23 01:17:45,192 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:17:52,437 INFO] Step 7300/50000; acc:  90.40; ppl:  1.52; xent: 0.42; lr: 0.00010; 10220/9553 tok/s;   1484 sec
[2021-04-23 01:18:02,467 INFO] Step 7350/50000; acc:  90.37; ppl:  1.53; xent: 0.43; lr: 0.00010; 10000/9490 tok/s;   1494 sec
[2021-04-23 01:18:12,391 INFO] Step 7400/50000; acc:  90.42; ppl:  1.52; xent: 0.42; lr: 0.00010; 10187/9587 tok/s;   1504 sec
[2021-04-23 01:18:22,421 INFO] Step 7450/50000; acc:  90.44; ppl:  1.52; xent: 0.42; lr: 0.00010; 10153/9639 tok/s;   1514 sec
[2021-04-23 01:18:32,417 INFO] Step 7500/50000; acc:  90.67; ppl:  1.51; xent: 0.41; lr: 0.00010; 10054/9533 tok/s;   1524 sec
[2021-04-23 01:18:42,565 INFO] Step 7550/50000; acc:  90.73; ppl:  1.51; xent: 0.41; lr: 0.00010; 10086/9486 tok/s;   1534 sec
[2021-04-23 01:18:52,731 INFO] Step 7600/50000; acc:  90.76; ppl:  1.50; xent: 0.41; lr: 0.00010; 10014/9509 tok/s;   1544 sec
[2021-04-23 01:19:03,014 INFO] Step 7650/50000; acc:  90.92; ppl:  1.49; xent: 0.40; lr: 0.00010; 10231/9577 tok/s;   1554 sec
[2021-04-23 01:19:12,968 INFO] Step 7700/50000; acc:  90.51; ppl:  1.51; xent: 0.41; lr: 0.00010; 9940/9449 tok/s;   1564 sec
[2021-04-23 01:19:23,125 INFO] Step 7750/50000; acc:  90.86; ppl:  1.50; xent: 0.41; lr: 0.00010; 10112/9551 tok/s;   1574 sec
[2021-04-23 01:19:33,245 INFO] Step 7800/50000; acc:  90.87; ppl:  1.50; xent: 0.40; lr: 0.00010; 10017/9427 tok/s;   1584 sec
[2021-04-23 01:19:43,185 INFO] Step 7850/50000; acc:  90.74; ppl:  1.50; xent: 0.40; lr: 0.00010; 10128/9600 tok/s;   1594 sec
[2021-04-23 01:19:53,360 INFO] Step 7900/50000; acc:  90.88; ppl:  1.49; xent: 0.40; lr: 0.00010; 10228/9543 tok/s;   1605 sec
[2021-04-23 01:20:03,310 INFO] Step 7950/50000; acc:  91.12; ppl:  1.48; xent: 0.39; lr: 0.00010; 9963/9382 tok/s;   1614 sec
[2021-04-23 01:20:13,510 INFO] Step 8000/50000; acc:  90.77; ppl:  1.50; xent: 0.41; lr: 0.00010; 10147/9537 tok/s;   1625 sec
[2021-04-23 01:20:13,519 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:20:23,492 INFO] Step 8050/50000; acc:  90.92; ppl:  1.49; xent: 0.40; lr: 0.00010; 9961/9407 tok/s;   1635 sec
[2021-04-23 01:20:33,732 INFO] Step 8100/50000; acc:  90.97; ppl:  1.49; xent: 0.40; lr: 0.00010; 10209/9591 tok/s;   1645 sec
[2021-04-23 01:20:43,533 INFO] Step 8150/50000; acc:  90.88; ppl:  1.50; xent: 0.41; lr: 0.00010; 10105/9663 tok/s;   1655 sec
[2021-04-23 01:20:53,769 INFO] Step 8200/50000; acc:  91.09; ppl:  1.48; xent: 0.39; lr: 0.00010; 9961/9367 tok/s;   1665 sec
[2021-04-23 01:21:03,810 INFO] Step 8250/50000; acc:  91.20; ppl:  1.48; xent: 0.39; lr: 0.00010; 10168/9637 tok/s;   1675 sec
[2021-04-23 01:21:13,832 INFO] Step 8300/50000; acc:  91.25; ppl:  1.48; xent: 0.39; lr: 0.00010; 9976/9454 tok/s;   1685 sec
[2021-04-23 01:21:24,180 INFO] Step 8350/50000; acc:  91.36; ppl:  1.46; xent: 0.38; lr: 0.00010; 10077/9486 tok/s;   1695 sec
[2021-04-23 01:21:34,315 INFO] Step 8400/50000; acc:  90.98; ppl:  1.48; xent: 0.39; lr: 0.00010; 10057/9508 tok/s;   1705 sec
[2021-04-23 01:21:44,705 INFO] Step 8450/50000; acc:  91.32; ppl:  1.46; xent: 0.38; lr: 0.00010; 9935/9310 tok/s;   1716 sec
[2021-04-23 01:21:54,694 INFO] Step 8500/50000; acc:  91.23; ppl:  1.48; xent: 0.39; lr: 0.00010; 9957/9436 tok/s;   1726 sec
[2021-04-23 01:22:04,859 INFO] Step 8550/50000; acc:  91.36; ppl:  1.46; xent: 0.38; lr: 0.00010; 10090/9532 tok/s;   1736 sec
[2021-04-23 01:22:14,836 INFO] Step 8600/50000; acc:  91.19; ppl:  1.47; xent: 0.38; lr: 0.00010; 10230/9593 tok/s;   1746 sec
[2021-04-23 01:22:24,790 INFO] Step 8650/50000; acc:  91.33; ppl:  1.46; xent: 0.38; lr: 0.00010; 10087/9533 tok/s;   1756 sec
[2021-04-23 01:22:34,924 INFO] Step 8700/50000; acc:  91.36; ppl:  1.46; xent: 0.38; lr: 0.00010; 10116/9461 tok/s;   1766 sec
[2021-04-23 01:22:42,202 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:22:44,954 INFO] Step 8750/50000; acc:  91.33; ppl:  1.46; xent: 0.38; lr: 0.00010; 10013/9462 tok/s;   1776 sec
[2021-04-23 01:22:55,185 INFO] Step 8800/50000; acc:  91.43; ppl:  1.46; xent: 0.38; lr: 0.00010; 10105/9500 tok/s;   1786 sec
[2021-04-23 01:23:05,104 INFO] Step 8850/50000; acc:  91.29; ppl:  1.47; xent: 0.39; lr: 0.00010; 10003/9461 tok/s;   1796 sec
[2021-04-23 01:23:15,292 INFO] Step 8900/50000; acc:  91.42; ppl:  1.46; xent: 0.38; lr: 0.00010; 10170/9672 tok/s;   1806 sec
[2021-04-23 01:23:25,377 INFO] Step 8950/50000; acc:  91.37; ppl:  1.46; xent: 0.38; lr: 0.00010; 9931/9377 tok/s;   1817 sec
[2021-04-23 01:23:35,401 INFO] Step 9000/50000; acc:  91.53; ppl:  1.46; xent: 0.38; lr: 0.00010; 10110/9565 tok/s;   1827 sec
[2021-04-23 01:23:45,615 INFO] Step 9050/50000; acc:  91.60; ppl:  1.45; xent: 0.37; lr: 0.00010; 10069/9541 tok/s;   1837 sec
[2021-04-23 01:23:55,732 INFO] Step 9100/50000; acc:  91.75; ppl:  1.44; xent: 0.36; lr: 0.00010; 10055/9495 tok/s;   1847 sec
[2021-04-23 01:24:06,027 INFO] Step 9150/50000; acc:  91.46; ppl:  1.45; xent: 0.37; lr: 0.00010; 10052/9432 tok/s;   1857 sec
[2021-04-23 01:24:16,149 INFO] Step 9200/50000; acc:  91.57; ppl:  1.45; xent: 0.37; lr: 0.00010; 9943/9396 tok/s;   1867 sec
[2021-04-23 01:24:26,460 INFO] Step 9250/50000; acc:  91.73; ppl:  1.44; xent: 0.36; lr: 0.00010; 10079/9464 tok/s;   1878 sec
[2021-04-23 01:24:36,268 INFO] Step 9300/50000; acc:  91.53; ppl:  1.45; xent: 0.37; lr: 0.00010; 10095/9602 tok/s;   1887 sec
[2021-04-23 01:24:46,384 INFO] Step 9350/50000; acc:  91.70; ppl:  1.44; xent: 0.37; lr: 0.00010; 10186/9532 tok/s;   1898 sec
[2021-04-23 01:24:56,381 INFO] Step 9400/50000; acc:  91.66; ppl:  1.45; xent: 0.37; lr: 0.00010; 10128/9466 tok/s;   1908 sec
[2021-04-23 01:25:06,308 INFO] Step 9450/50000; acc:  91.65; ppl:  1.44; xent: 0.36; lr: 0.00010; 10066/9567 tok/s;   1917 sec
[2021-04-23 01:25:10,893 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:25:16,547 INFO] Step 9500/50000; acc:  91.63; ppl:  1.45; xent: 0.37; lr: 0.00010; 10113/9485 tok/s;   1928 sec
[2021-04-23 01:25:26,532 INFO] Step 9550/50000; acc:  91.71; ppl:  1.44; xent: 0.36; lr: 0.00010; 10068/9477 tok/s;   1938 sec
[2021-04-23 01:25:36,553 INFO] Step 9600/50000; acc:  91.70; ppl:  1.44; xent: 0.37; lr: 0.00010; 10275/9751 tok/s;   1948 sec
[2021-04-23 01:25:46,548 INFO] Step 9650/50000; acc:  91.64; ppl:  1.45; xent: 0.37; lr: 0.00010; 9884/9410 tok/s;   1958 sec
[2021-04-23 01:25:56,864 INFO] Step 9700/50000; acc:  91.63; ppl:  1.44; xent: 0.36; lr: 0.00010; 10076/9518 tok/s;   1968 sec
[2021-04-23 01:26:06,761 INFO] Step 9750/50000; acc:  91.73; ppl:  1.44; xent: 0.37; lr: 0.00010; 10065/9518 tok/s;   1978 sec
[2021-04-23 01:26:16,969 INFO] Step 9800/50000; acc:  92.13; ppl:  1.42; xent: 0.35; lr: 0.00010; 10111/9528 tok/s;   1988 sec
[2021-04-23 01:26:27,135 INFO] Step 9850/50000; acc:  91.86; ppl:  1.43; xent: 0.36; lr: 0.00010; 10138/9530 tok/s;   1998 sec
[2021-04-23 01:26:37,220 INFO] Step 9900/50000; acc:  91.87; ppl:  1.43; xent: 0.36; lr: 0.00010; 9958/9404 tok/s;   2008 sec
[2021-04-23 01:26:47,467 INFO] Step 9950/50000; acc:  91.81; ppl:  1.43; xent: 0.36; lr: 0.00010; 10082/9511 tok/s;   2019 sec
[2021-04-23 01:26:57,572 INFO] Step 10000/50000; acc:  91.81; ppl:  1.43; xent: 0.36; lr: 0.00010; 10001/9452 tok/s;   2029 sec
[2021-04-23 01:26:57,575 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-23 01:27:07,195 INFO] Validation perplexity: 1.40688
[2021-04-23 01:27:07,195 INFO] Validation accuracy: 92.4207
[2021-04-23 01:27:07,197 INFO] Saving checkpoint ../models/default_params/control/model_step_10000.pt
[2021-04-23 01:27:17,867 INFO] Step 10050/50000; acc:  91.89; ppl:  1.43; xent: 0.36; lr: 0.00010; 5108/4801 tok/s;   2049 sec
[2021-04-23 01:27:27,793 INFO] Step 10100/50000; acc:  91.97; ppl:  1.42; xent: 0.35; lr: 0.00010; 10014/9458 tok/s;   2059 sec
[2021-04-23 01:27:37,837 INFO] Step 10150/50000; acc:  91.93; ppl:  1.42; xent: 0.35; lr: 0.00010; 10152/9485 tok/s;   2069 sec
[2021-04-23 01:27:47,782 INFO] Step 10200/50000; acc:  91.82; ppl:  1.43; xent: 0.36; lr: 0.00010; 10235/9622 tok/s;   2079 sec
[2021-04-23 01:27:49,437 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:27:57,794 INFO] Step 10250/50000; acc:  91.97; ppl:  1.42; xent: 0.35; lr: 0.00010; 10049/9550 tok/s;   2089 sec
[2021-04-23 01:28:07,895 INFO] Step 10300/50000; acc:  91.93; ppl:  1.43; xent: 0.36; lr: 0.00010; 10195/9547 tok/s;   2099 sec
[2021-04-23 01:28:17,777 INFO] Step 10350/50000; acc:  91.75; ppl:  1.43; xent: 0.36; lr: 0.00010; 10074/9586 tok/s;   2109 sec
[2021-04-23 01:28:27,934 INFO] Step 10400/50000; acc:  91.91; ppl:  1.42; xent: 0.35; lr: 0.00010; 10159/9603 tok/s;   2119 sec
[2021-04-23 01:28:37,843 INFO] Step 10450/50000; acc:  91.91; ppl:  1.43; xent: 0.36; lr: 0.00010; 9985/9485 tok/s;   2129 sec
[2021-04-23 01:28:48,110 INFO] Step 10500/50000; acc:  92.22; ppl:  1.41; xent: 0.35; lr: 0.00010; 10136/9573 tok/s;   2139 sec
[2021-04-23 01:28:58,302 INFO] Step 10550/50000; acc:  92.34; ppl:  1.40; xent: 0.34; lr: 0.00010; 10021/9429 tok/s;   2149 sec
[2021-04-23 01:29:08,372 INFO] Step 10600/50000; acc:  91.97; ppl:  1.42; xent: 0.35; lr: 0.00010; 10093/9498 tok/s;   2160 sec
[2021-04-23 01:29:18,580 INFO] Step 10650/50000; acc:  92.07; ppl:  1.42; xent: 0.35; lr: 0.00010; 10049/9471 tok/s;   2170 sec
[2021-04-23 01:29:28,796 INFO] Step 10700/50000; acc:  91.99; ppl:  1.42; xent: 0.35; lr: 0.00010; 9865/9354 tok/s;   2180 sec
[2021-04-23 01:29:38,827 INFO] Step 10750/50000; acc:  92.13; ppl:  1.41; xent: 0.35; lr: 0.00010; 10226/9673 tok/s;   2190 sec
[2021-04-23 01:29:48,833 INFO] Step 10800/50000; acc:  92.30; ppl:  1.41; xent: 0.34; lr: 0.00010; 10187/9544 tok/s;   2200 sec
[2021-04-23 01:29:59,118 INFO] Step 10850/50000; acc:  92.41; ppl:  1.40; xent: 0.34; lr: 0.00010; 10053/9389 tok/s;   2210 sec
[2021-04-23 01:30:08,916 INFO] Step 10900/50000; acc:  91.96; ppl:  1.42; xent: 0.35; lr: 0.00010; 10061/9541 tok/s;   2220 sec
[2021-04-23 01:30:11,460 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:30:19,092 INFO] Step 10950/50000; acc:  92.19; ppl:  1.41; xent: 0.35; lr: 0.00010; 10131/9503 tok/s;   2230 sec
[2021-04-23 01:30:29,116 INFO] Step 11000/50000; acc:  92.21; ppl:  1.41; xent: 0.34; lr: 0.00010; 10149/9578 tok/s;   2240 sec
[2021-04-23 01:30:38,976 INFO] Step 11050/50000; acc:  92.03; ppl:  1.41; xent: 0.35; lr: 0.00010; 10099/9572 tok/s;   2250 sec
[2021-04-23 01:30:49,097 INFO] Step 11100/50000; acc:  92.17; ppl:  1.41; xent: 0.34; lr: 0.00010; 10167/9603 tok/s;   2260 sec
[2021-04-23 01:30:59,031 INFO] Step 11150/50000; acc:  92.11; ppl:  1.41; xent: 0.34; lr: 0.00010; 10055/9559 tok/s;   2270 sec
[2021-04-23 01:31:09,186 INFO] Step 11200/50000; acc:  92.29; ppl:  1.41; xent: 0.34; lr: 0.00010; 10123/9530 tok/s;   2280 sec
[2021-04-23 01:31:19,164 INFO] Step 11250/50000; acc:  92.40; ppl:  1.40; xent: 0.34; lr: 0.00010; 10037/9574 tok/s;   2290 sec
[2021-04-23 01:31:29,493 INFO] Step 11300/50000; acc:  92.50; ppl:  1.39; xent: 0.33; lr: 0.00010; 10242/9563 tok/s;   2301 sec
[2021-04-23 01:31:39,500 INFO] Step 11350/50000; acc:  91.98; ppl:  1.41; xent: 0.35; lr: 0.00010; 9986/9471 tok/s;   2311 sec
[2021-04-23 01:31:49,565 INFO] Step 11400/50000; acc:  92.43; ppl:  1.40; xent: 0.34; lr: 0.00010; 10134/9561 tok/s;   2321 sec
[2021-04-23 01:31:59,741 INFO] Step 11450/50000; acc:  92.28; ppl:  1.40; xent: 0.34; lr: 0.00010; 10046/9462 tok/s;   2331 sec
[2021-04-23 01:32:09,706 INFO] Step 11500/50000; acc:  92.21; ppl:  1.40; xent: 0.34; lr: 0.00010; 10116/9564 tok/s;   2341 sec
[2021-04-23 01:32:19,903 INFO] Step 11550/50000; acc:  92.34; ppl:  1.39; xent: 0.33; lr: 0.00010; 10171/9525 tok/s;   2351 sec
[2021-04-23 01:32:29,828 INFO] Step 11600/50000; acc:  92.43; ppl:  1.39; xent: 0.33; lr: 0.00010; 10108/9496 tok/s;   2361 sec
[2021-04-23 01:32:39,557 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:32:40,059 INFO] Step 11650/50000; acc:  92.23; ppl:  1.40; xent: 0.33; lr: 0.00010; 10133/9514 tok/s;   2371 sec
[2021-04-23 01:32:49,975 INFO] Step 11700/50000; acc:  92.31; ppl:  1.40; xent: 0.34; lr: 0.00010; 9980/9421 tok/s;   2381 sec
[2021-04-23 01:33:00,192 INFO] Step 11750/50000; acc:  92.36; ppl:  1.40; xent: 0.34; lr: 0.00010; 10072/9504 tok/s;   2391 sec
[2021-04-23 01:33:10,057 INFO] Step 11800/50000; acc:  92.23; ppl:  1.41; xent: 0.34; lr: 0.00010; 10184/9713 tok/s;   2401 sec
[2021-04-23 01:33:20,154 INFO] Step 11850/50000; acc:  92.33; ppl:  1.40; xent: 0.33; lr: 0.00010; 9946/9369 tok/s;   2411 sec
[2021-04-23 01:33:30,309 INFO] Step 11900/50000; acc:  92.46; ppl:  1.39; xent: 0.33; lr: 0.00010; 10164/9621 tok/s;   2421 sec
[2021-04-23 01:33:40,305 INFO] Step 11950/50000; acc:  92.46; ppl:  1.40; xent: 0.33; lr: 0.00010; 9969/9452 tok/s;   2431 sec
[2021-04-23 01:33:50,638 INFO] Step 12000/50000; acc:  92.74; ppl:  1.38; xent: 0.32; lr: 0.00010; 10134/9535 tok/s;   2442 sec
[2021-04-23 01:34:00,650 INFO] Step 12050/50000; acc:  92.30; ppl:  1.39; xent: 0.33; lr: 0.00010; 9996/9451 tok/s;   2452 sec
[2021-04-23 01:34:11,023 INFO] Step 12100/50000; acc:  92.42; ppl:  1.39; xent: 0.33; lr: 0.00010; 9993/9379 tok/s;   2462 sec
[2021-04-23 01:34:21,107 INFO] Step 12150/50000; acc:  92.39; ppl:  1.39; xent: 0.33; lr: 0.00010; 9989/9450 tok/s;   2472 sec
[2021-04-23 01:34:31,236 INFO] Step 12200/50000; acc:  92.44; ppl:  1.39; xent: 0.33; lr: 0.00010; 10039/9488 tok/s;   2482 sec
[2021-04-23 01:34:41,314 INFO] Step 12250/50000; acc:  92.47; ppl:  1.38; xent: 0.33; lr: 0.00010; 10222/9562 tok/s;   2492 sec
[2021-04-23 01:34:51,272 INFO] Step 12300/50000; acc:  92.58; ppl:  1.38; xent: 0.32; lr: 0.00010; 10070/9502 tok/s;   2502 sec
[2021-04-23 01:35:01,395 INFO] Step 12350/50000; acc:  92.55; ppl:  1.38; xent: 0.32; lr: 0.00010; 10103/9478 tok/s;   2513 sec
[2021-04-23 01:35:08,230 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:35:11,446 INFO] Step 12400/50000; acc:  92.44; ppl:  1.39; xent: 0.33; lr: 0.00010; 10104/9528 tok/s;   2523 sec
[2021-04-23 01:35:21,674 INFO] Step 12450/50000; acc:  92.57; ppl:  1.39; xent: 0.33; lr: 0.00010; 10143/9538 tok/s;   2533 sec
[2021-04-23 01:35:31,468 INFO] Step 12500/50000; acc:  92.39; ppl:  1.39; xent: 0.33; lr: 0.00010; 10082/9560 tok/s;   2543 sec
[2021-04-23 01:35:41,537 INFO] Step 12550/50000; acc:  92.43; ppl:  1.39; xent: 0.33; lr: 0.00010; 10144/9645 tok/s;   2553 sec
[2021-04-23 01:35:51,602 INFO] Step 12600/50000; acc:  92.50; ppl:  1.39; xent: 0.33; lr: 0.00010; 10068/9465 tok/s;   2563 sec
[2021-04-23 01:36:01,591 INFO] Step 12650/50000; acc:  92.47; ppl:  1.39; xent: 0.33; lr: 0.00010; 10012/9553 tok/s;   2573 sec
[2021-04-23 01:36:11,874 INFO] Step 12700/50000; acc:  92.67; ppl:  1.38; xent: 0.32; lr: 0.00010; 10107/9516 tok/s;   2583 sec
[2021-04-23 01:36:21,974 INFO] Step 12750/50000; acc:  92.70; ppl:  1.37; xent: 0.32; lr: 0.00010; 10007/9479 tok/s;   2593 sec
[2021-04-23 01:36:32,288 INFO] Step 12800/50000; acc:  92.42; ppl:  1.39; xent: 0.33; lr: 0.00010; 10075/9472 tok/s;   2603 sec
[2021-04-23 01:36:42,316 INFO] Step 12850/50000; acc:  92.54; ppl:  1.39; xent: 0.33; lr: 0.00010; 9863/9334 tok/s;   2613 sec
[2021-04-23 01:36:52,651 INFO] Step 12900/50000; acc:  92.76; ppl:  1.37; xent: 0.31; lr: 0.00010; 10102/9472 tok/s;   2624 sec
[2021-04-23 01:37:02,534 INFO] Step 12950/50000; acc:  92.64; ppl:  1.38; xent: 0.32; lr: 0.00010; 10161/9596 tok/s;   2634 sec
[2021-04-23 01:37:12,553 INFO] Step 13000/50000; acc:  92.66; ppl:  1.38; xent: 0.32; lr: 0.00010; 10187/9580 tok/s;   2644 sec
[2021-04-23 01:37:22,616 INFO] Step 13050/50000; acc:  92.67; ppl:  1.38; xent: 0.32; lr: 0.00010; 10139/9482 tok/s;   2654 sec
[2021-04-23 01:37:32,607 INFO] Step 13100/50000; acc:  92.59; ppl:  1.38; xent: 0.32; lr: 0.00010; 10019/9499 tok/s;   2664 sec
[2021-04-23 01:37:36,737 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:37:42,791 INFO] Step 13150/50000; acc:  92.59; ppl:  1.38; xent: 0.32; lr: 0.00010; 10131/9531 tok/s;   2674 sec
[2021-04-23 01:37:52,830 INFO] Step 13200/50000; acc:  92.67; ppl:  1.38; xent: 0.32; lr: 0.00010; 10125/9486 tok/s;   2684 sec
[2021-04-23 01:38:02,926 INFO] Step 13250/50000; acc:  92.66; ppl:  1.37; xent: 0.32; lr: 0.00010; 10220/9685 tok/s;   2694 sec
[2021-04-23 01:38:12,886 INFO] Step 13300/50000; acc:  92.50; ppl:  1.38; xent: 0.33; lr: 0.00010; 9885/9430 tok/s;   2704 sec
[2021-04-23 01:38:23,046 INFO] Step 13350/50000; acc:  92.60; ppl:  1.38; xent: 0.32; lr: 0.00010; 10077/9528 tok/s;   2714 sec
[2021-04-23 01:38:33,080 INFO] Step 13400/50000; acc:  92.59; ppl:  1.38; xent: 0.32; lr: 0.00010; 10061/9495 tok/s;   2724 sec
[2021-04-23 01:38:43,268 INFO] Step 13450/50000; acc:  92.94; ppl:  1.36; xent: 0.31; lr: 0.00010; 10001/9464 tok/s;   2734 sec
[2021-04-23 01:38:53,468 INFO] Step 13500/50000; acc:  92.75; ppl:  1.37; xent: 0.31; lr: 0.00010; 10179/9536 tok/s;   2745 sec
[2021-04-23 01:39:03,581 INFO] Step 13550/50000; acc:  92.64; ppl:  1.37; xent: 0.32; lr: 0.00010; 9887/9386 tok/s;   2755 sec
[2021-04-23 01:39:13,833 INFO] Step 13600/50000; acc:  92.75; ppl:  1.37; xent: 0.31; lr: 0.00010; 10130/9502 tok/s;   2765 sec
[2021-04-23 01:39:23,763 INFO] Step 13650/50000; acc:  92.69; ppl:  1.38; xent: 0.32; lr: 0.00010; 9991/9497 tok/s;   2775 sec
[2021-04-23 01:39:33,926 INFO] Step 13700/50000; acc:  92.90; ppl:  1.36; xent: 0.31; lr: 0.00010; 10270/9635 tok/s;   2785 sec
[2021-04-23 01:39:43,936 INFO] Step 13750/50000; acc:  92.74; ppl:  1.36; xent: 0.31; lr: 0.00010; 10026/9434 tok/s;   2795 sec
[2021-04-23 01:39:53,903 INFO] Step 13800/50000; acc:  92.80; ppl:  1.37; xent: 0.31; lr: 0.00010; 10171/9532 tok/s;   2805 sec
[2021-04-23 01:40:04,000 INFO] Step 13850/50000; acc:  92.71; ppl:  1.37; xent: 0.31; lr: 0.00010; 10148/9547 tok/s;   2815 sec
[2021-04-23 01:40:05,228 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:40:14,014 INFO] Step 13900/50000; acc:  92.85; ppl:  1.37; xent: 0.32; lr: 0.00010; 10061/9558 tok/s;   2825 sec
[2021-04-23 01:40:24,103 INFO] Step 13950/50000; acc:  92.77; ppl:  1.37; xent: 0.31; lr: 0.00010; 10173/9526 tok/s;   2835 sec
[2021-04-23 01:40:34,044 INFO] Step 14000/50000; acc:  92.65; ppl:  1.38; xent: 0.32; lr: 0.00010; 10117/9628 tok/s;   2845 sec
[2021-04-23 01:40:44,254 INFO] Step 14050/50000; acc:  92.69; ppl:  1.36; xent: 0.31; lr: 0.00010; 10150/9573 tok/s;   2855 sec
[2021-04-23 01:40:54,034 INFO] Step 14100/50000; acc:  92.69; ppl:  1.37; xent: 0.32; lr: 0.00010; 10059/9558 tok/s;   2865 sec
[2021-04-23 01:41:04,219 INFO] Step 14150/50000; acc:  92.93; ppl:  1.36; xent: 0.31; lr: 0.00010; 10085/9528 tok/s;   2875 sec
[2021-04-23 01:41:14,531 INFO] Step 14200/50000; acc:  93.11; ppl:  1.35; xent: 0.30; lr: 0.00010; 10027/9436 tok/s;   2886 sec
[2021-04-23 01:41:24,557 INFO] Step 14250/50000; acc:  92.68; ppl:  1.37; xent: 0.31; lr: 0.00010; 10007/9475 tok/s;   2896 sec
[2021-04-23 01:41:34,790 INFO] Step 14300/50000; acc:  92.84; ppl:  1.36; xent: 0.31; lr: 0.00010; 10103/9469 tok/s;   2906 sec
[2021-04-23 01:41:44,929 INFO] Step 14350/50000; acc:  92.77; ppl:  1.36; xent: 0.31; lr: 0.00010; 9891/9383 tok/s;   2916 sec
[2021-04-23 01:41:54,985 INFO] Step 14400/50000; acc:  92.85; ppl:  1.36; xent: 0.31; lr: 0.00010; 10254/9703 tok/s;   2926 sec
[2021-04-23 01:42:04,882 INFO] Step 14450/50000; acc:  92.83; ppl:  1.36; xent: 0.31; lr: 0.00010; 10119/9507 tok/s;   2936 sec
[2021-04-23 01:42:15,157 INFO] Step 14500/50000; acc:  93.09; ppl:  1.35; xent: 0.30; lr: 0.00010; 10099/9424 tok/s;   2946 sec
[2021-04-23 01:42:25,126 INFO] Step 14550/50000; acc:  92.64; ppl:  1.37; xent: 0.31; lr: 0.00010; 10011/9443 tok/s;   2956 sec
[2021-04-23 01:42:27,229 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:42:35,201 INFO] Step 14600/50000; acc:  92.95; ppl:  1.36; xent: 0.31; lr: 0.00010; 10146/9565 tok/s;   2966 sec
[2021-04-23 01:42:45,345 INFO] Step 14650/50000; acc:  92.95; ppl:  1.36; xent: 0.31; lr: 0.00010; 10115/9513 tok/s;   2977 sec
[2021-04-23 01:42:55,239 INFO] Step 14700/50000; acc:  92.76; ppl:  1.36; xent: 0.31; lr: 0.00010; 10076/9567 tok/s;   2986 sec
[2021-04-23 01:43:05,375 INFO] Step 14750/50000; acc:  92.84; ppl:  1.36; xent: 0.31; lr: 0.00010; 10141/9592 tok/s;   2997 sec
[2021-04-23 01:43:15,370 INFO] Step 14800/50000; acc:  92.81; ppl:  1.36; xent: 0.31; lr: 0.00010; 10099/9574 tok/s;   3007 sec
[2021-04-23 01:43:25,559 INFO] Step 14850/50000; acc:  93.03; ppl:  1.35; xent: 0.30; lr: 0.00010; 10094/9486 tok/s;   3017 sec
[2021-04-23 01:43:35,575 INFO] Step 14900/50000; acc:  93.00; ppl:  1.36; xent: 0.31; lr: 0.00010; 9982/9541 tok/s;   3027 sec
[2021-04-23 01:43:45,754 INFO] Step 14950/50000; acc:  93.02; ppl:  1.35; xent: 0.30; lr: 0.00010; 10217/9580 tok/s;   3037 sec
[2021-04-23 01:43:55,878 INFO] Step 15000/50000; acc:  92.83; ppl:  1.36; xent: 0.30; lr: 0.00010; 10013/9424 tok/s;   3047 sec
[2021-04-23 01:43:55,880 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-23 01:44:05,499 INFO] Validation perplexity: 1.35858
[2021-04-23 01:44:05,499 INFO] Validation accuracy: 93.1009
[2021-04-23 01:44:05,501 INFO] Saving checkpoint ../models/default_params/control/model_step_15000.pt
[2021-04-23 01:44:16,147 INFO] Step 15050/50000; acc:  92.98; ppl:  1.36; xent: 0.30; lr: 0.00010; 4963/4706 tok/s;   3067 sec
[2021-04-23 01:44:26,364 INFO] Step 15100/50000; acc:  92.99; ppl:  1.35; xent: 0.30; lr: 0.00010; 10107/9494 tok/s;   3078 sec
[2021-04-23 01:44:36,271 INFO] Step 15150/50000; acc:  93.01; ppl:  1.35; xent: 0.30; lr: 0.00010; 10121/9566 tok/s;   3087 sec
[2021-04-23 01:44:46,494 INFO] Step 15200/50000; acc:  93.09; ppl:  1.34; xent: 0.29; lr: 0.00010; 10194/9554 tok/s;   3098 sec
[2021-04-23 01:44:56,373 INFO] Step 15250/50000; acc:  92.94; ppl:  1.35; xent: 0.30; lr: 0.00010; 9958/9390 tok/s;   3108 sec
[2021-04-23 01:45:05,795 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:45:06,629 INFO] Step 15300/50000; acc:  92.99; ppl:  1.35; xent: 0.30; lr: 0.00010; 10166/9545 tok/s;   3118 sec
[2021-04-23 01:45:16,635 INFO] Step 15350/50000; acc:  92.88; ppl:  1.36; xent: 0.31; lr: 0.00010; 10011/9442 tok/s;   3128 sec
[2021-04-23 01:45:26,747 INFO] Step 15400/50000; acc:  93.03; ppl:  1.36; xent: 0.30; lr: 0.00010; 10106/9502 tok/s;   3138 sec
[2021-04-23 01:45:36,697 INFO] Step 15450/50000; acc:  92.90; ppl:  1.36; xent: 0.31; lr: 0.00010; 10177/9723 tok/s;   3148 sec
[2021-04-23 01:45:46,816 INFO] Step 15500/50000; acc:  92.85; ppl:  1.36; xent: 0.30; lr: 0.00010; 9934/9372 tok/s;   3158 sec
[2021-04-23 01:45:56,942 INFO] Step 15550/50000; acc:  93.03; ppl:  1.35; xent: 0.30; lr: 0.00010; 10155/9570 tok/s;   3168 sec
[2021-04-23 01:46:07,043 INFO] Step 15600/50000; acc:  93.10; ppl:  1.35; xent: 0.30; lr: 0.00010; 9996/9508 tok/s;   3178 sec
[2021-04-23 01:46:17,366 INFO] Step 15650/50000; acc:  93.23; ppl:  1.34; xent: 0.29; lr: 0.00010; 10147/9533 tok/s;   3189 sec
[2021-04-23 01:46:27,322 INFO] Step 15700/50000; acc:  92.91; ppl:  1.35; xent: 0.30; lr: 0.00010; 10028/9504 tok/s;   3198 sec
[2021-04-23 01:46:37,605 INFO] Step 15750/50000; acc:  92.96; ppl:  1.35; xent: 0.30; lr: 0.00010; 9913/9336 tok/s;   3209 sec
[2021-04-23 01:46:47,773 INFO] Step 15800/50000; acc:  93.07; ppl:  1.34; xent: 0.30; lr: 0.00010; 10040/9462 tok/s;   3219 sec
[2021-04-23 01:46:57,785 INFO] Step 15850/50000; acc:  92.96; ppl:  1.35; xent: 0.30; lr: 0.00010; 10010/9468 tok/s;   3229 sec
[2021-04-23 01:47:07,926 INFO] Step 15900/50000; acc:  93.08; ppl:  1.34; xent: 0.29; lr: 0.00010; 10238/9566 tok/s;   3239 sec
[2021-04-23 01:47:17,897 INFO] Step 15950/50000; acc:  93.05; ppl:  1.34; xent: 0.30; lr: 0.00010; 10020/9465 tok/s;   3249 sec
[2021-04-23 01:47:28,072 INFO] Step 16000/50000; acc:  93.19; ppl:  1.34; xent: 0.29; lr: 0.00010; 10106/9480 tok/s;   3259 sec
[2021-04-23 01:47:34,502 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:47:37,965 INFO] Step 16050/50000; acc:  92.91; ppl:  1.35; xent: 0.30; lr: 0.00010; 10085/9521 tok/s;   3269 sec
[2021-04-23 01:47:48,290 INFO] Step 16100/50000; acc:  93.12; ppl:  1.35; xent: 0.30; lr: 0.00010; 10112/9523 tok/s;   3279 sec
[2021-04-23 01:47:58,101 INFO] Step 16150/50000; acc:  92.97; ppl:  1.35; xent: 0.30; lr: 0.00010; 10171/9605 tok/s;   3289 sec
[2021-04-23 01:48:08,174 INFO] Step 16200/50000; acc:  93.01; ppl:  1.35; xent: 0.30; lr: 0.00010; 10062/9577 tok/s;   3299 sec
[2021-04-23 01:48:18,300 INFO] Step 16250/50000; acc:  93.08; ppl:  1.35; xent: 0.30; lr: 0.00010; 10085/9523 tok/s;   3309 sec
[2021-04-23 01:48:28,264 INFO] Step 16300/50000; acc:  92.99; ppl:  1.35; xent: 0.30; lr: 0.00010; 10040/9527 tok/s;   3319 sec
[2021-04-23 01:48:38,560 INFO] Step 16350/50000; acc:  93.20; ppl:  1.34; xent: 0.29; lr: 0.00010; 10079/9493 tok/s;   3330 sec
[2021-04-23 01:48:48,727 INFO] Step 16400/50000; acc:  93.27; ppl:  1.33; xent: 0.29; lr: 0.00010; 10066/9499 tok/s;   3340 sec
[2021-04-23 01:48:59,103 INFO] Step 16450/50000; acc:  93.03; ppl:  1.34; xent: 0.30; lr: 0.00010; 10011/9391 tok/s;   3350 sec
[2021-04-23 01:49:08,979 INFO] Step 16500/50000; acc:  92.99; ppl:  1.35; xent: 0.30; lr: 0.00010; 9989/9472 tok/s;   3360 sec
[2021-04-23 01:49:19,230 INFO] Step 16550/50000; acc:  93.26; ppl:  1.33; xent: 0.29; lr: 0.00010; 10031/9461 tok/s;   3370 sec
[2021-04-23 01:49:29,151 INFO] Step 16600/50000; acc:  93.20; ppl:  1.34; xent: 0.29; lr: 0.00010; 10258/9654 tok/s;   3380 sec
[2021-04-23 01:49:39,085 INFO] Step 16650/50000; acc:  93.17; ppl:  1.34; xent: 0.29; lr: 0.00010; 10129/9553 tok/s;   3390 sec
[2021-04-23 01:49:49,150 INFO] Step 16700/50000; acc:  93.22; ppl:  1.34; xent: 0.29; lr: 0.00010; 10231/9523 tok/s;   3400 sec
[2021-04-23 01:49:59,143 INFO] Step 16750/50000; acc:  92.97; ppl:  1.34; xent: 0.30; lr: 0.00010; 9965/9492 tok/s;   3410 sec
[2021-04-23 01:50:02,898 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:50:09,444 INFO] Step 16800/50000; acc:  93.26; ppl:  1.34; xent: 0.29; lr: 0.00010; 10079/9447 tok/s;   3421 sec
[2021-04-23 01:50:19,382 INFO] Step 16850/50000; acc:  93.18; ppl:  1.34; xent: 0.29; lr: 0.00010; 10014/9463 tok/s;   3431 sec
[2021-04-23 01:50:29,524 INFO] Step 16900/50000; acc:  93.15; ppl:  1.34; xent: 0.29; lr: 0.00010; 10235/9672 tok/s;   3441 sec
[2021-04-23 01:50:39,522 INFO] Step 16950/50000; acc:  92.97; ppl:  1.35; xent: 0.30; lr: 0.00010; 9966/9481 tok/s;   3451 sec
[2021-04-23 01:50:49,567 INFO] Step 17000/50000; acc:  93.17; ppl:  1.34; xent: 0.29; lr: 0.00010; 10114/9550 tok/s;   3461 sec
[2021-04-23 01:50:59,652 INFO] Step 17050/50000; acc:  93.15; ppl:  1.34; xent: 0.29; lr: 0.00010; 10099/9551 tok/s;   3471 sec
[2021-04-23 01:51:09,807 INFO] Step 17100/50000; acc:  93.42; ppl:  1.33; xent: 0.28; lr: 0.00010; 10053/9490 tok/s;   3481 sec
[2021-04-23 01:51:19,967 INFO] Step 17150/50000; acc:  93.19; ppl:  1.33; xent: 0.29; lr: 0.00010; 10178/9552 tok/s;   3491 sec
[2021-04-23 01:51:30,140 INFO] Step 17200/50000; acc:  93.09; ppl:  1.34; xent: 0.29; lr: 0.00010; 9940/9421 tok/s;   3501 sec
[2021-04-23 01:51:40,458 INFO] Step 17250/50000; acc:  93.28; ppl:  1.33; xent: 0.29; lr: 0.00010; 10083/9439 tok/s;   3512 sec
[2021-04-23 01:51:50,319 INFO] Step 17300/50000; acc:  93.05; ppl:  1.34; xent: 0.29; lr: 0.00010; 10021/9545 tok/s;   3521 sec
[2021-04-23 01:52:00,318 INFO] Step 17350/50000; acc:  93.27; ppl:  1.33; xent: 0.29; lr: 0.00010; 10291/9689 tok/s;   3531 sec
[2021-04-23 01:52:10,437 INFO] Step 17400/50000; acc:  93.26; ppl:  1.33; xent: 0.28; lr: 0.00010; 10038/9393 tok/s;   3542 sec
[2021-04-23 01:52:20,277 INFO] Step 17450/50000; acc:  93.19; ppl:  1.34; xent: 0.29; lr: 0.00010; 10171/9589 tok/s;   3551 sec
[2021-04-23 01:52:30,473 INFO] Step 17500/50000; acc:  93.24; ppl:  1.33; xent: 0.28; lr: 0.00010; 10137/9498 tok/s;   3562 sec
[2021-04-23 01:52:31,281 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:52:40,548 INFO] Step 17550/50000; acc:  93.17; ppl:  1.34; xent: 0.30; lr: 0.00010; 9965/9478 tok/s;   3572 sec
[2021-04-23 01:52:50,666 INFO] Step 17600/50000; acc:  93.24; ppl:  1.33; xent: 0.29; lr: 0.00010; 10182/9528 tok/s;   3582 sec
[2021-04-23 01:53:00,493 INFO] Step 17650/50000; acc:  92.95; ppl:  1.35; xent: 0.30; lr: 0.00010; 10058/9611 tok/s;   3592 sec
[2021-04-23 01:53:10,779 INFO] Step 17700/50000; acc:  93.23; ppl:  1.33; xent: 0.28; lr: 0.00010; 10113/9523 tok/s;   3602 sec
[2021-04-23 01:53:20,706 INFO] Step 17750/50000; acc:  93.10; ppl:  1.34; xent: 0.29; lr: 0.00010; 10038/9500 tok/s;   3612 sec
[2021-04-23 01:53:30,909 INFO] Step 17800/50000; acc:  93.30; ppl:  1.33; xent: 0.29; lr: 0.00010; 9986/9481 tok/s;   3622 sec
[2021-04-23 01:53:41,186 INFO] Step 17850/50000; acc:  93.60; ppl:  1.31; xent: 0.27; lr: 0.00010; 10150/9555 tok/s;   3632 sec
[2021-04-23 01:53:51,222 INFO] Step 17900/50000; acc:  93.05; ppl:  1.34; xent: 0.29; lr: 0.00010; 10002/9434 tok/s;   3642 sec
[2021-04-23 01:54:01,340 INFO] Step 17950/50000; acc:  93.32; ppl:  1.33; xent: 0.29; lr: 0.00010; 10175/9565 tok/s;   3653 sec
[2021-04-23 01:54:11,557 INFO] Step 18000/50000; acc:  93.27; ppl:  1.33; xent: 0.29; lr: 0.00010; 9929/9390 tok/s;   3663 sec
[2021-04-23 01:54:21,714 INFO] Step 18050/50000; acc:  93.35; ppl:  1.33; xent: 0.28; lr: 0.00010; 10178/9613 tok/s;   3673 sec
[2021-04-23 01:54:31,578 INFO] Step 18100/50000; acc:  93.28; ppl:  1.33; xent: 0.28; lr: 0.00010; 10122/9535 tok/s;   3683 sec
[2021-04-23 01:54:41,748 INFO] Step 18150/50000; acc:  93.41; ppl:  1.32; xent: 0.28; lr: 0.00010; 10056/9413 tok/s;   3693 sec
[2021-04-23 01:54:51,786 INFO] Step 18200/50000; acc:  93.18; ppl:  1.33; xent: 0.29; lr: 0.00010; 10072/9446 tok/s;   3703 sec
[2021-04-23 01:54:53,446 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:55:01,798 INFO] Step 18250/50000; acc:  93.20; ppl:  1.33; xent: 0.29; lr: 0.00010; 10059/9530 tok/s;   3713 sec
[2021-04-23 01:55:11,957 INFO] Step 18300/50000; acc:  93.49; ppl:  1.33; xent: 0.28; lr: 0.00010; 10199/9591 tok/s;   3723 sec
[2021-04-23 01:55:21,799 INFO] Step 18350/50000; acc:  93.13; ppl:  1.33; xent: 0.29; lr: 0.00010; 10061/9541 tok/s;   3733 sec
[2021-04-23 01:55:32,035 INFO] Step 18400/50000; acc:  93.26; ppl:  1.33; xent: 0.29; lr: 0.00010; 10106/9547 tok/s;   3743 sec
[2021-04-23 01:55:41,910 INFO] Step 18450/50000; acc:  93.20; ppl:  1.33; xent: 0.29; lr: 0.00010; 10021/9536 tok/s;   3753 sec
[2021-04-23 01:55:52,169 INFO] Step 18500/50000; acc:  93.49; ppl:  1.32; xent: 0.28; lr: 0.00010; 10099/9522 tok/s;   3763 sec
[2021-04-23 01:56:02,268 INFO] Step 18550/50000; acc:  93.35; ppl:  1.33; xent: 0.28; lr: 0.00010; 10019/9504 tok/s;   3773 sec
[2021-04-23 01:56:12,361 INFO] Step 18600/50000; acc:  93.46; ppl:  1.32; xent: 0.28; lr: 0.00010; 10190/9577 tok/s;   3784 sec
[2021-04-23 01:56:22,586 INFO] Step 18650/50000; acc:  93.21; ppl:  1.33; xent: 0.28; lr: 0.00010; 10016/9416 tok/s;   3794 sec
[2021-04-23 01:56:32,580 INFO] Step 18700/50000; acc:  93.38; ppl:  1.33; xent: 0.28; lr: 0.00010; 10059/9546 tok/s;   3804 sec
[2021-04-23 01:56:42,770 INFO] Step 18750/50000; acc:  93.41; ppl:  1.32; xent: 0.28; lr: 0.00010; 10121/9517 tok/s;   3814 sec
[2021-04-23 01:56:52,798 INFO] Step 18800/50000; acc:  93.36; ppl:  1.33; xent: 0.28; lr: 0.00010; 10122/9536 tok/s;   3824 sec
[2021-04-23 01:57:03,066 INFO] Step 18850/50000; acc:  93.45; ppl:  1.31; xent: 0.27; lr: 0.00010; 10164/9504 tok/s;   3834 sec
[2021-04-23 01:57:12,880 INFO] Step 18900/50000; acc:  93.35; ppl:  1.32; xent: 0.28; lr: 0.00010; 9983/9453 tok/s;   3844 sec
[2021-04-23 01:57:21,827 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:57:22,992 INFO] Step 18950/50000; acc:  93.29; ppl:  1.32; xent: 0.28; lr: 0.00010; 10155/9559 tok/s;   3854 sec
[2021-04-23 01:57:33,091 INFO] Step 19000/50000; acc:  93.33; ppl:  1.33; xent: 0.28; lr: 0.00010; 10040/9436 tok/s;   3864 sec
[2021-04-23 01:57:43,089 INFO] Step 19050/50000; acc:  93.36; ppl:  1.33; xent: 0.28; lr: 0.00010; 10076/9498 tok/s;   3874 sec
[2021-04-23 01:57:53,142 INFO] Step 19100/50000; acc:  93.39; ppl:  1.32; xent: 0.28; lr: 0.00010; 10193/9706 tok/s;   3884 sec
[2021-04-23 01:58:03,176 INFO] Step 19150/50000; acc:  93.19; ppl:  1.33; xent: 0.28; lr: 0.00010; 9958/9422 tok/s;   3894 sec
[2021-04-23 01:58:13,352 INFO] Step 19200/50000; acc:  93.38; ppl:  1.32; xent: 0.28; lr: 0.00010; 10144/9565 tok/s;   3905 sec
[2021-04-23 01:58:23,313 INFO] Step 19250/50000; acc:  93.46; ppl:  1.32; xent: 0.28; lr: 0.00010; 9971/9491 tok/s;   3914 sec
[2021-04-23 01:58:33,717 INFO] Step 19300/50000; acc:  93.59; ppl:  1.32; xent: 0.27; lr: 0.00010; 10113/9489 tok/s;   3925 sec
[2021-04-23 01:58:43,767 INFO] Step 19350/50000; acc:  93.32; ppl:  1.32; xent: 0.28; lr: 0.00010; 10056/9506 tok/s;   3935 sec
[2021-04-23 01:58:53,968 INFO] Step 19400/50000; acc:  93.32; ppl:  1.33; xent: 0.28; lr: 0.00010; 9906/9342 tok/s;   3945 sec
[2021-04-23 01:59:04,257 INFO] Step 19450/50000; acc:  93.46; ppl:  1.32; xent: 0.27; lr: 0.00010; 10010/9393 tok/s;   3955 sec
[2021-04-23 01:59:14,224 INFO] Step 19500/50000; acc:  93.39; ppl:  1.32; xent: 0.28; lr: 0.00010; 10069/9552 tok/s;   3965 sec
[2021-04-23 01:59:24,342 INFO] Step 19550/50000; acc:  93.51; ppl:  1.31; xent: 0.27; lr: 0.00010; 10218/9569 tok/s;   3976 sec
[2021-04-23 01:59:34,332 INFO] Step 19600/50000; acc:  93.35; ppl:  1.32; xent: 0.28; lr: 0.00010; 10124/9494 tok/s;   3985 sec
[2021-04-23 01:59:44,552 INFO] Step 19650/50000; acc:  93.50; ppl:  1.31; xent: 0.27; lr: 0.00010; 10071/9500 tok/s;   3996 sec
[2021-04-23 01:59:50,463 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 01:59:54,336 INFO] Step 19700/50000; acc:  93.31; ppl:  1.32; xent: 0.28; lr: 0.00010; 10160/9586 tok/s;   4005 sec
[2021-04-23 02:00:04,621 INFO] Step 19750/50000; acc:  93.40; ppl:  1.32; xent: 0.28; lr: 0.00010; 10022/9451 tok/s;   4016 sec
[2021-04-23 02:00:14,423 INFO] Step 19800/50000; acc:  93.33; ppl:  1.32; xent: 0.28; lr: 0.00010; 10299/9702 tok/s;   4026 sec
[2021-04-23 02:00:24,448 INFO] Step 19850/50000; acc:  93.27; ppl:  1.32; xent: 0.28; lr: 0.00010; 9962/9505 tok/s;   4036 sec
[2021-04-23 02:00:34,656 INFO] Step 19900/50000; acc:  93.49; ppl:  1.32; xent: 0.27; lr: 0.00010; 10108/9538 tok/s;   4046 sec
[2021-04-23 02:00:44,615 INFO] Step 19950/50000; acc:  93.36; ppl:  1.32; xent: 0.28; lr: 0.00010; 9995/9471 tok/s;   4056 sec
[2021-04-23 02:00:54,905 INFO] Step 20000/50000; acc:  93.55; ppl:  1.31; xent: 0.27; lr: 0.00010; 10125/9551 tok/s;   4066 sec
[2021-04-23 02:00:54,908 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-23 02:01:04,549 INFO] Validation perplexity: 1.33746
[2021-04-23 02:01:04,549 INFO] Validation accuracy: 93.2964
[2021-04-23 02:01:04,551 INFO] Saving checkpoint ../models/default_params/control/model_step_20000.pt
[2021-04-23 02:01:15,163 INFO] Step 20050/50000; acc:  93.43; ppl:  1.31; xent: 0.27; lr: 0.00010; 4966/4697 tok/s;   4086 sec
[2021-04-23 02:01:25,615 INFO] Step 20100/50000; acc:  93.48; ppl:  1.31; xent: 0.27; lr: 0.00010; 9983/9342 tok/s;   4097 sec
[2021-04-23 02:01:35,645 INFO] Step 20150/50000; acc:  93.34; ppl:  1.32; xent: 0.28; lr: 0.00010; 9967/9447 tok/s;   4107 sec
[2021-04-23 02:01:45,799 INFO] Step 20200/50000; acc:  93.60; ppl:  1.31; xent: 0.27; lr: 0.00010; 10046/9478 tok/s;   4117 sec
[2021-04-23 02:01:55,835 INFO] Step 20250/50000; acc:  93.51; ppl:  1.31; xent: 0.27; lr: 0.00010; 10223/9618 tok/s;   4127 sec
[2021-04-23 02:02:05,776 INFO] Step 20300/50000; acc:  93.48; ppl:  1.31; xent: 0.27; lr: 0.00010; 10128/9546 tok/s;   4137 sec
[2021-04-23 02:02:15,817 INFO] Step 20350/50000; acc:  93.51; ppl:  1.31; xent: 0.27; lr: 0.00010; 10208/9511 tok/s;   4147 sec
[2021-04-23 02:02:25,869 INFO] Step 20400/50000; acc:  93.45; ppl:  1.31; xent: 0.27; lr: 0.00010; 10041/9534 tok/s;   4157 sec
[2021-04-23 02:02:29,137 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:02:36,166 INFO] Step 20450/50000; acc:  93.64; ppl:  1.31; xent: 0.27; lr: 0.00010; 10077/9464 tok/s;   4167 sec
[2021-04-23 02:02:45,974 INFO] Step 20500/50000; acc:  93.44; ppl:  1.32; xent: 0.28; lr: 0.00010; 10121/9564 tok/s;   4177 sec
[2021-04-23 02:02:56,039 INFO] Step 20550/50000; acc:  93.37; ppl:  1.32; xent: 0.28; lr: 0.00010; 10158/9637 tok/s;   4187 sec
[2021-04-23 02:03:06,091 INFO] Step 20600/50000; acc:  93.46; ppl:  1.32; xent: 0.27; lr: 0.00010; 10053/9498 tok/s;   4197 sec
[2021-04-23 02:03:16,128 INFO] Step 20650/50000; acc:  93.43; ppl:  1.31; xent: 0.27; lr: 0.00010; 9974/9504 tok/s;   4207 sec
[2021-04-23 02:03:26,255 INFO] Step 20700/50000; acc:  93.61; ppl:  1.31; xent: 0.27; lr: 0.00010; 10158/9547 tok/s;   4217 sec
[2021-04-23 02:03:36,420 INFO] Step 20750/50000; acc:  93.69; ppl:  1.30; xent: 0.26; lr: 0.00010; 10005/9466 tok/s;   4228 sec
[2021-04-23 02:03:46,572 INFO] Step 20800/50000; acc:  93.54; ppl:  1.31; xent: 0.27; lr: 0.00010; 10202/9581 tok/s;   4238 sec
[2021-04-23 02:03:56,655 INFO] Step 20850/50000; acc:  93.47; ppl:  1.32; xent: 0.28; lr: 0.00010; 9874/9378 tok/s;   4248 sec
[2021-04-23 02:04:06,979 INFO] Step 20900/50000; acc:  93.68; ppl:  1.31; xent: 0.27; lr: 0.00010; 10122/9467 tok/s;   4258 sec
[2021-04-23 02:04:16,939 INFO] Step 20950/50000; acc:  93.49; ppl:  1.32; xent: 0.27; lr: 0.00010; 10034/9534 tok/s;   4268 sec
[2021-04-23 02:04:26,858 INFO] Step 21000/50000; acc:  93.70; ppl:  1.30; xent: 0.26; lr: 0.00010; 10305/9683 tok/s;   4278 sec
[2021-04-23 02:04:37,089 INFO] Step 21050/50000; acc:  93.64; ppl:  1.30; xent: 0.26; lr: 0.00010; 10008/9361 tok/s;   4288 sec
[2021-04-23 02:04:47,009 INFO] Step 21100/50000; acc:  93.43; ppl:  1.31; xent: 0.27; lr: 0.00010; 10092/9554 tok/s;   4298 sec
[2021-04-23 02:04:51,162 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:04:57,135 INFO] Step 21150/50000; acc:  93.53; ppl:  1.31; xent: 0.27; lr: 0.00010; 10193/9525 tok/s;   4308 sec
[2021-04-23 02:05:07,281 INFO] Step 21200/50000; acc:  93.48; ppl:  1.32; xent: 0.28; lr: 0.00010; 10002/9494 tok/s;   4318 sec
[2021-04-23 02:05:17,338 INFO] Step 21250/50000; acc:  93.65; ppl:  1.30; xent: 0.26; lr: 0.00010; 10245/9613 tok/s;   4329 sec
[2021-04-23 02:05:27,127 INFO] Step 21300/50000; acc:  93.24; ppl:  1.33; xent: 0.28; lr: 0.00010; 10063/9623 tok/s;   4338 sec
[2021-04-23 02:05:37,242 INFO] Step 21350/50000; acc:  93.59; ppl:  1.31; xent: 0.27; lr: 0.00010; 10138/9558 tok/s;   4348 sec
[2021-04-23 02:05:47,235 INFO] Step 21400/50000; acc:  93.55; ppl:  1.31; xent: 0.27; lr: 0.00010; 10103/9540 tok/s;   4358 sec
[2021-04-23 02:05:57,315 INFO] Step 21450/50000; acc:  93.52; ppl:  1.31; xent: 0.27; lr: 0.00010; 9974/9480 tok/s;   4368 sec
[2021-04-23 02:06:07,641 INFO] Step 21500/50000; acc:  93.96; ppl:  1.29; xent: 0.26; lr: 0.00010; 10204/9564 tok/s;   4379 sec
[2021-04-23 02:06:17,703 INFO] Step 21550/50000; acc:  93.37; ppl:  1.31; xent: 0.27; lr: 0.00010; 9918/9423 tok/s;   4389 sec
[2021-04-23 02:06:27,884 INFO] Step 21600/50000; acc:  93.76; ppl:  1.30; xent: 0.26; lr: 0.00010; 10157/9521 tok/s;   4399 sec
[2021-04-23 02:06:37,917 INFO] Step 21650/50000; acc:  93.54; ppl:  1.31; xent: 0.27; lr: 0.00010; 9924/9404 tok/s;   4409 sec
[2021-04-23 02:06:48,142 INFO] Step 21700/50000; acc:  93.64; ppl:  1.30; xent: 0.26; lr: 0.00010; 10191/9619 tok/s;   4419 sec
[2021-04-23 02:06:58,031 INFO] Step 21750/50000; acc:  93.68; ppl:  1.30; xent: 0.26; lr: 0.00010; 10192/9568 tok/s;   4429 sec
[2021-04-23 02:07:08,134 INFO] Step 21800/50000; acc:  93.69; ppl:  1.30; xent: 0.26; lr: 0.00010; 10041/9445 tok/s;   4439 sec
[2021-04-23 02:07:18,219 INFO] Step 21850/50000; acc:  93.55; ppl:  1.31; xent: 0.27; lr: 0.00010; 10103/9458 tok/s;   4449 sec
[2021-04-23 02:07:19,464 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:07:28,261 INFO] Step 21900/50000; acc:  93.40; ppl:  1.31; xent: 0.27; lr: 0.00010; 10029/9491 tok/s;   4459 sec
[2021-04-23 02:07:38,436 INFO] Step 21950/50000; acc:  93.79; ppl:  1.30; xent: 0.27; lr: 0.00010; 10159/9541 tok/s;   4470 sec
[2021-04-23 02:07:48,334 INFO] Step 22000/50000; acc:  93.47; ppl:  1.31; xent: 0.27; lr: 0.00010; 10121/9632 tok/s;   4479 sec
[2021-04-23 02:07:58,585 INFO] Step 22050/50000; acc:  93.56; ppl:  1.31; xent: 0.27; lr: 0.00010; 10110/9505 tok/s;   4490 sec
[2021-04-23 02:08:08,415 INFO] Step 22100/50000; acc:  93.50; ppl:  1.31; xent: 0.27; lr: 0.00010; 10040/9561 tok/s;   4500 sec
[2021-04-23 02:08:18,625 INFO] Step 22150/50000; acc:  93.68; ppl:  1.30; xent: 0.27; lr: 0.00010; 9986/9454 tok/s;   4510 sec
[2021-04-23 02:08:28,853 INFO] Step 22200/50000; acc:  93.76; ppl:  1.30; xent: 0.26; lr: 0.00010; 10054/9483 tok/s;   4520 sec
[2021-04-23 02:08:38,888 INFO] Step 22250/50000; acc:  93.64; ppl:  1.30; xent: 0.26; lr: 0.00010; 10090/9532 tok/s;   4530 sec
[2021-04-23 02:08:49,254 INFO] Step 22300/50000; acc:  93.57; ppl:  1.30; xent: 0.27; lr: 0.00010; 9981/9353 tok/s;   4540 sec
[2021-04-23 02:08:59,276 INFO] Step 22350/50000; acc:  93.70; ppl:  1.30; xent: 0.26; lr: 0.00010; 9978/9463 tok/s;   4550 sec
[2021-04-23 02:09:09,392 INFO] Step 22400/50000; acc:  93.81; ppl:  1.30; xent: 0.26; lr: 0.00010; 10221/9616 tok/s;   4561 sec
[2021-04-23 02:09:20,934 INFO] Step 22450/50000; acc:  93.57; ppl:  1.30; xent: 0.27; lr: 0.00010; 8649/8195 tok/s;   4572 sec
[2021-04-23 02:09:34,133 INFO] Step 22500/50000; acc:  93.79; ppl:  1.29; xent: 0.25; lr: 0.00010; 7939/7421 tok/s;   4585 sec
[2021-04-23 02:09:46,831 INFO] Step 22550/50000; acc:  93.68; ppl:  1.30; xent: 0.26; lr: 0.00010; 7818/7355 tok/s;   4598 sec
[2021-04-23 02:09:57,717 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:09:59,704 INFO] Step 22600/50000; acc:  93.64; ppl:  1.30; xent: 0.26; lr: 0.00010; 7915/7468 tok/s;   4611 sec
[2021-04-23 02:10:12,727 INFO] Step 22650/50000; acc:  93.73; ppl:  1.30; xent: 0.26; lr: 0.00010; 7858/7381 tok/s;   4624 sec
[2021-04-23 02:10:25,581 INFO] Step 22700/50000; acc:  93.69; ppl:  1.30; xent: 0.27; lr: 0.00010; 7825/7372 tok/s;   4637 sec
[2021-04-23 02:10:38,367 INFO] Step 22750/50000; acc:  93.63; ppl:  1.30; xent: 0.26; lr: 0.00010; 7993/7632 tok/s;   4650 sec
[2021-04-23 02:10:51,523 INFO] Step 22800/50000; acc:  93.56; ppl:  1.31; xent: 0.27; lr: 0.00010; 7691/7242 tok/s;   4663 sec
[2021-04-23 02:11:04,588 INFO] Step 22850/50000; acc:  93.80; ppl:  1.30; xent: 0.26; lr: 0.00010; 7904/7459 tok/s;   4676 sec
[2021-04-23 02:11:17,326 INFO] Step 22900/50000; acc:  93.60; ppl:  1.31; xent: 0.27; lr: 0.00010; 7780/7411 tok/s;   4688 sec
[2021-04-23 02:11:30,447 INFO] Step 22950/50000; acc:  93.88; ppl:  1.29; xent: 0.26; lr: 0.00010; 7905/7442 tok/s;   4702 sec
[2021-04-23 02:11:43,478 INFO] Step 23000/50000; acc:  93.61; ppl:  1.30; xent: 0.26; lr: 0.00010; 7842/7381 tok/s;   4715 sec
[2021-04-23 02:11:56,542 INFO] Step 23050/50000; acc:  93.62; ppl:  1.31; xent: 0.27; lr: 0.00010; 7640/7246 tok/s;   4728 sec
[2021-04-23 02:12:09,705 INFO] Step 23100/50000; acc:  93.83; ppl:  1.29; xent: 0.26; lr: 0.00010; 7897/7382 tok/s;   4741 sec
[2021-04-23 02:12:22,533 INFO] Step 23150/50000; acc:  93.65; ppl:  1.30; xent: 0.26; lr: 0.00010; 7785/7398 tok/s;   4754 sec
[2021-04-23 02:12:35,502 INFO] Step 23200/50000; acc:  93.84; ppl:  1.29; xent: 0.25; lr: 0.00010; 8014/7493 tok/s;   4767 sec
[2021-04-23 02:12:46,747 INFO] Step 23250/50000; acc:  93.59; ppl:  1.30; xent: 0.26; lr: 0.00010; 8817/8314 tok/s;   4778 sec
[2021-04-23 02:12:56,947 INFO] Step 23300/50000; acc:  93.81; ppl:  1.29; xent: 0.26; lr: 0.00010; 10153/9559 tok/s;   4788 sec
[2021-04-23 02:13:02,536 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:13:06,831 INFO] Step 23350/50000; acc:  93.62; ppl:  1.30; xent: 0.26; lr: 0.00010; 10171/9577 tok/s;   4798 sec
[2021-04-23 02:13:17,020 INFO] Step 23400/50000; acc:  93.73; ppl:  1.30; xent: 0.27; lr: 0.00010; 10046/9449 tok/s;   4808 sec
[2021-04-23 02:13:26,935 INFO] Step 23450/50000; acc:  93.65; ppl:  1.30; xent: 0.26; lr: 0.00010; 10258/9709 tok/s;   4818 sec
[2021-04-23 02:13:36,948 INFO] Step 23500/50000; acc:  93.63; ppl:  1.30; xent: 0.26; lr: 0.00010; 9984/9510 tok/s;   4828 sec
[2021-04-23 02:13:47,119 INFO] Step 23550/50000; acc:  93.76; ppl:  1.30; xent: 0.26; lr: 0.00010; 10111/9532 tok/s;   4838 sec
[2021-04-23 02:13:57,163 INFO] Step 23600/50000; acc:  93.68; ppl:  1.30; xent: 0.26; lr: 0.00010; 10021/9484 tok/s;   4848 sec
[2021-04-23 02:14:07,488 INFO] Step 23650/50000; acc:  93.85; ppl:  1.29; xent: 0.26; lr: 0.00010; 10128/9527 tok/s;   4859 sec
[2021-04-23 02:14:17,451 INFO] Step 23700/50000; acc:  93.75; ppl:  1.29; xent: 0.25; lr: 0.00010; 10049/9521 tok/s;   4869 sec
[2021-04-23 02:14:27,743 INFO] Step 23750/50000; acc:  93.70; ppl:  1.30; xent: 0.26; lr: 0.00010; 9978/9370 tok/s;   4879 sec
[2021-04-23 02:14:37,840 INFO] Step 23800/50000; acc:  93.74; ppl:  1.29; xent: 0.25; lr: 0.00010; 10047/9494 tok/s;   4889 sec
[2021-04-23 02:14:48,012 INFO] Step 23850/50000; acc:  93.79; ppl:  1.29; xent: 0.26; lr: 0.00010; 9880/9344 tok/s;   4899 sec
[2021-04-23 02:14:58,101 INFO] Step 23900/50000; acc:  93.83; ppl:  1.29; xent: 0.26; lr: 0.00010; 10244/9652 tok/s;   4909 sec
[2021-04-23 02:15:08,042 INFO] Step 23950/50000; acc:  93.75; ppl:  1.29; xent: 0.25; lr: 0.00010; 10099/9509 tok/s;   4919 sec
[2021-04-23 02:15:18,154 INFO] Step 24000/50000; acc:  93.84; ppl:  1.29; xent: 0.26; lr: 0.00010; 10172/9475 tok/s;   4929 sec
[2021-04-23 02:15:28,041 INFO] Step 24050/50000; acc:  93.62; ppl:  1.30; xent: 0.26; lr: 0.00010; 10044/9533 tok/s;   4939 sec
[2021-04-23 02:15:31,021 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:15:38,354 INFO] Step 24100/50000; acc:  93.93; ppl:  1.29; xent: 0.25; lr: 0.00010; 10123/9519 tok/s;   4950 sec
[2021-04-23 02:15:48,245 INFO] Step 24150/50000; acc:  93.86; ppl:  1.29; xent: 0.26; lr: 0.00010; 10133/9569 tok/s;   4959 sec
[2021-04-23 02:15:58,251 INFO] Step 24200/50000; acc:  93.65; ppl:  1.30; xent: 0.26; lr: 0.00010; 10138/9601 tok/s;   4969 sec
[2021-04-23 02:16:08,335 INFO] Step 24250/50000; acc:  93.78; ppl:  1.29; xent: 0.26; lr: 0.00010; 10102/9558 tok/s;   4979 sec
[2021-04-23 02:16:18,424 INFO] Step 24300/50000; acc:  93.69; ppl:  1.29; xent: 0.26; lr: 0.00010; 9934/9476 tok/s;   4990 sec
[2021-04-23 02:16:28,495 INFO] Step 24350/50000; acc:  93.85; ppl:  1.29; xent: 0.26; lr: 0.00010; 10188/9551 tok/s;   5000 sec
[2021-04-23 02:16:38,754 INFO] Step 24400/50000; acc:  93.97; ppl:  1.28; xent: 0.25; lr: 0.00010; 10044/9495 tok/s;   5010 sec
[2021-04-23 02:16:48,942 INFO] Step 24450/50000; acc:  93.76; ppl:  1.29; xent: 0.25; lr: 0.00010; 10168/9545 tok/s;   5020 sec
[2021-04-23 02:16:58,938 INFO] Step 24500/50000; acc:  93.69; ppl:  1.30; xent: 0.26; lr: 0.00010; 9927/9413 tok/s;   5030 sec
[2021-04-23 02:17:09,181 INFO] Step 24550/50000; acc:  93.86; ppl:  1.29; xent: 0.25; lr: 0.00010; 10048/9446 tok/s;   5040 sec
[2021-04-23 02:17:19,266 INFO] Step 24600/50000; acc:  93.84; ppl:  1.29; xent: 0.25; lr: 0.00010; 10036/9505 tok/s;   5050 sec
[2021-04-23 02:17:29,068 INFO] Step 24650/50000; acc:  93.85; ppl:  1.29; xent: 0.25; lr: 0.00010; 10283/9692 tok/s;   5060 sec
[2021-04-23 02:17:39,338 INFO] Step 24700/50000; acc:  93.91; ppl:  1.28; xent: 0.25; lr: 0.00010; 10070/9370 tok/s;   5071 sec
[2021-04-23 02:17:49,246 INFO] Step 24750/50000; acc:  93.68; ppl:  1.30; xent: 0.26; lr: 0.00010; 10042/9548 tok/s;   5080 sec
[2021-04-23 02:17:53,042 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:17:59,458 INFO] Step 24800/50000; acc:  93.89; ppl:  1.29; xent: 0.25; lr: 0.00010; 10171/9508 tok/s;   5091 sec
[2021-04-23 02:18:09,410 INFO] Step 24850/50000; acc:  93.69; ppl:  1.30; xent: 0.26; lr: 0.00010; 10001/9493 tok/s;   5101 sec
[2021-04-23 02:18:19,541 INFO] Step 24900/50000; acc:  93.94; ppl:  1.28; xent: 0.25; lr: 0.00010; 10239/9619 tok/s;   5111 sec
[2021-04-23 02:18:29,447 INFO] Step 24950/50000; acc:  93.61; ppl:  1.30; xent: 0.26; lr: 0.00010; 10049/9587 tok/s;   5121 sec
[2021-04-23 02:18:39,489 INFO] Step 25000/50000; acc:  93.82; ppl:  1.29; xent: 0.25; lr: 0.00010; 10144/9573 tok/s;   5131 sec
[2021-04-23 02:18:39,491 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-23 02:18:49,130 INFO] Validation perplexity: 1.33406
[2021-04-23 02:18:49,130 INFO] Validation accuracy: 93.46
[2021-04-23 02:18:49,133 INFO] Saving checkpoint ../models/default_params/control/model_step_25000.pt
[2021-04-23 02:18:59,853 INFO] Step 25050/50000; acc:  93.92; ppl:  1.29; xent: 0.25; lr: 0.00010; 4991/4708 tok/s;   5151 sec
[2021-04-23 02:19:09,927 INFO] Step 25100/50000; acc:  93.82; ppl:  1.29; xent: 0.26; lr: 0.00010; 10007/9509 tok/s;   5161 sec
[2021-04-23 02:19:20,194 INFO] Step 25150/50000; acc:  94.13; ppl:  1.28; xent: 0.24; lr: 0.00010; 10209/9586 tok/s;   5171 sec
[2021-04-23 02:19:30,326 INFO] Step 25200/50000; acc:  93.61; ppl:  1.29; xent: 0.26; lr: 0.00010; 9982/9426 tok/s;   5181 sec
[2021-04-23 02:19:40,542 INFO] Step 25250/50000; acc:  93.91; ppl:  1.28; xent: 0.25; lr: 0.00010; 10149/9537 tok/s;   5192 sec
[2021-04-23 02:19:50,501 INFO] Step 25300/50000; acc:  93.79; ppl:  1.29; xent: 0.25; lr: 0.00010; 9941/9430 tok/s;   5202 sec
[2021-04-23 02:20:00,636 INFO] Step 25350/50000; acc:  93.88; ppl:  1.28; xent: 0.25; lr: 0.00010; 10136/9586 tok/s;   5212 sec
[2021-04-23 02:20:10,625 INFO] Step 25400/50000; acc:  94.02; ppl:  1.28; xent: 0.25; lr: 0.00010; 10226/9577 tok/s;   5222 sec
[2021-04-23 02:20:20,606 INFO] Step 25450/50000; acc:  93.98; ppl:  1.28; xent: 0.25; lr: 0.00010; 10006/9426 tok/s;   5232 sec
[2021-04-23 02:20:30,735 INFO] Step 25500/50000; acc:  93.81; ppl:  1.29; xent: 0.25; lr: 0.00010; 10185/9510 tok/s;   5242 sec
[2021-04-23 02:20:31,579 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:20:40,723 INFO] Step 25550/50000; acc:  93.71; ppl:  1.29; xent: 0.26; lr: 0.00010; 10018/9510 tok/s;   5252 sec
[2021-04-23 02:20:50,886 INFO] Step 25600/50000; acc:  93.99; ppl:  1.28; xent: 0.25; lr: 0.00010; 10219/9573 tok/s;   5262 sec
[2021-04-23 02:21:00,680 INFO] Step 25650/50000; acc:  93.61; ppl:  1.30; xent: 0.26; lr: 0.00010; 10032/9610 tok/s;   5272 sec
[2021-04-23 02:21:10,987 INFO] Step 25700/50000; acc:  93.90; ppl:  1.29; xent: 0.25; lr: 0.00010; 10109/9494 tok/s;   5282 sec
[2021-04-23 02:21:21,451 INFO] Step 25750/50000; acc:  93.88; ppl:  1.28; xent: 0.25; lr: 0.00010; 9553/9069 tok/s;   5293 sec
[2021-04-23 02:21:34,382 INFO] Step 25800/50000; acc:  94.00; ppl:  1.29; xent: 0.25; lr: 0.00010; 7818/7402 tok/s;   5306 sec
[2021-04-23 02:21:47,619 INFO] Step 25850/50000; acc:  94.06; ppl:  1.28; xent: 0.25; lr: 0.00010; 7836/7384 tok/s;   5319 sec
[2021-04-23 02:22:00,497 INFO] Step 25900/50000; acc:  93.84; ppl:  1.28; xent: 0.25; lr: 0.00010; 7863/7443 tok/s;   5332 sec
[2021-04-23 02:22:13,718 INFO] Step 25950/50000; acc:  93.86; ppl:  1.28; xent: 0.25; lr: 0.00010; 7790/7302 tok/s;   5345 sec
[2021-04-23 02:22:26,819 INFO] Step 26000/50000; acc:  93.97; ppl:  1.28; xent: 0.25; lr: 0.00010; 7740/7312 tok/s;   5358 sec
[2021-04-23 02:22:39,787 INFO] Step 26050/50000; acc:  93.98; ppl:  1.28; xent: 0.24; lr: 0.00010; 7985/7521 tok/s;   5371 sec
[2021-04-23 02:22:52,437 INFO] Step 26100/50000; acc:  93.91; ppl:  1.28; xent: 0.25; lr: 0.00010; 7862/7462 tok/s;   5384 sec
[2021-04-23 02:23:05,470 INFO] Step 26150/50000; acc:  94.04; ppl:  1.27; xent: 0.24; lr: 0.00010; 7901/7395 tok/s;   5397 sec
[2021-04-23 02:23:18,366 INFO] Step 26200/50000; acc:  93.82; ppl:  1.29; xent: 0.25; lr: 0.00010; 7804/7314 tok/s;   5410 sec
[2021-04-23 02:23:28,605 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:23:31,106 INFO] Step 26250/50000; acc:  93.88; ppl:  1.28; xent: 0.25; lr: 0.00010; 7898/7482 tok/s;   5422 sec
[2021-04-23 02:23:44,224 INFO] Step 26300/50000; acc:  93.93; ppl:  1.28; xent: 0.25; lr: 0.00010; 7877/7364 tok/s;   5435 sec
[2021-04-23 02:23:57,018 INFO] Step 26350/50000; acc:  93.93; ppl:  1.29; xent: 0.25; lr: 0.00010; 7809/7387 tok/s;   5448 sec
[2021-04-23 02:24:09,907 INFO] Step 26400/50000; acc:  93.80; ppl:  1.29; xent: 0.25; lr: 0.00010; 7970/7609 tok/s;   5461 sec
[2021-04-23 02:24:22,732 INFO] Step 26450/50000; acc:  93.74; ppl:  1.29; xent: 0.26; lr: 0.00010; 7759/7325 tok/s;   5474 sec
[2021-04-23 02:24:35,756 INFO] Step 26500/50000; acc:  94.03; ppl:  1.28; xent: 0.25; lr: 0.00010; 7964/7505 tok/s;   5487 sec
[2021-04-23 02:24:48,236 INFO] Step 26550/50000; acc:  93.93; ppl:  1.28; xent: 0.25; lr: 0.00010; 8047/7646 tok/s;   5499 sec
[2021-04-23 02:24:58,426 INFO] Step 26600/50000; acc:  94.22; ppl:  1.27; xent: 0.24; lr: 0.00010; 10112/9520 tok/s;   5510 sec
[2021-04-23 02:25:08,670 INFO] Step 26650/50000; acc:  93.83; ppl:  1.28; xent: 0.25; lr: 0.00010; 10038/9456 tok/s;   5520 sec
[2021-04-23 02:25:18,820 INFO] Step 26700/50000; acc:  93.87; ppl:  1.28; xent: 0.25; lr: 0.00010; 9851/9350 tok/s;   5530 sec
[2021-04-23 02:25:29,069 INFO] Step 26750/50000; acc:  94.08; ppl:  1.27; xent: 0.24; lr: 0.00010; 10105/9453 tok/s;   5540 sec
[2021-04-23 02:25:39,113 INFO] Step 26800/50000; acc:  94.05; ppl:  1.27; xent: 0.24; lr: 0.00010; 10062/9564 tok/s;   5550 sec
[2021-04-23 02:25:49,277 INFO] Step 26850/50000; acc:  94.02; ppl:  1.27; xent: 0.24; lr: 0.00010; 10227/9526 tok/s;   5560 sec
[2021-04-23 02:25:59,420 INFO] Step 26900/50000; acc:  93.86; ppl:  1.28; xent: 0.25; lr: 0.00010; 9739/9185 tok/s;   5571 sec
[2021-04-23 02:26:12,437 INFO] Step 26950/50000; acc:  94.05; ppl:  1.28; xent: 0.24; lr: 0.00010; 7845/7419 tok/s;   5584 sec
[2021-04-23 02:26:19,146 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:26:25,248 INFO] Step 27000/50000; acc:  93.91; ppl:  1.28; xent: 0.25; lr: 0.00010; 7941/7474 tok/s;   5596 sec
[2021-04-23 02:26:38,231 INFO] Step 27050/50000; acc:  94.00; ppl:  1.28; xent: 0.25; lr: 0.00010; 7792/7315 tok/s;   5609 sec
[2021-04-23 02:26:51,037 INFO] Step 27100/50000; acc:  93.87; ppl:  1.28; xent: 0.25; lr: 0.00010; 8010/7569 tok/s;   5622 sec
[2021-04-23 02:27:03,858 INFO] Step 27150/50000; acc:  93.79; ppl:  1.29; xent: 0.25; lr: 0.00010; 7754/7396 tok/s;   5635 sec
[2021-04-23 02:27:17,002 INFO] Step 27200/50000; acc:  93.96; ppl:  1.28; xent: 0.25; lr: 0.00010; 7856/7416 tok/s;   5648 sec
[2021-04-23 02:27:29,576 INFO] Step 27250/50000; acc:  93.96; ppl:  1.28; xent: 0.25; lr: 0.00010; 7854/7448 tok/s;   5661 sec
[2021-04-23 02:27:42,846 INFO] Step 27300/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 7943/7456 tok/s;   5674 sec
[2021-04-23 02:27:55,703 INFO] Step 27350/50000; acc:  94.12; ppl:  1.27; xent: 0.24; lr: 0.00010; 7871/7449 tok/s;   5687 sec
[2021-04-23 02:28:08,777 INFO] Step 27400/50000; acc:  93.94; ppl:  1.28; xent: 0.25; lr: 0.00010; 7795/7331 tok/s;   5700 sec
[2021-04-23 02:28:21,718 INFO] Step 27450/50000; acc:  94.00; ppl:  1.27; xent: 0.24; lr: 0.00010; 7906/7458 tok/s;   5713 sec
[2021-04-23 02:28:34,746 INFO] Step 27500/50000; acc:  93.95; ppl:  1.28; xent: 0.24; lr: 0.00010; 7713/7292 tok/s;   5726 sec
[2021-04-23 02:28:47,412 INFO] Step 27550/50000; acc:  94.17; ppl:  1.27; xent: 0.24; lr: 0.00010; 8138/7661 tok/s;   5739 sec
[2021-04-23 02:29:00,306 INFO] Step 27600/50000; acc:  94.03; ppl:  1.27; xent: 0.24; lr: 0.00010; 7882/7390 tok/s;   5751 sec
[2021-04-23 02:29:13,309 INFO] Step 27650/50000; acc:  94.07; ppl:  1.27; xent: 0.24; lr: 0.00010; 7911/7403 tok/s;   5764 sec
[2021-04-23 02:29:25,868 INFO] Step 27700/50000; acc:  93.89; ppl:  1.28; xent: 0.25; lr: 0.00010; 7897/7501 tok/s;   5777 sec
[2021-04-23 02:29:28,580 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:29:36,229 INFO] Step 27750/50000; acc:  94.07; ppl:  1.27; xent: 0.24; lr: 0.00010; 9921/9355 tok/s;   5787 sec
[2021-04-23 02:29:46,188 INFO] Step 27800/50000; acc:  94.08; ppl:  1.28; xent: 0.24; lr: 0.00010; 10185/9577 tok/s;   5797 sec
[2021-04-23 02:29:56,062 INFO] Step 27850/50000; acc:  93.88; ppl:  1.28; xent: 0.25; lr: 0.00010; 10117/9606 tok/s;   5807 sec
[2021-04-23 02:30:06,188 INFO] Step 27900/50000; acc:  94.00; ppl:  1.27; xent: 0.24; lr: 0.00010; 10184/9598 tok/s;   5817 sec
[2021-04-23 02:30:16,207 INFO] Step 27950/50000; acc:  93.92; ppl:  1.28; xent: 0.25; lr: 0.00010; 9935/9486 tok/s;   5827 sec
[2021-04-23 02:30:26,317 INFO] Step 28000/50000; acc:  94.14; ppl:  1.27; xent: 0.24; lr: 0.00010; 10201/9580 tok/s;   5837 sec
[2021-04-23 02:30:36,488 INFO] Step 28050/50000; acc:  94.14; ppl:  1.27; xent: 0.24; lr: 0.00010; 9958/9434 tok/s;   5848 sec
[2021-04-23 02:30:46,759 INFO] Step 28100/50000; acc:  94.05; ppl:  1.27; xent: 0.24; lr: 0.00010; 10149/9511 tok/s;   5858 sec
[2021-04-23 02:30:56,773 INFO] Step 28150/50000; acc:  93.98; ppl:  1.28; xent: 0.24; lr: 0.00010; 10015/9495 tok/s;   5868 sec
[2021-04-23 02:31:07,045 INFO] Step 28200/50000; acc:  94.09; ppl:  1.27; xent: 0.24; lr: 0.00010; 9944/9364 tok/s;   5878 sec
[2021-04-23 02:31:17,118 INFO] Step 28250/50000; acc:  94.11; ppl:  1.27; xent: 0.24; lr: 0.00010; 10124/9575 tok/s;   5888 sec
[2021-04-23 02:31:27,007 INFO] Step 28300/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 10216/9636 tok/s;   5898 sec
[2021-04-23 02:31:37,227 INFO] Step 28350/50000; acc:  94.24; ppl:  1.26; xent: 0.23; lr: 0.00010; 10081/9390 tok/s;   5908 sec
[2021-04-23 02:31:47,168 INFO] Step 28400/50000; acc:  93.93; ppl:  1.28; xent: 0.25; lr: 0.00010; 10116/9561 tok/s;   5918 sec
[2021-04-23 02:31:50,485 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:31:57,390 INFO] Step 28450/50000; acc:  94.09; ppl:  1.27; xent: 0.24; lr: 0.00010; 10193/9550 tok/s;   5929 sec
[2021-04-23 02:32:07,324 INFO] Step 28500/50000; acc:  93.97; ppl:  1.28; xent: 0.25; lr: 0.00010; 9975/9484 tok/s;   5938 sec
[2021-04-23 02:32:17,287 INFO] Step 28550/50000; acc:  94.16; ppl:  1.27; xent: 0.24; lr: 0.00010; 10236/9623 tok/s;   5948 sec
[2021-04-23 02:32:27,256 INFO] Step 28600/50000; acc:  93.90; ppl:  1.28; xent: 0.25; lr: 0.00010; 10134/9610 tok/s;   5958 sec
[2021-04-23 02:32:37,238 INFO] Step 28650/50000; acc:  93.98; ppl:  1.27; xent: 0.24; lr: 0.00010; 10066/9562 tok/s;   5968 sec
[2021-04-23 02:32:47,398 INFO] Step 28700/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 10098/9474 tok/s;   5979 sec
[2021-04-23 02:32:57,470 INFO] Step 28750/50000; acc:  94.04; ppl:  1.28; xent: 0.24; lr: 0.00010; 9989/9520 tok/s;   5989 sec
[2021-04-23 02:33:07,705 INFO] Step 28800/50000; acc:  94.37; ppl:  1.26; xent: 0.23; lr: 0.00010; 10271/9644 tok/s;   5999 sec
[2021-04-23 02:33:17,702 INFO] Step 28850/50000; acc:  93.88; ppl:  1.28; xent: 0.24; lr: 0.00010; 9927/9400 tok/s;   6009 sec
[2021-04-23 02:33:27,956 INFO] Step 28900/50000; acc:  94.28; ppl:  1.27; xent: 0.24; lr: 0.00010; 10169/9561 tok/s;   6019 sec
[2021-04-23 02:33:37,998 INFO] Step 28950/50000; acc:  94.08; ppl:  1.27; xent: 0.24; lr: 0.00010; 9968/9433 tok/s;   6029 sec
[2021-04-23 02:33:48,047 INFO] Step 29000/50000; acc:  94.07; ppl:  1.27; xent: 0.24; lr: 0.00010; 10156/9604 tok/s;   6039 sec
[2021-04-23 02:33:58,108 INFO] Step 29050/50000; acc:  94.27; ppl:  1.26; xent: 0.23; lr: 0.00010; 10243/9573 tok/s;   6049 sec
[2021-04-23 02:34:08,089 INFO] Step 29100/50000; acc:  94.12; ppl:  1.27; xent: 0.24; lr: 0.00010; 9987/9401 tok/s;   6059 sec
[2021-04-23 02:34:18,239 INFO] Step 29150/50000; acc:  94.02; ppl:  1.27; xent: 0.24; lr: 0.00010; 10150/9515 tok/s;   6069 sec
[2021-04-23 02:34:18,676 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:34:28,339 INFO] Step 29200/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 10017/9469 tok/s;   6080 sec
[2021-04-23 02:34:38,564 INFO] Step 29250/50000; acc:  94.20; ppl:  1.27; xent: 0.24; lr: 0.00010; 10174/9546 tok/s;   6090 sec
[2021-04-23 02:34:48,256 INFO] Step 29300/50000; acc:  93.91; ppl:  1.28; xent: 0.24; lr: 0.00010; 10097/9673 tok/s;   6099 sec
[2021-04-23 02:34:58,495 INFO] Step 29350/50000; acc:  94.07; ppl:  1.27; xent: 0.24; lr: 0.00010; 10038/9436 tok/s;   6110 sec
[2021-04-23 02:35:08,468 INFO] Step 29400/50000; acc:  94.10; ppl:  1.27; xent: 0.24; lr: 0.00010; 10148/9617 tok/s;   6120 sec
[2021-04-23 02:35:18,488 INFO] Step 29450/50000; acc:  94.11; ppl:  1.27; xent: 0.24; lr: 0.00010; 9962/9441 tok/s;   6130 sec
[2021-04-23 02:35:28,843 INFO] Step 29500/50000; acc:  94.31; ppl:  1.26; xent: 0.23; lr: 0.00010; 10111/9524 tok/s;   6140 sec
[2021-04-23 02:35:38,850 INFO] Step 29550/50000; acc:  93.98; ppl:  1.27; xent: 0.24; lr: 0.00010; 10067/9526 tok/s;   6150 sec
[2021-04-23 02:35:49,183 INFO] Step 29600/50000; acc:  94.19; ppl:  1.26; xent: 0.23; lr: 0.00010; 9975/9362 tok/s;   6160 sec
[2021-04-23 02:35:59,185 INFO] Step 29650/50000; acc:  94.17; ppl:  1.27; xent: 0.24; lr: 0.00010; 9979/9458 tok/s;   6170 sec
[2021-04-23 02:36:09,454 INFO] Step 29700/50000; acc:  94.20; ppl:  1.26; xent: 0.23; lr: 0.00010; 10142/9554 tok/s;   6181 sec
[2021-04-23 02:36:19,340 INFO] Step 29750/50000; acc:  94.17; ppl:  1.26; xent: 0.23; lr: 0.00010; 10185/9591 tok/s;   6191 sec
[2021-04-23 02:36:29,428 INFO] Step 29800/50000; acc:  94.20; ppl:  1.26; xent: 0.23; lr: 0.00010; 10107/9525 tok/s;   6201 sec
[2021-04-23 02:36:39,506 INFO] Step 29850/50000; acc:  94.11; ppl:  1.27; xent: 0.24; lr: 0.00010; 10068/9426 tok/s;   6211 sec
[2021-04-23 02:36:47,171 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:36:49,509 INFO] Step 29900/50000; acc:  94.04; ppl:  1.26; xent: 0.23; lr: 0.00010; 10088/9522 tok/s;   6221 sec
[2021-04-23 02:36:59,636 INFO] Step 29950/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 10171/9553 tok/s;   6231 sec
[2021-04-23 02:37:09,668 INFO] Step 30000/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 10069/9500 tok/s;   6241 sec
[2021-04-23 02:37:09,670 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-23 02:37:19,286 INFO] Validation perplexity: 1.32736
[2021-04-23 02:37:19,286 INFO] Validation accuracy: 93.4833
[2021-04-23 02:37:19,288 INFO] Saving checkpoint ../models/default_params/control/model_step_30000.pt
[2021-04-23 02:37:30,006 INFO] Step 30050/50000; acc:  94.11; ppl:  1.27; xent: 0.24; lr: 0.00010; 5064/4817 tok/s;   6261 sec
[2021-04-23 02:37:39,967 INFO] Step 30100/50000; acc:  93.98; ppl:  1.27; xent: 0.24; lr: 0.00010; 9937/9396 tok/s;   6271 sec
[2021-04-23 02:37:50,055 INFO] Step 30150/50000; acc:  94.16; ppl:  1.26; xent: 0.23; lr: 0.00010; 10128/9586 tok/s;   6281 sec
[2021-04-23 02:38:00,154 INFO] Step 30200/50000; acc:  94.23; ppl:  1.26; xent: 0.23; lr: 0.00010; 10096/9560 tok/s;   6291 sec
[2021-04-23 02:38:10,300 INFO] Step 30250/50000; acc:  94.25; ppl:  1.26; xent: 0.23; lr: 0.00010; 10019/9475 tok/s;   6301 sec
[2021-04-23 02:38:20,619 INFO] Step 30300/50000; acc:  94.07; ppl:  1.26; xent: 0.23; lr: 0.00010; 10055/9432 tok/s;   6312 sec
[2021-04-23 02:38:30,661 INFO] Step 30350/50000; acc:  94.11; ppl:  1.27; xent: 0.24; lr: 0.00010; 9901/9387 tok/s;   6322 sec
[2021-04-23 02:38:40,918 INFO] Step 30400/50000; acc:  94.31; ppl:  1.25; xent: 0.23; lr: 0.00010; 10125/9494 tok/s;   6332 sec
[2021-04-23 02:38:50,796 INFO] Step 30450/50000; acc:  94.18; ppl:  1.26; xent: 0.23; lr: 0.00010; 10058/9579 tok/s;   6342 sec
[2021-04-23 02:39:01,024 INFO] Step 30500/50000; acc:  94.34; ppl:  1.25; xent: 0.22; lr: 0.00010; 10228/9522 tok/s;   6352 sec
[2021-04-23 02:39:10,920 INFO] Step 30550/50000; acc:  94.11; ppl:  1.27; xent: 0.24; lr: 0.00010; 10101/9512 tok/s;   6362 sec
[2021-04-23 02:39:20,964 INFO] Step 30600/50000; acc:  94.20; ppl:  1.26; xent: 0.23; lr: 0.00010; 10087/9526 tok/s;   6372 sec
[2021-04-23 02:39:25,898 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:39:31,093 INFO] Step 30650/50000; acc:  94.18; ppl:  1.26; xent: 0.23; lr: 0.00010; 10121/9547 tok/s;   6382 sec
[2021-04-23 02:39:41,192 INFO] Step 30700/50000; acc:  94.23; ppl:  1.27; xent: 0.24; lr: 0.00010; 10012/9388 tok/s;   6392 sec
[2021-04-23 02:39:51,151 INFO] Step 30750/50000; acc:  94.13; ppl:  1.26; xent: 0.23; lr: 0.00010; 10293/9758 tok/s;   6402 sec
[2021-04-23 02:40:01,279 INFO] Step 30800/50000; acc:  94.08; ppl:  1.27; xent: 0.24; lr: 0.00010; 9921/9420 tok/s;   6412 sec
[2021-04-23 02:40:11,548 INFO] Step 30850/50000; acc:  94.21; ppl:  1.26; xent: 0.23; lr: 0.00010; 10080/9499 tok/s;   6423 sec
[2021-04-23 02:40:21,362 INFO] Step 30900/50000; acc:  93.98; ppl:  1.27; xent: 0.24; lr: 0.00010; 10021/9520 tok/s;   6433 sec
[2021-04-23 02:40:31,705 INFO] Step 30950/50000; acc:  94.44; ppl:  1.25; xent: 0.22; lr: 0.00010; 10051/9478 tok/s;   6443 sec
[2021-04-23 02:40:41,791 INFO] Step 31000/50000; acc:  94.30; ppl:  1.26; xent: 0.23; lr: 0.00010; 10146/9553 tok/s;   6453 sec
[2021-04-23 02:40:51,843 INFO] Step 31050/50000; acc:  94.07; ppl:  1.27; xent: 0.24; lr: 0.00010; 9984/9427 tok/s;   6463 sec
[2021-04-23 02:41:02,033 INFO] Step 31100/50000; acc:  94.33; ppl:  1.25; xent: 0.23; lr: 0.00010; 10159/9551 tok/s;   6473 sec
[2021-04-23 02:41:12,096 INFO] Step 31150/50000; acc:  94.16; ppl:  1.26; xent: 0.23; lr: 0.00010; 9935/9420 tok/s;   6483 sec
[2021-04-23 02:41:22,127 INFO] Step 31200/50000; acc:  94.40; ppl:  1.25; xent: 0.23; lr: 0.00010; 10311/9695 tok/s;   6493 sec
[2021-04-23 02:41:32,105 INFO] Step 31250/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 10005/9429 tok/s;   6503 sec
[2021-04-23 02:41:42,273 INFO] Step 31300/50000; acc:  94.33; ppl:  1.25; xent: 0.23; lr: 0.00010; 10184/9513 tok/s;   6513 sec
[2021-04-23 02:41:52,110 INFO] Step 31350/50000; acc:  94.13; ppl:  1.26; xent: 0.23; lr: 0.00010; 10204/9645 tok/s;   6523 sec
[2021-04-23 02:41:54,220 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:42:02,246 INFO] Step 31400/50000; acc:  94.33; ppl:  1.26; xent: 0.23; lr: 0.00010; 10071/9517 tok/s;   6533 sec
[2021-04-23 02:42:12,286 INFO] Step 31450/50000; acc:  94.32; ppl:  1.26; xent: 0.23; lr: 0.00010; 10164/9552 tok/s;   6543 sec
[2021-04-23 02:42:22,166 INFO] Step 31500/50000; acc:  94.10; ppl:  1.27; xent: 0.24; lr: 0.00010; 10124/9590 tok/s;   6553 sec
[2021-04-23 02:42:32,284 INFO] Step 31550/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 10162/9609 tok/s;   6563 sec
[2021-04-23 02:42:42,345 INFO] Step 31600/50000; acc:  94.14; ppl:  1.26; xent: 0.23; lr: 0.00010; 10009/9501 tok/s;   6574 sec
[2021-04-23 02:42:52,526 INFO] Step 31650/50000; acc:  94.35; ppl:  1.26; xent: 0.23; lr: 0.00010; 10155/9567 tok/s;   6584 sec
[2021-04-23 02:43:02,655 INFO] Step 31700/50000; acc:  94.44; ppl:  1.25; xent: 0.22; lr: 0.00010; 9964/9432 tok/s;   6594 sec
[2021-04-23 02:43:12,809 INFO] Step 31750/50000; acc:  94.23; ppl:  1.26; xent: 0.23; lr: 0.00010; 10095/9496 tok/s;   6604 sec
[2021-04-23 02:43:22,979 INFO] Step 31800/50000; acc:  94.18; ppl:  1.26; xent: 0.23; lr: 0.00010; 10004/9425 tok/s;   6614 sec
[2021-04-23 02:43:33,142 INFO] Step 31850/50000; acc:  94.32; ppl:  1.25; xent: 0.23; lr: 0.00010; 9909/9375 tok/s;   6624 sec
[2021-04-23 02:43:43,222 INFO] Step 31900/50000; acc:  94.34; ppl:  1.26; xent: 0.23; lr: 0.00010; 10215/9661 tok/s;   6634 sec
[2021-04-23 02:43:53,122 INFO] Step 31950/50000; acc:  94.40; ppl:  1.25; xent: 0.22; lr: 0.00010; 10166/9579 tok/s;   6644 sec
[2021-04-23 02:44:03,335 INFO] Step 32000/50000; acc:  94.49; ppl:  1.25; xent: 0.22; lr: 0.00010; 10114/9439 tok/s;   6654 sec
[2021-04-23 02:44:13,170 INFO] Step 32050/50000; acc:  94.14; ppl:  1.26; xent: 0.23; lr: 0.00010; 10056/9512 tok/s;   6664 sec
[2021-04-23 02:44:16,190 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:44:23,454 INFO] Step 32100/50000; acc:  94.35; ppl:  1.25; xent: 0.22; lr: 0.00010; 10179/9524 tok/s;   6675 sec
[2021-04-23 02:44:33,395 INFO] Step 32150/50000; acc:  94.27; ppl:  1.26; xent: 0.23; lr: 0.00010; 10095/9565 tok/s;   6685 sec
[2021-04-23 02:44:43,302 INFO] Step 32200/50000; acc:  94.39; ppl:  1.25; xent: 0.23; lr: 0.00010; 10202/9624 tok/s;   6694 sec
[2021-04-23 02:44:53,333 INFO] Step 32250/50000; acc:  94.19; ppl:  1.26; xent: 0.23; lr: 0.00010; 10163/9631 tok/s;   6704 sec
[2021-04-23 02:45:03,354 INFO] Step 32300/50000; acc:  94.23; ppl:  1.26; xent: 0.23; lr: 0.00010; 10018/9540 tok/s;   6715 sec
[2021-04-23 02:45:13,480 INFO] Step 32350/50000; acc:  94.30; ppl:  1.26; xent: 0.23; lr: 0.00010; 10108/9478 tok/s;   6725 sec
[2021-04-23 02:45:23,580 INFO] Step 32400/50000; acc:  94.36; ppl:  1.26; xent: 0.23; lr: 0.00010; 10089/9621 tok/s;   6735 sec
[2021-04-23 02:45:33,852 INFO] Step 32450/50000; acc:  94.46; ppl:  1.25; xent: 0.22; lr: 0.00010; 10242/9556 tok/s;   6745 sec
[2021-04-23 02:45:43,780 INFO] Step 32500/50000; acc:  94.08; ppl:  1.26; xent: 0.23; lr: 0.00010; 9949/9446 tok/s;   6755 sec
[2021-04-23 02:45:53,948 INFO] Step 32550/50000; acc:  94.41; ppl:  1.25; xent: 0.22; lr: 0.00010; 10109/9562 tok/s;   6765 sec
[2021-04-23 02:46:04,038 INFO] Step 32600/50000; acc:  94.40; ppl:  1.25; xent: 0.22; lr: 0.00010; 10052/9450 tok/s;   6775 sec
[2021-04-23 02:46:13,979 INFO] Step 32650/50000; acc:  94.35; ppl:  1.25; xent: 0.22; lr: 0.00010; 10126/9591 tok/s;   6785 sec
[2021-04-23 02:46:24,194 INFO] Step 32700/50000; acc:  94.49; ppl:  1.24; xent: 0.22; lr: 0.00010; 10175/9516 tok/s;   6795 sec
[2021-04-23 02:46:34,084 INFO] Step 32750/50000; acc:  94.34; ppl:  1.25; xent: 0.23; lr: 0.00010; 10037/9438 tok/s;   6805 sec
[2021-04-23 02:46:44,290 INFO] Step 32800/50000; acc:  94.25; ppl:  1.26; xent: 0.23; lr: 0.00010; 10143/9531 tok/s;   6815 sec
[2021-04-23 02:46:44,300 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:46:54,242 INFO] Step 32850/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 9979/9438 tok/s;   6825 sec
[2021-04-23 02:47:04,565 INFO] Step 32900/50000; acc:  94.40; ppl:  1.25; xent: 0.22; lr: 0.00010; 10130/9528 tok/s;   6836 sec
[2021-04-23 02:47:14,349 INFO] Step 32950/50000; acc:  94.14; ppl:  1.26; xent: 0.23; lr: 0.00010; 10128/9685 tok/s;   6846 sec
[2021-04-23 02:47:24,559 INFO] Step 33000/50000; acc:  94.22; ppl:  1.26; xent: 0.23; lr: 0.00010; 9974/9357 tok/s;   6856 sec
[2021-04-23 02:47:34,601 INFO] Step 33050/50000; acc:  94.43; ppl:  1.25; xent: 0.22; lr: 0.00010; 10172/9657 tok/s;   6866 sec
[2021-04-23 02:47:44,614 INFO] Step 33100/50000; acc:  94.29; ppl:  1.26; xent: 0.23; lr: 0.00010; 10003/9470 tok/s;   6876 sec
[2021-04-23 02:47:54,948 INFO] Step 33150/50000; acc:  94.47; ppl:  1.25; xent: 0.22; lr: 0.00010; 10094/9486 tok/s;   6886 sec
[2021-04-23 02:48:05,053 INFO] Step 33200/50000; acc:  94.19; ppl:  1.26; xent: 0.23; lr: 0.00010; 10077/9500 tok/s;   6896 sec
[2021-04-23 02:48:15,408 INFO] Step 33250/50000; acc:  94.38; ppl:  1.25; xent: 0.22; lr: 0.00010; 9959/9357 tok/s;   6907 sec
[2021-04-23 02:48:25,390 INFO] Step 33300/50000; acc:  94.31; ppl:  1.25; xent: 0.22; lr: 0.00010; 9969/9471 tok/s;   6917 sec
[2021-04-23 02:48:35,622 INFO] Step 33350/50000; acc:  94.44; ppl:  1.25; xent: 0.22; lr: 0.00010; 10025/9453 tok/s;   6927 sec
[2021-04-23 02:48:45,608 INFO] Step 33400/50000; acc:  94.45; ppl:  1.25; xent: 0.22; lr: 0.00010; 10226/9590 tok/s;   6937 sec
[2021-04-23 02:48:55,554 INFO] Step 33450/50000; acc:  94.41; ppl:  1.25; xent: 0.22; lr: 0.00010; 10090/9524 tok/s;   6947 sec
[2021-04-23 02:49:05,708 INFO] Step 33500/50000; acc:  94.34; ppl:  1.25; xent: 0.22; lr: 0.00010; 10093/9448 tok/s;   6957 sec
[2021-04-23 02:49:12,965 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:49:15,709 INFO] Step 33550/50000; acc:  94.30; ppl:  1.25; xent: 0.22; lr: 0.00010; 10039/9480 tok/s;   6967 sec
[2021-04-23 02:49:25,887 INFO] Step 33600/50000; acc:  94.50; ppl:  1.25; xent: 0.22; lr: 0.00010; 10173/9568 tok/s;   6977 sec
[2021-04-23 02:49:35,747 INFO] Step 33650/50000; acc:  94.29; ppl:  1.25; xent: 0.23; lr: 0.00010; 10051/9525 tok/s;   6987 sec
[2021-04-23 02:49:45,892 INFO] Step 33700/50000; acc:  94.37; ppl:  1.25; xent: 0.22; lr: 0.00010; 10226/9702 tok/s;   6997 sec
[2021-04-23 02:49:55,893 INFO] Step 33750/50000; acc:  94.27; ppl:  1.26; xent: 0.23; lr: 0.00010; 9999/9436 tok/s;   7007 sec
[2021-04-23 02:50:05,930 INFO] Step 33800/50000; acc:  94.37; ppl:  1.25; xent: 0.22; lr: 0.00010; 10104/9605 tok/s;   7017 sec
[2021-04-23 02:50:16,165 INFO] Step 33850/50000; acc:  94.44; ppl:  1.25; xent: 0.22; lr: 0.00010; 10054/9492 tok/s;   7027 sec
[2021-04-23 02:50:26,275 INFO] Step 33900/50000; acc:  94.51; ppl:  1.24; xent: 0.22; lr: 0.00010; 10048/9491 tok/s;   7037 sec
[2021-04-23 02:50:36,567 INFO] Step 33950/50000; acc:  94.27; ppl:  1.25; xent: 0.22; lr: 0.00010; 10059/9437 tok/s;   7048 sec
[2021-04-23 02:50:46,701 INFO] Step 34000/50000; acc:  94.30; ppl:  1.25; xent: 0.22; lr: 0.00010; 9934/9412 tok/s;   7058 sec
[2021-04-23 02:50:56,980 INFO] Step 34050/50000; acc:  94.53; ppl:  1.24; xent: 0.22; lr: 0.00010; 10101/9464 tok/s;   7068 sec
[2021-04-23 02:51:06,779 INFO] Step 34100/50000; acc:  94.43; ppl:  1.25; xent: 0.22; lr: 0.00010; 10126/9617 tok/s;   7078 sec
[2021-04-23 02:51:16,883 INFO] Step 34150/50000; acc:  94.43; ppl:  1.24; xent: 0.22; lr: 0.00010; 10181/9547 tok/s;   7088 sec
[2021-04-23 02:51:26,820 INFO] Step 34200/50000; acc:  94.37; ppl:  1.25; xent: 0.22; lr: 0.00010; 10191/9512 tok/s;   7098 sec
[2021-04-23 02:51:36,789 INFO] Step 34250/50000; acc:  94.24; ppl:  1.25; xent: 0.22; lr: 0.00010; 10023/9524 tok/s;   7108 sec
[2021-04-23 02:51:41,364 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:51:47,018 INFO] Step 34300/50000; acc:  94.41; ppl:  1.25; xent: 0.22; lr: 0.00010; 10127/9506 tok/s;   7118 sec
[2021-04-23 02:51:56,952 INFO] Step 34350/50000; acc:  94.49; ppl:  1.25; xent: 0.22; lr: 0.00010; 10113/9518 tok/s;   7128 sec
[2021-04-23 02:52:06,991 INFO] Step 34400/50000; acc:  94.39; ppl:  1.25; xent: 0.22; lr: 0.00010; 10259/9732 tok/s;   7138 sec
[2021-04-23 02:52:17,020 INFO] Step 34450/50000; acc:  94.25; ppl:  1.25; xent: 0.23; lr: 0.00010; 9850/9385 tok/s;   7148 sec
[2021-04-23 02:52:27,270 INFO] Step 34500/50000; acc:  94.53; ppl:  1.24; xent: 0.22; lr: 0.00010; 10142/9573 tok/s;   7158 sec
[2021-04-23 02:52:37,199 INFO] Step 34550/50000; acc:  94.36; ppl:  1.25; xent: 0.22; lr: 0.00010; 10037/9488 tok/s;   7168 sec
[2021-04-23 02:52:47,500 INFO] Step 34600/50000; acc:  94.63; ppl:  1.24; xent: 0.21; lr: 0.00010; 10018/9452 tok/s;   7179 sec
[2021-04-23 02:52:57,680 INFO] Step 34650/50000; acc:  94.52; ppl:  1.24; xent: 0.22; lr: 0.00010; 10112/9502 tok/s;   7189 sec
[2021-04-23 02:53:07,834 INFO] Step 34700/50000; acc:  94.31; ppl:  1.25; xent: 0.22; lr: 0.00010; 9904/9371 tok/s;   7199 sec
[2021-04-23 02:53:18,104 INFO] Step 34750/50000; acc:  94.55; ppl:  1.24; xent: 0.21; lr: 0.00010; 10064/9477 tok/s;   7209 sec
[2021-04-23 02:53:28,244 INFO] Step 34800/50000; acc:  94.44; ppl:  1.24; xent: 0.22; lr: 0.00010; 9955/9404 tok/s;   7219 sec
[2021-04-23 02:53:38,371 INFO] Step 34850/50000; acc:  94.63; ppl:  1.24; xent: 0.21; lr: 0.00010; 10243/9617 tok/s;   7230 sec
[2021-04-23 02:53:48,274 INFO] Step 34900/50000; acc:  94.45; ppl:  1.24; xent: 0.22; lr: 0.00010; 10028/9467 tok/s;   7239 sec
[2021-04-23 02:53:58,347 INFO] Step 34950/50000; acc:  94.47; ppl:  1.24; xent: 0.22; lr: 0.00010; 10141/9495 tok/s;   7250 sec
[2021-04-23 02:54:08,344 INFO] Step 35000/50000; acc:  94.42; ppl:  1.25; xent: 0.22; lr: 0.00010; 10171/9542 tok/s;   7260 sec
[2021-04-23 02:54:08,347 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-23 02:54:17,987 INFO] Validation perplexity: 1.33217
[2021-04-23 02:54:17,987 INFO] Validation accuracy: 93.4208
[2021-04-23 02:54:17,989 INFO] Saving checkpoint ../models/default_params/control/model_step_35000.pt
[2021-04-23 02:54:20,294 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:54:28,674 INFO] Step 35050/50000; acc:  94.48; ppl:  1.24; xent: 0.22; lr: 0.00010; 4948/4709 tok/s;   7280 sec
[2021-04-23 02:54:38,778 INFO] Step 35100/50000; acc:  94.49; ppl:  1.25; xent: 0.22; lr: 0.00010; 10195/9563 tok/s;   7290 sec
[2021-04-23 02:54:48,630 INFO] Step 35150/50000; acc:  94.26; ppl:  1.25; xent: 0.23; lr: 0.00010; 10089/9585 tok/s;   7300 sec
[2021-04-23 02:54:58,800 INFO] Step 35200/50000; acc:  94.49; ppl:  1.24; xent: 0.22; lr: 0.00010; 10168/9610 tok/s;   7310 sec
[2021-04-23 02:55:08,674 INFO] Step 35250/50000; acc:  94.35; ppl:  1.25; xent: 0.22; lr: 0.00010; 10008/9513 tok/s;   7320 sec
[2021-04-23 02:55:18,946 INFO] Step 35300/50000; acc:  94.57; ppl:  1.24; xent: 0.22; lr: 0.00010; 10139/9554 tok/s;   7330 sec
[2021-04-23 02:55:29,147 INFO] Step 35350/50000; acc:  94.65; ppl:  1.23; xent: 0.21; lr: 0.00010; 10010/9440 tok/s;   7340 sec
[2021-04-23 02:55:39,223 INFO] Step 35400/50000; acc:  94.47; ppl:  1.24; xent: 0.22; lr: 0.00010; 10096/9497 tok/s;   7350 sec
[2021-04-23 02:55:49,388 INFO] Step 35450/50000; acc:  94.39; ppl:  1.25; xent: 0.22; lr: 0.00010; 10074/9490 tok/s;   7361 sec
[2021-04-23 02:55:59,598 INFO] Step 35500/50000; acc:  94.49; ppl:  1.24; xent: 0.21; lr: 0.00010; 9880/9368 tok/s;   7371 sec
[2021-04-23 02:56:09,605 INFO] Step 35550/50000; acc:  94.53; ppl:  1.24; xent: 0.21; lr: 0.00010; 10255/9703 tok/s;   7381 sec
[2021-04-23 02:56:19,616 INFO] Step 35600/50000; acc:  94.55; ppl:  1.24; xent: 0.21; lr: 0.00010; 10187/9538 tok/s;   7391 sec
[2021-04-23 02:56:29,890 INFO] Step 35650/50000; acc:  94.60; ppl:  1.23; xent: 0.21; lr: 0.00010; 10043/9385 tok/s;   7401 sec
[2021-04-23 02:56:39,699 INFO] Step 35700/50000; acc:  94.36; ppl:  1.25; xent: 0.22; lr: 0.00010; 10049/9529 tok/s;   7411 sec
[2021-04-23 02:56:42,243 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:56:49,892 INFO] Step 35750/50000; acc:  94.51; ppl:  1.24; xent: 0.22; lr: 0.00010; 10115/9502 tok/s;   7421 sec
[2021-04-23 02:56:59,971 INFO] Step 35800/50000; acc:  94.55; ppl:  1.24; xent: 0.22; lr: 0.00010; 10089/9520 tok/s;   7431 sec
[2021-04-23 02:57:09,869 INFO] Step 35850/50000; acc:  94.44; ppl:  1.24; xent: 0.22; lr: 0.00010; 10073/9539 tok/s;   7441 sec
[2021-04-23 02:57:20,000 INFO] Step 35900/50000; acc:  94.48; ppl:  1.24; xent: 0.22; lr: 0.00010; 10168/9622 tok/s;   7451 sec
[2021-04-23 02:57:29,921 INFO] Step 35950/50000; acc:  94.38; ppl:  1.24; xent: 0.22; lr: 0.00010; 10056/9530 tok/s;   7461 sec
[2021-04-23 02:57:40,092 INFO] Step 36000/50000; acc:  94.52; ppl:  1.24; xent: 0.22; lr: 0.00010; 10097/9514 tok/s;   7471 sec
[2021-04-23 02:57:50,155 INFO] Step 36050/50000; acc:  94.53; ppl:  1.24; xent: 0.22; lr: 0.00010; 9970/9506 tok/s;   7481 sec
[2021-04-23 02:58:00,474 INFO] Step 36100/50000; acc:  94.70; ppl:  1.23; xent: 0.21; lr: 0.00010; 10234/9576 tok/s;   7492 sec
[2021-04-23 02:58:10,490 INFO] Step 36150/50000; acc:  94.29; ppl:  1.25; xent: 0.22; lr: 0.00010; 9986/9440 tok/s;   7502 sec
[2021-04-23 02:58:20,548 INFO] Step 36200/50000; acc:  94.60; ppl:  1.24; xent: 0.21; lr: 0.00010; 10138/9587 tok/s;   7512 sec
[2021-04-23 02:58:30,706 INFO] Step 36250/50000; acc:  94.53; ppl:  1.24; xent: 0.21; lr: 0.00010; 10076/9491 tok/s;   7522 sec
[2021-04-23 02:58:40,691 INFO] Step 36300/50000; acc:  94.58; ppl:  1.24; xent: 0.21; lr: 0.00010; 10088/9545 tok/s;   7532 sec
[2021-04-23 02:58:50,904 INFO] Step 36350/50000; acc:  94.67; ppl:  1.23; xent: 0.21; lr: 0.00010; 10161/9520 tok/s;   7542 sec
[2021-04-23 02:59:00,876 INFO] Step 36400/50000; acc:  94.61; ppl:  1.24; xent: 0.21; lr: 0.00010; 10049/9430 tok/s;   7552 sec
[2021-04-23 02:59:10,587 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 02:59:11,104 INFO] Step 36450/50000; acc:  94.51; ppl:  1.24; xent: 0.22; lr: 0.00010; 10127/9504 tok/s;   7562 sec
[2021-04-23 02:59:20,998 INFO] Step 36500/50000; acc:  94.55; ppl:  1.24; xent: 0.21; lr: 0.00010; 10017/9455 tok/s;   7572 sec
[2021-04-23 02:59:31,240 INFO] Step 36550/50000; acc:  94.57; ppl:  1.24; xent: 0.22; lr: 0.00010; 10057/9473 tok/s;   7582 sec
[2021-04-23 02:59:41,079 INFO] Step 36600/50000; acc:  94.45; ppl:  1.24; xent: 0.22; lr: 0.00010; 10210/9761 tok/s;   7592 sec
[2021-04-23 02:59:51,186 INFO] Step 36650/50000; acc:  94.51; ppl:  1.24; xent: 0.22; lr: 0.00010; 9934/9335 tok/s;   7602 sec
[2021-04-23 03:00:01,338 INFO] Step 36700/50000; acc:  94.59; ppl:  1.24; xent: 0.21; lr: 0.00010; 10156/9614 tok/s;   7613 sec
[2021-04-23 03:00:11,338 INFO] Step 36750/50000; acc:  94.50; ppl:  1.24; xent: 0.22; lr: 0.00010; 9978/9478 tok/s;   7623 sec
[2021-04-23 03:00:21,628 INFO] Step 36800/50000; acc:  94.70; ppl:  1.23; xent: 0.21; lr: 0.00010; 10169/9566 tok/s;   7633 sec
[2021-04-23 03:00:31,672 INFO] Step 36850/50000; acc:  94.45; ppl:  1.24; xent: 0.21; lr: 0.00010; 9975/9436 tok/s;   7643 sec
[2021-04-23 03:00:42,058 INFO] Step 36900/50000; acc:  94.51; ppl:  1.24; xent: 0.21; lr: 0.00010; 9961/9359 tok/s;   7653 sec
[2021-04-23 03:00:52,172 INFO] Step 36950/50000; acc:  94.51; ppl:  1.24; xent: 0.21; lr: 0.00010; 9962/9435 tok/s;   7663 sec
[2021-04-23 03:01:02,237 INFO] Step 37000/50000; acc:  94.62; ppl:  1.23; xent: 0.21; lr: 0.00010; 10099/9527 tok/s;   7673 sec
[2021-04-23 03:01:12,338 INFO] Step 37050/50000; acc:  94.69; ppl:  1.23; xent: 0.21; lr: 0.00010; 10181/9522 tok/s;   7684 sec
[2021-04-23 03:01:22,321 INFO] Step 37100/50000; acc:  94.59; ppl:  1.23; xent: 0.21; lr: 0.00010; 10071/9486 tok/s;   7693 sec
[2021-04-23 03:01:32,473 INFO] Step 37150/50000; acc:  94.52; ppl:  1.24; xent: 0.21; lr: 0.00010; 10077/9463 tok/s;   7704 sec
[2021-04-23 03:01:39,329 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:01:42,530 INFO] Step 37200/50000; acc:  94.50; ppl:  1.24; xent: 0.21; lr: 0.00010; 10102/9529 tok/s;   7714 sec
[2021-04-23 03:01:52,760 INFO] Step 37250/50000; acc:  94.68; ppl:  1.23; xent: 0.21; lr: 0.00010; 10149/9546 tok/s;   7724 sec
[2021-04-23 03:02:02,471 INFO] Step 37300/50000; acc:  94.45; ppl:  1.24; xent: 0.22; lr: 0.00010; 10149/9621 tok/s;   7734 sec
[2021-04-23 03:02:12,625 INFO] Step 37350/50000; acc:  94.47; ppl:  1.24; xent: 0.22; lr: 0.00010; 10060/9574 tok/s;   7744 sec
[2021-04-23 03:02:22,698 INFO] Step 37400/50000; acc:  94.61; ppl:  1.24; xent: 0.21; lr: 0.00010; 10063/9470 tok/s;   7754 sec
[2021-04-23 03:02:32,647 INFO] Step 37450/50000; acc:  94.51; ppl:  1.24; xent: 0.21; lr: 0.00010; 10042/9561 tok/s;   7764 sec
[2021-04-23 03:02:42,940 INFO] Step 37500/50000; acc:  94.68; ppl:  1.23; xent: 0.21; lr: 0.00010; 10103/9520 tok/s;   7774 sec
[2021-04-23 03:02:53,058 INFO] Step 37550/50000; acc:  94.68; ppl:  1.23; xent: 0.21; lr: 0.00010; 10008/9460 tok/s;   7784 sec
[2021-04-23 03:03:03,367 INFO] Step 37600/50000; acc:  94.51; ppl:  1.24; xent: 0.21; lr: 0.00010; 10067/9478 tok/s;   7795 sec
[2021-04-23 03:03:13,315 INFO] Step 37650/50000; acc:  94.55; ppl:  1.24; xent: 0.21; lr: 0.00010; 9944/9392 tok/s;   7804 sec
[2021-04-23 03:03:23,633 INFO] Step 37700/50000; acc:  94.79; ppl:  1.23; xent: 0.20; lr: 0.00010; 10117/9517 tok/s;   7815 sec
[2021-04-23 03:03:33,443 INFO] Step 37750/50000; acc:  94.65; ppl:  1.23; xent: 0.21; lr: 0.00010; 10239/9666 tok/s;   7825 sec
[2021-04-23 03:03:43,467 INFO] Step 37800/50000; acc:  94.71; ppl:  1.23; xent: 0.20; lr: 0.00010; 10181/9566 tok/s;   7835 sec
[2021-04-23 03:03:53,465 INFO] Step 37850/50000; acc:  94.61; ppl:  1.23; xent: 0.21; lr: 0.00010; 10201/9534 tok/s;   7845 sec
[2021-04-23 03:04:03,467 INFO] Step 37900/50000; acc:  94.46; ppl:  1.24; xent: 0.21; lr: 0.00010; 9999/9487 tok/s;   7855 sec
[2021-04-23 03:04:07,612 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:04:13,726 INFO] Step 37950/50000; acc:  94.71; ppl:  1.23; xent: 0.21; lr: 0.00010; 10089/9471 tok/s;   7865 sec
[2021-04-23 03:04:23,775 INFO] Step 38000/50000; acc:  94.73; ppl:  1.23; xent: 0.21; lr: 0.00010; 10077/9493 tok/s;   7875 sec
[2021-04-23 03:04:33,854 INFO] Step 38050/50000; acc:  94.63; ppl:  1.23; xent: 0.21; lr: 0.00010; 10243/9671 tok/s;   7885 sec
[2021-04-23 03:04:43,807 INFO] Step 38100/50000; acc:  94.47; ppl:  1.24; xent: 0.21; lr: 0.00010; 9890/9449 tok/s;   7895 sec
[2021-04-23 03:04:53,914 INFO] Step 38150/50000; acc:  94.62; ppl:  1.23; xent: 0.21; lr: 0.00010; 10133/9568 tok/s;   7905 sec
[2021-04-23 03:05:03,947 INFO] Step 38200/50000; acc:  94.51; ppl:  1.24; xent: 0.21; lr: 0.00010; 10065/9531 tok/s;   7915 sec
[2021-04-23 03:05:14,059 INFO] Step 38250/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 10079/9501 tok/s;   7925 sec
[2021-04-23 03:05:24,217 INFO] Step 38300/50000; acc:  94.62; ppl:  1.23; xent: 0.20; lr: 0.00010; 10230/9568 tok/s;   7935 sec
[2021-04-23 03:05:34,299 INFO] Step 38350/50000; acc:  94.52; ppl:  1.24; xent: 0.21; lr: 0.00010; 9909/9439 tok/s;   7945 sec
[2021-04-23 03:05:44,577 INFO] Step 38400/50000; acc:  94.65; ppl:  1.23; xent: 0.21; lr: 0.00010; 10109/9481 tok/s;   7956 sec
[2021-04-23 03:05:54,500 INFO] Step 38450/50000; acc:  94.61; ppl:  1.23; xent: 0.21; lr: 0.00010; 9991/9485 tok/s;   7966 sec
[2021-04-23 03:06:04,607 INFO] Step 38500/50000; acc:  94.79; ppl:  1.23; xent: 0.20; lr: 0.00010; 10328/9701 tok/s;   7976 sec
[2021-04-23 03:06:14,624 INFO] Step 38550/50000; acc:  94.69; ppl:  1.23; xent: 0.21; lr: 0.00010; 10020/9414 tok/s;   7986 sec
[2021-04-23 03:06:24,579 INFO] Step 38600/50000; acc:  94.61; ppl:  1.23; xent: 0.21; lr: 0.00010; 10188/9571 tok/s;   7996 sec
[2021-04-23 03:06:34,698 INFO] Step 38650/50000; acc:  94.67; ppl:  1.23; xent: 0.21; lr: 0.00010; 10119/9507 tok/s;   8006 sec
[2021-04-23 03:06:35,927 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:06:44,770 INFO] Step 38700/50000; acc:  94.61; ppl:  1.24; xent: 0.21; lr: 0.00010; 10014/9513 tok/s;   8016 sec
[2021-04-23 03:06:54,790 INFO] Step 38750/50000; acc:  94.81; ppl:  1.23; xent: 0.20; lr: 0.00010; 10245/9584 tok/s;   8026 sec
[2021-04-23 03:07:04,731 INFO] Step 38800/50000; acc:  94.51; ppl:  1.24; xent: 0.21; lr: 0.00010; 10115/9644 tok/s;   8036 sec
[2021-04-23 03:07:14,980 INFO] Step 38850/50000; acc:  94.65; ppl:  1.23; xent: 0.21; lr: 0.00010; 10094/9522 tok/s;   8046 sec
[2021-04-23 03:07:24,774 INFO] Step 38900/50000; acc:  94.59; ppl:  1.23; xent: 0.21; lr: 0.00010; 10058/9549 tok/s;   8056 sec
[2021-04-23 03:07:34,971 INFO] Step 38950/50000; acc:  94.67; ppl:  1.23; xent: 0.21; lr: 0.00010; 10075/9532 tok/s;   8066 sec
[2021-04-23 03:07:45,218 INFO] Step 39000/50000; acc:  94.91; ppl:  1.22; xent: 0.20; lr: 0.00010; 10083/9478 tok/s;   8076 sec
[2021-04-23 03:07:55,233 INFO] Step 39050/50000; acc:  94.54; ppl:  1.23; xent: 0.21; lr: 0.00010; 10024/9475 tok/s;   8086 sec
[2021-04-23 03:08:05,412 INFO] Step 39100/50000; acc:  94.70; ppl:  1.23; xent: 0.21; lr: 0.00010; 10149/9530 tok/s;   8097 sec
[2021-04-23 03:08:15,523 INFO] Step 39150/50000; acc:  94.64; ppl:  1.23; xent: 0.20; lr: 0.00010; 9915/9403 tok/s;   8107 sec
[2021-04-23 03:08:25,641 INFO] Step 39200/50000; acc:  94.81; ppl:  1.23; xent: 0.20; lr: 0.00010; 10194/9648 tok/s;   8117 sec
[2021-04-23 03:08:35,521 INFO] Step 39250/50000; acc:  94.78; ppl:  1.22; xent: 0.20; lr: 0.00010; 10142/9506 tok/s;   8127 sec
[2021-04-23 03:08:45,844 INFO] Step 39300/50000; acc:  94.86; ppl:  1.22; xent: 0.20; lr: 0.00010; 10068/9416 tok/s;   8137 sec
[2021-04-23 03:08:55,770 INFO] Step 39350/50000; acc:  94.55; ppl:  1.23; xent: 0.21; lr: 0.00010; 10043/9476 tok/s;   8147 sec
[2021-04-23 03:08:57,864 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:09:05,825 INFO] Step 39400/50000; acc:  94.69; ppl:  1.23; xent: 0.21; lr: 0.00010; 10162/9582 tok/s;   8157 sec
[2021-04-23 03:09:15,926 INFO] Step 39450/50000; acc:  94.81; ppl:  1.23; xent: 0.20; lr: 0.00010; 10151/9552 tok/s;   8167 sec
[2021-04-23 03:09:25,825 INFO] Step 39500/50000; acc:  94.60; ppl:  1.23; xent: 0.21; lr: 0.00010; 10066/9555 tok/s;   8177 sec
[2021-04-23 03:09:35,984 INFO] Step 39550/50000; acc:  94.66; ppl:  1.23; xent: 0.21; lr: 0.00010; 10135/9554 tok/s;   8187 sec
[2021-04-23 03:09:46,014 INFO] Step 39600/50000; acc:  94.63; ppl:  1.23; xent: 0.21; lr: 0.00010; 10054/9571 tok/s;   8197 sec
[2021-04-23 03:09:56,170 INFO] Step 39650/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 10129/9516 tok/s;   8207 sec
[2021-04-23 03:10:06,203 INFO] Step 39700/50000; acc:  94.72; ppl:  1.23; xent: 0.21; lr: 0.00010; 9970/9520 tok/s;   8217 sec
[2021-04-23 03:10:16,359 INFO] Step 39750/50000; acc:  94.79; ppl:  1.22; xent: 0.20; lr: 0.00010; 10217/9590 tok/s;   8228 sec
[2021-04-23 03:10:26,500 INFO] Step 39800/50000; acc:  94.59; ppl:  1.23; xent: 0.21; lr: 0.00010; 10017/9413 tok/s;   8238 sec
[2021-04-23 03:10:36,483 INFO] Step 39850/50000; acc:  94.79; ppl:  1.22; xent: 0.20; lr: 0.00010; 10056/9521 tok/s;   8248 sec
[2021-04-23 03:10:46,721 INFO] Step 39900/50000; acc:  94.81; ppl:  1.22; xent: 0.20; lr: 0.00010; 10107/9500 tok/s;   8258 sec
[2021-04-23 03:10:56,654 INFO] Step 39950/50000; acc:  94.75; ppl:  1.22; xent: 0.20; lr: 0.00010; 10096/9557 tok/s;   8268 sec
[2021-04-23 03:11:06,886 INFO] Step 40000/50000; acc:  94.96; ppl:  1.21; xent: 0.19; lr: 0.00010; 10191/9540 tok/s;   8278 sec
[2021-04-23 03:11:06,889 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-23 03:11:16,510 INFO] Validation perplexity: 1.33545
[2021-04-23 03:11:16,510 INFO] Validation accuracy: 93.3873
[2021-04-23 03:11:16,512 INFO] Saving checkpoint ../models/default_params/control/model_step_40000.pt
[2021-04-23 03:11:27,030 INFO] Step 40050/50000; acc:  94.61; ppl:  1.23; xent: 0.21; lr: 0.00010; 4878/4599 tok/s;   8298 sec
[2021-04-23 03:11:36,415 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:11:37,232 INFO] Step 40100/50000; acc:  94.70; ppl:  1.23; xent: 0.20; lr: 0.00010; 10225/9609 tok/s;   8308 sec
[2021-04-23 03:11:47,236 INFO] Step 40150/50000; acc:  94.65; ppl:  1.23; xent: 0.21; lr: 0.00010; 10009/9445 tok/s;   8318 sec
[2021-04-23 03:11:57,338 INFO] Step 40200/50000; acc:  94.86; ppl:  1.23; xent: 0.20; lr: 0.00010; 10107/9490 tok/s;   8329 sec
[2021-04-23 03:12:07,290 INFO] Step 40250/50000; acc:  94.69; ppl:  1.23; xent: 0.21; lr: 0.00010; 10192/9742 tok/s;   8338 sec
[2021-04-23 03:12:17,415 INFO] Step 40300/50000; acc:  94.70; ppl:  1.23; xent: 0.21; lr: 0.00010; 9917/9379 tok/s;   8349 sec
[2021-04-23 03:12:27,483 INFO] Step 40350/50000; acc:  94.77; ppl:  1.22; xent: 0.20; lr: 0.00010; 10214/9604 tok/s;   8359 sec
[2021-04-23 03:12:37,563 INFO] Step 40400/50000; acc:  94.70; ppl:  1.23; xent: 0.20; lr: 0.00010; 10031/9541 tok/s;   8369 sec
[2021-04-23 03:12:47,900 INFO] Step 40450/50000; acc:  94.85; ppl:  1.22; xent: 0.20; lr: 0.00010; 10119/9507 tok/s;   8379 sec
[2021-04-23 03:12:57,922 INFO] Step 40500/50000; acc:  94.67; ppl:  1.22; xent: 0.20; lr: 0.00010; 9960/9432 tok/s;   8389 sec
[2021-04-23 03:13:08,153 INFO] Step 40550/50000; acc:  94.67; ppl:  1.23; xent: 0.21; lr: 0.00010; 9966/9388 tok/s;   8399 sec
[2021-04-23 03:13:18,373 INFO] Step 40600/50000; acc:  94.87; ppl:  1.22; xent: 0.20; lr: 0.00010; 9989/9405 tok/s;   8410 sec
[2021-04-23 03:13:28,368 INFO] Step 40650/50000; acc:  94.78; ppl:  1.22; xent: 0.20; lr: 0.00010; 10034/9497 tok/s;   8420 sec
[2021-04-23 03:13:38,504 INFO] Step 40700/50000; acc:  94.91; ppl:  1.21; xent: 0.19; lr: 0.00010; 10234/9564 tok/s;   8430 sec
[2021-04-23 03:13:48,437 INFO] Step 40750/50000; acc:  94.72; ppl:  1.22; xent: 0.20; lr: 0.00010; 10061/9473 tok/s;   8440 sec
[2021-04-23 03:13:58,592 INFO] Step 40800/50000; acc:  94.72; ppl:  1.23; xent: 0.20; lr: 0.00010; 10120/9529 tok/s;   8450 sec
[2021-04-23 03:14:05,005 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:14:08,437 INFO] Step 40850/50000; acc:  94.76; ppl:  1.22; xent: 0.20; lr: 0.00010; 10137/9572 tok/s;   8460 sec
[2021-04-23 03:14:18,793 INFO] Step 40900/50000; acc:  94.87; ppl:  1.22; xent: 0.20; lr: 0.00010; 10107/9509 tok/s;   8470 sec
[2021-04-23 03:14:28,574 INFO] Step 40950/50000; acc:  94.74; ppl:  1.22; xent: 0.20; lr: 0.00010; 10182/9633 tok/s;   8480 sec
[2021-04-23 03:14:38,633 INFO] Step 41000/50000; acc:  94.74; ppl:  1.22; xent: 0.20; lr: 0.00010; 10069/9562 tok/s;   8490 sec
[2021-04-23 03:14:48,809 INFO] Step 41050/50000; acc:  94.75; ppl:  1.22; xent: 0.20; lr: 0.00010; 10048/9510 tok/s;   8500 sec
[2021-04-23 03:14:58,782 INFO] Step 41100/50000; acc:  94.69; ppl:  1.23; xent: 0.21; lr: 0.00010; 10030/9494 tok/s;   8510 sec
[2021-04-23 03:15:09,036 INFO] Step 41150/50000; acc:  94.93; ppl:  1.22; xent: 0.20; lr: 0.00010; 10110/9519 tok/s;   8520 sec
[2021-04-23 03:15:19,184 INFO] Step 41200/50000; acc:  94.87; ppl:  1.22; xent: 0.20; lr: 0.00010; 10103/9535 tok/s;   8530 sec
[2021-04-23 03:15:29,565 INFO] Step 41250/50000; acc:  94.79; ppl:  1.22; xent: 0.20; lr: 0.00010; 9990/9377 tok/s;   8541 sec
[2021-04-23 03:15:39,517 INFO] Step 41300/50000; acc:  94.63; ppl:  1.22; xent: 0.20; lr: 0.00010; 9920/9408 tok/s;   8551 sec
[2021-04-23 03:15:49,750 INFO] Step 41350/50000; acc:  94.92; ppl:  1.21; xent: 0.19; lr: 0.00010; 10047/9473 tok/s;   8561 sec
[2021-04-23 03:15:59,670 INFO] Step 41400/50000; acc:  94.86; ppl:  1.22; xent: 0.20; lr: 0.00010; 10266/9661 tok/s;   8571 sec
[2021-04-23 03:16:09,622 INFO] Step 41450/50000; acc:  94.88; ppl:  1.22; xent: 0.20; lr: 0.00010; 10104/9533 tok/s;   8581 sec
[2021-04-23 03:16:19,730 INFO] Step 41500/50000; acc:  94.78; ppl:  1.22; xent: 0.20; lr: 0.00010; 10175/9469 tok/s;   8591 sec
[2021-04-23 03:16:29,702 INFO] Step 41550/50000; acc:  94.77; ppl:  1.22; xent: 0.20; lr: 0.00010; 9995/9528 tok/s;   8601 sec
[2021-04-23 03:16:33,456 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:16:39,958 INFO] Step 41600/50000; acc:  94.88; ppl:  1.22; xent: 0.20; lr: 0.00010; 10104/9473 tok/s;   8611 sec
[2021-04-23 03:16:49,855 INFO] Step 41650/50000; acc:  94.90; ppl:  1.22; xent: 0.20; lr: 0.00010; 10073/9506 tok/s;   8621 sec
[2021-04-23 03:16:59,997 INFO] Step 41700/50000; acc:  94.77; ppl:  1.22; xent: 0.20; lr: 0.00010; 10237/9680 tok/s;   8631 sec
[2021-04-23 03:17:09,966 INFO] Step 41750/50000; acc:  94.62; ppl:  1.23; xent: 0.20; lr: 0.00010; 9998/9510 tok/s;   8641 sec
[2021-04-23 03:17:20,053 INFO] Step 41800/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 10076/9531 tok/s;   8651 sec
[2021-04-23 03:17:30,128 INFO] Step 41850/50000; acc:  94.76; ppl:  1.22; xent: 0.20; lr: 0.00010; 10101/9542 tok/s;   8661 sec
[2021-04-23 03:17:40,318 INFO] Step 41900/50000; acc:  94.93; ppl:  1.22; xent: 0.20; lr: 0.00010; 10025/9475 tok/s;   8671 sec
[2021-04-23 03:17:50,432 INFO] Step 41950/50000; acc:  94.92; ppl:  1.21; xent: 0.19; lr: 0.00010; 10207/9570 tok/s;   8682 sec
[2021-04-23 03:18:00,664 INFO] Step 42000/50000; acc:  94.75; ppl:  1.22; xent: 0.20; lr: 0.00010; 9905/9386 tok/s;   8692 sec
[2021-04-23 03:18:10,927 INFO] Step 42050/50000; acc:  94.93; ppl:  1.21; xent: 0.19; lr: 0.00010; 10122/9462 tok/s;   8702 sec
[2021-04-23 03:18:20,807 INFO] Step 42100/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 9999/9525 tok/s;   8712 sec
[2021-04-23 03:18:30,825 INFO] Step 42150/50000; acc:  94.95; ppl:  1.21; xent: 0.19; lr: 0.00010; 10282/9677 tok/s;   8722 sec
[2021-04-23 03:18:40,960 INFO] Step 42200/50000; acc:  94.97; ppl:  1.21; xent: 0.19; lr: 0.00010; 10027/9388 tok/s;   8732 sec
[2021-04-23 03:18:50,863 INFO] Step 42250/50000; acc:  94.70; ppl:  1.23; xent: 0.20; lr: 0.00010; 10094/9511 tok/s;   8742 sec
[2021-04-23 03:18:55,454 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:19:01,020 INFO] Step 42300/50000; acc:  94.85; ppl:  1.22; xent: 0.20; lr: 0.00010; 10189/9555 tok/s;   8752 sec
[2021-04-23 03:19:11,065 INFO] Step 42350/50000; acc:  94.86; ppl:  1.22; xent: 0.20; lr: 0.00010; 9986/9503 tok/s;   8762 sec
[2021-04-23 03:19:21,092 INFO] Step 42400/50000; acc:  94.92; ppl:  1.21; xent: 0.19; lr: 0.00010; 10269/9616 tok/s;   8772 sec
[2021-04-23 03:19:30,913 INFO] Step 42450/50000; acc:  94.63; ppl:  1.23; xent: 0.21; lr: 0.00010; 10064/9622 tok/s;   8782 sec
[2021-04-23 03:19:41,135 INFO] Step 42500/50000; acc:  94.88; ppl:  1.22; xent: 0.20; lr: 0.00010; 10184/9585 tok/s;   8792 sec
[2021-04-23 03:19:51,034 INFO] Step 42550/50000; acc:  94.82; ppl:  1.22; xent: 0.20; lr: 0.00010; 10068/9513 tok/s;   8802 sec
[2021-04-23 03:20:01,193 INFO] Step 42600/50000; acc:  94.84; ppl:  1.22; xent: 0.20; lr: 0.00010; 10022/9524 tok/s;   8812 sec
[2021-04-23 03:20:11,487 INFO] Step 42650/50000; acc:  95.08; ppl:  1.21; xent: 0.19; lr: 0.00010; 10142/9533 tok/s;   8823 sec
[2021-04-23 03:20:21,538 INFO] Step 42700/50000; acc:  94.74; ppl:  1.22; xent: 0.20; lr: 0.00010; 9984/9458 tok/s;   8833 sec
[2021-04-23 03:20:31,666 INFO] Step 42750/50000; acc:  94.86; ppl:  1.22; xent: 0.19; lr: 0.00010; 10174/9541 tok/s;   8843 sec
[2021-04-23 03:20:41,830 INFO] Step 42800/50000; acc:  94.93; ppl:  1.21; xent: 0.19; lr: 0.00010; 9971/9423 tok/s;   8853 sec
[2021-04-23 03:20:52,017 INFO] Step 42850/50000; acc:  94.91; ppl:  1.21; xent: 0.19; lr: 0.00010; 10168/9598 tok/s;   8863 sec
[2021-04-23 03:21:01,821 INFO] Step 42900/50000; acc:  94.90; ppl:  1.21; xent: 0.19; lr: 0.00010; 10160/9565 tok/s;   8873 sec
[2021-04-23 03:21:11,975 INFO] Step 42950/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 10073/9456 tok/s;   8883 sec
[2021-04-23 03:21:22,013 INFO] Step 43000/50000; acc:  94.77; ppl:  1.22; xent: 0.20; lr: 0.00010; 10076/9425 tok/s;   8893 sec
[2021-04-23 03:21:23,668 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:21:32,035 INFO] Step 43050/50000; acc:  94.87; ppl:  1.22; xent: 0.20; lr: 0.00010; 10037/9512 tok/s;   8903 sec
[2021-04-23 03:21:42,213 INFO] Step 43100/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 10176/9581 tok/s;   8913 sec
[2021-04-23 03:21:52,062 INFO] Step 43150/50000; acc:  94.78; ppl:  1.22; xent: 0.20; lr: 0.00010; 10064/9552 tok/s;   8923 sec
[2021-04-23 03:22:02,279 INFO] Step 43200/50000; acc:  94.87; ppl:  1.22; xent: 0.19; lr: 0.00010; 10122/9541 tok/s;   8933 sec
[2021-04-23 03:22:12,215 INFO] Step 43250/50000; acc:  94.78; ppl:  1.22; xent: 0.20; lr: 0.00010; 9971/9494 tok/s;   8943 sec
[2021-04-23 03:22:22,461 INFO] Step 43300/50000; acc:  95.01; ppl:  1.21; xent: 0.19; lr: 0.00010; 10099/9509 tok/s;   8954 sec
[2021-04-23 03:22:32,599 INFO] Step 43350/50000; acc:  94.90; ppl:  1.22; xent: 0.20; lr: 0.00010; 10013/9508 tok/s;   8964 sec
[2021-04-23 03:22:42,718 INFO] Step 43400/50000; acc:  94.94; ppl:  1.21; xent: 0.19; lr: 0.00010; 10140/9531 tok/s;   8974 sec
[2021-04-23 03:22:53,059 INFO] Step 43450/50000; acc:  94.77; ppl:  1.22; xent: 0.20; lr: 0.00010; 9916/9319 tok/s;   8984 sec
[2021-04-23 03:23:03,126 INFO] Step 43500/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 9985/9470 tok/s;   8994 sec
[2021-04-23 03:23:13,262 INFO] Step 43550/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 10156/9562 tok/s;   9004 sec
[2021-04-23 03:23:23,338 INFO] Step 43600/50000; acc:  94.95; ppl:  1.21; xent: 0.19; lr: 0.00010; 10088/9515 tok/s;   9015 sec
[2021-04-23 03:23:33,625 INFO] Step 43650/50000; acc:  95.06; ppl:  1.21; xent: 0.19; lr: 0.00010; 10133/9458 tok/s;   9025 sec
[2021-04-23 03:23:43,487 INFO] Step 43700/50000; acc:  94.84; ppl:  1.22; xent: 0.20; lr: 0.00010; 9940/9420 tok/s;   9035 sec
[2021-04-23 03:23:52,415 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:23:53,610 INFO] Step 43750/50000; acc:  94.86; ppl:  1.22; xent: 0.19; lr: 0.00010; 10148/9544 tok/s;   9045 sec
[2021-04-23 03:24:03,723 INFO] Step 43800/50000; acc:  94.96; ppl:  1.21; xent: 0.19; lr: 0.00010; 10037/9433 tok/s;   9055 sec
[2021-04-23 03:24:13,733 INFO] Step 43850/50000; acc:  94.94; ppl:  1.22; xent: 0.20; lr: 0.00010; 10046/9476 tok/s;   9065 sec
[2021-04-23 03:24:23,732 INFO] Step 43900/50000; acc:  94.91; ppl:  1.21; xent: 0.19; lr: 0.00010; 10246/9767 tok/s;   9075 sec
[2021-04-23 03:24:33,848 INFO] Step 43950/50000; acc:  94.82; ppl:  1.22; xent: 0.20; lr: 0.00010; 9882/9342 tok/s;   9085 sec
[2021-04-23 03:24:44,020 INFO] Step 44000/50000; acc:  94.99; ppl:  1.21; xent: 0.19; lr: 0.00010; 10143/9553 tok/s;   9095 sec
[2021-04-23 03:24:53,989 INFO] Step 44050/50000; acc:  94.95; ppl:  1.21; xent: 0.19; lr: 0.00010; 9970/9498 tok/s;   9105 sec
[2021-04-23 03:25:04,373 INFO] Step 44100/50000; acc:  95.10; ppl:  1.21; xent: 0.19; lr: 0.00010; 10139/9517 tok/s;   9116 sec
[2021-04-23 03:25:14,460 INFO] Step 44150/50000; acc:  94.86; ppl:  1.21; xent: 0.19; lr: 0.00010; 10000/9448 tok/s;   9126 sec
[2021-04-23 03:25:24,643 INFO] Step 44200/50000; acc:  94.92; ppl:  1.21; xent: 0.19; lr: 0.00010; 9947/9377 tok/s;   9136 sec
[2021-04-23 03:25:34,943 INFO] Step 44250/50000; acc:  95.08; ppl:  1.20; xent: 0.19; lr: 0.00010; 9994/9384 tok/s;   9146 sec
[2021-04-23 03:25:44,924 INFO] Step 44300/50000; acc:  94.90; ppl:  1.21; xent: 0.19; lr: 0.00010; 10055/9544 tok/s;   9156 sec
[2021-04-23 03:25:55,052 INFO] Step 44350/50000; acc:  95.09; ppl:  1.20; xent: 0.19; lr: 0.00010; 10218/9555 tok/s;   9166 sec
[2021-04-23 03:26:05,049 INFO] Step 44400/50000; acc:  94.96; ppl:  1.21; xent: 0.19; lr: 0.00010; 10101/9490 tok/s;   9176 sec
[2021-04-23 03:26:15,264 INFO] Step 44450/50000; acc:  94.88; ppl:  1.21; xent: 0.19; lr: 0.00010; 10079/9495 tok/s;   9186 sec
[2021-04-23 03:26:21,149 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:26:25,105 INFO] Step 44500/50000; acc:  94.92; ppl:  1.21; xent: 0.19; lr: 0.00010; 10095/9557 tok/s;   9196 sec
[2021-04-23 03:26:35,402 INFO] Step 44550/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 10029/9405 tok/s;   9207 sec
[2021-04-23 03:26:45,263 INFO] Step 44600/50000; acc:  94.92; ppl:  1.21; xent: 0.19; lr: 0.00010; 10222/9680 tok/s;   9216 sec
[2021-04-23 03:26:55,227 INFO] Step 44650/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 10026/9568 tok/s;   9226 sec
[2021-04-23 03:27:05,428 INFO] Step 44700/50000; acc:  95.06; ppl:  1.21; xent: 0.19; lr: 0.00010; 10115/9502 tok/s;   9237 sec
[2021-04-23 03:27:15,406 INFO] Step 44750/50000; acc:  94.92; ppl:  1.21; xent: 0.19; lr: 0.00010; 9960/9460 tok/s;   9247 sec
[2021-04-23 03:27:25,698 INFO] Step 44800/50000; acc:  95.03; ppl:  1.21; xent: 0.19; lr: 0.00010; 10147/9563 tok/s;   9257 sec
[2021-04-23 03:27:35,707 INFO] Step 44850/50000; acc:  94.99; ppl:  1.21; xent: 0.19; lr: 0.00010; 10047/9510 tok/s;   9267 sec
[2021-04-23 03:27:46,145 INFO] Step 44900/50000; acc:  94.94; ppl:  1.21; xent: 0.19; lr: 0.00010; 9991/9348 tok/s;   9277 sec
[2021-04-23 03:27:56,110 INFO] Step 44950/50000; acc:  94.98; ppl:  1.21; xent: 0.19; lr: 0.00010; 10036/9513 tok/s;   9287 sec
[2021-04-23 03:28:06,338 INFO] Step 45000/50000; acc:  95.09; ppl:  1.20; xent: 0.19; lr: 0.00010; 9969/9377 tok/s;   9298 sec
[2021-04-23 03:28:06,340 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-23 03:28:15,964 INFO] Validation perplexity: 1.34278
[2021-04-23 03:28:15,964 INFO] Validation accuracy: 93.3538
[2021-04-23 03:28:15,966 INFO] Saving checkpoint ../models/default_params/control/model_step_45000.pt
[2021-04-23 03:28:26,581 INFO] Step 45050/50000; acc:  95.07; ppl:  1.21; xent: 0.19; lr: 0.00010; 5058/4774 tok/s;   9318 sec
[2021-04-23 03:28:36,612 INFO] Step 45100/50000; acc:  95.10; ppl:  1.20; xent: 0.18; lr: 0.00010; 10058/9468 tok/s;   9328 sec
[2021-04-23 03:28:46,695 INFO] Step 45150/50000; acc:  95.08; ppl:  1.21; xent: 0.19; lr: 0.00010; 10170/9474 tok/s;   9338 sec
[2021-04-23 03:28:56,711 INFO] Step 45200/50000; acc:  94.82; ppl:  1.21; xent: 0.19; lr: 0.00010; 10082/9572 tok/s;   9348 sec
[2021-04-23 03:28:59,946 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:29:06,975 INFO] Step 45250/50000; acc:  95.09; ppl:  1.21; xent: 0.19; lr: 0.00010; 10109/9481 tok/s;   9358 sec
[2021-04-23 03:29:16,806 INFO] Step 45300/50000; acc:  95.07; ppl:  1.21; xent: 0.19; lr: 0.00010; 10088/9570 tok/s;   9368 sec
[2021-04-23 03:29:26,841 INFO] Step 45350/50000; acc:  94.98; ppl:  1.21; xent: 0.19; lr: 0.00010; 10181/9636 tok/s;   9378 sec
[2021-04-23 03:29:36,850 INFO] Step 45400/50000; acc:  94.96; ppl:  1.21; xent: 0.19; lr: 0.00010; 10091/9553 tok/s;   9388 sec
[2021-04-23 03:29:46,924 INFO] Step 45450/50000; acc:  94.93; ppl:  1.21; xent: 0.19; lr: 0.00010; 9952/9493 tok/s;   9398 sec
[2021-04-23 03:29:57,034 INFO] Step 45500/50000; acc:  95.10; ppl:  1.21; xent: 0.19; lr: 0.00010; 10166/9543 tok/s;   9408 sec
[2021-04-23 03:30:07,205 INFO] Step 45550/50000; acc:  95.17; ppl:  1.20; xent: 0.18; lr: 0.00010; 10016/9465 tok/s;   9418 sec
[2021-04-23 03:30:17,343 INFO] Step 45600/50000; acc:  95.08; ppl:  1.21; xent: 0.19; lr: 0.00010; 10214/9581 tok/s;   9429 sec
[2021-04-23 03:30:27,420 INFO] Step 45650/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 9875/9391 tok/s;   9439 sec
[2021-04-23 03:30:37,746 INFO] Step 45700/50000; acc:  95.22; ppl:  1.20; xent: 0.18; lr: 0.00010; 10124/9473 tok/s;   9449 sec
[2021-04-23 03:30:47,768 INFO] Step 45750/50000; acc:  95.05; ppl:  1.21; xent: 0.19; lr: 0.00010; 9967/9459 tok/s;   9459 sec
[2021-04-23 03:30:57,653 INFO] Step 45800/50000; acc:  95.12; ppl:  1.20; xent: 0.18; lr: 0.00010; 10335/9723 tok/s;   9469 sec
[2021-04-23 03:31:07,876 INFO] Step 45850/50000; acc:  95.14; ppl:  1.20; xent: 0.18; lr: 0.00010; 10022/9358 tok/s;   9479 sec
[2021-04-23 03:31:17,777 INFO] Step 45900/50000; acc:  94.90; ppl:  1.21; xent: 0.19; lr: 0.00010; 10095/9559 tok/s;   9489 sec
[2021-04-23 03:31:21,969 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:31:27,983 INFO] Step 45950/50000; acc:  95.06; ppl:  1.20; xent: 0.19; lr: 0.00010; 10133/9477 tok/s;   9499 sec
[2021-04-23 03:31:38,062 INFO] Step 46000/50000; acc:  95.00; ppl:  1.21; xent: 0.19; lr: 0.00010; 10055/9533 tok/s;   9509 sec
[2021-04-23 03:31:48,187 INFO] Step 46050/50000; acc:  95.17; ppl:  1.20; xent: 0.18; lr: 0.00010; 10194/9576 tok/s;   9519 sec
[2021-04-23 03:31:57,985 INFO] Step 46100/50000; acc:  94.89; ppl:  1.21; xent: 0.19; lr: 0.00010; 10037/9603 tok/s;   9529 sec
[2021-04-23 03:32:08,146 INFO] Step 46150/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 10107/9538 tok/s;   9539 sec
[2021-04-23 03:32:18,164 INFO] Step 46200/50000; acc:  95.06; ppl:  1.21; xent: 0.19; lr: 0.00010; 10066/9488 tok/s;   9549 sec
[2021-04-23 03:32:28,232 INFO] Step 46250/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 9991/9511 tok/s;   9559 sec
[2021-04-23 03:32:38,523 INFO] Step 46300/50000; acc:  95.23; ppl:  1.20; xent: 0.18; lr: 0.00010; 10217/9570 tok/s;   9570 sec
[2021-04-23 03:32:48,589 INFO] Step 46350/50000; acc:  94.98; ppl:  1.21; xent: 0.19; lr: 0.00010; 9937/9426 tok/s;   9580 sec
[2021-04-23 03:32:58,776 INFO] Step 46400/50000; acc:  95.10; ppl:  1.20; xent: 0.19; lr: 0.00010; 10162/9544 tok/s;   9590 sec
[2021-04-23 03:33:08,796 INFO] Step 46450/50000; acc:  95.14; ppl:  1.20; xent: 0.18; lr: 0.00010; 9919/9391 tok/s;   9600 sec
[2021-04-23 03:33:19,049 INFO] Step 46500/50000; acc:  95.12; ppl:  1.20; xent: 0.18; lr: 0.00010; 10168/9604 tok/s;   9610 sec
[2021-04-23 03:33:28,950 INFO] Step 46550/50000; acc:  95.16; ppl:  1.20; xent: 0.18; lr: 0.00010; 10181/9559 tok/s;   9620 sec
[2021-04-23 03:33:39,019 INFO] Step 46600/50000; acc:  95.18; ppl:  1.20; xent: 0.18; lr: 0.00010; 10065/9454 tok/s;   9630 sec
[2021-04-23 03:33:49,107 INFO] Step 46650/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 10124/9472 tok/s;   9640 sec
[2021-04-23 03:33:50,349 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:33:59,134 INFO] Step 46700/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 10032/9509 tok/s;   9650 sec
[2021-04-23 03:34:09,294 INFO] Step 46750/50000; acc:  95.22; ppl:  1.20; xent: 0.18; lr: 0.00010; 10179/9541 tok/s;   9660 sec
[2021-04-23 03:34:19,210 INFO] Step 46800/50000; acc:  94.99; ppl:  1.21; xent: 0.19; lr: 0.00010; 10087/9629 tok/s;   9670 sec
[2021-04-23 03:34:29,480 INFO] Step 46850/50000; acc:  95.12; ppl:  1.20; xent: 0.18; lr: 0.00010; 10092/9474 tok/s;   9681 sec
[2021-04-23 03:34:39,288 INFO] Step 46900/50000; acc:  95.00; ppl:  1.21; xent: 0.19; lr: 0.00010; 10068/9602 tok/s;   9690 sec
[2021-04-23 03:34:49,466 INFO] Step 46950/50000; acc:  95.08; ppl:  1.21; xent: 0.19; lr: 0.00010; 10014/9453 tok/s;   9701 sec
[2021-04-23 03:34:59,717 INFO] Step 47000/50000; acc:  95.19; ppl:  1.20; xent: 0.18; lr: 0.00010; 10037/9473 tok/s;   9711 sec
[2021-04-23 03:35:09,733 INFO] Step 47050/50000; acc:  95.15; ppl:  1.20; xent: 0.18; lr: 0.00010; 10098/9553 tok/s;   9721 sec
[2021-04-23 03:35:20,108 INFO] Step 47100/50000; acc:  95.06; ppl:  1.20; xent: 0.19; lr: 0.00010; 9961/9332 tok/s;   9731 sec
[2021-04-23 03:35:30,148 INFO] Step 47150/50000; acc:  95.18; ppl:  1.20; xent: 0.18; lr: 0.00010; 9981/9445 tok/s;   9741 sec
[2021-04-23 03:35:40,278 INFO] Step 47200/50000; acc:  95.23; ppl:  1.20; xent: 0.18; lr: 0.00010; 10202/9629 tok/s;   9751 sec
[2021-04-23 03:35:50,174 INFO] Step 47250/50000; acc:  95.14; ppl:  1.20; xent: 0.18; lr: 0.00010; 10088/9543 tok/s;   9761 sec
[2021-04-23 03:36:00,441 INFO] Step 47300/50000; acc:  95.26; ppl:  1.20; xent: 0.18; lr: 0.00010; 10188/9537 tok/s;   9772 sec
[2021-04-23 03:36:10,379 INFO] Step 47350/50000; acc:  95.08; ppl:  1.21; xent: 0.19; lr: 0.00010; 9990/9415 tok/s;   9782 sec
[2021-04-23 03:36:18,879 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:36:20,418 INFO] Step 47400/50000; acc:  95.05; ppl:  1.20; xent: 0.19; lr: 0.00010; 10164/9559 tok/s;   9792 sec
[2021-04-23 03:36:30,606 INFO] Step 47450/50000; acc:  95.20; ppl:  1.20; xent: 0.18; lr: 0.00010; 10051/9431 tok/s;   9802 sec
[2021-04-23 03:36:40,603 INFO] Step 47500/50000; acc:  95.16; ppl:  1.20; xent: 0.18; lr: 0.00010; 10047/9484 tok/s;   9812 sec
[2021-04-23 03:36:50,616 INFO] Step 47550/50000; acc:  95.10; ppl:  1.20; xent: 0.19; lr: 0.00010; 10221/9740 tok/s;   9822 sec
[2021-04-23 03:37:00,791 INFO] Step 47600/50000; acc:  95.12; ppl:  1.20; xent: 0.19; lr: 0.00010; 9946/9391 tok/s;   9832 sec
[2021-04-23 03:37:10,987 INFO] Step 47650/50000; acc:  95.05; ppl:  1.20; xent: 0.18; lr: 0.00010; 10120/9540 tok/s;   9842 sec
[2021-04-23 03:37:20,909 INFO] Step 47700/50000; acc:  95.06; ppl:  1.21; xent: 0.19; lr: 0.00010; 9992/9515 tok/s;   9852 sec
[2021-04-23 03:37:31,183 INFO] Step 47750/50000; acc:  95.22; ppl:  1.20; xent: 0.18; lr: 0.00010; 10120/9503 tok/s;   9862 sec
[2021-04-23 03:37:41,324 INFO] Step 47800/50000; acc:  95.02; ppl:  1.20; xent: 0.18; lr: 0.00010; 10048/9472 tok/s;   9872 sec
[2021-04-23 03:37:51,481 INFO] Step 47850/50000; acc:  95.00; ppl:  1.20; xent: 0.19; lr: 0.00010; 9844/9359 tok/s;   9883 sec
[2021-04-23 03:38:01,747 INFO] Step 47900/50000; acc:  95.33; ppl:  1.19; xent: 0.17; lr: 0.00010; 10109/9434 tok/s;   9893 sec
[2021-04-23 03:38:11,735 INFO] Step 47950/50000; acc:  95.22; ppl:  1.20; xent: 0.18; lr: 0.00010; 10004/9545 tok/s;   9903 sec
[2021-04-23 03:38:21,861 INFO] Step 48000/50000; acc:  95.30; ppl:  1.19; xent: 0.18; lr: 0.00010; 10251/9561 tok/s;   9913 sec
[2021-04-23 03:38:31,726 INFO] Step 48050/50000; acc:  95.07; ppl:  1.20; xent: 0.19; lr: 0.00010; 10059/9472 tok/s;   9923 sec
[2021-04-23 03:38:41,956 INFO] Step 48100/50000; acc:  95.16; ppl:  1.20; xent: 0.18; lr: 0.00010; 10130/9543 tok/s;   9933 sec
[2021-04-23 03:38:47,544 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:38:51,929 INFO] Step 48150/50000; acc:  95.17; ppl:  1.20; xent: 0.18; lr: 0.00010; 10070/9503 tok/s;   9943 sec
[2021-04-23 03:39:02,128 INFO] Step 48200/50000; acc:  95.15; ppl:  1.20; xent: 0.18; lr: 0.00010; 10061/9436 tok/s;   9953 sec
[2021-04-23 03:39:12,065 INFO] Step 48250/50000; acc:  95.12; ppl:  1.20; xent: 0.18; lr: 0.00010; 10221/9679 tok/s;   9963 sec
[2021-04-23 03:39:22,095 INFO] Step 48300/50000; acc:  95.11; ppl:  1.20; xent: 0.19; lr: 0.00010; 9957/9476 tok/s;   9973 sec
[2021-04-23 03:39:32,315 INFO] Step 48350/50000; acc:  95.16; ppl:  1.20; xent: 0.18; lr: 0.00010; 10069/9505 tok/s;   9983 sec
[2021-04-23 03:39:42,298 INFO] Step 48400/50000; acc:  95.13; ppl:  1.20; xent: 0.18; lr: 0.00010; 10069/9550 tok/s;   9993 sec
[2021-04-23 03:39:52,605 INFO] Step 48450/50000; acc:  95.25; ppl:  1.20; xent: 0.18; lr: 0.00010; 10154/9538 tok/s;  10004 sec
[2021-04-23 03:40:02,550 INFO] Step 48500/50000; acc:  95.23; ppl:  1.19; xent: 0.18; lr: 0.00010; 10072/9526 tok/s;  10014 sec
[2021-04-23 03:40:12,870 INFO] Step 48550/50000; acc:  95.10; ppl:  1.20; xent: 0.18; lr: 0.00010; 9960/9357 tok/s;  10024 sec
[2021-04-23 03:40:22,879 INFO] Step 48600/50000; acc:  95.23; ppl:  1.19; xent: 0.18; lr: 0.00010; 10127/9572 tok/s;  10034 sec
[2021-04-23 03:40:32,998 INFO] Step 48650/50000; acc:  95.23; ppl:  1.20; xent: 0.18; lr: 0.00010; 9928/9361 tok/s;  10044 sec
[2021-04-23 03:40:43,064 INFO] Step 48700/50000; acc:  95.33; ppl:  1.20; xent: 0.18; lr: 0.00010; 10271/9693 tok/s;  10054 sec
[2021-04-23 03:40:53,041 INFO] Step 48750/50000; acc:  95.22; ppl:  1.19; xent: 0.18; lr: 0.00010; 10060/9479 tok/s;  10064 sec
[2021-04-23 03:41:03,137 INFO] Step 48800/50000; acc:  95.18; ppl:  1.20; xent: 0.18; lr: 0.00010; 10176/9478 tok/s;  10074 sec
[2021-04-23 03:41:12,973 INFO] Step 48850/50000; acc:  95.03; ppl:  1.20; xent: 0.18; lr: 0.00010; 10117/9609 tok/s;  10084 sec
[2021-04-23 03:41:15,935 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:41:23,235 INFO] Step 48900/50000; acc:  95.28; ppl:  1.19; xent: 0.18; lr: 0.00010; 10174/9580 tok/s;  10094 sec
[2021-04-23 03:41:33,113 INFO] Step 48950/50000; acc:  95.13; ppl:  1.20; xent: 0.18; lr: 0.00010; 10137/9564 tok/s;  10104 sec
[2021-04-23 03:41:43,055 INFO] Step 49000/50000; acc:  95.11; ppl:  1.20; xent: 0.18; lr: 0.00010; 10195/9643 tok/s;  10114 sec
[2021-04-23 03:41:53,123 INFO] Step 49050/50000; acc:  95.17; ppl:  1.20; xent: 0.18; lr: 0.00010; 10134/9589 tok/s;  10124 sec
[2021-04-23 03:42:03,157 INFO] Step 49100/50000; acc:  95.17; ppl:  1.20; xent: 0.18; lr: 0.00010; 9978/9515 tok/s;  10134 sec
[2021-04-23 03:42:13,250 INFO] Step 49150/50000; acc:  95.28; ppl:  1.20; xent: 0.18; lr: 0.00010; 10174/9538 tok/s;  10144 sec
[2021-04-23 03:42:23,524 INFO] Step 49200/50000; acc:  95.34; ppl:  1.19; xent: 0.17; lr: 0.00010; 10024/9486 tok/s;  10155 sec
[2021-04-23 03:42:33,721 INFO] Step 49250/50000; acc:  95.27; ppl:  1.19; xent: 0.18; lr: 0.00010; 10164/9521 tok/s;  10165 sec
[2021-04-23 03:42:43,731 INFO] Step 49300/50000; acc:  95.11; ppl:  1.20; xent: 0.18; lr: 0.00010; 9913/9413 tok/s;  10175 sec
[2021-04-23 03:42:54,028 INFO] Step 49350/50000; acc:  95.38; ppl:  1.19; xent: 0.17; lr: 0.00010; 10003/9401 tok/s;  10185 sec
[2021-04-23 03:43:04,070 INFO] Step 49400/50000; acc:  95.32; ppl:  1.19; xent: 0.18; lr: 0.00010; 10065/9544 tok/s;  10195 sec
[2021-04-23 03:43:13,928 INFO] Step 49450/50000; acc:  95.24; ppl:  1.19; xent: 0.18; lr: 0.00010; 10233/9638 tok/s;  10205 sec
[2021-04-23 03:43:24,181 INFO] Step 49500/50000; acc:  95.41; ppl:  1.19; xent: 0.17; lr: 0.00010; 10090/9392 tok/s;  10215 sec
[2021-04-23 03:43:34,035 INFO] Step 49550/50000; acc:  95.07; ppl:  1.20; xent: 0.18; lr: 0.00010; 10080/9581 tok/s;  10225 sec
[2021-04-23 03:43:37,832 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-23 03:43:44,247 INFO] Step 49600/50000; acc:  95.26; ppl:  1.20; xent: 0.18; lr: 0.00010; 10190/9512 tok/s;  10235 sec
[2021-04-23 03:43:54,259 INFO] Step 49650/50000; acc:  95.23; ppl:  1.20; xent: 0.18; lr: 0.00010; 9931/9452 tok/s;  10245 sec
[2021-04-23 03:44:04,298 INFO] Step 49700/50000; acc:  95.40; ppl:  1.19; xent: 0.17; lr: 0.00010; 10313/9665 tok/s;  10255 sec
[2021-04-23 03:44:14,173 INFO] Step 49750/50000; acc:  95.04; ppl:  1.20; xent: 0.19; lr: 0.00010; 10102/9624 tok/s;  10265 sec
[2021-04-23 03:44:24,277 INFO] Step 49800/50000; acc:  95.35; ppl:  1.19; xent: 0.18; lr: 0.00010; 10082/9537 tok/s;  10275 sec
[2021-04-23 03:44:34,380 INFO] Step 49850/50000; acc:  95.30; ppl:  1.19; xent: 0.18; lr: 0.00010; 10063/9489 tok/s;  10286 sec
[2021-04-23 03:44:44,453 INFO] Step 49900/50000; acc:  95.20; ppl:  1.20; xent: 0.18; lr: 0.00010; 10017/9517 tok/s;  10296 sec
[2021-04-23 03:44:54,687 INFO] Step 49950/50000; acc:  95.48; ppl:  1.19; xent: 0.17; lr: 0.00010; 10238/9626 tok/s;  10306 sec
[2021-04-23 03:45:04,801 INFO] Step 50000/50000; acc:  95.08; ppl:  1.20; xent: 0.18; lr: 0.00005; 9993/9425 tok/s;  10316 sec
[2021-04-23 03:45:04,804 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-23 03:45:14,442 INFO] Validation perplexity: 1.3484
[2021-04-23 03:45:14,442 INFO] Validation accuracy: 93.21
[2021-04-23 03:45:14,444 INFO] Saving checkpoint ../models/default_params/control/model_step_50000.pt

Train on basic condensed EditOperations:

modelDefaultBasic = HephaestusModel(MODEL_DEFAULT_BASIC)
modelDefaultBasic.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_OPS_GENERAL_BASIC_TRAIN,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_OPS_GENERAL_BASIC_VALID
)
[2021-04-24 05:10:12,077 INFO] Counter vocab from -1 samples.
[2021-04-24 05:10:12,077 INFO] n_sample=-1: Build vocab on full datasets.
[2021-04-24 05:10:12,088 INFO] corpus_1's transforms: TransformPipe()
[2021-04-24 05:10:12,089 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:10:13,192 INFO] Counters src:429
[2021-04-24 05:10:13,192 INFO] Counters tgt:438
[2021-04-24 05:10:13,192 WARNING] path ../models/default_params/basic_ops/save_data.vocab.src exists, may overwrite...
[2021-04-24 05:10:13,194 WARNING] path ../models/default_params/basic_ops/save_data.vocab.tgt exists, may overwrite...
[2021-04-24 05:10:13,907 INFO] Parsed 2 corpora from -data.
[2021-04-24 05:10:13,907 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-04-24 05:10:13,907 INFO] Loading vocab from text file...
[2021-04-24 05:10:13,907 INFO] Loading src vocabulary from ../models/default_params/basic_ops/save_data.vocab.src
[2021-04-24 05:10:13,908 INFO] Loaded src vocab has 429 tokens.
[2021-04-24 05:10:13,909 INFO] Loading tgt vocabulary from ../models/default_params/basic_ops/save_data.vocab.tgt
[2021-04-24 05:10:13,910 INFO] Loaded tgt vocab has 438 tokens.
[2021-04-24 05:10:13,910 INFO] Building fields with vocab in counters...
[2021-04-24 05:10:13,910 INFO]  * tgt vocab size: 442.
[2021-04-24 05:10:13,911 INFO]  * src vocab size: 431.
[2021-04-24 05:10:13,911 INFO]  * src vocab size = 431
[2021-04-24 05:10:13,911 INFO]  * tgt vocab size = 442
[2021-04-24 05:10:13,913 INFO] Building model...
[2021-04-24 05:10:16,454 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): LSTM(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(442, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedLSTM(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): LSTMCell(768, 256)
        (1): LSTMCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=442, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-04-24 05:10:16,454 INFO] encoder: 1535488
[2021-04-24 05:10:16,454 INFO] decoder: 2179770
[2021-04-24 05:10:16,454 INFO] * number of parameters: 3715258
[2021-04-24 05:10:16,455 INFO] Starting training on GPU: [0]
[2021-04-24 05:10:16,455 INFO] Start training loop and validate every 5000 steps...
[2021-04-24 05:10:16,456 INFO] corpus_1's transforms: TransformPipe()
[2021-04-24 05:10:16,456 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:10:35,880 INFO] Step 50/50000; acc:  18.06; ppl: 109.78; xent: 4.70; lr: 0.00010; 5207/6538 tok/s;     19 sec
[2021-04-24 05:10:55,350 INFO] Step 100/50000; acc:  21.37; ppl: 25.83; xent: 3.25; lr: 0.00010; 5322/6602 tok/s;     39 sec
[2021-04-24 05:11:14,274 INFO] Step 150/50000; acc:  34.82; ppl: 19.52; xent: 2.97; lr: 0.00010; 5285/6620 tok/s;     58 sec
[2021-04-24 05:11:34,396 INFO] Step 200/50000; acc:  54.52; ppl:  9.51; xent: 2.25; lr: 0.00010; 4998/6257 tok/s;     78 sec
[2021-04-24 05:11:54,024 INFO] Step 250/50000; acc:  56.98; ppl:  6.77; xent: 1.91; lr: 0.00010; 5201/6462 tok/s;     98 sec
[2021-04-24 05:12:13,841 INFO] Step 300/50000; acc:  57.50; ppl:  5.80; xent: 1.76; lr: 0.00010; 5147/6426 tok/s;    117 sec
[2021-04-24 05:12:33,169 INFO] Step 350/50000; acc:  57.92; ppl:  5.45; xent: 1.70; lr: 0.00010; 5325/6422 tok/s;    137 sec
[2021-04-24 05:12:53,031 INFO] Step 400/50000; acc:  58.22; ppl:  5.23; xent: 1.66; lr: 0.00010; 5083/6485 tok/s;    157 sec
[2021-04-24 05:13:12,738 INFO] Step 450/50000; acc:  59.45; ppl:  4.84; xent: 1.58; lr: 0.00010; 5229/6567 tok/s;    176 sec
[2021-04-24 05:13:32,529 INFO] Step 500/50000; acc:  60.29; ppl:  4.54; xent: 1.51; lr: 0.00010; 5103/6549 tok/s;    196 sec
[2021-04-24 05:13:52,040 INFO] Step 550/50000; acc:  61.94; ppl:  4.23; xent: 1.44; lr: 0.00010; 5187/6341 tok/s;    216 sec
[2021-04-24 05:14:11,368 INFO] Step 600/50000; acc:  64.61; ppl:  3.85; xent: 1.35; lr: 0.00010; 5354/6544 tok/s;    235 sec
[2021-04-24 05:14:30,701 INFO] Step 650/50000; acc:  66.59; ppl:  3.53; xent: 1.26; lr: 0.00010; 5228/6684 tok/s;    254 sec
[2021-04-24 05:14:49,873 INFO] Step 700/50000; acc:  67.95; ppl:  3.33; xent: 1.20; lr: 0.00010; 5304/6590 tok/s;    273 sec
[2021-04-24 05:14:51,432 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:15:09,100 INFO] Step 750/50000; acc:  69.44; ppl:  3.15; xent: 1.15; lr: 0.00010; 5230/6765 tok/s;    293 sec
[2021-04-24 05:15:28,361 INFO] Step 800/50000; acc:  70.84; ppl:  2.98; xent: 1.09; lr: 0.00010; 5310/6569 tok/s;    312 sec
[2021-04-24 05:15:47,470 INFO] Step 850/50000; acc:  71.31; ppl:  2.92; xent: 1.07; lr: 0.00010; 5224/6586 tok/s;    331 sec
[2021-04-24 05:16:07,449 INFO] Step 900/50000; acc:  72.96; ppl:  2.73; xent: 1.00; lr: 0.00010; 5219/6419 tok/s;    351 sec
[2021-04-24 05:16:27,219 INFO] Step 950/50000; acc:  74.36; ppl:  2.61; xent: 0.96; lr: 0.00010; 5071/6397 tok/s;    371 sec
[2021-04-24 05:16:46,703 INFO] Step 1000/50000; acc:  75.59; ppl:  2.51; xent: 0.92; lr: 0.00010; 5136/6572 tok/s;    390 sec
[2021-04-24 05:17:06,318 INFO] Step 1050/50000; acc:  76.27; ppl:  2.48; xent: 0.91; lr: 0.00010; 5291/6304 tok/s;    410 sec
[2021-04-24 05:17:25,070 INFO] Step 1100/50000; acc:  77.89; ppl:  2.34; xent: 0.85; lr: 0.00010; 5476/6714 tok/s;    429 sec
[2021-04-24 05:17:45,419 INFO] Step 1150/50000; acc:  78.92; ppl:  2.26; xent: 0.82; lr: 0.00010; 4963/6286 tok/s;    449 sec
[2021-04-24 05:18:04,952 INFO] Step 1200/50000; acc:  79.56; ppl:  2.23; xent: 0.80; lr: 0.00010; 5186/6628 tok/s;    468 sec
[2021-04-24 05:18:24,313 INFO] Step 1250/50000; acc:  80.27; ppl:  2.17; xent: 0.77; lr: 0.00010; 5319/6598 tok/s;    488 sec
[2021-04-24 05:18:43,608 INFO] Step 1300/50000; acc:  80.55; ppl:  2.15; xent: 0.77; lr: 0.00010; 5252/6445 tok/s;    507 sec
[2021-04-24 05:19:02,692 INFO] Step 1350/50000; acc:  81.85; ppl:  2.06; xent: 0.72; lr: 0.00010; 5330/6794 tok/s;    526 sec
[2021-04-24 05:19:22,151 INFO] Step 1400/50000; acc:  82.24; ppl:  2.04; xent: 0.71; lr: 0.00010; 5235/6543 tok/s;    546 sec
[2021-04-24 05:19:37,436 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:19:41,555 INFO] Step 1450/50000; acc:  82.92; ppl:  1.98; xent: 0.68; lr: 0.00010; 5241/6642 tok/s;    565 sec
[2021-04-24 05:20:00,685 INFO] Step 1500/50000; acc:  83.00; ppl:  2.00; xent: 0.69; lr: 0.00010; 5330/6587 tok/s;    584 sec
[2021-04-24 05:20:19,912 INFO] Step 1550/50000; acc:  82.93; ppl:  2.00; xent: 0.69; lr: 0.00010; 5232/6608 tok/s;    603 sec
[2021-04-24 05:20:39,069 INFO] Step 1600/50000; acc:  82.87; ppl:  2.00; xent: 0.69; lr: 0.00010; 5269/6501 tok/s;    623 sec
[2021-04-24 05:20:59,494 INFO] Step 1650/50000; acc:  84.33; ppl:  1.91; xent: 0.65; lr: 0.00010; 4932/6354 tok/s;    643 sec
[2021-04-24 05:21:18,987 INFO] Step 1700/50000; acc:  84.02; ppl:  1.92; xent: 0.65; lr: 0.00010; 5333/6664 tok/s;    663 sec
[2021-04-24 05:21:38,870 INFO] Step 1750/50000; acc:  84.10; ppl:  1.92; xent: 0.65; lr: 0.00010; 5069/6328 tok/s;    682 sec
[2021-04-24 05:21:57,505 INFO] Step 1800/50000; acc:  84.52; ppl:  1.89; xent: 0.64; lr: 0.00010; 5454/6553 tok/s;    701 sec
[2021-04-24 05:22:17,392 INFO] Step 1850/50000; acc:  84.62; ppl:  1.88; xent: 0.63; lr: 0.00010; 5193/6453 tok/s;    721 sec
[2021-04-24 05:22:37,414 INFO] Step 1900/50000; acc:  85.29; ppl:  1.83; xent: 0.60; lr: 0.00010; 5059/6408 tok/s;    741 sec
[2021-04-24 05:22:57,874 INFO] Step 1950/50000; acc:  84.69; ppl:  1.88; xent: 0.63; lr: 0.00010; 4959/6286 tok/s;    761 sec
[2021-04-24 05:23:17,063 INFO] Step 2000/50000; acc:  84.85; ppl:  1.86; xent: 0.62; lr: 0.00010; 5267/6449 tok/s;    781 sec
[2021-04-24 05:23:36,754 INFO] Step 2050/50000; acc:  85.41; ppl:  1.82; xent: 0.60; lr: 0.00010; 5263/6546 tok/s;    800 sec
[2021-04-24 05:23:55,492 INFO] Step 2100/50000; acc:  85.45; ppl:  1.82; xent: 0.60; lr: 0.00010; 5365/6768 tok/s;    819 sec
[2021-04-24 05:24:14,977 INFO] Step 2150/50000; acc:  85.71; ppl:  1.80; xent: 0.59; lr: 0.00010; 5160/6605 tok/s;    839 sec
[2021-04-24 05:24:25,049 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:24:34,273 INFO] Step 2200/50000; acc:  85.66; ppl:  1.80; xent: 0.59; lr: 0.00010; 5345/6702 tok/s;    858 sec
[2021-04-24 05:24:53,721 INFO] Step 2250/50000; acc:  85.83; ppl:  1.80; xent: 0.59; lr: 0.00010; 5251/6612 tok/s;    877 sec
[2021-04-24 05:25:12,553 INFO] Step 2300/50000; acc:  85.29; ppl:  1.82; xent: 0.60; lr: 0.00010; 5387/6642 tok/s;    896 sec
[2021-04-24 05:25:32,312 INFO] Step 2350/50000; acc:  85.42; ppl:  1.82; xent: 0.60; lr: 0.00010; 5057/6371 tok/s;    916 sec
[2021-04-24 05:25:52,344 INFO] Step 2400/50000; acc:  85.81; ppl:  1.79; xent: 0.58; lr: 0.00010; 5074/6354 tok/s;    936 sec
[2021-04-24 05:26:11,454 INFO] Step 2450/50000; acc:  85.97; ppl:  1.78; xent: 0.58; lr: 0.00010; 5246/6650 tok/s;    955 sec
[2021-04-24 05:26:31,592 INFO] Step 2500/50000; acc:  86.29; ppl:  1.77; xent: 0.57; lr: 0.00010; 5230/6422 tok/s;    975 sec
[2021-04-24 05:26:50,520 INFO] Step 2550/50000; acc:  86.04; ppl:  1.77; xent: 0.57; lr: 0.00010; 5365/6477 tok/s;    994 sec
[2021-04-24 05:27:10,844 INFO] Step 2600/50000; acc:  86.00; ppl:  1.78; xent: 0.58; lr: 0.00010; 4956/6243 tok/s;   1014 sec
[2021-04-24 05:27:30,660 INFO] Step 2650/50000; acc:  86.47; ppl:  1.75; xent: 0.56; lr: 0.00010; 5181/6564 tok/s;   1034 sec
[2021-04-24 05:27:50,718 INFO] Step 2700/50000; acc:  86.19; ppl:  1.77; xent: 0.57; lr: 0.00010; 5073/6437 tok/s;   1054 sec
[2021-04-24 05:28:10,186 INFO] Step 2750/50000; acc:  86.03; ppl:  1.77; xent: 0.57; lr: 0.00010; 5185/6360 tok/s;   1074 sec
[2021-04-24 05:28:29,234 INFO] Step 2800/50000; acc:  86.54; ppl:  1.74; xent: 0.55; lr: 0.00010; 5341/6654 tok/s;   1093 sec
[2021-04-24 05:28:48,691 INFO] Step 2850/50000; acc:  87.00; ppl:  1.72; xent: 0.54; lr: 0.00010; 5266/6800 tok/s;   1112 sec
[2021-04-24 05:29:07,240 INFO] Step 2900/50000; acc:  86.04; ppl:  1.77; xent: 0.57; lr: 0.00010; 5432/6650 tok/s;   1131 sec
[2021-04-24 05:29:12,056 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:29:27,207 INFO] Step 2950/50000; acc:  86.89; ppl:  1.72; xent: 0.54; lr: 0.00010; 5080/6587 tok/s;   1151 sec
[2021-04-24 05:29:46,193 INFO] Step 3000/50000; acc:  86.60; ppl:  1.73; xent: 0.55; lr: 0.00010; 5409/6709 tok/s;   1170 sec
[2021-04-24 05:30:05,396 INFO] Step 3050/50000; acc:  85.89; ppl:  1.77; xent: 0.57; lr: 0.00010; 5268/6487 tok/s;   1189 sec
[2021-04-24 05:30:25,391 INFO] Step 3100/50000; acc:  86.69; ppl:  1.73; xent: 0.55; lr: 0.00010; 5086/6392 tok/s;   1209 sec
[2021-04-24 05:30:44,882 INFO] Step 3150/50000; acc:  86.58; ppl:  1.73; xent: 0.55; lr: 0.00010; 5137/6467 tok/s;   1228 sec
[2021-04-24 05:31:04,197 INFO] Step 3200/50000; acc:  86.55; ppl:  1.73; xent: 0.55; lr: 0.00010; 5250/6528 tok/s;   1248 sec
[2021-04-24 05:31:23,441 INFO] Step 3250/50000; acc:  86.86; ppl:  1.71; xent: 0.54; lr: 0.00010; 5336/6584 tok/s;   1267 sec
[2021-04-24 05:31:43,170 INFO] Step 3300/50000; acc:  86.86; ppl:  1.72; xent: 0.54; lr: 0.00010; 5286/6469 tok/s;   1287 sec
[2021-04-24 05:32:02,625 INFO] Step 3350/50000; acc:  86.87; ppl:  1.71; xent: 0.54; lr: 0.00010; 5174/6453 tok/s;   1306 sec
[2021-04-24 05:32:22,768 INFO] Step 3400/50000; acc:  87.12; ppl:  1.70; xent: 0.53; lr: 0.00010; 5012/6543 tok/s;   1326 sec
[2021-04-24 05:32:42,292 INFO] Step 3450/50000; acc:  86.64; ppl:  1.73; xent: 0.55; lr: 0.00010; 5240/6411 tok/s;   1346 sec
[2021-04-24 05:33:01,384 INFO] Step 3500/50000; acc:  87.11; ppl:  1.70; xent: 0.53; lr: 0.00010; 5359/6559 tok/s;   1365 sec
[2021-04-24 05:33:20,637 INFO] Step 3550/50000; acc:  87.45; ppl:  1.68; xent: 0.52; lr: 0.00010; 5238/6673 tok/s;   1384 sec
[2021-04-24 05:33:39,775 INFO] Step 3600/50000; acc:  87.17; ppl:  1.69; xent: 0.52; lr: 0.00010; 5268/6715 tok/s;   1403 sec
[2021-04-24 05:33:46,219 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:33:59,255 INFO] Step 3650/50000; acc:  87.51; ppl:  1.67; xent: 0.51; lr: 0.00010; 5311/6691 tok/s;   1423 sec
[2021-04-24 05:34:18,316 INFO] Step 3700/50000; acc:  87.18; ppl:  1.70; xent: 0.53; lr: 0.00010; 5303/6631 tok/s;   1442 sec
[2021-04-24 05:34:37,717 INFO] Step 3750/50000; acc:  86.96; ppl:  1.70; xent: 0.53; lr: 0.00010; 5189/6502 tok/s;   1461 sec
[2021-04-24 05:34:57,028 INFO] Step 3800/50000; acc:  86.85; ppl:  1.71; xent: 0.53; lr: 0.00010; 5289/6539 tok/s;   1481 sec
[2021-04-24 05:35:17,184 INFO] Step 3850/50000; acc:  87.42; ppl:  1.68; xent: 0.52; lr: 0.00010; 5040/6319 tok/s;   1501 sec
[2021-04-24 05:35:36,743 INFO] Step 3900/50000; acc:  87.42; ppl:  1.68; xent: 0.52; lr: 0.00010; 5185/6471 tok/s;   1520 sec
[2021-04-24 05:35:56,500 INFO] Step 3950/50000; acc:  87.37; ppl:  1.68; xent: 0.52; lr: 0.00010; 5099/6351 tok/s;   1540 sec
[2021-04-24 05:36:15,451 INFO] Step 4000/50000; acc:  87.48; ppl:  1.67; xent: 0.52; lr: 0.00010; 5472/6607 tok/s;   1559 sec
[2021-04-24 05:36:34,997 INFO] Step 4050/50000; acc:  87.35; ppl:  1.68; xent: 0.52; lr: 0.00010; 5153/6587 tok/s;   1579 sec
[2021-04-24 05:36:54,749 INFO] Step 4100/50000; acc:  87.82; ppl:  1.65; xent: 0.50; lr: 0.00010; 5280/6575 tok/s;   1598 sec
[2021-04-24 05:37:14,638 INFO] Step 4150/50000; acc:  87.42; ppl:  1.67; xent: 0.51; lr: 0.00010; 5049/6404 tok/s;   1618 sec
[2021-04-24 05:37:34,607 INFO] Step 4200/50000; acc:  87.32; ppl:  1.68; xent: 0.52; lr: 0.00010; 5048/6243 tok/s;   1638 sec
[2021-04-24 05:37:53,740 INFO] Step 4250/50000; acc:  87.98; ppl:  1.64; xent: 0.50; lr: 0.00010; 5399/6661 tok/s;   1657 sec
[2021-04-24 05:38:13,166 INFO] Step 4300/50000; acc:  88.02; ppl:  1.63; xent: 0.49; lr: 0.00010; 5212/6627 tok/s;   1677 sec
[2021-04-24 05:38:32,354 INFO] Step 4350/50000; acc:  87.61; ppl:  1.65; xent: 0.50; lr: 0.00010; 5247/6650 tok/s;   1696 sec
[2021-04-24 05:38:33,109 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:38:51,481 INFO] Step 4400/50000; acc:  87.86; ppl:  1.65; xent: 0.50; lr: 0.00010; 5295/6746 tok/s;   1715 sec
[2021-04-24 05:39:11,206 INFO] Step 4450/50000; acc:  87.98; ppl:  1.64; xent: 0.50; lr: 0.00010; 5253/6537 tok/s;   1735 sec
[2021-04-24 05:39:29,972 INFO] Step 4500/50000; acc:  87.03; ppl:  1.69; xent: 0.52; lr: 0.00010; 5312/6595 tok/s;   1754 sec
[2021-04-24 05:39:50,556 INFO] Step 4550/50000; acc:  87.78; ppl:  1.64; xent: 0.49; lr: 0.00010; 4913/6120 tok/s;   1774 sec
[2021-04-24 05:40:10,156 INFO] Step 4600/50000; acc:  87.71; ppl:  1.65; xent: 0.50; lr: 0.00010; 5236/6536 tok/s;   1794 sec
[2021-04-24 05:40:30,474 INFO] Step 4650/50000; acc:  88.06; ppl:  1.63; xent: 0.49; lr: 0.00010; 4965/6388 tok/s;   1814 sec
[2021-04-24 05:40:49,559 INFO] Step 4700/50000; acc:  87.42; ppl:  1.67; xent: 0.51; lr: 0.00010; 5419/6363 tok/s;   1833 sec
[2021-04-24 05:41:08,624 INFO] Step 4750/50000; acc:  87.89; ppl:  1.63; xent: 0.49; lr: 0.00010; 5311/6669 tok/s;   1852 sec
[2021-04-24 05:41:29,079 INFO] Step 4800/50000; acc:  88.02; ppl:  1.63; xent: 0.49; lr: 0.00010; 4972/6215 tok/s;   1873 sec
[2021-04-24 05:41:48,878 INFO] Step 4850/50000; acc:  87.87; ppl:  1.63; xent: 0.49; lr: 0.00010; 5101/6527 tok/s;   1892 sec
[2021-04-24 05:42:08,819 INFO] Step 4900/50000; acc:  88.11; ppl:  1.63; xent: 0.49; lr: 0.00010; 5226/6434 tok/s;   1912 sec
[2021-04-24 05:42:27,921 INFO] Step 4950/50000; acc:  87.71; ppl:  1.64; xent: 0.50; lr: 0.00010; 5289/6523 tok/s;   1931 sec
[2021-04-24 05:42:47,201 INFO] Step 5000/50000; acc:  88.29; ppl:  1.61; xent: 0.48; lr: 0.00010; 5235/6669 tok/s;   1951 sec
[2021-04-24 05:42:47,202 INFO] valid's transforms: TransformPipe()
[2021-04-24 05:42:47,203 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-24 05:43:15,422 INFO] Validation perplexity: 1.59687
[2021-04-24 05:43:15,422 INFO] Validation accuracy: 88.4028
[2021-04-24 05:43:15,426 INFO] Saving checkpoint ../models/default_params/basic_ops/model_step_5000.pt
[2021-04-24 05:43:34,970 INFO] Step 5050/50000; acc:  88.22; ppl:  1.62; xent: 0.48; lr: 0.00010; 2129/2692 tok/s;   1999 sec
[2021-04-24 05:43:49,406 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:43:54,389 INFO] Step 5100/50000; acc:  88.38; ppl:  1.60; xent: 0.47; lr: 0.00010; 5256/6639 tok/s;   2018 sec
[2021-04-24 05:44:13,769 INFO] Step 5150/50000; acc:  88.01; ppl:  1.63; xent: 0.49; lr: 0.00010; 5201/6490 tok/s;   2037 sec
[2021-04-24 05:44:32,898 INFO] Step 5200/50000; acc:  87.85; ppl:  1.64; xent: 0.49; lr: 0.00010; 5292/6644 tok/s;   2056 sec
[2021-04-24 05:44:52,465 INFO] Step 5250/50000; acc:  87.85; ppl:  1.63; xent: 0.49; lr: 0.00010; 5229/6468 tok/s;   2076 sec
[2021-04-24 05:45:12,476 INFO] Step 5300/50000; acc:  88.12; ppl:  1.61; xent: 0.48; lr: 0.00010; 5041/6473 tok/s;   2096 sec
[2021-04-24 05:45:32,203 INFO] Step 5350/50000; acc:  88.08; ppl:  1.61; xent: 0.48; lr: 0.00010; 5112/6385 tok/s;   2116 sec
[2021-04-24 05:45:52,330 INFO] Step 5400/50000; acc:  88.10; ppl:  1.62; xent: 0.48; lr: 0.00010; 5123/6311 tok/s;   2136 sec
[2021-04-24 05:46:11,310 INFO] Step 5450/50000; acc:  88.17; ppl:  1.61; xent: 0.48; lr: 0.00010; 5413/6517 tok/s;   2155 sec
[2021-04-24 05:46:31,159 INFO] Step 5500/50000; acc:  88.11; ppl:  1.62; xent: 0.48; lr: 0.00010; 5167/6443 tok/s;   2175 sec
[2021-04-24 05:46:51,330 INFO] Step 5550/50000; acc:  88.34; ppl:  1.60; xent: 0.47; lr: 0.00010; 4957/6353 tok/s;   2195 sec
[2021-04-24 05:47:11,391 INFO] Step 5600/50000; acc:  88.29; ppl:  1.60; xent: 0.47; lr: 0.00010; 5101/6382 tok/s;   2215 sec
[2021-04-24 05:47:30,830 INFO] Step 5650/50000; acc:  88.07; ppl:  1.62; xent: 0.48; lr: 0.00010; 5177/6364 tok/s;   2234 sec
[2021-04-24 05:47:50,616 INFO] Step 5700/50000; acc:  88.51; ppl:  1.59; xent: 0.46; lr: 0.00010; 5286/6563 tok/s;   2254 sec
[2021-04-24 05:48:09,128 INFO] Step 5750/50000; acc:  88.53; ppl:  1.59; xent: 0.46; lr: 0.00010; 5417/6966 tok/s;   2273 sec
[2021-04-24 05:48:28,527 INFO] Step 5800/50000; acc:  88.42; ppl:  1.59; xent: 0.47; lr: 0.00010; 5165/6497 tok/s;   2292 sec
[2021-04-24 05:48:37,715 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:48:47,979 INFO] Step 5850/50000; acc:  88.30; ppl:  1.60; xent: 0.47; lr: 0.00010; 5284/6626 tok/s;   2312 sec
[2021-04-24 05:49:07,549 INFO] Step 5900/50000; acc:  88.76; ppl:  1.58; xent: 0.46; lr: 0.00010; 5243/6620 tok/s;   2331 sec
[2021-04-24 05:49:26,250 INFO] Step 5950/50000; acc:  87.91; ppl:  1.62; xent: 0.48; lr: 0.00010; 5353/6570 tok/s;   2350 sec
[2021-04-24 05:49:46,218 INFO] Step 6000/50000; acc:  88.12; ppl:  1.61; xent: 0.48; lr: 0.00010; 5047/6312 tok/s;   2370 sec
[2021-04-24 05:50:06,455 INFO] Step 6050/50000; acc:  88.56; ppl:  1.58; xent: 0.46; lr: 0.00010; 5082/6466 tok/s;   2390 sec
[2021-04-24 05:50:25,445 INFO] Step 6100/50000; acc:  88.39; ppl:  1.59; xent: 0.46; lr: 0.00010; 5275/6645 tok/s;   2409 sec
[2021-04-24 05:50:44,989 INFO] Step 6150/50000; acc:  88.48; ppl:  1.59; xent: 0.46; lr: 0.00010; 5243/6433 tok/s;   2429 sec
[2021-04-24 05:51:04,184 INFO] Step 6200/50000; acc:  88.40; ppl:  1.59; xent: 0.46; lr: 0.00010; 5404/6490 tok/s;   2448 sec
[2021-04-24 05:51:24,443 INFO] Step 6250/50000; acc:  88.56; ppl:  1.59; xent: 0.46; lr: 0.00010; 5016/6341 tok/s;   2468 sec
[2021-04-24 05:51:44,060 INFO] Step 6300/50000; acc:  88.68; ppl:  1.57; xent: 0.45; lr: 0.00010; 5210/6560 tok/s;   2488 sec
[2021-04-24 05:52:03,589 INFO] Step 6350/50000; acc:  88.44; ppl:  1.59; xent: 0.46; lr: 0.00010; 5133/6610 tok/s;   2507 sec
[2021-04-24 05:52:22,775 INFO] Step 6400/50000; acc:  88.32; ppl:  1.60; xent: 0.47; lr: 0.00010; 5317/6490 tok/s;   2526 sec
[2021-04-24 05:52:41,986 INFO] Step 6450/50000; acc:  88.63; ppl:  1.57; xent: 0.45; lr: 0.00010; 5268/6545 tok/s;   2546 sec
[2021-04-24 05:53:01,453 INFO] Step 6500/50000; acc:  89.18; ppl:  1.55; xent: 0.44; lr: 0.00010; 5317/6885 tok/s;   2565 sec
[2021-04-24 05:53:19,859 INFO] Step 6550/50000; acc:  88.36; ppl:  1.59; xent: 0.46; lr: 0.00010; 5476/6767 tok/s;   2583 sec
[2021-04-24 05:53:23,848 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:53:39,488 INFO] Step 6600/50000; acc:  88.86; ppl:  1.57; xent: 0.45; lr: 0.00010; 5134/6574 tok/s;   2603 sec
[2021-04-24 05:53:58,726 INFO] Step 6650/50000; acc:  88.52; ppl:  1.58; xent: 0.46; lr: 0.00010; 5319/6578 tok/s;   2622 sec
[2021-04-24 05:54:17,884 INFO] Step 6700/50000; acc:  88.27; ppl:  1.60; xent: 0.47; lr: 0.00010; 5292/6514 tok/s;   2641 sec
[2021-04-24 05:54:37,942 INFO] Step 6750/50000; acc:  88.55; ppl:  1.58; xent: 0.46; lr: 0.00010; 5024/6398 tok/s;   2661 sec
[2021-04-24 05:54:57,501 INFO] Step 6800/50000; acc:  88.63; ppl:  1.58; xent: 0.45; lr: 0.00010; 5155/6433 tok/s;   2681 sec
[2021-04-24 05:55:16,906 INFO] Step 6850/50000; acc:  88.85; ppl:  1.57; xent: 0.45; lr: 0.00010; 5293/6613 tok/s;   2700 sec
[2021-04-24 05:55:36,387 INFO] Step 6900/50000; acc:  88.58; ppl:  1.58; xent: 0.46; lr: 0.00010; 5272/6404 tok/s;   2720 sec
[2021-04-24 05:55:55,995 INFO] Step 6950/50000; acc:  88.64; ppl:  1.58; xent: 0.45; lr: 0.00010; 5165/6461 tok/s;   2740 sec
[2021-04-24 05:56:15,566 INFO] Step 7000/50000; acc:  88.72; ppl:  1.57; xent: 0.45; lr: 0.00010; 5254/6459 tok/s;   2759 sec
[2021-04-24 05:56:35,795 INFO] Step 7050/50000; acc:  88.94; ppl:  1.56; xent: 0.44; lr: 0.00010; 5040/6581 tok/s;   2779 sec
[2021-04-24 05:56:55,223 INFO] Step 7100/50000; acc:  88.33; ppl:  1.59; xent: 0.46; lr: 0.00010; 5232/6364 tok/s;   2799 sec
[2021-04-24 05:57:14,161 INFO] Step 7150/50000; acc:  88.74; ppl:  1.56; xent: 0.45; lr: 0.00010; 5337/6666 tok/s;   2818 sec
[2021-04-24 05:57:33,489 INFO] Step 7200/50000; acc:  89.15; ppl:  1.54; xent: 0.43; lr: 0.00010; 5260/6647 tok/s;   2837 sec
[2021-04-24 05:57:52,892 INFO] Step 7250/50000; acc:  88.86; ppl:  1.56; xent: 0.44; lr: 0.00010; 5168/6570 tok/s;   2856 sec
[2021-04-24 05:57:58,389 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 05:58:12,331 INFO] Step 7300/50000; acc:  89.14; ppl:  1.54; xent: 0.43; lr: 0.00010; 5392/6830 tok/s;   2876 sec
[2021-04-24 05:58:31,380 INFO] Step 7350/50000; acc:  88.74; ppl:  1.57; xent: 0.45; lr: 0.00010; 5285/6545 tok/s;   2895 sec
[2021-04-24 05:58:50,457 INFO] Step 7400/50000; acc:  88.58; ppl:  1.58; xent: 0.46; lr: 0.00010; 5241/6586 tok/s;   2914 sec
[2021-04-24 05:59:09,728 INFO] Step 7450/50000; acc:  88.39; ppl:  1.58; xent: 0.46; lr: 0.00010; 5296/6474 tok/s;   2933 sec
[2021-04-24 05:59:29,877 INFO] Step 7500/50000; acc:  89.00; ppl:  1.55; xent: 0.44; lr: 0.00010; 5056/6442 tok/s;   2953 sec
[2021-04-24 05:59:49,372 INFO] Step 7550/50000; acc:  88.81; ppl:  1.56; xent: 0.45; lr: 0.00010; 5138/6438 tok/s;   2973 sec
[2021-04-24 06:00:09,048 INFO] Step 7600/50000; acc:  88.83; ppl:  1.56; xent: 0.45; lr: 0.00010; 5173/6410 tok/s;   2993 sec
[2021-04-24 06:00:28,185 INFO] Step 7650/50000; acc:  88.92; ppl:  1.56; xent: 0.44; lr: 0.00010; 5480/6601 tok/s;   3012 sec
[2021-04-24 06:00:47,709 INFO] Step 7700/50000; acc:  88.68; ppl:  1.56; xent: 0.45; lr: 0.00010; 5156/6547 tok/s;   3031 sec
[2021-04-24 06:01:07,028 INFO] Step 7750/50000; acc:  89.14; ppl:  1.54; xent: 0.43; lr: 0.00010; 5243/6579 tok/s;   3051 sec
[2021-04-24 06:01:27,227 INFO] Step 7800/50000; acc:  88.83; ppl:  1.56; xent: 0.45; lr: 0.00010; 5081/6388 tok/s;   3071 sec
[2021-04-24 06:01:46,887 INFO] Step 7850/50000; acc:  88.77; ppl:  1.56; xent: 0.45; lr: 0.00010; 5181/6400 tok/s;   3090 sec
[2021-04-24 06:02:06,141 INFO] Step 7900/50000; acc:  89.24; ppl:  1.54; xent: 0.43; lr: 0.00010; 5339/6685 tok/s;   3110 sec
[2021-04-24 06:02:25,230 INFO] Step 7950/50000; acc:  89.15; ppl:  1.54; xent: 0.43; lr: 0.00010; 5216/6648 tok/s;   3129 sec
[2021-04-24 06:02:44,591 INFO] Step 8000/50000; acc:  89.08; ppl:  1.54; xent: 0.43; lr: 0.00010; 5262/6637 tok/s;   3148 sec
[2021-04-24 06:02:44,604 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:03:03,747 INFO] Step 8050/50000; acc:  89.07; ppl:  1.55; xent: 0.44; lr: 0.00010; 5259/6648 tok/s;   3167 sec
[2021-04-24 06:03:23,493 INFO] Step 8100/50000; acc:  89.08; ppl:  1.55; xent: 0.44; lr: 0.00010; 5303/6566 tok/s;   3187 sec
[2021-04-24 06:03:42,252 INFO] Step 8150/50000; acc:  88.33; ppl:  1.58; xent: 0.46; lr: 0.00010; 5297/6550 tok/s;   3206 sec
[2021-04-24 06:04:02,608 INFO] Step 8200/50000; acc:  89.18; ppl:  1.53; xent: 0.43; lr: 0.00010; 4948/6309 tok/s;   3226 sec
[2021-04-24 06:04:22,144 INFO] Step 8250/50000; acc:  88.86; ppl:  1.56; xent: 0.44; lr: 0.00010; 5246/6476 tok/s;   3246 sec
[2021-04-24 06:04:42,527 INFO] Step 8300/50000; acc:  89.24; ppl:  1.54; xent: 0.43; lr: 0.00010; 4967/6376 tok/s;   3266 sec
[2021-04-24 06:05:01,731 INFO] Step 8350/50000; acc:  88.76; ppl:  1.56; xent: 0.45; lr: 0.00010; 5319/6252 tok/s;   3285 sec
[2021-04-24 06:05:21,041 INFO] Step 8400/50000; acc:  88.93; ppl:  1.55; xent: 0.44; lr: 0.00010; 5284/6658 tok/s;   3305 sec
[2021-04-24 06:05:41,557 INFO] Step 8450/50000; acc:  89.41; ppl:  1.52; xent: 0.42; lr: 0.00010; 5006/6309 tok/s;   3325 sec
[2021-04-24 06:06:01,393 INFO] Step 8500/50000; acc:  88.85; ppl:  1.55; xent: 0.44; lr: 0.00010; 5105/6420 tok/s;   3345 sec
[2021-04-24 06:06:21,050 INFO] Step 8550/50000; acc:  89.16; ppl:  1.54; xent: 0.43; lr: 0.00010; 5147/6370 tok/s;   3365 sec
[2021-04-24 06:06:40,484 INFO] Step 8600/50000; acc:  89.10; ppl:  1.54; xent: 0.43; lr: 0.00010; 5316/6600 tok/s;   3384 sec
[2021-04-24 06:06:59,595 INFO] Step 8650/50000; acc:  89.40; ppl:  1.53; xent: 0.42; lr: 0.00010; 5316/6716 tok/s;   3403 sec
[2021-04-24 06:07:18,986 INFO] Step 8700/50000; acc:  89.49; ppl:  1.52; xent: 0.42; lr: 0.00010; 5221/6623 tok/s;   3423 sec
[2021-04-24 06:07:32,445 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:07:38,101 INFO] Step 8750/50000; acc:  89.23; ppl:  1.53; xent: 0.42; lr: 0.00010; 5275/6728 tok/s;   3442 sec
[2021-04-24 06:07:57,487 INFO] Step 8800/50000; acc:  89.18; ppl:  1.54; xent: 0.43; lr: 0.00010; 5250/6514 tok/s;   3461 sec
[2021-04-24 06:08:16,245 INFO] Step 8850/50000; acc:  88.98; ppl:  1.55; xent: 0.44; lr: 0.00010; 5365/6719 tok/s;   3480 sec
[2021-04-24 06:08:35,940 INFO] Step 8900/50000; acc:  89.00; ppl:  1.54; xent: 0.43; lr: 0.00010; 5261/6501 tok/s;   3499 sec
[2021-04-24 06:08:55,890 INFO] Step 8950/50000; acc:  89.08; ppl:  1.53; xent: 0.43; lr: 0.00010; 5037/6402 tok/s;   3519 sec
[2021-04-24 06:09:15,416 INFO] Step 9000/50000; acc:  89.02; ppl:  1.54; xent: 0.43; lr: 0.00010; 5134/6433 tok/s;   3539 sec
[2021-04-24 06:09:35,535 INFO] Step 9050/50000; acc:  89.22; ppl:  1.54; xent: 0.43; lr: 0.00010; 5123/6327 tok/s;   3559 sec
[2021-04-24 06:09:54,563 INFO] Step 9100/50000; acc:  89.06; ppl:  1.54; xent: 0.43; lr: 0.00010; 5418/6464 tok/s;   3578 sec
[2021-04-24 06:10:14,388 INFO] Step 9150/50000; acc:  89.14; ppl:  1.54; xent: 0.43; lr: 0.00010; 5110/6465 tok/s;   3598 sec
[2021-04-24 06:10:34,616 INFO] Step 9200/50000; acc:  89.51; ppl:  1.52; xent: 0.42; lr: 0.00010; 4982/6422 tok/s;   3618 sec
[2021-04-24 06:10:54,418 INFO] Step 9250/50000; acc:  89.24; ppl:  1.53; xent: 0.42; lr: 0.00010; 5222/6475 tok/s;   3638 sec
[2021-04-24 06:11:13,833 INFO] Step 9300/50000; acc:  89.06; ppl:  1.54; xent: 0.43; lr: 0.00010; 5191/6436 tok/s;   3657 sec
[2021-04-24 06:11:33,062 INFO] Step 9350/50000; acc:  89.43; ppl:  1.52; xent: 0.42; lr: 0.00010; 5287/6534 tok/s;   3677 sec
[2021-04-24 06:11:51,834 INFO] Step 9400/50000; acc:  89.45; ppl:  1.52; xent: 0.42; lr: 0.00010; 5463/6989 tok/s;   3695 sec
[2021-04-24 06:12:11,411 INFO] Step 9450/50000; acc:  89.39; ppl:  1.52; xent: 0.42; lr: 0.00010; 5163/6476 tok/s;   3715 sec
[2021-04-24 06:12:19,755 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:12:30,878 INFO] Step 9500/50000; acc:  89.45; ppl:  1.52; xent: 0.42; lr: 0.00010; 5250/6623 tok/s;   3734 sec
[2021-04-24 06:12:50,347 INFO] Step 9550/50000; acc:  89.44; ppl:  1.52; xent: 0.42; lr: 0.00010; 5188/6630 tok/s;   3754 sec
[2021-04-24 06:13:09,112 INFO] Step 9600/50000; acc:  88.69; ppl:  1.56; xent: 0.44; lr: 0.00010; 5400/6506 tok/s;   3773 sec
[2021-04-24 06:13:29,119 INFO] Step 9650/50000; acc:  89.12; ppl:  1.53; xent: 0.43; lr: 0.00010; 5008/6316 tok/s;   3793 sec
[2021-04-24 06:13:49,537 INFO] Step 9700/50000; acc:  89.54; ppl:  1.51; xent: 0.41; lr: 0.00010; 5094/6454 tok/s;   3813 sec
[2021-04-24 06:14:08,920 INFO] Step 9750/50000; acc:  89.17; ppl:  1.53; xent: 0.43; lr: 0.00010; 5156/6522 tok/s;   3832 sec
[2021-04-24 06:14:28,348 INFO] Step 9800/50000; acc:  89.46; ppl:  1.52; xent: 0.42; lr: 0.00010; 5250/6431 tok/s;   3852 sec
[2021-04-24 06:14:47,519 INFO] Step 9850/50000; acc:  89.22; ppl:  1.53; xent: 0.43; lr: 0.00010; 5391/6447 tok/s;   3871 sec
[2021-04-24 06:15:07,861 INFO] Step 9900/50000; acc:  89.32; ppl:  1.52; xent: 0.42; lr: 0.00010; 5006/6345 tok/s;   3891 sec
[2021-04-24 06:15:27,330 INFO] Step 9950/50000; acc:  89.58; ppl:  1.50; xent: 0.41; lr: 0.00010; 5195/6584 tok/s;   3911 sec
[2021-04-24 06:15:47,053 INFO] Step 10000/50000; acc:  89.18; ppl:  1.53; xent: 0.43; lr: 0.00010; 5126/6510 tok/s;   3931 sec
[2021-04-24 06:15:47,055 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-24 06:16:15,271 INFO] Validation perplexity: 1.50777
[2021-04-24 06:16:15,271 INFO] Validation accuracy: 89.5905
[2021-04-24 06:16:15,275 INFO] Saving checkpoint ../models/default_params/basic_ops/model_step_10000.pt
[2021-04-24 06:16:34,538 INFO] Step 10050/50000; acc:  89.33; ppl:  1.52; xent: 0.42; lr: 0.00010; 2172/2670 tok/s;   3978 sec
[2021-04-24 06:16:53,391 INFO] Step 10100/50000; acc:  89.45; ppl:  1.51; xent: 0.41; lr: 0.00010; 5369/6618 tok/s;   3997 sec
[2021-04-24 06:17:12,597 INFO] Step 10150/50000; acc:  89.85; ppl:  1.50; xent: 0.40; lr: 0.00010; 5238/6851 tok/s;   4016 sec
[2021-04-24 06:17:31,411 INFO] Step 10200/50000; acc:  89.19; ppl:  1.53; xent: 0.42; lr: 0.00010; 5475/6738 tok/s;   4035 sec
[2021-04-24 06:17:34,494 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:17:50,948 INFO] Step 10250/50000; acc:  89.54; ppl:  1.52; xent: 0.42; lr: 0.00010; 5211/6579 tok/s;   4054 sec
[2021-04-24 06:18:10,320 INFO] Step 10300/50000; acc:  89.42; ppl:  1.52; xent: 0.42; lr: 0.00010; 5249/6551 tok/s;   4074 sec
[2021-04-24 06:18:28,956 INFO] Step 10350/50000; acc:  88.95; ppl:  1.54; xent: 0.43; lr: 0.00010; 5366/6706 tok/s;   4093 sec
[2021-04-24 06:18:49,036 INFO] Step 10400/50000; acc:  89.42; ppl:  1.51; xent: 0.41; lr: 0.00010; 5058/6388 tok/s;   4113 sec
[2021-04-24 06:19:08,735 INFO] Step 10450/50000; acc:  89.34; ppl:  1.52; xent: 0.42; lr: 0.00010; 5095/6420 tok/s;   4132 sec
[2021-04-24 06:19:28,359 INFO] Step 10500/50000; acc:  89.55; ppl:  1.51; xent: 0.41; lr: 0.00010; 5305/6540 tok/s;   4152 sec
[2021-04-24 06:19:47,811 INFO] Step 10550/50000; acc:  89.25; ppl:  1.52; xent: 0.42; lr: 0.00010; 5268/6386 tok/s;   4171 sec
[2021-04-24 06:20:07,421 INFO] Step 10600/50000; acc:  89.45; ppl:  1.51; xent: 0.41; lr: 0.00010; 5125/6428 tok/s;   4191 sec
[2021-04-24 06:20:27,299 INFO] Step 10650/50000; acc:  89.48; ppl:  1.51; xent: 0.41; lr: 0.00010; 5172/6406 tok/s;   4211 sec
[2021-04-24 06:20:47,435 INFO] Step 10700/50000; acc:  89.62; ppl:  1.50; xent: 0.41; lr: 0.00010; 5070/6561 tok/s;   4231 sec
[2021-04-24 06:21:06,774 INFO] Step 10750/50000; acc:  89.14; ppl:  1.53; xent: 0.43; lr: 0.00010; 5194/6380 tok/s;   4250 sec
[2021-04-24 06:21:25,738 INFO] Step 10800/50000; acc:  89.68; ppl:  1.50; xent: 0.40; lr: 0.00010; 5382/6672 tok/s;   4269 sec
[2021-04-24 06:21:45,367 INFO] Step 10850/50000; acc:  90.01; ppl:  1.49; xent: 0.40; lr: 0.00010; 5238/6670 tok/s;   4289 sec
[2021-04-24 06:22:04,548 INFO] Step 10900/50000; acc:  89.40; ppl:  1.51; xent: 0.42; lr: 0.00010; 5236/6560 tok/s;   4308 sec
[2021-04-24 06:22:09,328 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:22:24,060 INFO] Step 10950/50000; acc:  89.74; ppl:  1.50; xent: 0.40; lr: 0.00010; 5210/6663 tok/s;   4328 sec
[2021-04-24 06:22:43,170 INFO] Step 11000/50000; acc:  89.66; ppl:  1.51; xent: 0.41; lr: 0.00010; 5389/6722 tok/s;   4347 sec
[2021-04-24 06:23:02,373 INFO] Step 11050/50000; acc:  89.20; ppl:  1.53; xent: 0.42; lr: 0.00010; 5246/6488 tok/s;   4366 sec
[2021-04-24 06:23:21,834 INFO] Step 11100/50000; acc:  89.18; ppl:  1.52; xent: 0.42; lr: 0.00010; 5224/6428 tok/s;   4385 sec
[2021-04-24 06:23:41,675 INFO] Step 11150/50000; acc:  89.56; ppl:  1.50; xent: 0.41; lr: 0.00010; 5060/6432 tok/s;   4405 sec
[2021-04-24 06:24:01,347 INFO] Step 11200/50000; acc:  89.59; ppl:  1.50; xent: 0.41; lr: 0.00010; 5139/6489 tok/s;   4425 sec
[2021-04-24 06:24:20,691 INFO] Step 11250/50000; acc:  89.47; ppl:  1.51; xent: 0.41; lr: 0.00010; 5246/6473 tok/s;   4444 sec
[2021-04-24 06:24:39,683 INFO] Step 11300/50000; acc:  89.62; ppl:  1.50; xent: 0.41; lr: 0.00010; 5578/6690 tok/s;   4463 sec
[2021-04-24 06:24:59,476 INFO] Step 11350/50000; acc:  89.51; ppl:  1.51; xent: 0.41; lr: 0.00010; 5064/6540 tok/s;   4483 sec
[2021-04-24 06:25:19,062 INFO] Step 11400/50000; acc:  89.61; ppl:  1.50; xent: 0.41; lr: 0.00010; 5151/6383 tok/s;   4503 sec
[2021-04-24 06:25:38,980 INFO] Step 11450/50000; acc:  89.60; ppl:  1.51; xent: 0.41; lr: 0.00010; 5144/6471 tok/s;   4523 sec
[2021-04-24 06:25:58,704 INFO] Step 11500/50000; acc:  89.54; ppl:  1.50; xent: 0.41; lr: 0.00010; 5178/6451 tok/s;   4542 sec
[2021-04-24 06:26:17,598 INFO] Step 11550/50000; acc:  89.76; ppl:  1.50; xent: 0.40; lr: 0.00010; 5377/6690 tok/s;   4561 sec
[2021-04-24 06:26:36,674 INFO] Step 11600/50000; acc:  89.98; ppl:  1.48; xent: 0.39; lr: 0.00010; 5263/6735 tok/s;   4580 sec
[2021-04-24 06:26:55,196 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:26:56,176 INFO] Step 11650/50000; acc:  89.76; ppl:  1.48; xent: 0.39; lr: 0.00010; 5290/6668 tok/s;   4600 sec
[2021-04-24 06:27:15,537 INFO] Step 11700/50000; acc:  89.55; ppl:  1.51; xent: 0.41; lr: 0.00010; 5199/6529 tok/s;   4619 sec
[2021-04-24 06:27:35,042 INFO] Step 11750/50000; acc:  89.45; ppl:  1.52; xent: 0.42; lr: 0.00010; 5212/6484 tok/s;   4639 sec
[2021-04-24 06:27:54,101 INFO] Step 11800/50000; acc:  89.15; ppl:  1.53; xent: 0.42; lr: 0.00010; 5334/6521 tok/s;   4658 sec
[2021-04-24 06:28:14,582 INFO] Step 11850/50000; acc:  89.70; ppl:  1.49; xent: 0.40; lr: 0.00010; 4958/6308 tok/s;   4678 sec
[2021-04-24 06:28:33,981 INFO] Step 11900/50000; acc:  89.68; ppl:  1.50; xent: 0.40; lr: 0.00010; 5261/6552 tok/s;   4698 sec
[2021-04-24 06:28:54,311 INFO] Step 11950/50000; acc:  89.67; ppl:  1.50; xent: 0.40; lr: 0.00010; 4918/6353 tok/s;   4718 sec
[2021-04-24 06:29:13,438 INFO] Step 12000/50000; acc:  89.44; ppl:  1.51; xent: 0.41; lr: 0.00010; 5393/6321 tok/s;   4737 sec
[2021-04-24 06:29:32,735 INFO] Step 12050/50000; acc:  89.52; ppl:  1.50; xent: 0.41; lr: 0.00010; 5258/6611 tok/s;   4756 sec
[2021-04-24 06:29:53,031 INFO] Step 12100/50000; acc:  90.00; ppl:  1.48; xent: 0.39; lr: 0.00010; 5110/6413 tok/s;   4777 sec
[2021-04-24 06:30:12,723 INFO] Step 12150/50000; acc:  89.44; ppl:  1.51; xent: 0.41; lr: 0.00010; 5133/6437 tok/s;   4796 sec
[2021-04-24 06:30:32,729 INFO] Step 12200/50000; acc:  89.68; ppl:  1.50; xent: 0.40; lr: 0.00010; 5026/6312 tok/s;   4816 sec
[2021-04-24 06:30:52,271 INFO] Step 12250/50000; acc:  89.68; ppl:  1.49; xent: 0.40; lr: 0.00010; 5284/6475 tok/s;   4836 sec
[2021-04-24 06:31:11,245 INFO] Step 12300/50000; acc:  90.03; ppl:  1.48; xent: 0.39; lr: 0.00010; 5360/6807 tok/s;   4855 sec
[2021-04-24 06:31:30,564 INFO] Step 12350/50000; acc:  89.92; ppl:  1.48; xent: 0.39; lr: 0.00010; 5178/6603 tok/s;   4874 sec
[2021-04-24 06:31:43,460 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:31:49,849 INFO] Step 12400/50000; acc:  89.76; ppl:  1.49; xent: 0.40; lr: 0.00010; 5271/6684 tok/s;   4893 sec
[2021-04-24 06:32:09,576 INFO] Step 12450/50000; acc:  89.87; ppl:  1.49; xent: 0.40; lr: 0.00010; 5232/6535 tok/s;   4913 sec
[2021-04-24 06:32:28,067 INFO] Step 12500/50000; acc:  89.38; ppl:  1.51; xent: 0.41; lr: 0.00010; 5439/6708 tok/s;   4932 sec
[2021-04-24 06:32:47,763 INFO] Step 12550/50000; acc:  89.46; ppl:  1.51; xent: 0.41; lr: 0.00010; 5114/6392 tok/s;   4951 sec
[2021-04-24 06:33:07,803 INFO] Step 12600/50000; acc:  89.75; ppl:  1.49; xent: 0.40; lr: 0.00010; 5116/6475 tok/s;   4971 sec
[2021-04-24 06:33:27,383 INFO] Step 12650/50000; acc:  89.63; ppl:  1.50; xent: 0.40; lr: 0.00010; 5171/6460 tok/s;   4991 sec
[2021-04-24 06:33:47,509 INFO] Step 12700/50000; acc:  89.72; ppl:  1.49; xent: 0.40; lr: 0.00010; 5102/6298 tok/s;   5011 sec
[2021-04-24 06:34:06,332 INFO] Step 12750/50000; acc:  89.54; ppl:  1.50; xent: 0.40; lr: 0.00010; 5391/6477 tok/s;   5030 sec
[2021-04-24 06:34:26,291 INFO] Step 12800/50000; acc:  89.70; ppl:  1.49; xent: 0.40; lr: 0.00010; 5125/6464 tok/s;   5050 sec
[2021-04-24 06:34:46,433 INFO] Step 12850/50000; acc:  90.09; ppl:  1.47; xent: 0.39; lr: 0.00010; 4981/6464 tok/s;   5070 sec
[2021-04-24 06:35:06,380 INFO] Step 12900/50000; acc:  89.72; ppl:  1.49; xent: 0.40; lr: 0.00010; 5235/6423 tok/s;   5090 sec
[2021-04-24 06:35:25,486 INFO] Step 12950/50000; acc:  89.58; ppl:  1.50; xent: 0.40; lr: 0.00010; 5274/6545 tok/s;   5109 sec
[2021-04-24 06:35:44,779 INFO] Step 13000/50000; acc:  89.92; ppl:  1.48; xent: 0.39; lr: 0.00010; 5232/6513 tok/s;   5128 sec
[2021-04-24 06:36:03,537 INFO] Step 13050/50000; acc:  90.11; ppl:  1.47; xent: 0.39; lr: 0.00010; 5457/6994 tok/s;   5147 sec
[2021-04-24 06:36:22,886 INFO] Step 13100/50000; acc:  89.74; ppl:  1.48; xent: 0.39; lr: 0.00010; 5241/6601 tok/s;   5166 sec
[2021-04-24 06:36:30,424 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:36:42,375 INFO] Step 13150/50000; acc:  89.84; ppl:  1.49; xent: 0.40; lr: 0.00010; 5180/6521 tok/s;   5186 sec
[2021-04-24 06:37:01,880 INFO] Step 13200/50000; acc:  89.98; ppl:  1.48; xent: 0.39; lr: 0.00010; 5217/6649 tok/s;   5205 sec
[2021-04-24 06:37:20,503 INFO] Step 13250/50000; acc:  89.46; ppl:  1.51; xent: 0.41; lr: 0.00010; 5513/6675 tok/s;   5224 sec
[2021-04-24 06:37:40,449 INFO] Step 13300/50000; acc:  89.65; ppl:  1.49; xent: 0.40; lr: 0.00010; 5026/6285 tok/s;   5244 sec
[2021-04-24 06:38:00,684 INFO] Step 13350/50000; acc:  89.79; ppl:  1.49; xent: 0.40; lr: 0.00010; 4991/6375 tok/s;   5264 sec
[2021-04-24 06:38:19,971 INFO] Step 13400/50000; acc:  89.73; ppl:  1.49; xent: 0.40; lr: 0.00010; 5300/6599 tok/s;   5284 sec
[2021-04-24 06:38:39,597 INFO] Step 13450/50000; acc:  89.91; ppl:  1.48; xent: 0.39; lr: 0.00010; 5245/6418 tok/s;   5303 sec
[2021-04-24 06:38:59,007 INFO] Step 13500/50000; acc:  89.71; ppl:  1.49; xent: 0.40; lr: 0.00010; 5287/6445 tok/s;   5323 sec
[2021-04-24 06:39:18,764 INFO] Step 13550/50000; acc:  89.68; ppl:  1.49; xent: 0.40; lr: 0.00010; 5088/6341 tok/s;   5342 sec
[2021-04-24 06:39:38,681 INFO] Step 13600/50000; acc:  90.10; ppl:  1.46; xent: 0.38; lr: 0.00010; 5130/6559 tok/s;   5362 sec
[2021-04-24 06:39:58,279 INFO] Step 13650/50000; acc:  89.78; ppl:  1.49; xent: 0.40; lr: 0.00010; 5129/6504 tok/s;   5382 sec
[2021-04-24 06:40:18,090 INFO] Step 13700/50000; acc:  89.92; ppl:  1.48; xent: 0.39; lr: 0.00010; 5275/6484 tok/s;   5402 sec
[2021-04-24 06:40:36,980 INFO] Step 13750/50000; acc:  90.10; ppl:  1.47; xent: 0.38; lr: 0.00010; 5331/6680 tok/s;   5421 sec
[2021-04-24 06:40:55,973 INFO] Step 13800/50000; acc:  90.12; ppl:  1.47; xent: 0.39; lr: 0.00010; 5276/6733 tok/s;   5440 sec
[2021-04-24 06:41:15,124 INFO] Step 13850/50000; acc:  89.81; ppl:  1.48; xent: 0.39; lr: 0.00010; 5365/6682 tok/s;   5459 sec
[2021-04-24 06:41:17,503 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:41:34,718 INFO] Step 13900/50000; acc:  90.14; ppl:  1.47; xent: 0.39; lr: 0.00010; 5211/6618 tok/s;   5478 sec
[2021-04-24 06:41:53,964 INFO] Step 13950/50000; acc:  89.70; ppl:  1.49; xent: 0.40; lr: 0.00010; 5218/6483 tok/s;   5498 sec
[2021-04-24 06:42:13,046 INFO] Step 14000/50000; acc:  89.51; ppl:  1.50; xent: 0.40; lr: 0.00010; 5279/6611 tok/s;   5517 sec
[2021-04-24 06:42:33,384 INFO] Step 14050/50000; acc:  90.14; ppl:  1.47; xent: 0.38; lr: 0.00010; 5066/6404 tok/s;   5537 sec
[2021-04-24 06:42:52,681 INFO] Step 14100/50000; acc:  89.51; ppl:  1.50; xent: 0.40; lr: 0.00010; 5194/6421 tok/s;   5556 sec
[2021-04-24 06:43:12,500 INFO] Step 14150/50000; acc:  90.03; ppl:  1.47; xent: 0.39; lr: 0.00010; 5106/6421 tok/s;   5576 sec
[2021-04-24 06:43:32,091 INFO] Step 14200/50000; acc:  89.84; ppl:  1.48; xent: 0.39; lr: 0.00010; 5347/6454 tok/s;   5596 sec
[2021-04-24 06:43:51,746 INFO] Step 14250/50000; acc:  89.86; ppl:  1.48; xent: 0.39; lr: 0.00010; 5161/6430 tok/s;   5615 sec
[2021-04-24 06:44:11,521 INFO] Step 14300/50000; acc:  89.98; ppl:  1.48; xent: 0.39; lr: 0.00010; 5167/6397 tok/s;   5635 sec
[2021-04-24 06:44:31,668 INFO] Step 14350/50000; acc:  89.96; ppl:  1.48; xent: 0.39; lr: 0.00010; 4998/6541 tok/s;   5655 sec
[2021-04-24 06:44:50,800 INFO] Step 14400/50000; acc:  89.68; ppl:  1.49; xent: 0.40; lr: 0.00010; 5306/6456 tok/s;   5674 sec
[2021-04-24 06:45:10,136 INFO] Step 14450/50000; acc:  90.16; ppl:  1.46; xent: 0.38; lr: 0.00010; 5254/6527 tok/s;   5694 sec
[2021-04-24 06:45:29,530 INFO] Step 14500/50000; acc:  90.42; ppl:  1.45; xent: 0.37; lr: 0.00010; 5350/6777 tok/s;   5713 sec
[2021-04-24 06:45:48,596 INFO] Step 14550/50000; acc:  89.86; ppl:  1.48; xent: 0.39; lr: 0.00010; 5253/6594 tok/s;   5732 sec
[2021-04-24 06:45:52,668 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:46:07,924 INFO] Step 14600/50000; acc:  90.11; ppl:  1.46; xent: 0.38; lr: 0.00010; 5227/6670 tok/s;   5751 sec
[2021-04-24 06:46:27,464 INFO] Step 14650/50000; acc:  90.09; ppl:  1.47; xent: 0.39; lr: 0.00010; 5264/6579 tok/s;   5771 sec
[2021-04-24 06:46:46,515 INFO] Step 14700/50000; acc:  89.62; ppl:  1.49; xent: 0.40; lr: 0.00010; 5307/6578 tok/s;   5790 sec
[2021-04-24 06:47:06,384 INFO] Step 14750/50000; acc:  89.63; ppl:  1.49; xent: 0.40; lr: 0.00010; 5063/6297 tok/s;   5810 sec
[2021-04-24 06:47:26,214 INFO] Step 14800/50000; acc:  90.08; ppl:  1.46; xent: 0.38; lr: 0.00010; 5098/6474 tok/s;   5830 sec
[2021-04-24 06:47:46,125 INFO] Step 14850/50000; acc:  90.08; ppl:  1.47; xent: 0.39; lr: 0.00010; 5136/6504 tok/s;   5850 sec
[2021-04-24 06:48:05,595 INFO] Step 14900/50000; acc:  89.66; ppl:  1.49; xent: 0.40; lr: 0.00010; 5226/6271 tok/s;   5869 sec
[2021-04-24 06:48:24,358 INFO] Step 14950/50000; acc:  89.92; ppl:  1.48; xent: 0.39; lr: 0.00010; 5472/6704 tok/s;   5888 sec
[2021-04-24 06:48:44,748 INFO] Step 15000/50000; acc:  89.99; ppl:  1.47; xent: 0.39; lr: 0.00010; 5028/6420 tok/s;   5908 sec
[2021-04-24 06:48:44,749 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-24 06:49:12,958 INFO] Validation perplexity: 1.47403
[2021-04-24 06:49:12,958 INFO] Validation accuracy: 90.0341
[2021-04-24 06:49:12,962 INFO] Saving checkpoint ../models/default_params/basic_ops/model_step_15000.pt
[2021-04-24 06:49:32,459 INFO] Step 15050/50000; acc:  90.17; ppl:  1.46; xent: 0.38; lr: 0.00010; 2133/2670 tok/s;   5956 sec
[2021-04-24 06:49:52,141 INFO] Step 15100/50000; acc:  89.89; ppl:  1.48; xent: 0.39; lr: 0.00010; 5183/6463 tok/s;   5976 sec
[2021-04-24 06:50:11,659 INFO] Step 15150/50000; acc:  89.87; ppl:  1.47; xent: 0.39; lr: 0.00010; 5160/6457 tok/s;   5995 sec
[2021-04-24 06:50:30,606 INFO] Step 15200/50000; acc:  90.26; ppl:  1.46; xent: 0.38; lr: 0.00010; 5416/6741 tok/s;   6014 sec
[2021-04-24 06:50:49,698 INFO] Step 15250/50000; acc:  90.23; ppl:  1.46; xent: 0.38; lr: 0.00010; 5224/6679 tok/s;   6033 sec
[2021-04-24 06:51:07,693 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:51:09,449 INFO] Step 15300/50000; acc:  90.20; ppl:  1.45; xent: 0.37; lr: 0.00010; 5283/6596 tok/s;   6053 sec
[2021-04-24 06:51:28,631 INFO] Step 15350/50000; acc:  89.98; ppl:  1.47; xent: 0.39; lr: 0.00010; 5236/6561 tok/s;   6072 sec
[2021-04-24 06:51:48,116 INFO] Step 15400/50000; acc:  89.86; ppl:  1.48; xent: 0.39; lr: 0.00010; 5191/6535 tok/s;   6092 sec
[2021-04-24 06:52:07,168 INFO] Step 15450/50000; acc:  89.66; ppl:  1.49; xent: 0.40; lr: 0.00010; 5329/6489 tok/s;   6111 sec
[2021-04-24 06:52:27,482 INFO] Step 15500/50000; acc:  90.13; ppl:  1.46; xent: 0.38; lr: 0.00010; 5008/6443 tok/s;   6131 sec
[2021-04-24 06:52:46,755 INFO] Step 15550/50000; acc:  89.96; ppl:  1.47; xent: 0.38; lr: 0.00010; 5229/6493 tok/s;   6150 sec
[2021-04-24 06:53:06,966 INFO] Step 15600/50000; acc:  90.15; ppl:  1.46; xent: 0.38; lr: 0.00010; 4997/6366 tok/s;   6171 sec
[2021-04-24 06:53:26,175 INFO] Step 15650/50000; acc:  90.03; ppl:  1.47; xent: 0.38; lr: 0.00010; 5431/6498 tok/s;   6190 sec
[2021-04-24 06:53:45,488 INFO] Step 15700/50000; acc:  89.69; ppl:  1.48; xent: 0.39; lr: 0.00010; 5261/6486 tok/s;   6209 sec
[2021-04-24 06:54:05,642 INFO] Step 15750/50000; acc:  90.38; ppl:  1.45; xent: 0.37; lr: 0.00010; 4988/6388 tok/s;   6229 sec
[2021-04-24 06:54:25,725 INFO] Step 15800/50000; acc:  89.99; ppl:  1.47; xent: 0.38; lr: 0.00010; 5149/6466 tok/s;   6249 sec
[2021-04-24 06:54:45,464 INFO] Step 15850/50000; acc:  90.00; ppl:  1.47; xent: 0.39; lr: 0.00010; 5133/6342 tok/s;   6269 sec
[2021-04-24 06:55:05,201 INFO] Step 15900/50000; acc:  90.17; ppl:  1.46; xent: 0.38; lr: 0.00010; 5198/6406 tok/s;   6289 sec
[2021-04-24 06:55:23,656 INFO] Step 15950/50000; acc:  90.17; ppl:  1.46; xent: 0.38; lr: 0.00010; 5443/6862 tok/s;   6307 sec
[2021-04-24 06:55:43,004 INFO] Step 16000/50000; acc:  90.33; ppl:  1.45; xent: 0.37; lr: 0.00010; 5224/6646 tok/s;   6327 sec
[2021-04-24 06:55:55,386 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 06:56:02,317 INFO] Step 16050/50000; acc:  90.16; ppl:  1.46; xent: 0.38; lr: 0.00010; 5238/6671 tok/s;   6346 sec
[2021-04-24 06:56:22,068 INFO] Step 16100/50000; acc:  90.42; ppl:  1.45; xent: 0.37; lr: 0.00010; 5292/6615 tok/s;   6366 sec
[2021-04-24 06:56:40,525 INFO] Step 16150/50000; acc:  89.71; ppl:  1.48; xent: 0.39; lr: 0.00010; 5423/6689 tok/s;   6384 sec
[2021-04-24 06:57:00,638 INFO] Step 16200/50000; acc:  89.85; ppl:  1.47; xent: 0.39; lr: 0.00010; 4981/6271 tok/s;   6404 sec
[2021-04-24 06:57:20,699 INFO] Step 16250/50000; acc:  90.06; ppl:  1.46; xent: 0.38; lr: 0.00010; 5105/6452 tok/s;   6424 sec
[2021-04-24 06:57:40,050 INFO] Step 16300/50000; acc:  90.00; ppl:  1.47; xent: 0.38; lr: 0.00010; 5239/6522 tok/s;   6444 sec
[2021-04-24 06:58:00,101 INFO] Step 16350/50000; acc:  90.11; ppl:  1.46; xent: 0.38; lr: 0.00010; 5066/6286 tok/s;   6464 sec
[2021-04-24 06:58:19,073 INFO] Step 16400/50000; acc:  90.05; ppl:  1.47; xent: 0.38; lr: 0.00010; 5399/6473 tok/s;   6483 sec
[2021-04-24 06:58:39,099 INFO] Step 16450/50000; acc:  90.15; ppl:  1.46; xent: 0.38; lr: 0.00010; 5164/6556 tok/s;   6503 sec
[2021-04-24 06:58:59,059 INFO] Step 16500/50000; acc:  90.40; ppl:  1.44; xent: 0.37; lr: 0.00010; 5031/6461 tok/s;   6523 sec
[2021-04-24 06:59:18,487 INFO] Step 16550/50000; acc:  90.03; ppl:  1.47; xent: 0.38; lr: 0.00010; 5218/6463 tok/s;   6542 sec
[2021-04-24 06:59:37,662 INFO] Step 16600/50000; acc:  89.91; ppl:  1.47; xent: 0.39; lr: 0.00010; 5370/6587 tok/s;   6561 sec
[2021-04-24 06:59:57,037 INFO] Step 16650/50000; acc:  90.44; ppl:  1.44; xent: 0.37; lr: 0.00010; 5258/6515 tok/s;   6581 sec
[2021-04-24 07:00:15,759 INFO] Step 16700/50000; acc:  90.60; ppl:  1.44; xent: 0.37; lr: 0.00010; 5436/7046 tok/s;   6599 sec
[2021-04-24 07:00:35,052 INFO] Step 16750/50000; acc:  90.01; ppl:  1.46; xent: 0.38; lr: 0.00010; 5181/6536 tok/s;   6619 sec
[2021-04-24 07:00:41,945 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:00:54,807 INFO] Step 16800/50000; acc:  90.29; ppl:  1.45; xent: 0.37; lr: 0.00010; 5172/6500 tok/s;   6638 sec
[2021-04-24 07:01:13,946 INFO] Step 16850/50000; acc:  90.23; ppl:  1.45; xent: 0.37; lr: 0.00010; 5274/6710 tok/s;   6657 sec
[2021-04-24 07:01:32,786 INFO] Step 16900/50000; acc:  89.76; ppl:  1.48; xent: 0.39; lr: 0.00010; 5511/6612 tok/s;   6676 sec
[2021-04-24 07:01:52,652 INFO] Step 16950/50000; acc:  90.16; ppl:  1.46; xent: 0.38; lr: 0.00010; 5032/6366 tok/s;   6696 sec
[2021-04-24 07:02:12,617 INFO] Step 17000/50000; acc:  90.07; ppl:  1.46; xent: 0.38; lr: 0.00010; 5031/6399 tok/s;   6716 sec
[2021-04-24 07:02:31,869 INFO] Step 17050/50000; acc:  90.18; ppl:  1.46; xent: 0.38; lr: 0.00010; 5305/6608 tok/s;   6735 sec
[2021-04-24 07:02:51,365 INFO] Step 17100/50000; acc:  90.24; ppl:  1.45; xent: 0.37; lr: 0.00010; 5298/6494 tok/s;   6755 sec
[2021-04-24 07:03:10,710 INFO] Step 17150/50000; acc:  90.17; ppl:  1.46; xent: 0.38; lr: 0.00010; 5240/6417 tok/s;   6774 sec
[2021-04-24 07:03:30,426 INFO] Step 17200/50000; acc:  90.09; ppl:  1.47; xent: 0.38; lr: 0.00010; 5137/6411 tok/s;   6794 sec
[2021-04-24 07:03:50,422 INFO] Step 17250/50000; acc:  90.50; ppl:  1.44; xent: 0.36; lr: 0.00010; 5175/6623 tok/s;   6814 sec
[2021-04-24 07:04:10,258 INFO] Step 17300/50000; acc:  90.12; ppl:  1.46; xent: 0.38; lr: 0.00010; 5068/6369 tok/s;   6834 sec
[2021-04-24 07:04:29,504 INFO] Step 17350/50000; acc:  90.05; ppl:  1.46; xent: 0.38; lr: 0.00010; 5273/6465 tok/s;   6853 sec
[2021-04-24 07:04:48,627 INFO] Step 17400/50000; acc:  90.60; ppl:  1.44; xent: 0.36; lr: 0.00010; 5380/6811 tok/s;   6872 sec
[2021-04-24 07:05:07,811 INFO] Step 17450/50000; acc:  90.50; ppl:  1.45; xent: 0.37; lr: 0.00010; 5277/6638 tok/s;   6891 sec
[2021-04-24 07:05:26,879 INFO] Step 17500/50000; acc:  90.23; ppl:  1.45; xent: 0.37; lr: 0.00010; 5358/6727 tok/s;   6910 sec
[2021-04-24 07:05:28,387 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:05:46,477 INFO] Step 17550/50000; acc:  90.28; ppl:  1.46; xent: 0.38; lr: 0.00010; 5144/6549 tok/s;   6930 sec
[2021-04-24 07:06:05,538 INFO] Step 17600/50000; acc:  90.26; ppl:  1.45; xent: 0.37; lr: 0.00010; 5319/6571 tok/s;   6949 sec
[2021-04-24 07:06:24,817 INFO] Step 17650/50000; acc:  89.77; ppl:  1.47; xent: 0.39; lr: 0.00010; 5198/6540 tok/s;   6968 sec
[2021-04-24 07:06:44,950 INFO] Step 17700/50000; acc:  90.47; ppl:  1.44; xent: 0.37; lr: 0.00010; 5169/6451 tok/s;   6988 sec
[2021-04-24 07:07:04,435 INFO] Step 17750/50000; acc:  89.92; ppl:  1.47; xent: 0.38; lr: 0.00010; 5133/6413 tok/s;   7008 sec
[2021-04-24 07:07:24,306 INFO] Step 17800/50000; acc:  90.36; ppl:  1.45; xent: 0.37; lr: 0.00010; 5069/6342 tok/s;   7028 sec
[2021-04-24 07:07:43,848 INFO] Step 17850/50000; acc:  90.21; ppl:  1.46; xent: 0.38; lr: 0.00010; 5352/6460 tok/s;   7047 sec
[2021-04-24 07:08:03,350 INFO] Step 17900/50000; acc:  90.28; ppl:  1.45; xent: 0.37; lr: 0.00010; 5213/6523 tok/s;   7067 sec
[2021-04-24 07:08:23,035 INFO] Step 17950/50000; acc:  90.43; ppl:  1.44; xent: 0.37; lr: 0.00010; 5122/6430 tok/s;   7087 sec
[2021-04-24 07:08:43,104 INFO] Step 18000/50000; acc:  90.35; ppl:  1.45; xent: 0.37; lr: 0.00010; 5059/6542 tok/s;   7107 sec
[2021-04-24 07:09:02,763 INFO] Step 18050/50000; acc:  90.11; ppl:  1.46; xent: 0.38; lr: 0.00010; 5230/6425 tok/s;   7126 sec
[2021-04-24 07:09:22,003 INFO] Step 18100/50000; acc:  90.48; ppl:  1.44; xent: 0.36; lr: 0.00010; 5287/6478 tok/s;   7146 sec
[2021-04-24 07:09:41,428 INFO] Step 18150/50000; acc:  90.68; ppl:  1.43; xent: 0.36; lr: 0.00010; 5190/6682 tok/s;   7165 sec
[2021-04-24 07:10:00,651 INFO] Step 18200/50000; acc:  90.10; ppl:  1.46; xent: 0.38; lr: 0.00010; 5324/6577 tok/s;   7184 sec
[2021-04-24 07:10:03,911 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:10:19,925 INFO] Step 18250/50000; acc:  90.55; ppl:  1.44; xent: 0.36; lr: 0.00010; 5283/6724 tok/s;   7203 sec
[2021-04-24 07:10:39,432 INFO] Step 18300/50000; acc:  90.42; ppl:  1.45; xent: 0.37; lr: 0.00010; 5250/6587 tok/s;   7223 sec
[2021-04-24 07:10:58,214 INFO] Step 18350/50000; acc:  89.87; ppl:  1.47; xent: 0.39; lr: 0.00010; 5299/6621 tok/s;   7242 sec
[2021-04-24 07:11:18,071 INFO] Step 18400/50000; acc:  90.15; ppl:  1.46; xent: 0.38; lr: 0.00010; 5125/6377 tok/s;   7262 sec
[2021-04-24 07:11:38,010 INFO] Step 18450/50000; acc:  90.36; ppl:  1.44; xent: 0.37; lr: 0.00010; 5035/6402 tok/s;   7282 sec
[2021-04-24 07:11:57,687 INFO] Step 18500/50000; acc:  90.46; ppl:  1.44; xent: 0.37; lr: 0.00010; 5267/6617 tok/s;   7301 sec
[2021-04-24 07:12:17,359 INFO] Step 18550/50000; acc:  89.97; ppl:  1.46; xent: 0.38; lr: 0.00010; 5160/6131 tok/s;   7321 sec
[2021-04-24 07:12:36,451 INFO] Step 18600/50000; acc:  90.26; ppl:  1.45; xent: 0.37; lr: 0.00010; 5332/6639 tok/s;   7340 sec
[2021-04-24 07:12:56,425 INFO] Step 18650/50000; acc:  90.41; ppl:  1.44; xent: 0.37; lr: 0.00010; 5135/6484 tok/s;   7360 sec
[2021-04-24 07:13:15,921 INFO] Step 18700/50000; acc:  90.49; ppl:  1.44; xent: 0.36; lr: 0.00010; 5224/6601 tok/s;   7379 sec
[2021-04-24 07:13:35,659 INFO] Step 18750/50000; acc:  90.28; ppl:  1.45; xent: 0.37; lr: 0.00010; 5117/6381 tok/s;   7399 sec
[2021-04-24 07:13:55,482 INFO] Step 18800/50000; acc:  90.26; ppl:  1.45; xent: 0.37; lr: 0.00010; 5125/6390 tok/s;   7419 sec
[2021-04-24 07:14:14,620 INFO] Step 18850/50000; acc:  90.69; ppl:  1.43; xent: 0.36; lr: 0.00010; 5425/6785 tok/s;   7438 sec
[2021-04-24 07:14:33,939 INFO] Step 18900/50000; acc:  90.57; ppl:  1.44; xent: 0.36; lr: 0.00010; 5167/6538 tok/s;   7457 sec
[2021-04-24 07:14:50,671 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:14:53,151 INFO] Step 18950/50000; acc:  90.55; ppl:  1.43; xent: 0.36; lr: 0.00010; 5273/6648 tok/s;   7477 sec
[2021-04-24 07:15:12,181 INFO] Step 19000/50000; acc:  90.28; ppl:  1.45; xent: 0.37; lr: 0.00010; 5388/6662 tok/s;   7496 sec
[2021-04-24 07:15:31,636 INFO] Step 19050/50000; acc:  90.37; ppl:  1.45; xent: 0.37; lr: 0.00010; 5243/6605 tok/s;   7515 sec
[2021-04-24 07:15:50,929 INFO] Step 19100/50000; acc:  89.98; ppl:  1.46; xent: 0.38; lr: 0.00010; 5246/6465 tok/s;   7534 sec
[2021-04-24 07:16:11,512 INFO] Step 19150/50000; acc:  90.35; ppl:  1.44; xent: 0.37; lr: 0.00010; 4871/6253 tok/s;   7555 sec
[2021-04-24 07:16:30,724 INFO] Step 19200/50000; acc:  90.33; ppl:  1.45; xent: 0.37; lr: 0.00010; 5293/6585 tok/s;   7574 sec
[2021-04-24 07:16:50,758 INFO] Step 19250/50000; acc:  90.41; ppl:  1.44; xent: 0.36; lr: 0.00010; 5029/6369 tok/s;   7594 sec
[2021-04-24 07:17:10,290 INFO] Step 19300/50000; acc:  90.26; ppl:  1.45; xent: 0.37; lr: 0.00010; 5391/6400 tok/s;   7614 sec
[2021-04-24 07:17:29,761 INFO] Step 19350/50000; acc:  90.16; ppl:  1.45; xent: 0.37; lr: 0.00010; 5204/6490 tok/s;   7633 sec
[2021-04-24 07:17:49,949 INFO] Step 19400/50000; acc:  90.71; ppl:  1.43; xent: 0.36; lr: 0.00010; 4951/6326 tok/s;   7653 sec
[2021-04-24 07:18:10,451 INFO] Step 19450/50000; acc:  90.35; ppl:  1.44; xent: 0.37; lr: 0.00010; 5035/6293 tok/s;   7674 sec
[2021-04-24 07:18:29,687 INFO] Step 19500/50000; acc:  90.31; ppl:  1.44; xent: 0.37; lr: 0.00010; 5283/6520 tok/s;   7693 sec
[2021-04-24 07:18:49,367 INFO] Step 19550/50000; acc:  90.45; ppl:  1.44; xent: 0.36; lr: 0.00010; 5146/6453 tok/s;   7713 sec
[2021-04-24 07:19:07,990 INFO] Step 19600/50000; acc:  90.67; ppl:  1.43; xent: 0.36; lr: 0.00010; 5442/6885 tok/s;   7732 sec
[2021-04-24 07:19:27,479 INFO] Step 19650/50000; acc:  90.64; ppl:  1.43; xent: 0.36; lr: 0.00010; 5249/6599 tok/s;   7751 sec
[2021-04-24 07:19:38,880 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:19:46,648 INFO] Step 19700/50000; acc:  90.39; ppl:  1.44; xent: 0.37; lr: 0.00010; 5278/6704 tok/s;   7770 sec
[2021-04-24 07:20:06,201 INFO] Step 19750/50000; acc:  90.58; ppl:  1.44; xent: 0.36; lr: 0.00010; 5204/6522 tok/s;   7790 sec
[2021-04-24 07:20:24,745 INFO] Step 19800/50000; acc:  90.20; ppl:  1.45; xent: 0.37; lr: 0.00010; 5507/6802 tok/s;   7808 sec
[2021-04-24 07:20:44,911 INFO] Step 19850/50000; acc:  90.28; ppl:  1.45; xent: 0.37; lr: 0.00010; 5008/6337 tok/s;   7828 sec
[2021-04-24 07:21:04,980 INFO] Step 19900/50000; acc:  90.32; ppl:  1.44; xent: 0.37; lr: 0.00010; 5084/6335 tok/s;   7849 sec
[2021-04-24 07:21:23,969 INFO] Step 19950/50000; acc:  90.35; ppl:  1.44; xent: 0.37; lr: 0.00010; 5264/6634 tok/s;   7868 sec
[2021-04-24 07:21:44,253 INFO] Step 20000/50000; acc:  90.40; ppl:  1.44; xent: 0.37; lr: 0.00010; 5055/6233 tok/s;   7888 sec
[2021-04-24 07:21:44,255 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-24 07:22:12,418 INFO] Validation perplexity: 1.45439
[2021-04-24 07:22:12,419 INFO] Validation accuracy: 90.2611
[2021-04-24 07:22:12,423 INFO] Saving checkpoint ../models/default_params/basic_ops/model_step_20000.pt
[2021-04-24 07:22:31,546 INFO] Step 20050/50000; acc:  90.47; ppl:  1.44; xent: 0.36; lr: 0.00010; 2157/2616 tok/s;   7935 sec
[2021-04-24 07:22:51,844 INFO] Step 20100/50000; acc:  90.43; ppl:  1.44; xent: 0.36; lr: 0.00010; 5141/6433 tok/s;   7955 sec
[2021-04-24 07:23:11,849 INFO] Step 20150/50000; acc:  90.66; ppl:  1.42; xent: 0.35; lr: 0.00010; 5017/6449 tok/s;   7975 sec
[2021-04-24 07:23:31,598 INFO] Step 20200/50000; acc:  90.44; ppl:  1.44; xent: 0.36; lr: 0.00010; 5103/6377 tok/s;   7995 sec
[2021-04-24 07:23:50,975 INFO] Step 20250/50000; acc:  90.34; ppl:  1.45; xent: 0.37; lr: 0.00010; 5307/6508 tok/s;   8015 sec
[2021-04-24 07:24:10,414 INFO] Step 20300/50000; acc:  90.69; ppl:  1.42; xent: 0.35; lr: 0.00010; 5252/6528 tok/s;   8034 sec
[2021-04-24 07:24:29,071 INFO] Step 20350/50000; acc:  90.80; ppl:  1.42; xent: 0.35; lr: 0.00010; 5377/6972 tok/s;   8053 sec
[2021-04-24 07:24:48,139 INFO] Step 20400/50000; acc:  90.39; ppl:  1.44; xent: 0.36; lr: 0.00010; 5300/6595 tok/s;   8072 sec
[2021-04-24 07:24:54,339 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:25:08,046 INFO] Step 20450/50000; acc:  90.79; ppl:  1.42; xent: 0.35; lr: 0.00010; 5186/6598 tok/s;   8092 sec
[2021-04-24 07:25:26,780 INFO] Step 20500/50000; acc:  90.55; ppl:  1.44; xent: 0.36; lr: 0.00010; 5392/6829 tok/s;   8110 sec
[2021-04-24 07:25:45,619 INFO] Step 20550/50000; acc:  89.98; ppl:  1.46; xent: 0.38; lr: 0.00010; 5354/6474 tok/s;   8129 sec
[2021-04-24 07:26:05,696 INFO] Step 20600/50000; acc:  90.45; ppl:  1.44; xent: 0.36; lr: 0.00010; 5093/6401 tok/s;   8149 sec
[2021-04-24 07:26:25,764 INFO] Step 20650/50000; acc:  90.52; ppl:  1.43; xent: 0.36; lr: 0.00010; 5048/6401 tok/s;   8169 sec
[2021-04-24 07:26:44,966 INFO] Step 20700/50000; acc:  90.37; ppl:  1.44; xent: 0.36; lr: 0.00010; 5294/6533 tok/s;   8189 sec
[2021-04-24 07:27:04,200 INFO] Step 20750/50000; acc:  90.52; ppl:  1.43; xent: 0.36; lr: 0.00010; 5304/6565 tok/s;   8208 sec
[2021-04-24 07:27:23,574 INFO] Step 20800/50000; acc:  90.43; ppl:  1.43; xent: 0.36; lr: 0.00010; 5267/6449 tok/s;   8227 sec
[2021-04-24 07:27:43,089 INFO] Step 20850/50000; acc:  90.42; ppl:  1.44; xent: 0.36; lr: 0.00010; 5175/6504 tok/s;   8247 sec
[2021-04-24 07:28:03,083 INFO] Step 20900/50000; acc:  90.89; ppl:  1.42; xent: 0.35; lr: 0.00010; 5228/6647 tok/s;   8267 sec
[2021-04-24 07:28:22,657 INFO] Step 20950/50000; acc:  90.29; ppl:  1.44; xent: 0.37; lr: 0.00010; 5119/6391 tok/s;   8286 sec
[2021-04-24 07:28:42,031 INFO] Step 21000/50000; acc:  90.56; ppl:  1.43; xent: 0.36; lr: 0.00010; 5219/6447 tok/s;   8306 sec
[2021-04-24 07:29:01,292 INFO] Step 21050/50000; acc:  90.97; ppl:  1.41; xent: 0.35; lr: 0.00010; 5331/6737 tok/s;   8325 sec
[2021-04-24 07:29:20,368 INFO] Step 21100/50000; acc:  90.67; ppl:  1.43; xent: 0.36; lr: 0.00010; 5316/6675 tok/s;   8344 sec
[2021-04-24 07:29:28,329 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:29:39,575 INFO] Step 21150/50000; acc:  90.58; ppl:  1.43; xent: 0.36; lr: 0.00010; 5263/6661 tok/s;   8363 sec
[2021-04-24 07:29:59,296 INFO] Step 21200/50000; acc:  90.68; ppl:  1.43; xent: 0.36; lr: 0.00010; 5151/6467 tok/s;   8383 sec
[2021-04-24 07:30:18,570 INFO] Step 21250/50000; acc:  90.53; ppl:  1.44; xent: 0.36; lr: 0.00010; 5319/6638 tok/s;   8402 sec
[2021-04-24 07:30:37,575 INFO] Step 21300/50000; acc:  90.05; ppl:  1.45; xent: 0.37; lr: 0.00010; 5278/6595 tok/s;   8421 sec
[2021-04-24 07:30:57,761 INFO] Step 21350/50000; acc:  90.69; ppl:  1.42; xent: 0.35; lr: 0.00010; 5011/6329 tok/s;   8441 sec
[2021-04-24 07:31:17,182 INFO] Step 21400/50000; acc:  90.41; ppl:  1.43; xent: 0.36; lr: 0.00010; 5265/6528 tok/s;   8461 sec
[2021-04-24 07:31:36,862 INFO] Step 21450/50000; acc:  90.64; ppl:  1.43; xent: 0.36; lr: 0.00010; 5163/6393 tok/s;   8480 sec
[2021-04-24 07:31:56,359 INFO] Step 21500/50000; acc:  90.36; ppl:  1.44; xent: 0.36; lr: 0.00010; 5342/6445 tok/s;   8500 sec
[2021-04-24 07:32:15,609 INFO] Step 21550/50000; acc:  90.49; ppl:  1.43; xent: 0.36; lr: 0.00010; 5205/6642 tok/s;   8519 sec
[2021-04-24 07:32:35,453 INFO] Step 21600/50000; acc:  90.78; ppl:  1.42; xent: 0.35; lr: 0.00010; 5132/6379 tok/s;   8539 sec
[2021-04-24 07:32:55,372 INFO] Step 21650/50000; acc:  90.67; ppl:  1.43; xent: 0.35; lr: 0.00010; 5067/6595 tok/s;   8559 sec
[2021-04-24 07:33:15,037 INFO] Step 21700/50000; acc:  90.50; ppl:  1.44; xent: 0.36; lr: 0.00010; 5304/6433 tok/s;   8579 sec
[2021-04-24 07:33:34,109 INFO] Step 21750/50000; acc:  90.69; ppl:  1.42; xent: 0.35; lr: 0.00010; 5303/6473 tok/s;   8598 sec
[2021-04-24 07:33:53,517 INFO] Step 21800/50000; acc:  90.99; ppl:  1.41; xent: 0.34; lr: 0.00010; 5164/6706 tok/s;   8617 sec
[2021-04-24 07:34:12,510 INFO] Step 21850/50000; acc:  90.42; ppl:  1.44; xent: 0.36; lr: 0.00010; 5378/6655 tok/s;   8636 sec
[2021-04-24 07:34:15,029 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:34:31,904 INFO] Step 21900/50000; acc:  90.85; ppl:  1.41; xent: 0.35; lr: 0.00010; 5258/6747 tok/s;   8655 sec
[2021-04-24 07:34:51,399 INFO] Step 21950/50000; acc:  90.59; ppl:  1.43; xent: 0.36; lr: 0.00010; 5197/6493 tok/s;   8675 sec
[2021-04-24 07:35:10,397 INFO] Step 22000/50000; acc:  90.22; ppl:  1.44; xent: 0.37; lr: 0.00010; 5283/6611 tok/s;   8694 sec
[2021-04-24 07:35:30,303 INFO] Step 22050/50000; acc:  90.60; ppl:  1.43; xent: 0.36; lr: 0.00010; 5175/6435 tok/s;   8714 sec
[2021-04-24 07:35:50,335 INFO] Step 22100/50000; acc:  90.55; ppl:  1.43; xent: 0.36; lr: 0.00010; 5019/6283 tok/s;   8734 sec
[2021-04-24 07:36:09,899 INFO] Step 22150/50000; acc:  90.76; ppl:  1.42; xent: 0.35; lr: 0.00010; 5137/6587 tok/s;   8753 sec
[2021-04-24 07:36:29,542 INFO] Step 22200/50000; acc:  90.32; ppl:  1.44; xent: 0.36; lr: 0.00010; 5299/6269 tok/s;   8773 sec
[2021-04-24 07:36:48,650 INFO] Step 22250/50000; acc:  90.52; ppl:  1.43; xent: 0.36; lr: 0.00010; 5362/6599 tok/s;   8792 sec
[2021-04-24 07:37:08,662 INFO] Step 22300/50000; acc:  90.84; ppl:  1.42; xent: 0.35; lr: 0.00010; 5106/6410 tok/s;   8812 sec
[2021-04-24 07:37:28,198 INFO] Step 22350/50000; acc:  90.66; ppl:  1.42; xent: 0.35; lr: 0.00010; 5140/6658 tok/s;   8832 sec
[2021-04-24 07:37:47,434 INFO] Step 22400/50000; acc:  90.57; ppl:  1.43; xent: 0.36; lr: 0.00010; 5292/6470 tok/s;   8851 sec
[2021-04-24 07:38:07,115 INFO] Step 22450/50000; acc:  90.60; ppl:  1.43; xent: 0.35; lr: 0.00010; 5145/6404 tok/s;   8871 sec
[2021-04-24 07:38:26,480 INFO] Step 22500/50000; acc:  91.12; ppl:  1.41; xent: 0.34; lr: 0.00010; 5411/6814 tok/s;   8890 sec
[2021-04-24 07:38:45,941 INFO] Step 22550/50000; acc:  90.86; ppl:  1.41; xent: 0.35; lr: 0.00010; 5120/6526 tok/s;   8909 sec
[2021-04-24 07:39:01,912 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:39:05,194 INFO] Step 22600/50000; acc:  90.80; ppl:  1.41; xent: 0.35; lr: 0.00010; 5233/6545 tok/s;   8929 sec
[2021-04-24 07:39:24,368 INFO] Step 22650/50000; acc:  90.58; ppl:  1.43; xent: 0.36; lr: 0.00010; 5347/6611 tok/s;   8948 sec
[2021-04-24 07:39:43,892 INFO] Step 22700/50000; acc:  90.60; ppl:  1.43; xent: 0.36; lr: 0.00010; 5224/6601 tok/s;   8967 sec
[2021-04-24 07:40:03,063 INFO] Step 22750/50000; acc:  90.23; ppl:  1.44; xent: 0.36; lr: 0.00010; 5216/6435 tok/s;   8987 sec
[2021-04-24 07:40:23,695 INFO] Step 22800/50000; acc:  90.76; ppl:  1.42; xent: 0.35; lr: 0.00010; 4903/6305 tok/s;   9007 sec
[2021-04-24 07:40:43,057 INFO] Step 22850/50000; acc:  90.70; ppl:  1.42; xent: 0.35; lr: 0.00010; 5312/6622 tok/s;   9027 sec
[2021-04-24 07:41:03,079 INFO] Step 22900/50000; acc:  90.75; ppl:  1.42; xent: 0.35; lr: 0.00010; 5041/6381 tok/s;   9047 sec
[2021-04-24 07:41:21,761 INFO] Step 22950/50000; acc:  90.57; ppl:  1.43; xent: 0.36; lr: 0.00010; 5475/6478 tok/s;   9065 sec
[2021-04-24 07:41:41,644 INFO] Step 23000/50000; acc:  90.56; ppl:  1.43; xent: 0.36; lr: 0.00010; 5203/6497 tok/s;   9085 sec
[2021-04-24 07:42:01,800 INFO] Step 23050/50000; acc:  91.01; ppl:  1.41; xent: 0.34; lr: 0.00010; 5013/6395 tok/s;   9105 sec
[2021-04-24 07:42:22,524 INFO] Step 23100/50000; acc:  90.54; ppl:  1.43; xent: 0.35; lr: 0.00010; 4953/6183 tok/s;   9126 sec
[2021-04-24 07:42:41,459 INFO] Step 23150/50000; acc:  90.67; ppl:  1.42; xent: 0.35; lr: 0.00010; 5294/6555 tok/s;   9145 sec
[2021-04-24 07:43:01,146 INFO] Step 23200/50000; acc:  90.71; ppl:  1.42; xent: 0.35; lr: 0.00010; 5200/6479 tok/s;   9165 sec
[2021-04-24 07:43:19,773 INFO] Step 23250/50000; acc:  90.84; ppl:  1.41; xent: 0.35; lr: 0.00010; 5400/6799 tok/s;   9183 sec
[2021-04-24 07:43:39,559 INFO] Step 23300/50000; acc:  91.01; ppl:  1.41; xent: 0.34; lr: 0.00010; 5236/6656 tok/s;   9203 sec
[2021-04-24 07:43:50,129 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:43:58,575 INFO] Step 23350/50000; acc:  90.70; ppl:  1.42; xent: 0.35; lr: 0.00010; 5301/6703 tok/s;   9222 sec
[2021-04-24 07:44:18,134 INFO] Step 23400/50000; acc:  90.85; ppl:  1.42; xent: 0.35; lr: 0.00010; 5178/6526 tok/s;   9242 sec
[2021-04-24 07:44:36,817 INFO] Step 23450/50000; acc:  90.45; ppl:  1.43; xent: 0.36; lr: 0.00010; 5456/6739 tok/s;   9260 sec
[2021-04-24 07:44:56,804 INFO] Step 23500/50000; acc:  90.43; ppl:  1.43; xent: 0.36; lr: 0.00010; 5068/6320 tok/s;   9280 sec
[2021-04-24 07:45:16,862 INFO] Step 23550/50000; acc:  90.64; ppl:  1.42; xent: 0.35; lr: 0.00010; 5021/6338 tok/s;   9300 sec
[2021-04-24 07:45:36,030 INFO] Step 23600/50000; acc:  90.68; ppl:  1.42; xent: 0.35; lr: 0.00010; 5256/6592 tok/s;   9320 sec
[2021-04-24 07:45:56,115 INFO] Step 23650/50000; acc:  90.82; ppl:  1.41; xent: 0.35; lr: 0.00010; 5179/6409 tok/s;   9340 sec
[2021-04-24 07:46:14,944 INFO] Step 23700/50000; acc:  90.70; ppl:  1.42; xent: 0.35; lr: 0.00010; 5413/6509 tok/s;   9358 sec
[2021-04-24 07:46:35,223 INFO] Step 23750/50000; acc:  90.67; ppl:  1.42; xent: 0.35; lr: 0.00010; 4997/6315 tok/s;   9379 sec
[2021-04-24 07:46:55,113 INFO] Step 23800/50000; acc:  91.02; ppl:  1.40; xent: 0.34; lr: 0.00010; 5165/6575 tok/s;   9399 sec
[2021-04-24 07:47:15,154 INFO] Step 23850/50000; acc:  90.67; ppl:  1.42; xent: 0.35; lr: 0.00010; 5067/6366 tok/s;   9419 sec
[2021-04-24 07:47:34,637 INFO] Step 23900/50000; acc:  90.56; ppl:  1.43; xent: 0.35; lr: 0.00010; 5245/6392 tok/s;   9438 sec
[2021-04-24 07:47:53,864 INFO] Step 23950/50000; acc:  90.92; ppl:  1.40; xent: 0.34; lr: 0.00010; 5245/6584 tok/s;   9457 sec
[2021-04-24 07:48:12,965 INFO] Step 24000/50000; acc:  91.12; ppl:  1.40; xent: 0.34; lr: 0.00010; 5299/6858 tok/s;   9477 sec
[2021-04-24 07:48:31,764 INFO] Step 24050/50000; acc:  90.59; ppl:  1.42; xent: 0.35; lr: 0.00010; 5358/6620 tok/s;   9495 sec
[2021-04-24 07:48:37,348 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:48:51,767 INFO] Step 24100/50000; acc:  91.24; ppl:  1.40; xent: 0.34; lr: 0.00010; 5220/6680 tok/s;   9515 sec
[2021-04-24 07:49:10,563 INFO] Step 24150/50000; acc:  90.63; ppl:  1.42; xent: 0.35; lr: 0.00010; 5349/6729 tok/s;   9534 sec
[2021-04-24 07:49:29,770 INFO] Step 24200/50000; acc:  90.33; ppl:  1.43; xent: 0.36; lr: 0.00010; 5224/6381 tok/s;   9553 sec
[2021-04-24 07:49:49,702 INFO] Step 24250/50000; acc:  90.69; ppl:  1.42; xent: 0.35; lr: 0.00010; 5123/6393 tok/s;   9573 sec
[2021-04-24 07:50:09,894 INFO] Step 24300/50000; acc:  90.82; ppl:  1.41; xent: 0.34; lr: 0.00010; 5032/6369 tok/s;   9593 sec
[2021-04-24 07:50:29,058 INFO] Step 24350/50000; acc:  90.62; ppl:  1.42; xent: 0.35; lr: 0.00010; 5239/6520 tok/s;   9613 sec
[2021-04-24 07:50:48,484 INFO] Step 24400/50000; acc:  90.78; ppl:  1.41; xent: 0.35; lr: 0.00010; 5305/6514 tok/s;   9632 sec
[2021-04-24 07:51:07,965 INFO] Step 24450/50000; acc:  90.90; ppl:  1.41; xent: 0.34; lr: 0.00010; 5296/6555 tok/s;   9652 sec
[2021-04-24 07:51:27,443 INFO] Step 24500/50000; acc:  90.76; ppl:  1.42; xent: 0.35; lr: 0.00010; 5185/6438 tok/s;   9671 sec
[2021-04-24 07:51:47,326 INFO] Step 24550/50000; acc:  91.09; ppl:  1.40; xent: 0.34; lr: 0.00010; 5105/6596 tok/s;   9691 sec
[2021-04-24 07:52:06,953 INFO] Step 24600/50000; acc:  90.62; ppl:  1.42; xent: 0.35; lr: 0.00010; 5219/6429 tok/s;   9710 sec
[2021-04-24 07:52:26,160 INFO] Step 24650/50000; acc:  90.89; ppl:  1.41; xent: 0.34; lr: 0.00010; 5313/6531 tok/s;   9730 sec
[2021-04-24 07:52:45,404 INFO] Step 24700/50000; acc:  91.16; ppl:  1.40; xent: 0.33; lr: 0.00010; 5309/6691 tok/s;   9749 sec
[2021-04-24 07:53:04,514 INFO] Step 24750/50000; acc:  90.98; ppl:  1.41; xent: 0.34; lr: 0.00010; 5227/6733 tok/s;   9768 sec
[2021-04-24 07:53:11,556 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:53:23,649 INFO] Step 24800/50000; acc:  90.83; ppl:  1.41; xent: 0.34; lr: 0.00010; 5345/6717 tok/s;   9787 sec
[2021-04-24 07:53:42,843 INFO] Step 24850/50000; acc:  90.87; ppl:  1.41; xent: 0.34; lr: 0.00010; 5258/6595 tok/s;   9806 sec
[2021-04-24 07:54:02,483 INFO] Step 24900/50000; acc:  90.72; ppl:  1.42; xent: 0.35; lr: 0.00010; 5283/6544 tok/s;   9826 sec
[2021-04-24 07:54:21,556 INFO] Step 24950/50000; acc:  90.42; ppl:  1.43; xent: 0.36; lr: 0.00010; 5238/6531 tok/s;   9845 sec
[2021-04-24 07:54:41,791 INFO] Step 25000/50000; acc:  90.89; ppl:  1.41; xent: 0.34; lr: 0.00010; 4977/6258 tok/s;   9865 sec
[2021-04-24 07:54:41,793 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-24 07:55:10,144 INFO] Validation perplexity: 1.44689
[2021-04-24 07:55:10,144 INFO] Validation accuracy: 90.4373
[2021-04-24 07:55:10,148 INFO] Saving checkpoint ../models/default_params/basic_ops/model_step_25000.pt
[2021-04-24 07:55:29,908 INFO] Step 25050/50000; acc:  90.78; ppl:  1.41; xent: 0.35; lr: 0.00010; 2118/2631 tok/s;   9913 sec
[2021-04-24 07:55:49,794 INFO] Step 25100/50000; acc:  90.94; ppl:  1.41; xent: 0.34; lr: 0.00010; 5129/6409 tok/s;   9933 sec
[2021-04-24 07:56:09,067 INFO] Step 25150/50000; acc:  90.69; ppl:  1.42; xent: 0.35; lr: 0.00010; 5333/6425 tok/s;   9953 sec
[2021-04-24 07:56:28,681 INFO] Step 25200/50000; acc:  90.82; ppl:  1.41; xent: 0.35; lr: 0.00010; 5160/6563 tok/s;   9972 sec
[2021-04-24 07:56:48,362 INFO] Step 25250/50000; acc:  91.12; ppl:  1.40; xent: 0.33; lr: 0.00010; 5242/6587 tok/s;   9992 sec
[2021-04-24 07:57:08,230 INFO] Step 25300/50000; acc:  90.84; ppl:  1.41; xent: 0.34; lr: 0.00010; 5072/6478 tok/s;  10012 sec
[2021-04-24 07:57:28,095 INFO] Step 25350/50000; acc:  90.69; ppl:  1.42; xent: 0.35; lr: 0.00010; 5099/6272 tok/s;  10032 sec
[2021-04-24 07:57:47,329 INFO] Step 25400/50000; acc:  91.05; ppl:  1.40; xent: 0.34; lr: 0.00010; 5380/6584 tok/s;  10051 sec
[2021-04-24 07:58:06,655 INFO] Step 25450/50000; acc:  91.29; ppl:  1.39; xent: 0.33; lr: 0.00010; 5228/6651 tok/s;  10070 sec
[2021-04-24 07:58:25,875 INFO] Step 25500/50000; acc:  90.86; ppl:  1.41; xent: 0.34; lr: 0.00010; 5302/6659 tok/s;  10089 sec
[2021-04-24 07:58:27,437 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 07:58:45,018 INFO] Step 25550/50000; acc:  91.03; ppl:  1.40; xent: 0.34; lr: 0.00010; 5244/6743 tok/s;  10109 sec
[2021-04-24 07:59:04,326 INFO] Step 25600/50000; acc:  90.90; ppl:  1.41; xent: 0.34; lr: 0.00010; 5301/6596 tok/s;  10128 sec
[2021-04-24 07:59:23,207 INFO] Step 25650/50000; acc:  90.35; ppl:  1.43; xent: 0.36; lr: 0.00010; 5280/6592 tok/s;  10147 sec
[2021-04-24 07:59:43,441 INFO] Step 25700/50000; acc:  90.91; ppl:  1.40; xent: 0.34; lr: 0.00010; 5148/6368 tok/s;  10167 sec
[2021-04-24 08:00:03,252 INFO] Step 25750/50000; acc:  90.77; ppl:  1.41; xent: 0.35; lr: 0.00010; 5067/6355 tok/s;  10187 sec
[2021-04-24 08:00:23,548 INFO] Step 25800/50000; acc:  91.07; ppl:  1.40; xent: 0.33; lr: 0.00010; 4925/6368 tok/s;  10207 sec
[2021-04-24 08:00:42,761 INFO] Step 25850/50000; acc:  90.63; ppl:  1.42; xent: 0.35; lr: 0.00010; 5408/6372 tok/s;  10226 sec
[2021-04-24 08:01:01,777 INFO] Step 25900/50000; acc:  90.74; ppl:  1.41; xent: 0.34; lr: 0.00010; 5398/6668 tok/s;  10245 sec
[2021-04-24 08:01:22,299 INFO] Step 25950/50000; acc:  91.04; ppl:  1.40; xent: 0.33; lr: 0.00010; 4912/6194 tok/s;  10266 sec
[2021-04-24 08:01:42,028 INFO] Step 26000/50000; acc:  90.92; ppl:  1.40; xent: 0.34; lr: 0.00010; 5145/6589 tok/s;  10286 sec
[2021-04-24 08:02:01,805 INFO] Step 26050/50000; acc:  91.02; ppl:  1.40; xent: 0.34; lr: 0.00010; 5210/6465 tok/s;  10305 sec
[2021-04-24 08:02:21,058 INFO] Step 26100/50000; acc:  90.79; ppl:  1.41; xent: 0.34; lr: 0.00010; 5261/6437 tok/s;  10325 sec
[2021-04-24 08:02:40,283 INFO] Step 26150/50000; acc:  91.25; ppl:  1.39; xent: 0.33; lr: 0.00010; 5279/6743 tok/s;  10344 sec
[2021-04-24 08:02:59,680 INFO] Step 26200/50000; acc:  91.06; ppl:  1.40; xent: 0.34; lr: 0.00010; 5254/6576 tok/s;  10363 sec
[2021-04-24 08:03:14,911 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:03:19,093 INFO] Step 26250/50000; acc:  91.22; ppl:  1.39; xent: 0.33; lr: 0.00010; 5243/6647 tok/s;  10383 sec
[2021-04-24 08:03:38,233 INFO] Step 26300/50000; acc:  91.01; ppl:  1.40; xent: 0.34; lr: 0.00010; 5331/6635 tok/s;  10402 sec
[2021-04-24 08:03:57,583 INFO] Step 26350/50000; acc:  90.71; ppl:  1.42; xent: 0.35; lr: 0.00010; 5190/6511 tok/s;  10421 sec
[2021-04-24 08:04:17,084 INFO] Step 26400/50000; acc:  90.48; ppl:  1.42; xent: 0.35; lr: 0.00010; 5183/6389 tok/s;  10441 sec
[2021-04-24 08:04:37,181 INFO] Step 26450/50000; acc:  91.16; ppl:  1.39; xent: 0.33; lr: 0.00010; 5015/6510 tok/s;  10461 sec
[2021-04-24 08:04:57,028 INFO] Step 26500/50000; acc:  90.89; ppl:  1.40; xent: 0.34; lr: 0.00010; 5235/6495 tok/s;  10481 sec
[2021-04-24 08:05:17,120 INFO] Step 26550/50000; acc:  90.82; ppl:  1.41; xent: 0.34; lr: 0.00010; 5016/6257 tok/s;  10501 sec
[2021-04-24 08:05:35,820 INFO] Step 26600/50000; acc:  90.99; ppl:  1.40; xent: 0.34; lr: 0.00010; 5450/6539 tok/s;  10519 sec
[2021-04-24 08:05:55,632 INFO] Step 26650/50000; acc:  90.77; ppl:  1.41; xent: 0.34; lr: 0.00010; 5201/6463 tok/s;  10539 sec
[2021-04-24 08:06:15,895 INFO] Step 26700/50000; acc:  91.19; ppl:  1.39; xent: 0.33; lr: 0.00010; 5007/6316 tok/s;  10559 sec
[2021-04-24 08:06:35,980 INFO] Step 26750/50000; acc:  90.98; ppl:  1.40; xent: 0.34; lr: 0.00010; 5044/6410 tok/s;  10580 sec
[2021-04-24 08:06:55,333 INFO] Step 26800/50000; acc:  90.76; ppl:  1.41; xent: 0.34; lr: 0.00010; 5225/6397 tok/s;  10599 sec
[2021-04-24 08:07:14,947 INFO] Step 26850/50000; acc:  91.12; ppl:  1.39; xent: 0.33; lr: 0.00010; 5275/6586 tok/s;  10618 sec
[2021-04-24 08:07:33,512 INFO] Step 26900/50000; acc:  91.20; ppl:  1.39; xent: 0.33; lr: 0.00010; 5420/6863 tok/s;  10637 sec
[2021-04-24 08:07:52,817 INFO] Step 26950/50000; acc:  91.15; ppl:  1.39; xent: 0.33; lr: 0.00010; 5213/6633 tok/s;  10656 sec
[2021-04-24 08:08:02,875 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:08:12,195 INFO] Step 27000/50000; acc:  90.96; ppl:  1.40; xent: 0.34; lr: 0.00010; 5315/6654 tok/s;  10676 sec
[2021-04-24 08:08:31,979 INFO] Step 27050/50000; acc:  91.17; ppl:  1.40; xent: 0.34; lr: 0.00010; 5176/6523 tok/s;  10696 sec
[2021-04-24 08:08:50,843 INFO] Step 27100/50000; acc:  90.70; ppl:  1.41; xent: 0.35; lr: 0.00010; 5369/6606 tok/s;  10714 sec
[2021-04-24 08:09:10,682 INFO] Step 27150/50000; acc:  90.73; ppl:  1.41; xent: 0.34; lr: 0.00010; 5034/6354 tok/s;  10734 sec
[2021-04-24 08:09:30,850 INFO] Step 27200/50000; acc:  91.00; ppl:  1.40; xent: 0.34; lr: 0.00010; 5039/6331 tok/s;  10754 sec
[2021-04-24 08:09:49,794 INFO] Step 27250/50000; acc:  90.96; ppl:  1.40; xent: 0.33; lr: 0.00010; 5288/6721 tok/s;  10773 sec
[2021-04-24 08:10:09,769 INFO] Step 27300/50000; acc:  91.14; ppl:  1.40; xent: 0.33; lr: 0.00010; 5277/6433 tok/s;  10793 sec
[2021-04-24 08:10:28,542 INFO] Step 27350/50000; acc:  90.89; ppl:  1.40; xent: 0.34; lr: 0.00010; 5411/6527 tok/s;  10812 sec
[2021-04-24 08:10:48,939 INFO] Step 27400/50000; acc:  90.99; ppl:  1.40; xent: 0.34; lr: 0.00010; 4944/6260 tok/s;  10832 sec
[2021-04-24 08:11:08,545 INFO] Step 27450/50000; acc:  91.23; ppl:  1.39; xent: 0.33; lr: 0.00010; 5230/6584 tok/s;  10852 sec
[2021-04-24 08:11:28,171 INFO] Step 27500/50000; acc:  91.04; ppl:  1.40; xent: 0.34; lr: 0.00010; 5181/6617 tok/s;  10872 sec
[2021-04-24 08:11:47,346 INFO] Step 27550/50000; acc:  90.83; ppl:  1.41; xent: 0.34; lr: 0.00010; 5268/6426 tok/s;  10891 sec
[2021-04-24 08:12:06,477 INFO] Step 27600/50000; acc:  91.16; ppl:  1.39; xent: 0.33; lr: 0.00010; 5317/6606 tok/s;  10910 sec
[2021-04-24 08:12:25,986 INFO] Step 27650/50000; acc:  91.49; ppl:  1.38; xent: 0.32; lr: 0.00010; 5248/6820 tok/s;  10930 sec
[2021-04-24 08:12:44,307 INFO] Step 27700/50000; acc:  90.71; ppl:  1.41; xent: 0.34; lr: 0.00010; 5512/6789 tok/s;  10948 sec
[2021-04-24 08:12:49,119 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:13:04,024 INFO] Step 27750/50000; acc:  91.40; ppl:  1.38; xent: 0.32; lr: 0.00010; 5142/6602 tok/s;  10968 sec
[2021-04-24 08:13:23,213 INFO] Step 27800/50000; acc:  91.02; ppl:  1.40; xent: 0.34; lr: 0.00010; 5346/6658 tok/s;  10987 sec
[2021-04-24 08:13:42,454 INFO] Step 27850/50000; acc:  90.65; ppl:  1.41; xent: 0.35; lr: 0.00010; 5255/6454 tok/s;  11006 sec
[2021-04-24 08:14:02,377 INFO] Step 27900/50000; acc:  90.96; ppl:  1.40; xent: 0.33; lr: 0.00010; 5114/6422 tok/s;  11026 sec
[2021-04-24 08:14:22,121 INFO] Step 27950/50000; acc:  90.94; ppl:  1.40; xent: 0.33; lr: 0.00010; 5066/6397 tok/s;  11046 sec
[2021-04-24 08:14:41,461 INFO] Step 28000/50000; acc:  91.02; ppl:  1.40; xent: 0.34; lr: 0.00010; 5245/6515 tok/s;  11065 sec
[2021-04-24 08:15:00,904 INFO] Step 28050/50000; acc:  91.05; ppl:  1.39; xent: 0.33; lr: 0.00010; 5279/6538 tok/s;  11084 sec
[2021-04-24 08:15:20,696 INFO] Step 28100/50000; acc:  91.08; ppl:  1.40; xent: 0.33; lr: 0.00010; 5271/6483 tok/s;  11104 sec
[2021-04-24 08:15:40,201 INFO] Step 28150/50000; acc:  91.02; ppl:  1.40; xent: 0.34; lr: 0.00010; 5161/6373 tok/s;  11124 sec
[2021-04-24 08:16:00,312 INFO] Step 28200/50000; acc:  91.34; ppl:  1.38; xent: 0.32; lr: 0.00010; 5024/6586 tok/s;  11144 sec
[2021-04-24 08:16:19,931 INFO] Step 28250/50000; acc:  90.80; ppl:  1.41; xent: 0.34; lr: 0.00010; 5207/6341 tok/s;  11163 sec
[2021-04-24 08:16:39,020 INFO] Step 28300/50000; acc:  91.30; ppl:  1.38; xent: 0.33; lr: 0.00010; 5367/6595 tok/s;  11183 sec
[2021-04-24 08:16:58,266 INFO] Step 28350/50000; acc:  91.42; ppl:  1.38; xent: 0.32; lr: 0.00010; 5238/6653 tok/s;  11202 sec
[2021-04-24 08:17:17,627 INFO] Step 28400/50000; acc:  91.14; ppl:  1.39; xent: 0.33; lr: 0.00010; 5198/6627 tok/s;  11221 sec
[2021-04-24 08:17:23,904 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:17:37,103 INFO] Step 28450/50000; acc:  91.27; ppl:  1.38; xent: 0.32; lr: 0.00010; 5325/6740 tok/s;  11241 sec
[2021-04-24 08:17:56,153 INFO] Step 28500/50000; acc:  91.10; ppl:  1.40; xent: 0.33; lr: 0.00010; 5296/6540 tok/s;  11260 sec
[2021-04-24 08:18:15,281 INFO] Step 28550/50000; acc:  90.93; ppl:  1.40; xent: 0.34; lr: 0.00010; 5260/6629 tok/s;  11279 sec
[2021-04-24 08:18:34,365 INFO] Step 28600/50000; acc:  90.76; ppl:  1.41; xent: 0.34; lr: 0.00010; 5356/6563 tok/s;  11298 sec
[2021-04-24 08:18:54,710 INFO] Step 28650/50000; acc:  91.21; ppl:  1.39; xent: 0.33; lr: 0.00010; 4997/6344 tok/s;  11318 sec
[2021-04-24 08:19:14,364 INFO] Step 28700/50000; acc:  91.07; ppl:  1.39; xent: 0.33; lr: 0.00010; 5159/6433 tok/s;  11338 sec
[2021-04-24 08:19:33,934 INFO] Step 28750/50000; acc:  91.12; ppl:  1.39; xent: 0.33; lr: 0.00010; 5154/6427 tok/s;  11357 sec
[2021-04-24 08:19:52,988 INFO] Step 28800/50000; acc:  90.91; ppl:  1.40; xent: 0.34; lr: 0.00010; 5439/6558 tok/s;  11377 sec
[2021-04-24 08:20:12,607 INFO] Step 28850/50000; acc:  91.12; ppl:  1.39; xent: 0.33; lr: 0.00010; 5129/6566 tok/s;  11396 sec
[2021-04-24 08:20:32,262 INFO] Step 28900/50000; acc:  91.40; ppl:  1.38; xent: 0.32; lr: 0.00010; 5307/6580 tok/s;  11416 sec
[2021-04-24 08:20:52,126 INFO] Step 28950/50000; acc:  91.14; ppl:  1.39; xent: 0.33; lr: 0.00010; 5057/6427 tok/s;  11436 sec
[2021-04-24 08:21:11,901 INFO] Step 29000/50000; acc:  91.06; ppl:  1.39; xent: 0.33; lr: 0.00010; 5102/6281 tok/s;  11455 sec
[2021-04-24 08:21:31,059 INFO] Step 29050/50000; acc:  91.47; ppl:  1.38; xent: 0.32; lr: 0.00010; 5394/6740 tok/s;  11475 sec
[2021-04-24 08:21:50,448 INFO] Step 29100/50000; acc:  91.31; ppl:  1.38; xent: 0.32; lr: 0.00010; 5209/6619 tok/s;  11494 sec
[2021-04-24 08:22:09,689 INFO] Step 29150/50000; acc:  91.11; ppl:  1.39; xent: 0.33; lr: 0.00010; 5243/6604 tok/s;  11513 sec
[2021-04-24 08:22:10,423 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:22:28,869 INFO] Step 29200/50000; acc:  91.33; ppl:  1.38; xent: 0.32; lr: 0.00010; 5278/6666 tok/s;  11532 sec
[2021-04-24 08:22:48,648 INFO] Step 29250/50000; acc:  91.16; ppl:  1.39; xent: 0.33; lr: 0.00010; 5238/6548 tok/s;  11552 sec
[2021-04-24 08:23:07,436 INFO] Step 29300/50000; acc:  90.59; ppl:  1.41; xent: 0.35; lr: 0.00010; 5304/6543 tok/s;  11571 sec
[2021-04-24 08:23:27,862 INFO] Step 29350/50000; acc:  91.26; ppl:  1.38; xent: 0.32; lr: 0.00010; 4958/6290 tok/s;  11591 sec
[2021-04-24 08:23:47,374 INFO] Step 29400/50000; acc:  91.11; ppl:  1.39; xent: 0.33; lr: 0.00010; 5256/6560 tok/s;  11611 sec
[2021-04-24 08:24:07,710 INFO] Step 29450/50000; acc:  91.19; ppl:  1.39; xent: 0.33; lr: 0.00010; 4962/6328 tok/s;  11631 sec
[2021-04-24 08:24:26,935 INFO] Step 29500/50000; acc:  90.93; ppl:  1.40; xent: 0.34; lr: 0.00010; 5380/6300 tok/s;  11650 sec
[2021-04-24 08:24:45,951 INFO] Step 29550/50000; acc:  91.05; ppl:  1.39; xent: 0.33; lr: 0.00010; 5321/6719 tok/s;  11669 sec
[2021-04-24 08:25:06,343 INFO] Step 29600/50000; acc:  91.34; ppl:  1.38; xent: 0.32; lr: 0.00010; 4976/6202 tok/s;  11690 sec
[2021-04-24 08:25:26,077 INFO] Step 29650/50000; acc:  91.20; ppl:  1.39; xent: 0.33; lr: 0.00010; 5128/6567 tok/s;  11710 sec
[2021-04-24 08:25:46,321 INFO] Step 29700/50000; acc:  91.28; ppl:  1.39; xent: 0.33; lr: 0.00010; 5149/6318 tok/s;  11730 sec
[2021-04-24 08:26:05,406 INFO] Step 29750/50000; acc:  91.13; ppl:  1.39; xent: 0.33; lr: 0.00010; 5295/6582 tok/s;  11749 sec
[2021-04-24 08:26:24,645 INFO] Step 29800/50000; acc:  91.46; ppl:  1.37; xent: 0.32; lr: 0.00010; 5236/6650 tok/s;  11768 sec
[2021-04-24 08:26:43,856 INFO] Step 29850/50000; acc:  91.34; ppl:  1.38; xent: 0.32; lr: 0.00010; 5297/6690 tok/s;  11787 sec
[2021-04-24 08:26:58,233 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:27:03,083 INFO] Step 29900/50000; acc:  91.32; ppl:  1.38; xent: 0.32; lr: 0.00010; 5316/6751 tok/s;  11807 sec
[2021-04-24 08:27:22,258 INFO] Step 29950/50000; acc:  91.29; ppl:  1.39; xent: 0.33; lr: 0.00010; 5260/6539 tok/s;  11826 sec
[2021-04-24 08:27:41,312 INFO] Step 30000/50000; acc:  90.94; ppl:  1.40; xent: 0.34; lr: 0.00010; 5309/6639 tok/s;  11845 sec
[2021-04-24 08:27:41,314 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-24 08:28:09,670 INFO] Validation perplexity: 1.44318
[2021-04-24 08:28:09,671 INFO] Validation accuracy: 90.5087
[2021-04-24 08:28:09,675 INFO] Saving checkpoint ../models/default_params/basic_ops/model_step_30000.pt
[2021-04-24 08:28:29,196 INFO] Step 30050/50000; acc:  90.94; ppl:  1.40; xent: 0.34; lr: 0.00010; 2139/2635 tok/s;  11893 sec
[2021-04-24 08:28:49,264 INFO] Step 30100/50000; acc:  91.30; ppl:  1.38; xent: 0.32; lr: 0.00010; 5016/6474 tok/s;  11913 sec
[2021-04-24 08:29:08,877 INFO] Step 30150/50000; acc:  91.13; ppl:  1.39; xent: 0.33; lr: 0.00010; 5144/6395 tok/s;  11932 sec
[2021-04-24 08:29:28,831 INFO] Step 30200/50000; acc:  91.08; ppl:  1.39; xent: 0.33; lr: 0.00010; 5173/6354 tok/s;  11952 sec
[2021-04-24 08:29:48,012 INFO] Step 30250/50000; acc:  91.20; ppl:  1.39; xent: 0.33; lr: 0.00010; 5361/6408 tok/s;  11972 sec
[2021-04-24 08:30:07,863 INFO] Step 30300/50000; acc:  91.13; ppl:  1.39; xent: 0.33; lr: 0.00010; 5162/6527 tok/s;  11991 sec
[2021-04-24 08:30:28,018 INFO] Step 30350/50000; acc:  91.43; ppl:  1.38; xent: 0.32; lr: 0.00010; 4959/6379 tok/s;  12012 sec
[2021-04-24 08:30:47,772 INFO] Step 30400/50000; acc:  91.31; ppl:  1.38; xent: 0.32; lr: 0.00010; 5172/6464 tok/s;  12031 sec
[2021-04-24 08:31:07,080 INFO] Step 30450/50000; acc:  91.07; ppl:  1.39; xent: 0.33; lr: 0.00010; 5219/6400 tok/s;  12051 sec
[2021-04-24 08:31:26,789 INFO] Step 30500/50000; acc:  91.44; ppl:  1.37; xent: 0.32; lr: 0.00010; 5311/6595 tok/s;  12070 sec
[2021-04-24 08:31:45,340 INFO] Step 30550/50000; acc:  91.45; ppl:  1.38; xent: 0.32; lr: 0.00010; 5407/6968 tok/s;  12089 sec
[2021-04-24 08:32:04,859 INFO] Step 30600/50000; acc:  91.24; ppl:  1.38; xent: 0.32; lr: 0.00010; 5130/6454 tok/s;  12108 sec
[2021-04-24 08:32:14,027 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:32:24,317 INFO] Step 30650/50000; acc:  91.27; ppl:  1.38; xent: 0.32; lr: 0.00010; 5280/6621 tok/s;  12128 sec
[2021-04-24 08:32:43,899 INFO] Step 30700/50000; acc:  91.39; ppl:  1.38; xent: 0.32; lr: 0.00010; 5236/6632 tok/s;  12147 sec
[2021-04-24 08:33:02,746 INFO] Step 30750/50000; acc:  90.78; ppl:  1.40; xent: 0.34; lr: 0.00010; 5322/6488 tok/s;  12166 sec
[2021-04-24 08:33:22,683 INFO] Step 30800/50000; acc:  91.03; ppl:  1.39; xent: 0.33; lr: 0.00010; 5048/6350 tok/s;  12186 sec
[2021-04-24 08:33:42,985 INFO] Step 30850/50000; acc:  91.35; ppl:  1.38; xent: 0.32; lr: 0.00010; 5069/6436 tok/s;  12207 sec
[2021-04-24 08:34:02,375 INFO] Step 30900/50000; acc:  91.17; ppl:  1.39; xent: 0.33; lr: 0.00010; 5167/6518 tok/s;  12226 sec
[2021-04-24 08:34:21,920 INFO] Step 30950/50000; acc:  91.31; ppl:  1.38; xent: 0.32; lr: 0.00010; 5242/6410 tok/s;  12245 sec
[2021-04-24 08:34:41,086 INFO] Step 31000/50000; acc:  91.18; ppl:  1.39; xent: 0.33; lr: 0.00010; 5408/6521 tok/s;  12265 sec
[2021-04-24 08:35:01,295 INFO] Step 31050/50000; acc:  91.33; ppl:  1.38; xent: 0.32; lr: 0.00010; 5025/6334 tok/s;  12285 sec
[2021-04-24 08:35:20,772 INFO] Step 31100/50000; acc:  91.47; ppl:  1.37; xent: 0.31; lr: 0.00010; 5250/6617 tok/s;  12304 sec
[2021-04-24 08:35:40,269 INFO] Step 31150/50000; acc:  91.27; ppl:  1.38; xent: 0.32; lr: 0.00010; 5145/6594 tok/s;  12324 sec
[2021-04-24 08:35:59,448 INFO] Step 31200/50000; acc:  91.14; ppl:  1.39; xent: 0.33; lr: 0.00010; 5312/6494 tok/s;  12343 sec
[2021-04-24 08:36:18,462 INFO] Step 31250/50000; acc:  91.45; ppl:  1.37; xent: 0.32; lr: 0.00010; 5322/6622 tok/s;  12362 sec
[2021-04-24 08:36:38,305 INFO] Step 31300/50000; acc:  91.78; ppl:  1.36; xent: 0.31; lr: 0.00010; 5223/6746 tok/s;  12382 sec
[2021-04-24 08:36:56,683 INFO] Step 31350/50000; acc:  91.01; ppl:  1.39; xent: 0.33; lr: 0.00010; 5482/6741 tok/s;  12400 sec
[2021-04-24 08:37:00,674 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:37:16,386 INFO] Step 31400/50000; acc:  91.59; ppl:  1.37; xent: 0.32; lr: 0.00010; 5122/6573 tok/s;  12420 sec
[2021-04-24 08:37:35,605 INFO] Step 31450/50000; acc:  91.13; ppl:  1.39; xent: 0.33; lr: 0.00010; 5320/6614 tok/s;  12439 sec
[2021-04-24 08:37:54,645 INFO] Step 31500/50000; acc:  90.94; ppl:  1.40; xent: 0.33; lr: 0.00010; 5323/6592 tok/s;  12458 sec
[2021-04-24 08:38:14,865 INFO] Step 31550/50000; acc:  91.26; ppl:  1.38; xent: 0.32; lr: 0.00010; 4980/6298 tok/s;  12478 sec
[2021-04-24 08:38:34,445 INFO] Step 31600/50000; acc:  91.22; ppl:  1.38; xent: 0.32; lr: 0.00010; 5150/6452 tok/s;  12498 sec
[2021-04-24 08:38:54,021 INFO] Step 31650/50000; acc:  91.31; ppl:  1.38; xent: 0.32; lr: 0.00010; 5253/6555 tok/s;  12518 sec
[2021-04-24 08:39:13,665 INFO] Step 31700/50000; acc:  91.14; ppl:  1.38; xent: 0.33; lr: 0.00010; 5228/6306 tok/s;  12537 sec
[2021-04-24 08:39:33,259 INFO] Step 31750/50000; acc:  91.32; ppl:  1.38; xent: 0.32; lr: 0.00010; 5161/6466 tok/s;  12557 sec
[2021-04-24 08:39:53,356 INFO] Step 31800/50000; acc:  91.47; ppl:  1.37; xent: 0.32; lr: 0.00010; 5123/6387 tok/s;  12577 sec
[2021-04-24 08:40:13,609 INFO] Step 31850/50000; acc:  91.45; ppl:  1.37; xent: 0.32; lr: 0.00010; 5031/6488 tok/s;  12597 sec
[2021-04-24 08:40:33,131 INFO] Step 31900/50000; acc:  91.06; ppl:  1.39; xent: 0.33; lr: 0.00010; 5209/6332 tok/s;  12617 sec
[2021-04-24 08:40:52,049 INFO] Step 31950/50000; acc:  91.56; ppl:  1.37; xent: 0.31; lr: 0.00010; 5347/6686 tok/s;  12636 sec
[2021-04-24 08:41:11,325 INFO] Step 32000/50000; acc:  91.59; ppl:  1.36; xent: 0.31; lr: 0.00010; 5272/6671 tok/s;  12655 sec
[2021-04-24 08:41:30,579 INFO] Step 32050/50000; acc:  91.41; ppl:  1.37; xent: 0.32; lr: 0.00010; 5210/6640 tok/s;  12674 sec
[2021-04-24 08:41:36,132 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:41:50,124 INFO] Step 32100/50000; acc:  91.59; ppl:  1.37; xent: 0.31; lr: 0.00010; 5356/6721 tok/s;  12694 sec
[2021-04-24 08:42:09,180 INFO] Step 32150/50000; acc:  91.38; ppl:  1.38; xent: 0.32; lr: 0.00010; 5286/6616 tok/s;  12713 sec
[2021-04-24 08:42:28,288 INFO] Step 32200/50000; acc:  91.15; ppl:  1.39; xent: 0.33; lr: 0.00010; 5230/6533 tok/s;  12732 sec
[2021-04-24 08:42:47,619 INFO] Step 32250/50000; acc:  91.04; ppl:  1.39; xent: 0.33; lr: 0.00010; 5285/6529 tok/s;  12751 sec
[2021-04-24 08:43:07,749 INFO] Step 32300/50000; acc:  91.46; ppl:  1.37; xent: 0.32; lr: 0.00010; 5057/6375 tok/s;  12771 sec
[2021-04-24 08:43:27,296 INFO] Step 32350/50000; acc:  91.31; ppl:  1.38; xent: 0.32; lr: 0.00010; 5123/6443 tok/s;  12791 sec
[2021-04-24 08:43:46,876 INFO] Step 32400/50000; acc:  91.27; ppl:  1.38; xent: 0.32; lr: 0.00010; 5205/6401 tok/s;  12810 sec
[2021-04-24 08:44:05,832 INFO] Step 32450/50000; acc:  91.33; ppl:  1.38; xent: 0.32; lr: 0.00010; 5530/6702 tok/s;  12829 sec
[2021-04-24 08:44:25,585 INFO] Step 32500/50000; acc:  91.27; ppl:  1.38; xent: 0.32; lr: 0.00010; 5089/6535 tok/s;  12849 sec
[2021-04-24 08:44:44,944 INFO] Step 32550/50000; acc:  91.55; ppl:  1.37; xent: 0.31; lr: 0.00010; 5236/6500 tok/s;  12868 sec
[2021-04-24 08:45:05,029 INFO] Step 32600/50000; acc:  91.36; ppl:  1.37; xent: 0.32; lr: 0.00010; 5114/6442 tok/s;  12889 sec
[2021-04-24 08:45:24,695 INFO] Step 32650/50000; acc:  91.39; ppl:  1.38; xent: 0.32; lr: 0.00010; 5179/6393 tok/s;  12908 sec
[2021-04-24 08:45:43,826 INFO] Step 32700/50000; acc:  91.76; ppl:  1.36; xent: 0.31; lr: 0.00010; 5367/6698 tok/s;  12927 sec
[2021-04-24 08:46:02,935 INFO] Step 32750/50000; acc:  91.58; ppl:  1.36; xent: 0.31; lr: 0.00010; 5217/6700 tok/s;  12946 sec
[2021-04-24 08:46:22,208 INFO] Step 32800/50000; acc:  91.43; ppl:  1.37; xent: 0.31; lr: 0.00010; 5288/6635 tok/s;  12966 sec
[2021-04-24 08:46:22,225 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:46:41,668 INFO] Step 32850/50000; acc:  91.50; ppl:  1.37; xent: 0.31; lr: 0.00010; 5172/6554 tok/s;  12985 sec
[2021-04-24 08:47:01,410 INFO] Step 32900/50000; acc:  91.37; ppl:  1.38; xent: 0.32; lr: 0.00010; 5305/6530 tok/s;  13005 sec
[2021-04-24 08:47:20,182 INFO] Step 32950/50000; acc:  90.93; ppl:  1.40; xent: 0.33; lr: 0.00010; 5296/6552 tok/s;  13024 sec
[2021-04-24 08:47:40,563 INFO] Step 33000/50000; acc:  91.47; ppl:  1.37; xent: 0.31; lr: 0.00010; 4936/6317 tok/s;  13044 sec
[2021-04-24 08:48:00,154 INFO] Step 33050/50000; acc:  91.39; ppl:  1.37; xent: 0.32; lr: 0.00010; 5232/6469 tok/s;  13064 sec
[2021-04-24 08:48:20,480 INFO] Step 33100/50000; acc:  91.44; ppl:  1.37; xent: 0.32; lr: 0.00010; 4991/6381 tok/s;  13084 sec
[2021-04-24 08:48:39,604 INFO] Step 33150/50000; acc:  91.21; ppl:  1.38; xent: 0.32; lr: 0.00010; 5342/6289 tok/s;  13103 sec
[2021-04-24 08:48:58,786 INFO] Step 33200/50000; acc:  91.23; ppl:  1.38; xent: 0.32; lr: 0.00010; 5314/6682 tok/s;  13122 sec
[2021-04-24 08:49:18,913 INFO] Step 33250/50000; acc:  91.77; ppl:  1.36; xent: 0.31; lr: 0.00010; 5097/6450 tok/s;  13142 sec
[2021-04-24 08:49:38,683 INFO] Step 33300/50000; acc:  91.35; ppl:  1.38; xent: 0.32; lr: 0.00010; 5126/6399 tok/s;  13162 sec
[2021-04-24 08:49:58,792 INFO] Step 33350/50000; acc:  91.47; ppl:  1.37; xent: 0.31; lr: 0.00010; 5030/6301 tok/s;  13182 sec
[2021-04-24 08:50:18,038 INFO] Step 33400/50000; acc:  91.48; ppl:  1.37; xent: 0.31; lr: 0.00010; 5371/6615 tok/s;  13202 sec
[2021-04-24 08:50:37,184 INFO] Step 33450/50000; acc:  91.71; ppl:  1.36; xent: 0.31; lr: 0.00010; 5303/6713 tok/s;  13221 sec
[2021-04-24 08:50:56,391 INFO] Step 33500/50000; acc:  91.60; ppl:  1.36; xent: 0.31; lr: 0.00010; 5269/6665 tok/s;  13240 sec
[2021-04-24 08:51:10,049 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:51:15,657 INFO] Step 33550/50000; acc:  91.53; ppl:  1.37; xent: 0.31; lr: 0.00010; 5232/6697 tok/s;  13259 sec
[2021-04-24 08:51:35,293 INFO] Step 33600/50000; acc:  91.53; ppl:  1.37; xent: 0.32; lr: 0.00010; 5191/6437 tok/s;  13279 sec
[2021-04-24 08:51:53,907 INFO] Step 33650/50000; acc:  91.15; ppl:  1.38; xent: 0.33; lr: 0.00010; 5401/6737 tok/s;  13297 sec
[2021-04-24 08:52:13,642 INFO] Step 33700/50000; acc:  91.19; ppl:  1.38; xent: 0.32; lr: 0.00010; 5255/6481 tok/s;  13317 sec
[2021-04-24 08:52:33,551 INFO] Step 33750/50000; acc:  91.51; ppl:  1.37; xent: 0.31; lr: 0.00010; 5040/6448 tok/s;  13337 sec
[2021-04-24 08:52:53,149 INFO] Step 33800/50000; acc:  91.30; ppl:  1.38; xent: 0.32; lr: 0.00010; 5119/6426 tok/s;  13357 sec
[2021-04-24 08:53:13,219 INFO] Step 33850/50000; acc:  91.43; ppl:  1.37; xent: 0.32; lr: 0.00010; 5139/6323 tok/s;  13377 sec
[2021-04-24 08:53:32,140 INFO] Step 33900/50000; acc:  91.47; ppl:  1.37; xent: 0.32; lr: 0.00010; 5441/6468 tok/s;  13396 sec
[2021-04-24 08:53:52,043 INFO] Step 33950/50000; acc:  91.39; ppl:  1.37; xent: 0.32; lr: 0.00010; 5092/6453 tok/s;  13416 sec
[2021-04-24 08:54:12,215 INFO] Step 34000/50000; acc:  91.79; ppl:  1.36; xent: 0.31; lr: 0.00010; 4997/6455 tok/s;  13436 sec
[2021-04-24 08:54:31,905 INFO] Step 34050/50000; acc:  91.43; ppl:  1.37; xent: 0.31; lr: 0.00010; 5246/6504 tok/s;  13455 sec
[2021-04-24 08:54:51,259 INFO] Step 34100/50000; acc:  91.35; ppl:  1.37; xent: 0.32; lr: 0.00010; 5218/6464 tok/s;  13475 sec
[2021-04-24 08:55:10,410 INFO] Step 34150/50000; acc:  91.70; ppl:  1.36; xent: 0.31; lr: 0.00010; 5301/6581 tok/s;  13494 sec
[2021-04-24 08:55:29,178 INFO] Step 34200/50000; acc:  91.63; ppl:  1.36; xent: 0.31; lr: 0.00010; 5464/6975 tok/s;  13513 sec
[2021-04-24 08:55:48,591 INFO] Step 34250/50000; acc:  91.60; ppl:  1.36; xent: 0.31; lr: 0.00010; 5206/6554 tok/s;  13532 sec
[2021-04-24 08:55:56,889 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 08:56:08,157 INFO] Step 34300/50000; acc:  91.58; ppl:  1.36; xent: 0.31; lr: 0.00010; 5226/6589 tok/s;  13552 sec
[2021-04-24 08:56:27,358 INFO] Step 34350/50000; acc:  91.65; ppl:  1.36; xent: 0.31; lr: 0.00010; 5257/6716 tok/s;  13571 sec
[2021-04-24 08:56:45,946 INFO] Step 34400/50000; acc:  90.95; ppl:  1.39; xent: 0.33; lr: 0.00010; 5453/6554 tok/s;  13589 sec
[2021-04-24 08:57:05,943 INFO] Step 34450/50000; acc:  91.45; ppl:  1.37; xent: 0.31; lr: 0.00010; 5011/6312 tok/s;  13609 sec
[2021-04-24 08:57:26,067 INFO] Step 34500/50000; acc:  91.59; ppl:  1.36; xent: 0.31; lr: 0.00010; 5168/6548 tok/s;  13630 sec
[2021-04-24 08:57:45,436 INFO] Step 34550/50000; acc:  91.33; ppl:  1.37; xent: 0.31; lr: 0.00010; 5163/6497 tok/s;  13649 sec
[2021-04-24 08:58:04,953 INFO] Step 34600/50000; acc:  91.58; ppl:  1.36; xent: 0.31; lr: 0.00010; 5224/6434 tok/s;  13668 sec
[2021-04-24 08:58:24,134 INFO] Step 34650/50000; acc:  91.48; ppl:  1.37; xent: 0.31; lr: 0.00010; 5383/6500 tok/s;  13688 sec
[2021-04-24 08:58:44,310 INFO] Step 34700/50000; acc:  91.51; ppl:  1.37; xent: 0.31; lr: 0.00010; 5054/6301 tok/s;  13708 sec
[2021-04-24 08:59:04,067 INFO] Step 34750/50000; acc:  91.79; ppl:  1.35; xent: 0.30; lr: 0.00010; 5120/6554 tok/s;  13728 sec
[2021-04-24 08:59:23,748 INFO] Step 34800/50000; acc:  91.40; ppl:  1.37; xent: 0.32; lr: 0.00010; 5132/6469 tok/s;  13747 sec
[2021-04-24 08:59:43,281 INFO] Step 34850/50000; acc:  91.58; ppl:  1.36; xent: 0.31; lr: 0.00010; 5282/6516 tok/s;  13767 sec
[2021-04-24 09:00:02,556 INFO] Step 34900/50000; acc:  91.81; ppl:  1.35; xent: 0.30; lr: 0.00010; 5247/6582 tok/s;  13786 sec
[2021-04-24 09:00:21,581 INFO] Step 34950/50000; acc:  91.81; ppl:  1.36; xent: 0.30; lr: 0.00010; 5298/6802 tok/s;  13805 sec
[2021-04-24 09:00:40,783 INFO] Step 35000/50000; acc:  91.36; ppl:  1.37; xent: 0.31; lr: 0.00010; 5359/6623 tok/s;  13824 sec
[2021-04-24 09:00:40,784 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-24 09:01:08,948 INFO] Validation perplexity: 1.44539
[2021-04-24 09:01:08,948 INFO] Validation accuracy: 90.5702
[2021-04-24 09:01:08,952 INFO] Saving checkpoint ../models/default_params/basic_ops/model_step_35000.pt
[2021-04-24 09:01:12,082 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:01:28,701 INFO] Step 35050/50000; acc:  91.72; ppl:  1.36; xent: 0.30; lr: 0.00010; 2125/2687 tok/s;  13872 sec
[2021-04-24 09:01:47,877 INFO] Step 35100/50000; acc:  91.44; ppl:  1.37; xent: 0.32; lr: 0.00010; 5304/6585 tok/s;  13891 sec
[2021-04-24 09:02:06,972 INFO] Step 35150/50000; acc:  91.28; ppl:  1.38; xent: 0.32; lr: 0.00010; 5229/6594 tok/s;  13911 sec
[2021-04-24 09:02:27,128 INFO] Step 35200/50000; acc:  91.57; ppl:  1.36; xent: 0.31; lr: 0.00010; 5050/6334 tok/s;  13931 sec
[2021-04-24 09:02:46,640 INFO] Step 35250/50000; acc:  91.39; ppl:  1.37; xent: 0.31; lr: 0.00010; 5137/6464 tok/s;  13950 sec
[2021-04-24 09:03:06,205 INFO] Step 35300/50000; acc:  91.65; ppl:  1.36; xent: 0.31; lr: 0.00010; 5325/6553 tok/s;  13970 sec
[2021-04-24 09:03:25,834 INFO] Step 35350/50000; acc:  91.47; ppl:  1.37; xent: 0.31; lr: 0.00010; 5220/6355 tok/s;  13989 sec
[2021-04-24 09:03:45,406 INFO] Step 35400/50000; acc:  91.57; ppl:  1.36; xent: 0.31; lr: 0.00010; 5140/6428 tok/s;  14009 sec
[2021-04-24 09:04:05,129 INFO] Step 35450/50000; acc:  91.66; ppl:  1.36; xent: 0.31; lr: 0.00010; 5205/6441 tok/s;  14029 sec
[2021-04-24 09:04:25,457 INFO] Step 35500/50000; acc:  91.69; ppl:  1.36; xent: 0.31; lr: 0.00010; 5026/6530 tok/s;  14049 sec
[2021-04-24 09:04:44,524 INFO] Step 35550/50000; acc:  91.38; ppl:  1.37; xent: 0.31; lr: 0.00010; 5271/6461 tok/s;  14068 sec
[2021-04-24 09:05:03,825 INFO] Step 35600/50000; acc:  91.87; ppl:  1.35; xent: 0.30; lr: 0.00010; 5290/6561 tok/s;  14087 sec
[2021-04-24 09:05:23,276 INFO] Step 35650/50000; acc:  91.90; ppl:  1.35; xent: 0.30; lr: 0.00010; 5275/6742 tok/s;  14107 sec
[2021-04-24 09:05:42,409 INFO] Step 35700/50000; acc:  91.44; ppl:  1.37; xent: 0.31; lr: 0.00010; 5248/6549 tok/s;  14126 sec
[2021-04-24 09:05:47,215 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:06:02,062 INFO] Step 35750/50000; acc:  91.76; ppl:  1.35; xent: 0.30; lr: 0.00010; 5173/6609 tok/s;  14146 sec
[2021-04-24 09:06:21,309 INFO] Step 35800/50000; acc:  91.71; ppl:  1.36; xent: 0.31; lr: 0.00010; 5348/6705 tok/s;  14165 sec
[2021-04-24 09:06:40,639 INFO] Step 35850/50000; acc:  91.36; ppl:  1.37; xent: 0.32; lr: 0.00010; 5218/6441 tok/s;  14184 sec
[2021-04-24 09:07:00,234 INFO] Step 35900/50000; acc:  91.21; ppl:  1.37; xent: 0.32; lr: 0.00010; 5194/6368 tok/s;  14204 sec
[2021-04-24 09:07:20,190 INFO] Step 35950/50000; acc:  91.73; ppl:  1.35; xent: 0.30; lr: 0.00010; 5024/6464 tok/s;  14224 sec
[2021-04-24 09:07:39,906 INFO] Step 36000/50000; acc:  91.50; ppl:  1.36; xent: 0.31; lr: 0.00010; 5124/6411 tok/s;  14243 sec
[2021-04-24 09:07:59,385 INFO] Step 36050/50000; acc:  91.48; ppl:  1.37; xent: 0.31; lr: 0.00010; 5220/6394 tok/s;  14263 sec
[2021-04-24 09:08:18,650 INFO] Step 36100/50000; acc:  91.63; ppl:  1.36; xent: 0.31; lr: 0.00010; 5488/6644 tok/s;  14282 sec
[2021-04-24 09:08:38,561 INFO] Step 36150/50000; acc:  91.59; ppl:  1.36; xent: 0.31; lr: 0.00010; 5038/6472 tok/s;  14302 sec
[2021-04-24 09:08:58,026 INFO] Step 36200/50000; acc:  91.84; ppl:  1.35; xent: 0.30; lr: 0.00010; 5182/6475 tok/s;  14322 sec
[2021-04-24 09:09:17,911 INFO] Step 36250/50000; acc:  91.56; ppl:  1.36; xent: 0.31; lr: 0.00010; 5158/6492 tok/s;  14341 sec
[2021-04-24 09:09:37,551 INFO] Step 36300/50000; acc:  91.65; ppl:  1.36; xent: 0.31; lr: 0.00010; 5199/6404 tok/s;  14361 sec
[2021-04-24 09:09:56,630 INFO] Step 36350/50000; acc:  91.85; ppl:  1.35; xent: 0.30; lr: 0.00010; 5326/6633 tok/s;  14380 sec
[2021-04-24 09:10:15,841 INFO] Step 36400/50000; acc:  91.83; ppl:  1.35; xent: 0.30; lr: 0.00010; 5221/6716 tok/s;  14399 sec
[2021-04-24 09:10:34,265 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:10:35,282 INFO] Step 36450/50000; acc:  91.75; ppl:  1.35; xent: 0.30; lr: 0.00010; 5302/6685 tok/s;  14419 sec
[2021-04-24 09:10:54,541 INFO] Step 36500/50000; acc:  91.65; ppl:  1.36; xent: 0.31; lr: 0.00010; 5233/6528 tok/s;  14438 sec
[2021-04-24 09:11:14,134 INFO] Step 36550/50000; acc:  91.55; ppl:  1.37; xent: 0.31; lr: 0.00010; 5192/6486 tok/s;  14458 sec
[2021-04-24 09:11:33,127 INFO] Step 36600/50000; acc:  91.24; ppl:  1.38; xent: 0.32; lr: 0.00010; 5353/6544 tok/s;  14477 sec
[2021-04-24 09:11:53,507 INFO] Step 36650/50000; acc:  91.72; ppl:  1.35; xent: 0.30; lr: 0.00010; 4980/6377 tok/s;  14497 sec
[2021-04-24 09:12:12,697 INFO] Step 36700/50000; acc:  91.61; ppl:  1.36; xent: 0.31; lr: 0.00010; 5314/6570 tok/s;  14516 sec
[2021-04-24 09:12:33,032 INFO] Step 36750/50000; acc:  91.68; ppl:  1.35; xent: 0.30; lr: 0.00010; 4923/6339 tok/s;  14537 sec
[2021-04-24 09:12:51,983 INFO] Step 36800/50000; acc:  91.39; ppl:  1.36; xent: 0.31; lr: 0.00010; 5438/6402 tok/s;  14556 sec
[2021-04-24 09:13:11,519 INFO] Step 36850/50000; acc:  91.51; ppl:  1.36; xent: 0.31; lr: 0.00010; 5200/6541 tok/s;  14575 sec
[2021-04-24 09:13:31,752 INFO] Step 36900/50000; acc:  91.87; ppl:  1.35; xent: 0.30; lr: 0.00010; 5115/6438 tok/s;  14595 sec
[2021-04-24 09:13:51,859 INFO] Step 36950/50000; acc:  91.59; ppl:  1.36; xent: 0.31; lr: 0.00010; 5030/6323 tok/s;  14615 sec
[2021-04-24 09:14:11,717 INFO] Step 37000/50000; acc:  91.79; ppl:  1.35; xent: 0.30; lr: 0.00010; 5060/6322 tok/s;  14635 sec
[2021-04-24 09:14:31,501 INFO] Step 37050/50000; acc:  91.69; ppl:  1.35; xent: 0.30; lr: 0.00010; 5211/6402 tok/s;  14655 sec
[2021-04-24 09:14:50,252 INFO] Step 37100/50000; acc:  91.79; ppl:  1.35; xent: 0.30; lr: 0.00010; 5436/6840 tok/s;  14674 sec
[2021-04-24 09:15:09,634 INFO] Step 37150/50000; acc:  91.81; ppl:  1.35; xent: 0.30; lr: 0.00010; 5163/6615 tok/s;  14693 sec
[2021-04-24 09:15:22,673 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:15:28,963 INFO] Step 37200/50000; acc:  91.74; ppl:  1.35; xent: 0.30; lr: 0.00010; 5263/6669 tok/s;  14713 sec
[2021-04-24 09:15:48,731 INFO] Step 37250/50000; acc:  91.92; ppl:  1.35; xent: 0.30; lr: 0.00010; 5225/6549 tok/s;  14732 sec
[2021-04-24 09:16:07,256 INFO] Step 37300/50000; acc:  91.40; ppl:  1.37; xent: 0.31; lr: 0.00010; 5418/6681 tok/s;  14751 sec
[2021-04-24 09:16:27,340 INFO] Step 37350/50000; acc:  91.57; ppl:  1.36; xent: 0.31; lr: 0.00010; 5014/6302 tok/s;  14771 sec
[2021-04-24 09:16:47,554 INFO] Step 37400/50000; acc:  91.64; ppl:  1.35; xent: 0.30; lr: 0.00010; 5075/6412 tok/s;  14791 sec
[2021-04-24 09:17:06,910 INFO] Step 37450/50000; acc:  91.67; ppl:  1.36; xent: 0.31; lr: 0.00010; 5225/6488 tok/s;  14810 sec
[2021-04-24 09:17:27,068 INFO] Step 37500/50000; acc:  91.68; ppl:  1.35; xent: 0.30; lr: 0.00010; 5097/6300 tok/s;  14831 sec
[2021-04-24 09:17:45,757 INFO] Step 37550/50000; acc:  91.67; ppl:  1.36; xent: 0.30; lr: 0.00010; 5440/6586 tok/s;  14849 sec
[2021-04-24 09:18:05,891 INFO] Step 37600/50000; acc:  91.65; ppl:  1.36; xent: 0.30; lr: 0.00010; 5074/6358 tok/s;  14869 sec
[2021-04-24 09:18:26,041 INFO] Step 37650/50000; acc:  92.03; ppl:  1.34; xent: 0.29; lr: 0.00010; 4979/6498 tok/s;  14890 sec
[2021-04-24 09:18:45,725 INFO] Step 37700/50000; acc:  91.70; ppl:  1.36; xent: 0.30; lr: 0.00010; 5305/6488 tok/s;  14909 sec
[2021-04-24 09:19:04,882 INFO] Step 37750/50000; acc:  91.56; ppl:  1.36; xent: 0.31; lr: 0.00010; 5261/6529 tok/s;  14928 sec
[2021-04-24 09:19:23,938 INFO] Step 37800/50000; acc:  91.97; ppl:  1.34; xent: 0.29; lr: 0.00010; 5297/6543 tok/s;  14947 sec
[2021-04-24 09:19:42,824 INFO] Step 37850/50000; acc:  92.02; ppl:  1.34; xent: 0.29; lr: 0.00010; 5416/7004 tok/s;  14966 sec
[2021-04-24 09:20:02,231 INFO] Step 37900/50000; acc:  91.74; ppl:  1.35; xent: 0.30; lr: 0.00010; 5222/6541 tok/s;  14986 sec
[2021-04-24 09:20:09,706 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:20:21,724 INFO] Step 37950/50000; acc:  91.82; ppl:  1.35; xent: 0.30; lr: 0.00010; 5196/6565 tok/s;  15005 sec
[2021-04-24 09:20:41,169 INFO] Step 38000/50000; acc:  91.83; ppl:  1.35; xent: 0.30; lr: 0.00010; 5214/6592 tok/s;  15025 sec
[2021-04-24 09:20:59,842 INFO] Step 38050/50000; acc:  91.38; ppl:  1.37; xent: 0.31; lr: 0.00010; 5501/6691 tok/s;  15043 sec
[2021-04-24 09:21:19,789 INFO] Step 38100/50000; acc:  91.58; ppl:  1.36; xent: 0.30; lr: 0.00010; 5025/6324 tok/s;  15063 sec
[2021-04-24 09:21:39,760 INFO] Step 38150/50000; acc:  91.72; ppl:  1.35; xent: 0.30; lr: 0.00010; 5058/6417 tok/s;  15083 sec
[2021-04-24 09:21:58,907 INFO] Step 38200/50000; acc:  91.62; ppl:  1.35; xent: 0.30; lr: 0.00010; 5340/6685 tok/s;  15102 sec
[2021-04-24 09:22:18,303 INFO] Step 38250/50000; acc:  91.84; ppl:  1.35; xent: 0.30; lr: 0.00010; 5309/6487 tok/s;  15122 sec
[2021-04-24 09:22:37,528 INFO] Step 38300/50000; acc:  91.68; ppl:  1.35; xent: 0.30; lr: 0.00010; 5342/6479 tok/s;  15141 sec
[2021-04-24 09:22:57,274 INFO] Step 38350/50000; acc:  91.50; ppl:  1.36; xent: 0.31; lr: 0.00010; 5087/6355 tok/s;  15161 sec
[2021-04-24 09:23:17,082 INFO] Step 38400/50000; acc:  92.00; ppl:  1.33; xent: 0.29; lr: 0.00010; 5160/6584 tok/s;  15181 sec
[2021-04-24 09:23:36,892 INFO] Step 38450/50000; acc:  91.76; ppl:  1.35; xent: 0.30; lr: 0.00010; 5071/6463 tok/s;  15200 sec
[2021-04-24 09:23:56,321 INFO] Step 38500/50000; acc:  91.84; ppl:  1.35; xent: 0.30; lr: 0.00010; 5379/6560 tok/s;  15220 sec
[2021-04-24 09:24:15,347 INFO] Step 38550/50000; acc:  92.08; ppl:  1.34; xent: 0.29; lr: 0.00010; 5291/6710 tok/s;  15239 sec
[2021-04-24 09:24:34,559 INFO] Step 38600/50000; acc:  91.91; ppl:  1.34; xent: 0.30; lr: 0.00010; 5220/6619 tok/s;  15258 sec
[2021-04-24 09:24:53,716 INFO] Step 38650/50000; acc:  91.69; ppl:  1.35; xent: 0.30; lr: 0.00010; 5359/6686 tok/s;  15277 sec
[2021-04-24 09:24:56,087 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:25:13,512 INFO] Step 38700/50000; acc:  91.94; ppl:  1.35; xent: 0.30; lr: 0.00010; 5165/6539 tok/s;  15297 sec
[2021-04-24 09:25:32,482 INFO] Step 38750/50000; acc:  91.71; ppl:  1.35; xent: 0.30; lr: 0.00010; 5295/6578 tok/s;  15316 sec
[2021-04-24 09:25:51,663 INFO] Step 38800/50000; acc:  91.45; ppl:  1.36; xent: 0.31; lr: 0.00010; 5249/6596 tok/s;  15335 sec
[2021-04-24 09:26:11,903 INFO] Step 38850/50000; acc:  91.95; ppl:  1.34; xent: 0.29; lr: 0.00010; 5083/6399 tok/s;  15355 sec
[2021-04-24 09:26:31,426 INFO] Step 38900/50000; acc:  91.40; ppl:  1.36; xent: 0.31; lr: 0.00010; 5139/6361 tok/s;  15375 sec
[2021-04-24 09:26:51,115 INFO] Step 38950/50000; acc:  91.93; ppl:  1.34; xent: 0.29; lr: 0.00010; 5142/6438 tok/s;  15395 sec
[2021-04-24 09:27:10,805 INFO] Step 39000/50000; acc:  91.76; ppl:  1.35; xent: 0.30; lr: 0.00010; 5317/6474 tok/s;  15414 sec
[2021-04-24 09:27:29,930 INFO] Step 39050/50000; acc:  91.82; ppl:  1.35; xent: 0.30; lr: 0.00010; 5307/6598 tok/s;  15433 sec
[2021-04-24 09:27:49,596 INFO] Step 39100/50000; acc:  91.99; ppl:  1.34; xent: 0.29; lr: 0.00010; 5192/6435 tok/s;  15453 sec
[2021-04-24 09:28:09,764 INFO] Step 39150/50000; acc:  91.90; ppl:  1.34; xent: 0.29; lr: 0.00010; 4991/6494 tok/s;  15473 sec
[2021-04-24 09:28:29,283 INFO] Step 39200/50000; acc:  91.67; ppl:  1.35; xent: 0.30; lr: 0.00010; 5202/6356 tok/s;  15493 sec
[2021-04-24 09:28:48,918 INFO] Step 39250/50000; acc:  92.15; ppl:  1.33; xent: 0.29; lr: 0.00010; 5177/6442 tok/s;  15512 sec
[2021-04-24 09:29:08,106 INFO] Step 39300/50000; acc:  92.11; ppl:  1.33; xent: 0.29; lr: 0.00010; 5415/6862 tok/s;  15532 sec
[2021-04-24 09:29:27,139 INFO] Step 39350/50000; acc:  91.72; ppl:  1.35; xent: 0.30; lr: 0.00010; 5258/6548 tok/s;  15551 sec
[2021-04-24 09:29:31,204 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:29:46,467 INFO] Step 39400/50000; acc:  92.06; ppl:  1.34; xent: 0.29; lr: 0.00010; 5224/6724 tok/s;  15570 sec
[2021-04-24 09:30:05,953 INFO] Step 39450/50000; acc:  91.90; ppl:  1.35; xent: 0.30; lr: 0.00010; 5275/6587 tok/s;  15589 sec
[2021-04-24 09:30:24,855 INFO] Step 39500/50000; acc:  91.61; ppl:  1.36; xent: 0.31; lr: 0.00010; 5345/6602 tok/s;  15608 sec
[2021-04-24 09:30:44,627 INFO] Step 39550/50000; acc:  91.64; ppl:  1.36; xent: 0.30; lr: 0.00010; 5097/6357 tok/s;  15628 sec
[2021-04-24 09:31:04,563 INFO] Step 39600/50000; acc:  91.94; ppl:  1.34; xent: 0.29; lr: 0.00010; 5067/6452 tok/s;  15648 sec
[2021-04-24 09:31:24,182 INFO] Step 39650/50000; acc:  91.80; ppl:  1.35; xent: 0.30; lr: 0.00010; 5214/6577 tok/s;  15668 sec
[2021-04-24 09:31:43,713 INFO] Step 39700/50000; acc:  91.66; ppl:  1.35; xent: 0.30; lr: 0.00010; 5212/6223 tok/s;  15687 sec
[2021-04-24 09:32:02,716 INFO] Step 39750/50000; acc:  91.78; ppl:  1.35; xent: 0.30; lr: 0.00010; 5390/6642 tok/s;  15706 sec
[2021-04-24 09:32:22,962 INFO] Step 39800/50000; acc:  91.87; ppl:  1.34; xent: 0.29; lr: 0.00010; 5075/6474 tok/s;  15727 sec
[2021-04-24 09:32:42,476 INFO] Step 39850/50000; acc:  92.02; ppl:  1.34; xent: 0.29; lr: 0.00010; 5205/6520 tok/s;  15746 sec
[2021-04-24 09:33:02,224 INFO] Step 39900/50000; acc:  91.81; ppl:  1.35; xent: 0.30; lr: 0.00010; 5176/6444 tok/s;  15766 sec
[2021-04-24 09:33:21,995 INFO] Step 39950/50000; acc:  91.80; ppl:  1.34; xent: 0.30; lr: 0.00010; 5095/6378 tok/s;  15786 sec
[2021-04-24 09:33:40,987 INFO] Step 40000/50000; acc:  92.18; ppl:  1.33; xent: 0.29; lr: 0.00010; 5406/6696 tok/s;  15805 sec
[2021-04-24 09:33:40,988 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-24 09:34:09,213 INFO] Validation perplexity: 1.45077
[2021-04-24 09:34:09,213 INFO] Validation accuracy: 90.4812
[2021-04-24 09:34:09,217 INFO] Saving checkpoint ../models/default_params/basic_ops/model_step_40000.pt
[2021-04-24 09:34:28,609 INFO] Step 40050/50000; acc:  92.05; ppl:  1.33; xent: 0.29; lr: 0.00010; 2092/2693 tok/s;  15852 sec
[2021-04-24 09:34:46,410 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:34:48,148 INFO] Step 40100/50000; acc:  91.99; ppl:  1.33; xent: 0.29; lr: 0.00010; 5343/6651 tok/s;  15872 sec
[2021-04-24 09:35:07,079 INFO] Step 40150/50000; acc:  91.83; ppl:  1.35; xent: 0.30; lr: 0.00010; 5302/6625 tok/s;  15891 sec
[2021-04-24 09:35:26,783 INFO] Step 40200/50000; acc:  91.77; ppl:  1.35; xent: 0.30; lr: 0.00010; 5129/6476 tok/s;  15910 sec
[2021-04-24 09:35:45,883 INFO] Step 40250/50000; acc:  91.53; ppl:  1.36; xent: 0.31; lr: 0.00010; 5323/6495 tok/s;  15929 sec
[2021-04-24 09:36:06,546 INFO] Step 40300/50000; acc:  91.92; ppl:  1.34; xent: 0.29; lr: 0.00010; 4919/6335 tok/s;  15950 sec
[2021-04-24 09:36:25,610 INFO] Step 40350/50000; acc:  91.81; ppl:  1.35; xent: 0.30; lr: 0.00010; 5287/6551 tok/s;  15969 sec
[2021-04-24 09:36:45,642 INFO] Step 40400/50000; acc:  92.00; ppl:  1.34; xent: 0.29; lr: 0.00010; 5049/6430 tok/s;  15989 sec
[2021-04-24 09:37:04,878 INFO] Step 40450/50000; acc:  91.88; ppl:  1.34; xent: 0.29; lr: 0.00010; 5415/6479 tok/s;  16008 sec
[2021-04-24 09:37:24,172 INFO] Step 40500/50000; acc:  91.59; ppl:  1.35; xent: 0.30; lr: 0.00010; 5266/6498 tok/s;  16028 sec
[2021-04-24 09:37:44,570 INFO] Step 40550/50000; acc:  92.20; ppl:  1.33; xent: 0.29; lr: 0.00010; 4928/6320 tok/s;  16048 sec
[2021-04-24 09:38:04,967 INFO] Step 40600/50000; acc:  91.87; ppl:  1.34; xent: 0.29; lr: 0.00010; 5069/6315 tok/s;  16069 sec
[2021-04-24 09:38:24,542 INFO] Step 40650/50000; acc:  91.94; ppl:  1.34; xent: 0.29; lr: 0.00010; 5180/6402 tok/s;  16088 sec
[2021-04-24 09:38:44,139 INFO] Step 40700/50000; acc:  92.00; ppl:  1.33; xent: 0.29; lr: 0.00010; 5230/6515 tok/s;  16108 sec
[2021-04-24 09:39:02,823 INFO] Step 40750/50000; acc:  92.11; ppl:  1.33; xent: 0.29; lr: 0.00010; 5379/6821 tok/s;  16126 sec
[2021-04-24 09:39:22,239 INFO] Step 40800/50000; acc:  91.97; ppl:  1.34; xent: 0.29; lr: 0.00010; 5202/6575 tok/s;  16146 sec
[2021-04-24 09:39:34,584 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:39:41,538 INFO] Step 40850/50000; acc:  92.04; ppl:  1.33; xent: 0.29; lr: 0.00010; 5244/6655 tok/s;  16165 sec
[2021-04-24 09:40:01,354 INFO] Step 40900/50000; acc:  92.03; ppl:  1.34; xent: 0.29; lr: 0.00010; 5287/6584 tok/s;  16185 sec
[2021-04-24 09:40:19,675 INFO] Step 40950/50000; acc:  91.74; ppl:  1.35; xent: 0.30; lr: 0.00010; 5452/6798 tok/s;  16203 sec
[2021-04-24 09:40:39,782 INFO] Step 41000/50000; acc:  91.75; ppl:  1.35; xent: 0.30; lr: 0.00010; 4980/6275 tok/s;  16223 sec
[2021-04-24 09:40:59,752 INFO] Step 41050/50000; acc:  91.96; ppl:  1.34; xent: 0.29; lr: 0.00010; 5135/6411 tok/s;  16243 sec
[2021-04-24 09:41:19,029 INFO] Step 41100/50000; acc:  91.93; ppl:  1.34; xent: 0.29; lr: 0.00010; 5258/6582 tok/s;  16263 sec
[2021-04-24 09:41:39,103 INFO] Step 41150/50000; acc:  91.90; ppl:  1.34; xent: 0.29; lr: 0.00010; 5055/6236 tok/s;  16283 sec
[2021-04-24 09:41:58,137 INFO] Step 41200/50000; acc:  91.89; ppl:  1.34; xent: 0.29; lr: 0.00010; 5391/6541 tok/s;  16302 sec
[2021-04-24 09:42:18,183 INFO] Step 41250/50000; acc:  91.98; ppl:  1.34; xent: 0.29; lr: 0.00010; 5150/6485 tok/s;  16322 sec
[2021-04-24 09:42:38,207 INFO] Step 41300/50000; acc:  92.29; ppl:  1.32; xent: 0.28; lr: 0.00010; 5020/6461 tok/s;  16342 sec
[2021-04-24 09:42:57,903 INFO] Step 41350/50000; acc:  91.84; ppl:  1.34; xent: 0.30; lr: 0.00010; 5146/6371 tok/s;  16361 sec
[2021-04-24 09:43:17,146 INFO] Step 41400/50000; acc:  91.80; ppl:  1.34; xent: 0.30; lr: 0.00010; 5355/6559 tok/s;  16381 sec
[2021-04-24 09:43:36,462 INFO] Step 41450/50000; acc:  92.21; ppl:  1.33; xent: 0.28; lr: 0.00010; 5271/6535 tok/s;  16400 sec
[2021-04-24 09:43:55,614 INFO] Step 41500/50000; acc:  92.27; ppl:  1.33; xent: 0.28; lr: 0.00010; 5307/6887 tok/s;  16419 sec
[2021-04-24 09:44:14,656 INFO] Step 41550/50000; acc:  91.81; ppl:  1.34; xent: 0.29; lr: 0.00010; 5256/6593 tok/s;  16438 sec
[2021-04-24 09:44:21,407 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:44:34,214 INFO] Step 41600/50000; acc:  92.14; ppl:  1.33; xent: 0.29; lr: 0.00010; 5214/6594 tok/s;  16458 sec
[2021-04-24 09:44:52,935 INFO] Step 41650/50000; acc:  92.12; ppl:  1.33; xent: 0.29; lr: 0.00010; 5401/6856 tok/s;  16476 sec
[2021-04-24 09:45:12,020 INFO] Step 41700/50000; acc:  91.59; ppl:  1.36; xent: 0.30; lr: 0.00010; 5440/6565 tok/s;  16496 sec
[2021-04-24 09:45:32,004 INFO] Step 41750/50000; acc:  91.93; ppl:  1.34; xent: 0.29; lr: 0.00010; 5005/6326 tok/s;  16516 sec
[2021-04-24 09:45:51,889 INFO] Step 41800/50000; acc:  91.97; ppl:  1.34; xent: 0.29; lr: 0.00010; 5054/6414 tok/s;  16535 sec
[2021-04-24 09:46:11,026 INFO] Step 41850/50000; acc:  91.99; ppl:  1.34; xent: 0.29; lr: 0.00010; 5334/6625 tok/s;  16555 sec
[2021-04-24 09:46:30,424 INFO] Step 41900/50000; acc:  92.10; ppl:  1.33; xent: 0.29; lr: 0.00010; 5328/6538 tok/s;  16574 sec
[2021-04-24 09:46:49,693 INFO] Step 41950/50000; acc:  91.98; ppl:  1.34; xent: 0.29; lr: 0.00010; 5251/6447 tok/s;  16593 sec
[2021-04-24 09:47:09,199 INFO] Step 42000/50000; acc:  91.88; ppl:  1.34; xent: 0.29; lr: 0.00010; 5205/6498 tok/s;  16613 sec
[2021-04-24 09:47:29,077 INFO] Step 42050/50000; acc:  92.26; ppl:  1.33; xent: 0.28; lr: 0.00010; 5198/6629 tok/s;  16633 sec
[2021-04-24 09:47:48,803 INFO] Step 42100/50000; acc:  92.00; ppl:  1.33; xent: 0.29; lr: 0.00010; 5093/6415 tok/s;  16652 sec
[2021-04-24 09:48:08,077 INFO] Step 42150/50000; acc:  91.98; ppl:  1.33; xent: 0.29; lr: 0.00010; 5272/6459 tok/s;  16672 sec
[2021-04-24 09:48:27,358 INFO] Step 42200/50000; acc:  92.39; ppl:  1.32; xent: 0.28; lr: 0.00010; 5338/6739 tok/s;  16691 sec
[2021-04-24 09:48:46,685 INFO] Step 42250/50000; acc:  92.15; ppl:  1.33; xent: 0.29; lr: 0.00010; 5231/6614 tok/s;  16710 sec
[2021-04-24 09:48:55,308 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:49:05,910 INFO] Step 42300/50000; acc:  91.97; ppl:  1.33; xent: 0.29; lr: 0.00010; 5322/6641 tok/s;  16729 sec
[2021-04-24 09:49:25,560 INFO] Step 42350/50000; acc:  92.15; ppl:  1.33; xent: 0.29; lr: 0.00010; 5124/6539 tok/s;  16749 sec
[2021-04-24 09:49:44,459 INFO] Step 42400/50000; acc:  91.98; ppl:  1.34; xent: 0.29; lr: 0.00010; 5363/6636 tok/s;  16768 sec
[2021-04-24 09:50:03,567 INFO] Step 42450/50000; acc:  91.64; ppl:  1.35; xent: 0.30; lr: 0.00010; 5245/6572 tok/s;  16787 sec
[2021-04-24 09:50:23,840 INFO] Step 42500/50000; acc:  92.21; ppl:  1.33; xent: 0.28; lr: 0.00010; 5138/6436 tok/s;  16807 sec
[2021-04-24 09:50:43,198 INFO] Step 42550/50000; acc:  91.75; ppl:  1.34; xent: 0.30; lr: 0.00010; 5168/6445 tok/s;  16827 sec
[2021-04-24 09:51:02,865 INFO] Step 42600/50000; acc:  92.18; ppl:  1.33; xent: 0.28; lr: 0.00010; 5117/6410 tok/s;  16846 sec
[2021-04-24 09:51:22,297 INFO] Step 42650/50000; acc:  91.82; ppl:  1.34; xent: 0.29; lr: 0.00010; 5387/6469 tok/s;  16866 sec
[2021-04-24 09:51:41,640 INFO] Step 42700/50000; acc:  92.08; ppl:  1.34; xent: 0.29; lr: 0.00010; 5255/6624 tok/s;  16885 sec
[2021-04-24 09:52:01,424 INFO] Step 42750/50000; acc:  92.18; ppl:  1.32; xent: 0.28; lr: 0.00010; 5101/6374 tok/s;  16905 sec
[2021-04-24 09:52:21,302 INFO] Step 42800/50000; acc:  92.18; ppl:  1.33; xent: 0.29; lr: 0.00010; 5103/6590 tok/s;  16925 sec
[2021-04-24 09:52:41,090 INFO] Step 42850/50000; acc:  91.98; ppl:  1.33; xent: 0.29; lr: 0.00010; 5206/6404 tok/s;  16945 sec
[2021-04-24 09:53:00,217 INFO] Step 42900/50000; acc:  92.30; ppl:  1.32; xent: 0.28; lr: 0.00010; 5306/6473 tok/s;  16964 sec
[2021-04-24 09:53:19,519 INFO] Step 42950/50000; acc:  92.35; ppl:  1.32; xent: 0.28; lr: 0.00010; 5225/6747 tok/s;  16983 sec
[2021-04-24 09:53:38,706 INFO] Step 43000/50000; acc:  91.93; ppl:  1.34; xent: 0.29; lr: 0.00010; 5334/6596 tok/s;  17002 sec
[2021-04-24 09:53:41,843 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:53:57,956 INFO] Step 43050/50000; acc:  92.32; ppl:  1.32; xent: 0.28; lr: 0.00010; 5283/6762 tok/s;  17022 sec
[2021-04-24 09:54:17,707 INFO] Step 43100/50000; acc:  92.06; ppl:  1.34; xent: 0.29; lr: 0.00010; 5185/6502 tok/s;  17041 sec
[2021-04-24 09:54:36,415 INFO] Step 43150/50000; acc:  91.80; ppl:  1.34; xent: 0.29; lr: 0.00010; 5325/6652 tok/s;  17060 sec
[2021-04-24 09:54:56,124 INFO] Step 43200/50000; acc:  91.86; ppl:  1.34; xent: 0.29; lr: 0.00010; 5162/6395 tok/s;  17080 sec
[2021-04-24 09:55:16,245 INFO] Step 43250/50000; acc:  92.07; ppl:  1.33; xent: 0.28; lr: 0.00010; 4994/6364 tok/s;  17100 sec
[2021-04-24 09:55:35,950 INFO] Step 43300/50000; acc:  92.19; ppl:  1.33; xent: 0.28; lr: 0.00010; 5254/6612 tok/s;  17119 sec
[2021-04-24 09:55:55,741 INFO] Step 43350/50000; acc:  91.82; ppl:  1.34; xent: 0.29; lr: 0.00010; 5145/6080 tok/s;  17139 sec
[2021-04-24 09:56:14,895 INFO] Step 43400/50000; acc:  92.10; ppl:  1.33; xent: 0.28; lr: 0.00010; 5301/6625 tok/s;  17158 sec
[2021-04-24 09:56:35,157 INFO] Step 43450/50000; acc:  92.09; ppl:  1.33; xent: 0.29; lr: 0.00010; 5069/6357 tok/s;  17179 sec
[2021-04-24 09:56:54,817 INFO] Step 43500/50000; acc:  92.25; ppl:  1.32; xent: 0.28; lr: 0.00010; 5179/6607 tok/s;  17198 sec
[2021-04-24 09:57:14,128 INFO] Step 43550/50000; acc:  92.07; ppl:  1.33; xent: 0.29; lr: 0.00010; 5221/6482 tok/s;  17218 sec
[2021-04-24 09:57:33,686 INFO] Step 43600/50000; acc:  92.13; ppl:  1.33; xent: 0.28; lr: 0.00010; 5202/6432 tok/s;  17237 sec
[2021-04-24 09:57:53,087 INFO] Step 43650/50000; acc:  92.34; ppl:  1.32; xent: 0.28; lr: 0.00010; 5344/6766 tok/s;  17257 sec
[2021-04-24 09:58:12,476 INFO] Step 43700/50000; acc:  92.21; ppl:  1.32; xent: 0.28; lr: 0.00010; 5152/6512 tok/s;  17276 sec
[2021-04-24 09:58:29,363 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 09:58:31,901 INFO] Step 43750/50000; acc:  92.27; ppl:  1.32; xent: 0.28; lr: 0.00010; 5215/6544 tok/s;  17295 sec
[2021-04-24 09:58:50,924 INFO] Step 43800/50000; acc:  92.15; ppl:  1.33; xent: 0.28; lr: 0.00010; 5398/6692 tok/s;  17314 sec
[2021-04-24 09:59:10,578 INFO] Step 43850/50000; acc:  91.95; ppl:  1.34; xent: 0.29; lr: 0.00010; 5181/6528 tok/s;  17334 sec
[2021-04-24 09:59:29,521 INFO] Step 43900/50000; acc:  91.83; ppl:  1.34; xent: 0.29; lr: 0.00010; 5341/6564 tok/s;  17353 sec
[2021-04-24 09:59:50,380 INFO] Step 43950/50000; acc:  92.13; ppl:  1.33; xent: 0.28; lr: 0.00010; 4809/6223 tok/s;  17374 sec
[2021-04-24 10:00:09,449 INFO] Step 44000/50000; acc:  92.05; ppl:  1.33; xent: 0.29; lr: 0.00010; 5330/6577 tok/s;  17393 sec
[2021-04-24 10:00:29,578 INFO] Step 44050/50000; acc:  92.17; ppl:  1.33; xent: 0.28; lr: 0.00010; 5008/6414 tok/s;  17413 sec
[2021-04-24 10:00:48,679 INFO] Step 44100/50000; acc:  92.06; ppl:  1.33; xent: 0.28; lr: 0.00010; 5516/6451 tok/s;  17432 sec
[2021-04-24 10:01:08,314 INFO] Step 44150/50000; acc:  91.99; ppl:  1.33; xent: 0.29; lr: 0.00010; 5153/6502 tok/s;  17452 sec
[2021-04-24 10:01:28,334 INFO] Step 44200/50000; acc:  92.32; ppl:  1.32; xent: 0.28; lr: 0.00010; 5003/6334 tok/s;  17472 sec
[2021-04-24 10:01:48,948 INFO] Step 44250/50000; acc:  92.26; ppl:  1.32; xent: 0.28; lr: 0.00010; 5005/6274 tok/s;  17492 sec
[2021-04-24 10:02:08,282 INFO] Step 44300/50000; acc:  92.19; ppl:  1.32; xent: 0.28; lr: 0.00010; 5258/6500 tok/s;  17512 sec
[2021-04-24 10:02:27,959 INFO] Step 44350/50000; acc:  92.24; ppl:  1.32; xent: 0.28; lr: 0.00010; 5151/6447 tok/s;  17532 sec
[2021-04-24 10:02:46,548 INFO] Step 44400/50000; acc:  92.33; ppl:  1.32; xent: 0.28; lr: 0.00010; 5445/6876 tok/s;  17550 sec
[2021-04-24 10:03:06,113 INFO] Step 44450/50000; acc:  92.33; ppl:  1.32; xent: 0.28; lr: 0.00010; 5230/6623 tok/s;  17570 sec
[2021-04-24 10:03:17,544 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 10:03:25,279 INFO] Step 44500/50000; acc:  92.15; ppl:  1.33; xent: 0.28; lr: 0.00010; 5276/6691 tok/s;  17589 sec
[2021-04-24 10:03:44,995 INFO] Step 44550/50000; acc:  92.27; ppl:  1.32; xent: 0.28; lr: 0.00010; 5170/6468 tok/s;  17609 sec
[2021-04-24 10:04:03,741 INFO] Step 44600/50000; acc:  91.97; ppl:  1.33; xent: 0.29; lr: 0.00010; 5440/6711 tok/s;  17627 sec
[2021-04-24 10:04:23,739 INFO] Step 44650/50000; acc:  92.02; ppl:  1.33; xent: 0.29; lr: 0.00010; 5053/6353 tok/s;  17647 sec
[2021-04-24 10:04:43,828 INFO] Step 44700/50000; acc:  92.09; ppl:  1.33; xent: 0.28; lr: 0.00010; 5078/6343 tok/s;  17667 sec
[2021-04-24 10:05:02,895 INFO] Step 44750/50000; acc:  92.09; ppl:  1.33; xent: 0.28; lr: 0.00010; 5234/6611 tok/s;  17686 sec
[2021-04-24 10:05:22,885 INFO] Step 44800/50000; acc:  92.19; ppl:  1.32; xent: 0.28; lr: 0.00010; 5142/6347 tok/s;  17706 sec
[2021-04-24 10:05:42,005 INFO] Step 44850/50000; acc:  92.12; ppl:  1.33; xent: 0.28; lr: 0.00010; 5333/6442 tok/s;  17726 sec
[2021-04-24 10:06:02,253 INFO] Step 44900/50000; acc:  92.15; ppl:  1.33; xent: 0.28; lr: 0.00010; 5152/6457 tok/s;  17746 sec
[2021-04-24 10:06:22,046 INFO] Step 44950/50000; acc:  92.41; ppl:  1.31; xent: 0.27; lr: 0.00010; 5071/6550 tok/s;  17766 sec
[2021-04-24 10:06:42,005 INFO] Step 45000/50000; acc:  92.26; ppl:  1.32; xent: 0.28; lr: 0.00010; 5047/6322 tok/s;  17786 sec
[2021-04-24 10:06:42,007 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-24 10:07:10,264 INFO] Validation perplexity: 1.46336
[2021-04-24 10:07:10,265 INFO] Validation accuracy: 90.3797
[2021-04-24 10:07:10,269 INFO] Saving checkpoint ../models/default_params/basic_ops/model_step_45000.pt
[2021-04-24 10:07:29,421 INFO] Step 45050/50000; acc:  92.10; ppl:  1.33; xent: 0.29; lr: 0.00010; 2165/2633 tok/s;  17833 sec
[2021-04-24 10:07:48,919 INFO] Step 45100/50000; acc:  92.48; ppl:  1.31; xent: 0.27; lr: 0.00010; 5246/6527 tok/s;  17852 sec
[2021-04-24 10:08:08,135 INFO] Step 45150/50000; acc:  92.46; ppl:  1.31; xent: 0.27; lr: 0.00010; 5223/6805 tok/s;  17872 sec
[2021-04-24 10:08:26,972 INFO] Step 45200/50000; acc:  92.04; ppl:  1.33; xent: 0.29; lr: 0.00010; 5367/6606 tok/s;  17891 sec
[2021-04-24 10:08:33,057 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 10:08:46,763 INFO] Step 45250/50000; acc:  92.46; ppl:  1.31; xent: 0.27; lr: 0.00010; 5216/6688 tok/s;  17910 sec
[2021-04-24 10:09:05,639 INFO] Step 45300/50000; acc:  92.21; ppl:  1.33; xent: 0.28; lr: 0.00010; 5347/6778 tok/s;  17929 sec
[2021-04-24 10:09:24,497 INFO] Step 45350/50000; acc:  91.78; ppl:  1.34; xent: 0.29; lr: 0.00010; 5345/6466 tok/s;  17948 sec
[2021-04-24 10:09:44,555 INFO] Step 45400/50000; acc:  92.12; ppl:  1.33; xent: 0.28; lr: 0.00010; 5096/6378 tok/s;  17968 sec
[2021-04-24 10:10:04,783 INFO] Step 45450/50000; acc:  92.24; ppl:  1.32; xent: 0.28; lr: 0.00010; 5016/6331 tok/s;  17988 sec
[2021-04-24 10:10:23,882 INFO] Step 45500/50000; acc:  92.10; ppl:  1.33; xent: 0.28; lr: 0.00010; 5317/6598 tok/s;  18007 sec
[2021-04-24 10:10:43,358 INFO] Step 45550/50000; acc:  92.25; ppl:  1.32; xent: 0.28; lr: 0.00010; 5247/6470 tok/s;  18027 sec
[2021-04-24 10:11:02,618 INFO] Step 45600/50000; acc:  92.18; ppl:  1.32; xent: 0.28; lr: 0.00010; 5296/6534 tok/s;  18046 sec
[2021-04-24 10:11:22,211 INFO] Step 45650/50000; acc:  92.13; ppl:  1.33; xent: 0.28; lr: 0.00010; 5151/6426 tok/s;  18066 sec
[2021-04-24 10:11:42,229 INFO] Step 45700/50000; acc:  92.58; ppl:  1.31; xent: 0.27; lr: 0.00010; 5225/6674 tok/s;  18086 sec
[2021-04-24 10:12:01,843 INFO] Step 45750/50000; acc:  92.14; ppl:  1.33; xent: 0.28; lr: 0.00010; 5106/6348 tok/s;  18105 sec
[2021-04-24 10:12:21,062 INFO] Step 45800/50000; acc:  92.27; ppl:  1.32; xent: 0.28; lr: 0.00010; 5259/6497 tok/s;  18125 sec
[2021-04-24 10:12:40,310 INFO] Step 45850/50000; acc:  92.55; ppl:  1.31; xent: 0.27; lr: 0.00010; 5337/6756 tok/s;  18144 sec
[2021-04-24 10:12:59,581 INFO] Step 45900/50000; acc:  92.38; ppl:  1.32; xent: 0.28; lr: 0.00010; 5254/6647 tok/s;  18163 sec
[2021-04-24 10:13:07,326 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 10:13:18,677 INFO] Step 45950/50000; acc:  92.26; ppl:  1.32; xent: 0.28; lr: 0.00010; 5304/6709 tok/s;  18182 sec
[2021-04-24 10:13:38,055 INFO] Step 46000/50000; acc:  92.40; ppl:  1.32; xent: 0.28; lr: 0.00010; 5233/6556 tok/s;  18202 sec
[2021-04-24 10:13:57,426 INFO] Step 46050/50000; acc:  92.21; ppl:  1.33; xent: 0.28; lr: 0.00010; 5303/6602 tok/s;  18221 sec
[2021-04-24 10:14:16,542 INFO] Step 46100/50000; acc:  91.94; ppl:  1.34; xent: 0.29; lr: 0.00010; 5239/6523 tok/s;  18240 sec
[2021-04-24 10:14:36,861 INFO] Step 46150/50000; acc:  92.38; ppl:  1.32; xent: 0.28; lr: 0.00010; 4985/6298 tok/s;  18260 sec
[2021-04-24 10:14:56,744 INFO] Step 46200/50000; acc:  92.17; ppl:  1.33; xent: 0.28; lr: 0.00010; 5136/6404 tok/s;  18280 sec
[2021-04-24 10:15:16,540 INFO] Step 46250/50000; acc:  92.39; ppl:  1.32; xent: 0.27; lr: 0.00010; 5136/6358 tok/s;  18300 sec
[2021-04-24 10:15:35,694 INFO] Step 46300/50000; acc:  92.02; ppl:  1.33; xent: 0.28; lr: 0.00010; 5427/6539 tok/s;  18319 sec
[2021-04-24 10:15:54,891 INFO] Step 46350/50000; acc:  92.17; ppl:  1.32; xent: 0.28; lr: 0.00010; 5233/6623 tok/s;  18338 sec
[2021-04-24 10:16:14,603 INFO] Step 46400/50000; acc:  92.47; ppl:  1.31; xent: 0.27; lr: 0.00010; 5171/6458 tok/s;  18358 sec
[2021-04-24 10:16:34,595 INFO] Step 46450/50000; acc:  92.34; ppl:  1.32; xent: 0.28; lr: 0.00010; 5040/6503 tok/s;  18378 sec
[2021-04-24 10:16:54,615 INFO] Step 46500/50000; acc:  92.25; ppl:  1.32; xent: 0.28; lr: 0.00010; 5213/6375 tok/s;  18398 sec
[2021-04-24 10:17:13,697 INFO] Step 46550/50000; acc:  92.45; ppl:  1.31; xent: 0.27; lr: 0.00010; 5300/6492 tok/s;  18417 sec
[2021-04-24 10:17:33,091 INFO] Step 46600/50000; acc:  92.66; ppl:  1.30; xent: 0.26; lr: 0.00010; 5164/6670 tok/s;  18437 sec
[2021-04-24 10:17:52,249 INFO] Step 46650/50000; acc:  92.21; ppl:  1.32; xent: 0.28; lr: 0.00010; 5343/6710 tok/s;  18456 sec
[2021-04-24 10:17:54,774 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 10:18:11,685 INFO] Step 46700/50000; acc:  92.47; ppl:  1.31; xent: 0.27; lr: 0.00010; 5242/6661 tok/s;  18475 sec
[2021-04-24 10:18:31,033 INFO] Step 46750/50000; acc:  92.30; ppl:  1.32; xent: 0.28; lr: 0.00010; 5239/6550 tok/s;  18495 sec
[2021-04-24 10:18:50,023 INFO] Step 46800/50000; acc:  91.97; ppl:  1.33; xent: 0.29; lr: 0.00010; 5278/6549 tok/s;  18514 sec
[2021-04-24 10:19:10,293 INFO] Step 46850/50000; acc:  92.29; ppl:  1.32; xent: 0.28; lr: 0.00010; 5081/6380 tok/s;  18534 sec
[2021-04-24 10:19:30,181 INFO] Step 46900/50000; acc:  92.07; ppl:  1.32; xent: 0.28; lr: 0.00010; 5060/6281 tok/s;  18554 sec
[2021-04-24 10:19:50,210 INFO] Step 46950/50000; acc:  92.46; ppl:  1.31; xent: 0.27; lr: 0.00010; 5015/6474 tok/s;  18574 sec
[2021-04-24 10:20:09,481 INFO] Step 47000/50000; acc:  92.06; ppl:  1.33; xent: 0.28; lr: 0.00010; 5404/6373 tok/s;  18593 sec
[2021-04-24 10:20:28,637 INFO] Step 47050/50000; acc:  92.25; ppl:  1.32; xent: 0.28; lr: 0.00010; 5343/6583 tok/s;  18612 sec
[2021-04-24 10:20:49,068 INFO] Step 47100/50000; acc:  92.44; ppl:  1.31; xent: 0.27; lr: 0.00010; 4997/6262 tok/s;  18633 sec
[2021-04-24 10:21:08,465 INFO] Step 47150/50000; acc:  92.47; ppl:  1.31; xent: 0.27; lr: 0.00010; 5186/6716 tok/s;  18652 sec
[2021-04-24 10:21:28,220 INFO] Step 47200/50000; acc:  92.31; ppl:  1.32; xent: 0.28; lr: 0.00010; 5151/6321 tok/s;  18672 sec
[2021-04-24 10:21:47,831 INFO] Step 47250/50000; acc:  92.39; ppl:  1.31; xent: 0.27; lr: 0.00010; 5164/6424 tok/s;  18691 sec
[2021-04-24 10:22:07,221 INFO] Step 47300/50000; acc:  92.67; ppl:  1.30; xent: 0.27; lr: 0.00010; 5394/6815 tok/s;  18711 sec
[2021-04-24 10:22:26,581 INFO] Step 47350/50000; acc:  92.49; ppl:  1.31; xent: 0.27; lr: 0.00010; 5148/6531 tok/s;  18730 sec
[2021-04-24 10:22:42,558 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 10:22:45,871 INFO] Step 47400/50000; acc:  92.41; ppl:  1.31; xent: 0.27; lr: 0.00010; 5230/6565 tok/s;  18749 sec
[2021-04-24 10:23:05,017 INFO] Step 47450/50000; acc:  92.33; ppl:  1.32; xent: 0.28; lr: 0.00010; 5358/6656 tok/s;  18769 sec
[2021-04-24 10:23:24,566 INFO] Step 47500/50000; acc:  92.15; ppl:  1.33; xent: 0.28; lr: 0.00010; 5211/6534 tok/s;  18788 sec
[2021-04-24 10:23:44,049 INFO] Step 47550/50000; acc:  92.21; ppl:  1.32; xent: 0.28; lr: 0.00010; 5140/6334 tok/s;  18808 sec
[2021-04-24 10:24:04,239 INFO] Step 47600/50000; acc:  92.35; ppl:  1.31; xent: 0.27; lr: 0.00010; 5011/6480 tok/s;  18828 sec
[2021-04-24 10:24:24,057 INFO] Step 47650/50000; acc:  92.24; ppl:  1.32; xent: 0.28; lr: 0.00010; 5187/6441 tok/s;  18848 sec
[2021-04-24 10:24:44,265 INFO] Step 47700/50000; acc:  92.34; ppl:  1.32; xent: 0.27; lr: 0.00010; 4995/6292 tok/s;  18868 sec
[2021-04-24 10:25:02,902 INFO] Step 47750/50000; acc:  92.39; ppl:  1.31; xent: 0.27; lr: 0.00010; 5503/6514 tok/s;  18886 sec
[2021-04-24 10:25:22,486 INFO] Step 47800/50000; acc:  92.15; ppl:  1.32; xent: 0.28; lr: 0.00010; 5267/6584 tok/s;  18906 sec
[2021-04-24 10:25:42,953 INFO] Step 47850/50000; acc:  92.64; ppl:  1.30; xent: 0.26; lr: 0.00010; 4945/6288 tok/s;  18926 sec
[2021-04-24 10:26:03,241 INFO] Step 47900/50000; acc:  92.47; ppl:  1.31; xent: 0.27; lr: 0.00010; 5051/6359 tok/s;  18947 sec
[2021-04-24 10:26:22,428 INFO] Step 47950/50000; acc:  92.40; ppl:  1.31; xent: 0.27; lr: 0.00010; 5229/6445 tok/s;  18966 sec
[2021-04-24 10:26:41,906 INFO] Step 48000/50000; acc:  92.45; ppl:  1.31; xent: 0.27; lr: 0.00010; 5248/6556 tok/s;  18985 sec
[2021-04-24 10:27:00,504 INFO] Step 48050/50000; acc:  92.53; ppl:  1.31; xent: 0.27; lr: 0.00010; 5413/6843 tok/s;  19004 sec
[2021-04-24 10:27:20,147 INFO] Step 48100/50000; acc:  92.52; ppl:  1.31; xent: 0.27; lr: 0.00010; 5277/6662 tok/s;  19024 sec
[2021-04-24 10:27:30,641 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 10:27:39,136 INFO] Step 48150/50000; acc:  92.38; ppl:  1.31; xent: 0.27; lr: 0.00010; 5303/6714 tok/s;  19043 sec
[2021-04-24 10:27:58,814 INFO] Step 48200/50000; acc:  92.49; ppl:  1.31; xent: 0.27; lr: 0.00010; 5159/6506 tok/s;  19062 sec
[2021-04-24 10:28:17,675 INFO] Step 48250/50000; acc:  92.23; ppl:  1.32; xent: 0.28; lr: 0.00010; 5397/6637 tok/s;  19081 sec
[2021-04-24 10:28:37,604 INFO] Step 48300/50000; acc:  92.23; ppl:  1.32; xent: 0.28; lr: 0.00010; 5078/6370 tok/s;  19101 sec
[2021-04-24 10:28:57,970 INFO] Step 48350/50000; acc:  92.43; ppl:  1.31; xent: 0.27; lr: 0.00010; 4947/6260 tok/s;  19122 sec
[2021-04-24 10:29:16,905 INFO] Step 48400/50000; acc:  92.31; ppl:  1.32; xent: 0.28; lr: 0.00010; 5316/6675 tok/s;  19140 sec
[2021-04-24 10:29:36,884 INFO] Step 48450/50000; acc:  92.55; ppl:  1.31; xent: 0.27; lr: 0.00010; 5210/6392 tok/s;  19160 sec
[2021-04-24 10:29:55,685 INFO] Step 48500/50000; acc:  92.30; ppl:  1.31; xent: 0.27; lr: 0.00010; 5424/6543 tok/s;  19179 sec
[2021-04-24 10:30:16,077 INFO] Step 48550/50000; acc:  92.32; ppl:  1.32; xent: 0.27; lr: 0.00010; 4973/6295 tok/s;  19200 sec
[2021-04-24 10:30:35,675 INFO] Step 48600/50000; acc:  92.62; ppl:  1.30; xent: 0.26; lr: 0.00010; 5238/6640 tok/s;  19219 sec
[2021-04-24 10:30:55,322 INFO] Step 48650/50000; acc:  92.42; ppl:  1.31; xent: 0.27; lr: 0.00010; 5166/6535 tok/s;  19239 sec
[2021-04-24 10:31:14,539 INFO] Step 48700/50000; acc:  92.29; ppl:  1.32; xent: 0.28; lr: 0.00010; 5319/6435 tok/s;  19258 sec
[2021-04-24 10:31:33,753 INFO] Step 48750/50000; acc:  92.59; ppl:  1.30; xent: 0.26; lr: 0.00010; 5247/6582 tok/s;  19277 sec
[2021-04-24 10:31:52,920 INFO] Step 48800/50000; acc:  92.65; ppl:  1.30; xent: 0.26; lr: 0.00010; 5275/6851 tok/s;  19296 sec
[2021-04-24 10:32:11,597 INFO] Step 48850/50000; acc:  92.24; ppl:  1.32; xent: 0.27; lr: 0.00010; 5405/6736 tok/s;  19315 sec
[2021-04-24 10:32:17,055 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 10:32:31,410 INFO] Step 48900/50000; acc:  92.70; ppl:  1.30; xent: 0.26; lr: 0.00010; 5271/6678 tok/s;  19335 sec
[2021-04-24 10:32:50,364 INFO] Step 48950/50000; acc:  92.38; ppl:  1.32; xent: 0.27; lr: 0.00010; 5300/6665 tok/s;  19354 sec
[2021-04-24 10:33:09,485 INFO] Step 49000/50000; acc:  92.10; ppl:  1.32; xent: 0.28; lr: 0.00010; 5243/6408 tok/s;  19373 sec
[2021-04-24 10:33:29,316 INFO] Step 49050/50000; acc:  92.36; ppl:  1.31; xent: 0.27; lr: 0.00010; 5157/6431 tok/s;  19393 sec
[2021-04-24 10:33:49,474 INFO] Step 49100/50000; acc:  92.50; ppl:  1.31; xent: 0.27; lr: 0.00010; 5035/6404 tok/s;  19413 sec
[2021-04-24 10:34:08,699 INFO] Step 49150/50000; acc:  92.31; ppl:  1.32; xent: 0.27; lr: 0.00010; 5227/6487 tok/s;  19432 sec
[2021-04-24 10:34:28,112 INFO] Step 49200/50000; acc:  92.41; ppl:  1.31; xent: 0.27; lr: 0.00010; 5307/6518 tok/s;  19452 sec
[2021-04-24 10:34:47,556 INFO] Step 49250/50000; acc:  92.48; ppl:  1.31; xent: 0.27; lr: 0.00010; 5307/6608 tok/s;  19471 sec
[2021-04-24 10:35:07,267 INFO] Step 49300/50000; acc:  92.43; ppl:  1.31; xent: 0.27; lr: 0.00010; 5126/6320 tok/s;  19491 sec
[2021-04-24 10:35:27,485 INFO] Step 49350/50000; acc:  92.68; ppl:  1.30; xent: 0.26; lr: 0.00010; 5023/6516 tok/s;  19511 sec
[2021-04-24 10:35:47,004 INFO] Step 49400/50000; acc:  92.28; ppl:  1.32; xent: 0.27; lr: 0.00010; 5240/6451 tok/s;  19531 sec
[2021-04-24 10:36:06,124 INFO] Step 49450/50000; acc:  92.59; ppl:  1.30; xent: 0.26; lr: 0.00010; 5342/6543 tok/s;  19550 sec
[2021-04-24 10:36:25,446 INFO] Step 49500/50000; acc:  92.77; ppl:  1.30; xent: 0.26; lr: 0.00010; 5290/6682 tok/s;  19569 sec
[2021-04-24 10:36:44,864 INFO] Step 49550/50000; acc:  92.57; ppl:  1.30; xent: 0.27; lr: 0.00010; 5135/6580 tok/s;  19588 sec
[2021-04-24 10:36:51,875 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-24 10:37:04,081 INFO] Step 49600/50000; acc:  92.50; ppl:  1.31; xent: 0.27; lr: 0.00010; 5332/6740 tok/s;  19608 sec
[2021-04-24 10:37:23,404 INFO] Step 49650/50000; acc:  92.56; ppl:  1.31; xent: 0.27; lr: 0.00010; 5219/6472 tok/s;  19627 sec
[2021-04-24 10:37:42,804 INFO] Step 49700/50000; acc:  92.42; ppl:  1.32; xent: 0.27; lr: 0.00010; 5337/6663 tok/s;  19646 sec
[2021-04-24 10:38:01,733 INFO] Step 49750/50000; acc:  92.09; ppl:  1.32; xent: 0.28; lr: 0.00010; 5289/6535 tok/s;  19665 sec
[2021-04-24 10:38:22,102 INFO] Step 49800/50000; acc:  92.66; ppl:  1.30; xent: 0.26; lr: 0.00010; 4944/6294 tok/s;  19686 sec
[2021-04-24 10:38:41,880 INFO] Step 49850/50000; acc:  92.37; ppl:  1.31; xent: 0.27; lr: 0.00010; 5155/6385 tok/s;  19705 sec
[2021-04-24 10:39:01,649 INFO] Step 49900/50000; acc:  92.55; ppl:  1.30; xent: 0.27; lr: 0.00010; 5165/6461 tok/s;  19725 sec
[2021-04-24 10:39:20,804 INFO] Step 49950/50000; acc:  92.37; ppl:  1.31; xent: 0.27; lr: 0.00010; 5362/6467 tok/s;  19744 sec
[2021-04-24 10:39:40,413 INFO] Step 50000/50000; acc:  92.48; ppl:  1.31; xent: 0.27; lr: 0.00005; 5159/6591 tok/s;  19764 sec
[2021-04-24 10:39:40,415 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-24 10:40:08,515 INFO] Validation perplexity: 1.46823
[2021-04-24 10:40:08,515 INFO] Validation accuracy: 90.4511
[2021-04-24 10:40:08,519 INFO] Saving checkpoint ../models/default_params/basic_ops/model_step_50000.pt

Train on strictly condensed EditOperations:

modelDefaultStrict = HephaestusModel(MODEL_DEFAULT_STRICT)
modelDefaultStrict.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_OPS_GENERAL_STRICT_TRAIN,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_OPS_GENERAL_STRICT_VALID
)
[2021-04-23 03:45:17,313 INFO] Counter vocab from -1 samples.
[2021-04-23 03:45:17,313 INFO] n_sample=-1: Build vocab on full datasets.
[2021-04-23 03:45:17,318 INFO] corpus_1's transforms: TransformPipe()
[2021-04-23 03:45:17,319 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 03:45:17,920 INFO] Counters src:429
[2021-04-23 03:45:17,920 INFO] Counters tgt:444
[2021-04-23 03:45:17,920 WARNING] path ../models/default_params/strict_ops/save_data.vocab.src exists, may overwrite...
[2021-04-23 03:45:17,922 WARNING] path ../models/default_params/strict_ops/save_data.vocab.tgt exists, may overwrite...
[2021-04-23 03:45:18,564 INFO] Parsed 2 corpora from -data.
[2021-04-23 03:45:18,565 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-04-23 03:45:18,565 INFO] Loading vocab from text file...
[2021-04-23 03:45:18,565 INFO] Loading src vocabulary from ../models/default_params/strict_ops/save_data.vocab.src
[2021-04-23 03:45:18,567 INFO] Loaded src vocab has 429 tokens.
[2021-04-23 03:45:18,567 INFO] Loading tgt vocabulary from ../models/default_params/strict_ops/save_data.vocab.tgt
[2021-04-23 03:45:18,568 INFO] Loaded tgt vocab has 444 tokens.
[2021-04-23 03:45:18,569 INFO] Building fields with vocab in counters...
[2021-04-23 03:45:18,569 INFO]  * tgt vocab size: 448.
[2021-04-23 03:45:18,570 INFO]  * src vocab size: 431.
[2021-04-23 03:45:18,570 INFO]  * src vocab size = 431
[2021-04-23 03:45:18,570 INFO]  * tgt vocab size = 448
[2021-04-23 03:45:18,571 INFO] Building model...
[2021-04-23 03:45:19,710 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): LSTM(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(448, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedLSTM(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): LSTMCell(768, 256)
        (1): LSTMCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=448, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-04-23 03:45:19,710 INFO] encoder: 1535488
[2021-04-23 03:45:19,710 INFO] decoder: 2184384
[2021-04-23 03:45:19,710 INFO] * number of parameters: 3719872
[2021-04-23 03:45:19,711 INFO] Starting training on GPU: [0]
[2021-04-23 03:45:19,711 INFO] Start training loop and validate every 5000 steps...
[2021-04-23 03:45:19,712 INFO] corpus_1's transforms: TransformPipe()
[2021-04-23 03:45:19,712 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 03:45:30,415 INFO] Step 50/50000; acc:  17.67; ppl: 150.77; xent: 5.02; lr: 0.00010; 9403/3978 tok/s;     11 sec
[2021-04-23 03:45:41,158 INFO] Step 100/50000; acc:  25.13; ppl: 35.43; xent: 3.57; lr: 0.00010; 9375/3954 tok/s;     21 sec
[2021-04-23 03:45:52,431 INFO] Step 150/50000; acc:  29.05; ppl: 25.91; xent: 3.25; lr: 0.00010; 9071/3897 tok/s;     33 sec
[2021-04-23 03:46:03,153 INFO] Step 200/50000; acc:  41.06; ppl: 17.10; xent: 2.84; lr: 0.00010; 9525/3999 tok/s;     43 sec
[2021-04-23 03:46:13,265 INFO] Step 250/50000; acc:  45.54; ppl: 10.78; xent: 2.38; lr: 0.00010; 10066/4157 tok/s;     54 sec
[2021-04-23 03:46:23,765 INFO] Step 300/50000; acc:  45.68; ppl:  9.80; xent: 2.28; lr: 0.00010; 9532/4026 tok/s;     64 sec
[2021-04-23 03:46:34,984 INFO] Step 350/50000; acc:  46.15; ppl:  9.28; xent: 2.23; lr: 0.00010; 9322/3831 tok/s;     75 sec
[2021-04-23 03:46:45,877 INFO] Step 400/50000; acc:  46.47; ppl:  8.87; xent: 2.18; lr: 0.00010; 9139/3894 tok/s;     86 sec
[2021-04-23 03:46:56,697 INFO] Step 450/50000; acc:  46.79; ppl:  8.72; xent: 2.17; lr: 0.00010; 9560/3950 tok/s;     97 sec
[2021-04-23 03:47:07,338 INFO] Step 500/50000; acc:  46.94; ppl:  8.60; xent: 2.15; lr: 0.00010; 9654/4029 tok/s;    108 sec
[2021-04-23 03:47:18,090 INFO] Step 550/50000; acc:  47.40; ppl:  8.47; xent: 2.14; lr: 0.00010; 9272/3992 tok/s;    118 sec
[2021-04-23 03:47:29,496 INFO] Step 600/50000; acc:  48.51; ppl:  8.05; xent: 2.09; lr: 0.00010; 9058/3778 tok/s;    130 sec
[2021-04-23 03:47:39,745 INFO] Step 650/50000; acc:  49.03; ppl:  7.91; xent: 2.07; lr: 0.00010; 9814/4042 tok/s;    140 sec
[2021-04-23 03:47:50,478 INFO] Step 700/50000; acc:  49.63; ppl:  7.68; xent: 2.04; lr: 0.00010; 9716/4032 tok/s;    151 sec
[2021-04-23 03:47:51,154 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 03:48:00,896 INFO] Step 750/50000; acc:  49.63; ppl:  7.65; xent: 2.04; lr: 0.00010; 9358/4168 tok/s;    161 sec
[2021-04-23 03:48:11,625 INFO] Step 800/50000; acc:  50.52; ppl:  7.37; xent: 2.00; lr: 0.00010; 9723/3889 tok/s;    172 sec
[2021-04-23 03:48:22,688 INFO] Step 850/50000; acc:  50.28; ppl:  7.24; xent: 1.98; lr: 0.00010; 8976/3918 tok/s;    183 sec
[2021-04-23 03:48:33,706 INFO] Step 900/50000; acc:  50.75; ppl:  7.06; xent: 1.95; lr: 0.00010; 9195/3951 tok/s;    194 sec
[2021-04-23 03:48:43,912 INFO] Step 950/50000; acc:  51.09; ppl:  6.94; xent: 1.94; lr: 0.00010; 10044/4137 tok/s;    204 sec
[2021-04-23 03:48:54,461 INFO] Step 1000/50000; acc:  51.75; ppl:  6.65; xent: 1.90; lr: 0.00010; 9643/4051 tok/s;    215 sec
[2021-04-23 03:49:05,186 INFO] Step 1050/50000; acc:  51.86; ppl:  6.59; xent: 1.88; lr: 0.00010; 9655/3922 tok/s;    225 sec
[2021-04-23 03:49:16,499 INFO] Step 1100/50000; acc:  52.13; ppl:  6.47; xent: 1.87; lr: 0.00010; 8889/3829 tok/s;    237 sec
[2021-04-23 03:49:27,320 INFO] Step 1150/50000; acc:  52.72; ppl:  6.27; xent: 1.84; lr: 0.00010; 9496/3936 tok/s;    248 sec
[2021-04-23 03:49:38,247 INFO] Step 1200/50000; acc:  52.55; ppl:  6.22; xent: 1.83; lr: 0.00010; 9132/3914 tok/s;    259 sec
[2021-04-23 03:49:48,585 INFO] Step 1250/50000; acc:  52.81; ppl:  6.25; xent: 1.83; lr: 0.00010; 10000/4130 tok/s;    269 sec
[2021-04-23 03:49:59,550 INFO] Step 1300/50000; acc:  53.30; ppl:  5.95; xent: 1.78; lr: 0.00010; 9401/3874 tok/s;    280 sec
[2021-04-23 03:50:10,048 INFO] Step 1350/50000; acc:  53.47; ppl:  5.92; xent: 1.78; lr: 0.00010; 9545/4059 tok/s;    290 sec
[2021-04-23 03:50:20,494 INFO] Step 1400/50000; acc:  54.08; ppl:  5.76; xent: 1.75; lr: 0.00010; 9729/4060 tok/s;    301 sec
[2021-04-23 03:50:28,967 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 03:50:31,149 INFO] Step 1450/50000; acc:  54.08; ppl:  5.68; xent: 1.74; lr: 0.00010; 9517/4039 tok/s;    311 sec
[2021-04-23 03:50:42,000 INFO] Step 1500/50000; acc:  54.33; ppl:  5.65; xent: 1.73; lr: 0.00010; 9626/3951 tok/s;    322 sec
[2021-04-23 03:50:52,528 INFO] Step 1550/50000; acc:  54.54; ppl:  5.55; xent: 1.71; lr: 0.00010; 9265/4008 tok/s;    333 sec
[2021-04-23 03:51:03,917 INFO] Step 1600/50000; acc:  54.28; ppl:  5.60; xent: 1.72; lr: 0.00010; 9048/3860 tok/s;    344 sec
[2021-04-23 03:51:14,367 INFO] Step 1650/50000; acc:  54.64; ppl:  5.49; xent: 1.70; lr: 0.00010; 9587/4088 tok/s;    355 sec
[2021-04-23 03:51:25,026 INFO] Step 1700/50000; acc:  55.18; ppl:  5.38; xent: 1.68; lr: 0.00010; 9480/4006 tok/s;    365 sec
[2021-04-23 03:51:35,684 INFO] Step 1750/50000; acc:  55.26; ppl:  5.33; xent: 1.67; lr: 0.00010; 9666/4018 tok/s;    376 sec
[2021-04-23 03:51:46,443 INFO] Step 1800/50000; acc:  55.75; ppl:  5.26; xent: 1.66; lr: 0.00010; 9605/3866 tok/s;    387 sec
[2021-04-23 03:51:57,894 INFO] Step 1850/50000; acc:  55.47; ppl:  5.26; xent: 1.66; lr: 0.00010; 8996/3825 tok/s;    398 sec
[2021-04-23 03:52:08,024 INFO] Step 1900/50000; acc:  56.11; ppl:  5.11; xent: 1.63; lr: 0.00010; 9788/4148 tok/s;    408 sec
[2021-04-23 03:52:19,121 INFO] Step 1950/50000; acc:  55.95; ppl:  5.18; xent: 1.64; lr: 0.00010; 9302/3894 tok/s;    419 sec
[2021-04-23 03:52:29,881 INFO] Step 2000/50000; acc:  56.06; ppl:  5.17; xent: 1.64; lr: 0.00010; 9250/3934 tok/s;    430 sec
[2021-04-23 03:52:41,251 INFO] Step 2050/50000; acc:  56.11; ppl:  5.06; xent: 1.62; lr: 0.00010; 9155/3756 tok/s;    442 sec
[2021-04-23 03:52:51,510 INFO] Step 2100/50000; acc:  56.82; ppl:  4.98; xent: 1.61; lr: 0.00010; 9957/4141 tok/s;    452 sec
[2021-04-23 03:53:02,186 INFO] Step 2150/50000; acc:  56.79; ppl:  4.91; xent: 1.59; lr: 0.00010; 9294/4016 tok/s;    462 sec
[2021-04-23 03:53:07,741 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 03:53:12,894 INFO] Step 2200/50000; acc:  57.12; ppl:  4.90; xent: 1.59; lr: 0.00010; 9618/3993 tok/s;    473 sec
[2021-04-23 03:53:23,634 INFO] Step 2250/50000; acc:  57.33; ppl:  4.88; xent: 1.59; lr: 0.00010; 9462/3945 tok/s;    484 sec
[2021-04-23 03:53:35,491 INFO] Step 2300/50000; acc:  57.07; ppl:  4.88; xent: 1.59; lr: 0.00010; 8774/3661 tok/s;    496 sec
[2021-04-23 03:53:45,737 INFO] Step 2350/50000; acc:  57.47; ppl:  4.79; xent: 1.57; lr: 0.00010; 9452/4219 tok/s;    506 sec
[2021-04-23 03:53:56,773 INFO] Step 2400/50000; acc:  57.24; ppl:  4.86; xent: 1.58; lr: 0.00010; 9397/3884 tok/s;    517 sec
[2021-04-23 03:54:06,841 INFO] Step 2450/50000; acc:  57.93; ppl:  4.74; xent: 1.56; lr: 0.00010; 9903/4223 tok/s;    527 sec
[2021-04-23 03:54:17,496 INFO] Step 2500/50000; acc:  57.91; ppl:  4.76; xent: 1.56; lr: 0.00010; 9613/3967 tok/s;    538 sec
[2021-04-23 03:54:28,770 INFO] Step 2550/50000; acc:  57.95; ppl:  4.71; xent: 1.55; lr: 0.00010; 9205/3769 tok/s;    549 sec
[2021-04-23 03:54:39,947 INFO] Step 2600/50000; acc:  58.00; ppl:  4.69; xent: 1.54; lr: 0.00010; 9158/3891 tok/s;    560 sec
[2021-04-23 03:54:50,637 INFO] Step 2650/50000; acc:  58.05; ppl:  4.65; xent: 1.54; lr: 0.00010; 9583/3916 tok/s;    571 sec
[2021-04-23 03:55:01,134 INFO] Step 2700/50000; acc:  58.58; ppl:  4.65; xent: 1.54; lr: 0.00010; 9494/4059 tok/s;    581 sec
[2021-04-23 03:55:12,474 INFO] Step 2750/50000; acc:  58.50; ppl:  4.66; xent: 1.54; lr: 0.00010; 9054/3809 tok/s;    593 sec
[2021-04-23 03:55:23,072 INFO] Step 2800/50000; acc:  58.91; ppl:  4.48; xent: 1.50; lr: 0.00010; 9453/3949 tok/s;    603 sec
[2021-04-23 03:55:33,526 INFO] Step 2850/50000; acc:  58.88; ppl:  4.56; xent: 1.52; lr: 0.00010; 9847/4126 tok/s;    614 sec
[2021-04-23 03:55:44,426 INFO] Step 2900/50000; acc:  59.38; ppl:  4.47; xent: 1.50; lr: 0.00010; 9399/3961 tok/s;    625 sec
[2021-04-23 03:55:46,801 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 03:55:54,825 INFO] Step 2950/50000; acc:  59.42; ppl:  4.48; xent: 1.50; lr: 0.00010; 9619/4087 tok/s;    635 sec
[2021-04-23 03:56:05,584 INFO] Step 3000/50000; acc:  59.10; ppl:  4.51; xent: 1.51; lr: 0.00010; 9529/3941 tok/s;    646 sec
[2021-04-23 03:56:16,927 INFO] Step 3050/50000; acc:  59.29; ppl:  4.42; xent: 1.49; lr: 0.00010; 8879/3873 tok/s;    657 sec
[2021-04-23 03:56:27,560 INFO] Step 3100/50000; acc:  59.60; ppl:  4.45; xent: 1.49; lr: 0.00010; 9802/4049 tok/s;    668 sec
[2021-04-23 03:56:37,970 INFO] Step 3150/50000; acc:  60.06; ppl:  4.39; xent: 1.48; lr: 0.00010; 9309/4072 tok/s;    678 sec
[2021-04-23 03:56:48,583 INFO] Step 3200/50000; acc:  59.63; ppl:  4.38; xent: 1.48; lr: 0.00010; 9762/4010 tok/s;    689 sec
[2021-04-23 03:56:59,138 INFO] Step 3250/50000; acc:  59.85; ppl:  4.37; xent: 1.48; lr: 0.00010; 9684/3968 tok/s;    699 sec
[2021-04-23 03:57:10,474 INFO] Step 3300/50000; acc:  60.38; ppl:  4.30; xent: 1.46; lr: 0.00010; 8938/3778 tok/s;    711 sec
[2021-04-23 03:57:21,252 INFO] Step 3350/50000; acc:  59.82; ppl:  4.38; xent: 1.48; lr: 0.00010; 9564/3944 tok/s;    722 sec
[2021-04-23 03:57:31,838 INFO] Step 3400/50000; acc:  59.97; ppl:  4.34; xent: 1.47; lr: 0.00010; 9676/4071 tok/s;    732 sec
[2021-04-23 03:57:42,614 INFO] Step 3450/50000; acc:  60.04; ppl:  4.41; xent: 1.48; lr: 0.00010; 9463/3984 tok/s;    743 sec
[2021-04-23 03:57:53,749 INFO] Step 3500/50000; acc:  60.73; ppl:  4.24; xent: 1.44; lr: 0.00010; 9005/3827 tok/s;    754 sec
[2021-04-23 03:58:03,921 INFO] Step 3550/50000; acc:  60.68; ppl:  4.23; xent: 1.44; lr: 0.00010; 10088/4106 tok/s;    764 sec
[2021-04-23 03:58:14,535 INFO] Step 3600/50000; acc:  60.98; ppl:  4.18; xent: 1.43; lr: 0.00010; 9356/4024 tok/s;    775 sec
[2021-04-23 03:58:18,031 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 03:58:25,476 INFO] Step 3650/50000; acc:  60.38; ppl:  4.26; xent: 1.45; lr: 0.00010; 9498/3981 tok/s;    786 sec
[2021-04-23 03:58:36,421 INFO] Step 3700/50000; acc:  61.09; ppl:  4.16; xent: 1.42; lr: 0.00010; 9392/3877 tok/s;    797 sec
[2021-04-23 03:58:47,099 INFO] Step 3750/50000; acc:  60.46; ppl:  4.25; xent: 1.45; lr: 0.00010; 9290/4024 tok/s;    807 sec
[2021-04-23 03:58:58,341 INFO] Step 3800/50000; acc:  60.78; ppl:  4.18; xent: 1.43; lr: 0.00010; 9073/3904 tok/s;    819 sec
[2021-04-23 03:59:08,848 INFO] Step 3850/50000; acc:  60.47; ppl:  4.22; xent: 1.44; lr: 0.00010; 9619/4048 tok/s;    829 sec
[2021-04-23 03:59:19,210 INFO] Step 3900/50000; acc:  61.30; ppl:  4.12; xent: 1.42; lr: 0.00010; 10038/4084 tok/s;    839 sec
[2021-04-23 03:59:29,283 INFO] Step 3950/50000; acc:  61.07; ppl:  4.12; xent: 1.42; lr: 0.00010; 9704/4190 tok/s;    850 sec
[2021-04-23 03:59:40,492 INFO] Step 4000/50000; acc:  60.73; ppl:  4.18; xent: 1.43; lr: 0.00010; 9429/3822 tok/s;    861 sec
[2021-04-23 03:59:51,178 INFO] Step 4050/50000; acc:  61.32; ppl:  4.10; xent: 1.41; lr: 0.00010; 9382/3976 tok/s;    871 sec
[2021-04-23 04:00:02,102 INFO] Step 4100/50000; acc:  61.31; ppl:  4.10; xent: 1.41; lr: 0.00010; 9276/3911 tok/s;    882 sec
[2021-04-23 04:00:12,835 INFO] Step 4150/50000; acc:  61.00; ppl:  4.18; xent: 1.43; lr: 0.00010; 9575/3992 tok/s;    893 sec
[2021-04-23 04:00:23,703 INFO] Step 4200/50000; acc:  61.37; ppl:  4.15; xent: 1.42; lr: 0.00010; 9420/3923 tok/s;    904 sec
[2021-04-23 04:00:34,955 INFO] Step 4250/50000; acc:  61.69; ppl:  4.06; xent: 1.40; lr: 0.00010; 9154/3814 tok/s;    915 sec
[2021-04-23 04:00:45,191 INFO] Step 4300/50000; acc:  62.00; ppl:  4.00; xent: 1.39; lr: 0.00010; 9672/4095 tok/s;    925 sec
[2021-04-23 04:00:55,792 INFO] Step 4350/50000; acc:  62.20; ppl:  3.99; xent: 1.38; lr: 0.00010; 9678/4047 tok/s;    936 sec
[2021-04-23 04:00:56,282 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:01:06,612 INFO] Step 4400/50000; acc:  62.01; ppl:  4.01; xent: 1.39; lr: 0.00010; 9220/4024 tok/s;    947 sec
[2021-04-23 04:01:17,356 INFO] Step 4450/50000; acc:  61.80; ppl:  4.07; xent: 1.40; lr: 0.00010; 9679/3919 tok/s;    958 sec
[2021-04-23 04:01:28,797 INFO] Step 4500/50000; acc:  61.65; ppl:  4.01; xent: 1.39; lr: 0.00010; 8870/3774 tok/s;    969 sec
[2021-04-23 04:01:39,380 INFO] Step 4550/50000; acc:  62.29; ppl:  4.00; xent: 1.39; lr: 0.00010; 9413/4103 tok/s;    980 sec
[2021-04-23 04:01:49,737 INFO] Step 4600/50000; acc:  61.97; ppl:  4.02; xent: 1.39; lr: 0.00010; 9891/4103 tok/s;    990 sec
[2021-04-23 04:02:00,337 INFO] Step 4650/50000; acc:  62.38; ppl:  3.96; xent: 1.38; lr: 0.00010; 9482/4043 tok/s;   1001 sec
[2021-04-23 04:02:11,457 INFO] Step 4700/50000; acc:  61.87; ppl:  4.03; xent: 1.39; lr: 0.00010; 9533/3783 tok/s;   1012 sec
[2021-04-23 04:02:22,388 INFO] Step 4750/50000; acc:  62.26; ppl:  3.92; xent: 1.37; lr: 0.00010; 8982/3917 tok/s;   1023 sec
[2021-04-23 04:02:33,107 INFO] Step 4800/50000; acc:  62.34; ppl:  3.97; xent: 1.38; lr: 0.00010; 9677/3984 tok/s;   1033 sec
[2021-04-23 04:02:44,100 INFO] Step 4850/50000; acc:  62.37; ppl:  3.94; xent: 1.37; lr: 0.00010; 9146/3872 tok/s;   1044 sec
[2021-04-23 04:02:54,459 INFO] Step 4900/50000; acc:  62.25; ppl:  4.01; xent: 1.39; lr: 0.00010; 9762/4141 tok/s;   1055 sec
[2021-04-23 04:03:05,566 INFO] Step 4950/50000; acc:  62.60; ppl:  3.90; xent: 1.36; lr: 0.00010; 9305/3848 tok/s;   1066 sec
[2021-04-23 04:03:16,174 INFO] Step 5000/50000; acc:  62.59; ppl:  3.91; xent: 1.36; lr: 0.00010; 9673/4016 tok/s;   1076 sec
[2021-04-23 04:03:16,175 INFO] valid's transforms: TransformPipe()
[2021-04-23 04:03:16,184 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-23 04:03:24,945 INFO] Validation perplexity: 3.72794
[2021-04-23 04:03:24,945 INFO] Validation accuracy: 64.0765
[2021-04-23 04:03:24,947 INFO] Saving checkpoint ../models/default_params/strict_ops/model_step_5000.pt
[2021-04-23 04:03:35,899 INFO] Step 5050/50000; acc:  62.76; ppl:  3.89; xent: 1.36; lr: 0.00010; 5136/2156 tok/s;   1096 sec
[2021-04-23 04:03:43,733 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:03:46,422 INFO] Step 5100/50000; acc:  62.60; ppl:  3.88; xent: 1.36; lr: 0.00010; 9512/4087 tok/s;   1107 sec
[2021-04-23 04:03:57,116 INFO] Step 5150/50000; acc:  62.94; ppl:  3.87; xent: 1.35; lr: 0.00010; 9595/3965 tok/s;   1117 sec
[2021-04-23 04:04:07,814 INFO] Step 5200/50000; acc:  62.84; ppl:  3.88; xent: 1.35; lr: 0.00010; 9317/3951 tok/s;   1128 sec
[2021-04-23 04:04:19,325 INFO] Step 5250/50000; acc:  62.62; ppl:  3.90; xent: 1.36; lr: 0.00010; 8932/3834 tok/s;   1140 sec
[2021-04-23 04:04:30,088 INFO] Step 5300/50000; acc:  62.40; ppl:  3.93; xent: 1.37; lr: 0.00010; 9522/4013 tok/s;   1150 sec
[2021-04-23 04:04:40,375 INFO] Step 5350/50000; acc:  63.15; ppl:  3.82; xent: 1.34; lr: 0.00010; 9666/4108 tok/s;   1161 sec
[2021-04-23 04:04:50,889 INFO] Step 5400/50000; acc:  62.78; ppl:  3.86; xent: 1.35; lr: 0.00010; 9795/4074 tok/s;   1171 sec
[2021-04-23 04:05:01,706 INFO] Step 5450/50000; acc:  63.06; ppl:  3.82; xent: 1.34; lr: 0.00010; 9458/3865 tok/s;   1182 sec
[2021-04-23 04:05:13,297 INFO] Step 5500/50000; acc:  62.79; ppl:  3.88; xent: 1.36; lr: 0.00010; 9067/3767 tok/s;   1194 sec
[2021-04-23 04:05:23,140 INFO] Step 5550/50000; acc:  63.34; ppl:  3.77; xent: 1.33; lr: 0.00010; 9843/4225 tok/s;   1203 sec
[2021-04-23 04:05:34,288 INFO] Step 5600/50000; acc:  62.76; ppl:  3.86; xent: 1.35; lr: 0.00010; 9363/3901 tok/s;   1215 sec
[2021-04-23 04:05:45,313 INFO] Step 5650/50000; acc:  63.22; ppl:  3.82; xent: 1.34; lr: 0.00010; 9081/3854 tok/s;   1226 sec
[2021-04-23 04:05:56,160 INFO] Step 5700/50000; acc:  63.30; ppl:  3.77; xent: 1.33; lr: 0.00010; 9370/3899 tok/s;   1236 sec
[2021-04-23 04:06:06,426 INFO] Step 5750/50000; acc:  63.47; ppl:  3.76; xent: 1.32; lr: 0.00010; 9985/4168 tok/s;   1247 sec
[2021-04-23 04:06:17,395 INFO] Step 5800/50000; acc:  63.49; ppl:  3.76; xent: 1.32; lr: 0.00010; 9296/3907 tok/s;   1258 sec
[2021-04-23 04:06:22,374 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:06:28,231 INFO] Step 5850/50000; acc:  63.37; ppl:  3.79; xent: 1.33; lr: 0.00010; 9458/3967 tok/s;   1269 sec
[2021-04-23 04:06:38,737 INFO] Step 5900/50000; acc:  64.16; ppl:  3.71; xent: 1.31; lr: 0.00010; 9555/4007 tok/s;   1279 sec
[2021-04-23 04:06:50,413 INFO] Step 5950/50000; acc:  63.55; ppl:  3.77; xent: 1.33; lr: 0.00010; 8733/3707 tok/s;   1291 sec
[2021-04-23 04:07:01,034 INFO] Step 6000/50000; acc:  63.53; ppl:  3.73; xent: 1.32; lr: 0.00010; 9343/4106 tok/s;   1301 sec
[2021-04-23 04:07:12,035 INFO] Step 6050/50000; acc:  62.97; ppl:  3.83; xent: 1.34; lr: 0.00010; 9391/3895 tok/s;   1312 sec
[2021-04-23 04:07:22,221 INFO] Step 6100/50000; acc:  63.54; ppl:  3.71; xent: 1.31; lr: 0.00010; 9998/4148 tok/s;   1323 sec
[2021-04-23 04:07:32,841 INFO] Step 6150/50000; acc:  63.56; ppl:  3.74; xent: 1.32; lr: 0.00010; 9519/4004 tok/s;   1333 sec
[2021-04-23 04:07:43,890 INFO] Step 6200/50000; acc:  63.54; ppl:  3.72; xent: 1.32; lr: 0.00010; 9368/3850 tok/s;   1344 sec
[2021-04-23 04:07:55,090 INFO] Step 6250/50000; acc:  63.43; ppl:  3.76; xent: 1.32; lr: 0.00010; 9038/3830 tok/s;   1355 sec
[2021-04-23 04:08:05,814 INFO] Step 6300/50000; acc:  63.58; ppl:  3.74; xent: 1.32; lr: 0.00010; 9768/3998 tok/s;   1366 sec
[2021-04-23 04:08:16,078 INFO] Step 6350/50000; acc:  64.11; ppl:  3.68; xent: 1.30; lr: 0.00010; 9463/4142 tok/s;   1376 sec
[2021-04-23 04:08:27,227 INFO] Step 6400/50000; acc:  63.41; ppl:  3.76; xent: 1.32; lr: 0.00010; 9336/3816 tok/s;   1388 sec
[2021-04-23 04:08:37,910 INFO] Step 6450/50000; acc:  64.38; ppl:  3.62; xent: 1.29; lr: 0.00010; 9421/3958 tok/s;   1398 sec
[2021-04-23 04:08:48,285 INFO] Step 6500/50000; acc:  63.85; ppl:  3.66; xent: 1.30; lr: 0.00010; 9693/4109 tok/s;   1409 sec
[2021-04-23 04:08:59,309 INFO] Step 6550/50000; acc:  63.83; ppl:  3.68; xent: 1.30; lr: 0.00010; 9349/3924 tok/s;   1420 sec
[2021-04-23 04:09:01,266 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:09:09,661 INFO] Step 6600/50000; acc:  64.42; ppl:  3.66; xent: 1.30; lr: 0.00010; 9898/4111 tok/s;   1430 sec
[2021-04-23 04:09:20,510 INFO] Step 6650/50000; acc:  63.98; ppl:  3.69; xent: 1.31; lr: 0.00010; 9405/3910 tok/s;   1441 sec
[2021-04-23 04:09:31,685 INFO] Step 6700/50000; acc:  64.02; ppl:  3.63; xent: 1.29; lr: 0.00010; 8882/3935 tok/s;   1452 sec
[2021-04-23 04:09:42,287 INFO] Step 6750/50000; acc:  64.08; ppl:  3.67; xent: 1.30; lr: 0.00010; 9677/4018 tok/s;   1463 sec
[2021-04-23 04:09:53,016 INFO] Step 6800/50000; acc:  64.46; ppl:  3.64; xent: 1.29; lr: 0.00010; 9241/4000 tok/s;   1473 sec
[2021-04-23 04:10:03,620 INFO] Step 6850/50000; acc:  64.26; ppl:  3.63; xent: 1.29; lr: 0.00010; 9741/3977 tok/s;   1484 sec
[2021-04-23 04:10:14,651 INFO] Step 6900/50000; acc:  63.74; ppl:  3.69; xent: 1.30; lr: 0.00010; 9463/3843 tok/s;   1495 sec
[2021-04-23 04:10:25,237 INFO] Step 6950/50000; acc:  64.57; ppl:  3.56; xent: 1.27; lr: 0.00010; 9435/4015 tok/s;   1506 sec
[2021-04-23 04:10:36,211 INFO] Step 7000/50000; acc:  64.03; ppl:  3.66; xent: 1.30; lr: 0.00010; 9366/3886 tok/s;   1516 sec
[2021-04-23 04:10:46,786 INFO] Step 7050/50000; acc:  64.30; ppl:  3.62; xent: 1.29; lr: 0.00010; 9592/4086 tok/s;   1527 sec
[2021-04-23 04:10:57,788 INFO] Step 7100/50000; acc:  64.10; ppl:  3.71; xent: 1.31; lr: 0.00010; 9465/3912 tok/s;   1538 sec
[2021-04-23 04:11:08,488 INFO] Step 7150/50000; acc:  65.08; ppl:  3.50; xent: 1.25; lr: 0.00010; 9162/3934 tok/s;   1549 sec
[2021-04-23 04:11:18,846 INFO] Step 7200/50000; acc:  64.36; ppl:  3.62; xent: 1.29; lr: 0.00010; 10013/4078 tok/s;   1559 sec
[2021-04-23 04:11:29,785 INFO] Step 7250/50000; acc:  64.97; ppl:  3.53; xent: 1.26; lr: 0.00010; 9119/3896 tok/s;   1570 sec
[2021-04-23 04:11:32,763 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:11:40,475 INFO] Step 7300/50000; acc:  64.70; ppl:  3.59; xent: 1.28; lr: 0.00010; 9533/4068 tok/s;   1581 sec
[2021-04-23 04:11:51,053 INFO] Step 7350/50000; acc:  64.96; ppl:  3.55; xent: 1.27; lr: 0.00010; 9732/3998 tok/s;   1591 sec
[2021-04-23 04:12:02,080 INFO] Step 7400/50000; acc:  64.25; ppl:  3.63; xent: 1.29; lr: 0.00010; 9212/3910 tok/s;   1602 sec
[2021-04-23 04:12:13,331 INFO] Step 7450/50000; acc:  64.64; ppl:  3.54; xent: 1.27; lr: 0.00010; 9046/3904 tok/s;   1614 sec
[2021-04-23 04:12:23,660 INFO] Step 7500/50000; acc:  64.43; ppl:  3.60; xent: 1.28; lr: 0.00010; 9652/4110 tok/s;   1624 sec
[2021-04-23 04:12:33,840 INFO] Step 7550/50000; acc:  64.98; ppl:  3.54; xent: 1.26; lr: 0.00010; 10023/4130 tok/s;   1634 sec
[2021-04-23 04:12:44,259 INFO] Step 7600/50000; acc:  64.93; ppl:  3.52; xent: 1.26; lr: 0.00010; 9631/4101 tok/s;   1645 sec
[2021-04-23 04:12:55,846 INFO] Step 7650/50000; acc:  64.44; ppl:  3.58; xent: 1.28; lr: 0.00010; 9080/3702 tok/s;   1656 sec
[2021-04-23 04:13:06,459 INFO] Step 7700/50000; acc:  64.91; ppl:  3.54; xent: 1.26; lr: 0.00010; 9648/3961 tok/s;   1667 sec
[2021-04-23 04:13:17,213 INFO] Step 7750/50000; acc:  64.87; ppl:  3.51; xent: 1.26; lr: 0.00010; 9287/3980 tok/s;   1678 sec
[2021-04-23 04:13:28,015 INFO] Step 7800/50000; acc:  64.57; ppl:  3.60; xent: 1.28; lr: 0.00010; 9491/3986 tok/s;   1688 sec
[2021-04-23 04:13:38,815 INFO] Step 7850/50000; acc:  64.88; ppl:  3.55; xent: 1.27; lr: 0.00010; 9389/3935 tok/s;   1699 sec
[2021-04-23 04:13:50,201 INFO] Step 7900/50000; acc:  64.92; ppl:  3.52; xent: 1.26; lr: 0.00010; 9254/3806 tok/s;   1710 sec
[2021-04-23 04:14:00,111 INFO] Step 7950/50000; acc:  65.53; ppl:  3.42; xent: 1.23; lr: 0.00010; 9714/4170 tok/s;   1720 sec
[2021-04-23 04:14:11,010 INFO] Step 8000/50000; acc:  65.01; ppl:  3.53; xent: 1.26; lr: 0.00010; 9552/3966 tok/s;   1731 sec
[2021-04-23 04:14:11,018 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:14:21,925 INFO] Step 8050/50000; acc:  65.31; ppl:  3.47; xent: 1.24; lr: 0.00010; 9184/3970 tok/s;   1742 sec
[2021-04-23 04:14:32,466 INFO] Step 8100/50000; acc:  65.02; ppl:  3.52; xent: 1.26; lr: 0.00010; 9649/3985 tok/s;   1753 sec
[2021-04-23 04:14:43,969 INFO] Step 8150/50000; acc:  65.00; ppl:  3.50; xent: 1.25; lr: 0.00010; 8846/3748 tok/s;   1764 sec
[2021-04-23 04:14:54,607 INFO] Step 8200/50000; acc:  65.06; ppl:  3.52; xent: 1.26; lr: 0.00010; 9610/4109 tok/s;   1775 sec
[2021-04-23 04:15:05,267 INFO] Step 8250/50000; acc:  64.95; ppl:  3.51; xent: 1.26; lr: 0.00010; 9589/3979 tok/s;   1786 sec
[2021-04-23 04:15:15,769 INFO] Step 8300/50000; acc:  65.41; ppl:  3.45; xent: 1.24; lr: 0.00010; 9442/4079 tok/s;   1796 sec
[2021-04-23 04:15:26,712 INFO] Step 8350/50000; acc:  64.97; ppl:  3.51; xent: 1.25; lr: 0.00010; 9497/3812 tok/s;   1807 sec
[2021-04-23 04:15:37,816 INFO] Step 8400/50000; acc:  65.22; ppl:  3.45; xent: 1.24; lr: 0.00010; 9057/3916 tok/s;   1818 sec
[2021-04-23 04:15:48,371 INFO] Step 8450/50000; acc:  65.30; ppl:  3.45; xent: 1.24; lr: 0.00010; 9767/3997 tok/s;   1829 sec
[2021-04-23 04:15:59,601 INFO] Step 8500/50000; acc:  65.10; ppl:  3.49; xent: 1.25; lr: 0.00010; 9175/3826 tok/s;   1840 sec
[2021-04-23 04:16:09,921 INFO] Step 8550/50000; acc:  65.49; ppl:  3.50; xent: 1.25; lr: 0.00010; 9649/4141 tok/s;   1850 sec
[2021-04-23 04:16:20,978 INFO] Step 8600/50000; acc:  65.66; ppl:  3.42; xent: 1.23; lr: 0.00010; 9335/3856 tok/s;   1861 sec
[2021-04-23 04:16:31,529 INFO] Step 8650/50000; acc:  65.72; ppl:  3.42; xent: 1.23; lr: 0.00010; 9584/4024 tok/s;   1872 sec
[2021-04-23 04:16:42,022 INFO] Step 8700/50000; acc:  65.56; ppl:  3.45; xent: 1.24; lr: 0.00010; 9890/4070 tok/s;   1882 sec
[2021-04-23 04:16:49,554 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:16:52,524 INFO] Step 8750/50000; acc:  65.79; ppl:  3.40; xent: 1.22; lr: 0.00010; 9307/4085 tok/s;   1893 sec
[2021-04-23 04:17:03,376 INFO] Step 8800/50000; acc:  65.88; ppl:  3.42; xent: 1.23; lr: 0.00010; 9576/3912 tok/s;   1904 sec
[2021-04-23 04:17:14,254 INFO] Step 8850/50000; acc:  65.18; ppl:  3.44; xent: 1.24; lr: 0.00010; 9205/3895 tok/s;   1915 sec
[2021-04-23 04:17:25,408 INFO] Step 8900/50000; acc:  65.50; ppl:  3.45; xent: 1.24; lr: 0.00010; 9029/3935 tok/s;   1926 sec
[2021-04-23 04:17:36,097 INFO] Step 8950/50000; acc:  64.99; ppl:  3.46; xent: 1.24; lr: 0.00010; 9593/4059 tok/s;   1936 sec
[2021-04-23 04:17:46,800 INFO] Step 9000/50000; acc:  65.79; ppl:  3.41; xent: 1.23; lr: 0.00010; 9531/3975 tok/s;   1947 sec
[2021-04-23 04:17:57,040 INFO] Step 9050/50000; acc:  65.38; ppl:  3.43; xent: 1.23; lr: 0.00010; 10049/4149 tok/s;   1957 sec
[2021-04-23 04:18:07,973 INFO] Step 9100/50000; acc:  65.95; ppl:  3.37; xent: 1.21; lr: 0.00010; 9231/3823 tok/s;   1968 sec
[2021-04-23 04:18:19,355 INFO] Step 9150/50000; acc:  65.45; ppl:  3.44; xent: 1.24; lr: 0.00010; 9056/3822 tok/s;   1980 sec
[2021-04-23 04:18:29,713 INFO] Step 9200/50000; acc:  66.09; ppl:  3.35; xent: 1.21; lr: 0.00010; 9585/4052 tok/s;   1990 sec
[2021-04-23 04:18:40,728 INFO] Step 9250/50000; acc:  65.56; ppl:  3.43; xent: 1.23; lr: 0.00010; 9425/3921 tok/s;   2001 sec
[2021-04-23 04:18:51,835 INFO] Step 9300/50000; acc:  65.53; ppl:  3.43; xent: 1.23; lr: 0.00010; 9224/3838 tok/s;   2012 sec
[2021-04-23 04:19:02,563 INFO] Step 9350/50000; acc:  65.95; ppl:  3.36; xent: 1.21; lr: 0.00010; 9346/3934 tok/s;   2023 sec
[2021-04-23 04:19:12,943 INFO] Step 9400/50000; acc:  66.08; ppl:  3.37; xent: 1.22; lr: 0.00010; 9850/4147 tok/s;   2033 sec
[2021-04-23 04:19:23,606 INFO] Step 9450/50000; acc:  65.83; ppl:  3.36; xent: 1.21; lr: 0.00010; 9454/4012 tok/s;   2044 sec
[2021-04-23 04:19:28,238 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:19:34,438 INFO] Step 9500/50000; acc:  65.85; ppl:  3.41; xent: 1.23; lr: 0.00010; 9669/3986 tok/s;   2055 sec
[2021-04-23 04:19:44,835 INFO] Step 9550/50000; acc:  66.83; ppl:  3.31; xent: 1.20; lr: 0.00010; 9413/4035 tok/s;   2065 sec
[2021-04-23 04:19:56,745 INFO] Step 9600/50000; acc:  65.66; ppl:  3.41; xent: 1.23; lr: 0.00010; 8685/3662 tok/s;   2077 sec
[2021-04-23 04:20:07,017 INFO] Step 9650/50000; acc:  66.15; ppl:  3.35; xent: 1.21; lr: 0.00010; 9706/4183 tok/s;   2087 sec
[2021-04-23 04:20:17,965 INFO] Step 9700/50000; acc:  65.97; ppl:  3.39; xent: 1.22; lr: 0.00010; 9225/3920 tok/s;   2098 sec
[2021-04-23 04:20:28,234 INFO] Step 9750/50000; acc:  66.05; ppl:  3.34; xent: 1.21; lr: 0.00010; 9954/4131 tok/s;   2109 sec
[2021-04-23 04:20:38,800 INFO] Step 9800/50000; acc:  65.87; ppl:  3.38; xent: 1.22; lr: 0.00010; 9816/3999 tok/s;   2119 sec
[2021-04-23 04:20:50,114 INFO] Step 9850/50000; acc:  66.04; ppl:  3.35; xent: 1.21; lr: 0.00010; 9108/3799 tok/s;   2130 sec
[2021-04-23 04:21:01,292 INFO] Step 9900/50000; acc:  65.96; ppl:  3.35; xent: 1.21; lr: 0.00010; 8921/3849 tok/s;   2142 sec
[2021-04-23 04:21:11,714 INFO] Step 9950/50000; acc:  66.55; ppl:  3.31; xent: 1.20; lr: 0.00010; 9874/4052 tok/s;   2152 sec
[2021-04-23 04:21:22,208 INFO] Step 10000/50000; acc:  66.39; ppl:  3.35; xent: 1.21; lr: 0.00010; 9485/4089 tok/s;   2162 sec
[2021-04-23 04:21:22,211 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-23 04:21:30,959 INFO] Validation perplexity: 3.27928
[2021-04-23 04:21:30,959 INFO] Validation accuracy: 67.0223
[2021-04-23 04:21:30,961 INFO] Saving checkpoint ../models/default_params/strict_ops/model_step_10000.pt
[2021-04-23 04:21:42,783 INFO] Step 10050/50000; acc:  66.05; ppl:  3.36; xent: 1.21; lr: 0.00010; 5040/2064 tok/s;   2183 sec
[2021-04-23 04:21:53,496 INFO] Step 10100/50000; acc:  66.33; ppl:  3.29; xent: 1.19; lr: 0.00010; 9596/3974 tok/s;   2194 sec
[2021-04-23 04:22:03,839 INFO] Step 10150/50000; acc:  66.77; ppl:  3.29; xent: 1.19; lr: 0.00010; 9589/4078 tok/s;   2204 sec
[2021-04-23 04:22:14,599 INFO] Step 10200/50000; acc:  66.37; ppl:  3.32; xent: 1.20; lr: 0.00010; 9561/4039 tok/s;   2215 sec
[2021-04-23 04:22:16,231 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:22:24,931 INFO] Step 10250/50000; acc:  66.48; ppl:  3.30; xent: 1.19; lr: 0.00010; 9815/4105 tok/s;   2225 sec
[2021-04-23 04:22:36,123 INFO] Step 10300/50000; acc:  66.04; ppl:  3.37; xent: 1.22; lr: 0.00010; 9311/3836 tok/s;   2236 sec
[2021-04-23 04:22:46,861 INFO] Step 10350/50000; acc:  66.61; ppl:  3.26; xent: 1.18; lr: 0.00010; 9021/4051 tok/s;   2247 sec
[2021-04-23 04:22:57,706 INFO] Step 10400/50000; acc:  65.91; ppl:  3.36; xent: 1.21; lr: 0.00010; 9562/3946 tok/s;   2258 sec
[2021-04-23 04:23:08,222 INFO] Step 10450/50000; acc:  66.73; ppl:  3.28; xent: 1.19; lr: 0.00010; 9488/4060 tok/s;   2269 sec
[2021-04-23 04:23:18,840 INFO] Step 10500/50000; acc:  66.48; ppl:  3.29; xent: 1.19; lr: 0.00010; 9530/3975 tok/s;   2279 sec
[2021-04-23 04:23:30,001 INFO] Step 10550/50000; acc:  66.14; ppl:  3.33; xent: 1.20; lr: 0.00010; 9385/3807 tok/s;   2290 sec
[2021-04-23 04:23:40,757 INFO] Step 10600/50000; acc:  66.32; ppl:  3.29; xent: 1.19; lr: 0.00010; 9498/3971 tok/s;   2301 sec
[2021-04-23 04:23:51,700 INFO] Step 10650/50000; acc:  66.41; ppl:  3.31; xent: 1.20; lr: 0.00010; 9374/3900 tok/s;   2312 sec
[2021-04-23 04:24:02,137 INFO] Step 10700/50000; acc:  66.74; ppl:  3.27; xent: 1.19; lr: 0.00010; 9578/4119 tok/s;   2322 sec
[2021-04-23 04:24:12,885 INFO] Step 10750/50000; acc:  66.36; ppl:  3.37; xent: 1.21; lr: 0.00010; 9508/3985 tok/s;   2333 sec
[2021-04-23 04:24:23,770 INFO] Step 10800/50000; acc:  66.96; ppl:  3.21; xent: 1.17; lr: 0.00010; 9236/3874 tok/s;   2344 sec
[2021-04-23 04:24:34,115 INFO] Step 10850/50000; acc:  66.84; ppl:  3.28; xent: 1.19; lr: 0.00010; 9984/4075 tok/s;   2354 sec
[2021-04-23 04:24:44,900 INFO] Step 10900/50000; acc:  66.87; ppl:  3.25; xent: 1.18; lr: 0.00010; 9460/4002 tok/s;   2365 sec
[2021-04-23 04:24:47,355 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:24:55,585 INFO] Step 10950/50000; acc:  67.11; ppl:  3.26; xent: 1.18; lr: 0.00010; 9386/4043 tok/s;   2376 sec
[2021-04-23 04:25:06,298 INFO] Step 11000/50000; acc:  66.97; ppl:  3.25; xent: 1.18; lr: 0.00010; 9599/3972 tok/s;   2387 sec
[2021-04-23 04:25:17,078 INFO] Step 11050/50000; acc:  66.29; ppl:  3.29; xent: 1.19; lr: 0.00010; 9308/3955 tok/s;   2397 sec
[2021-04-23 04:25:28,585 INFO] Step 11100/50000; acc:  66.92; ppl:  3.28; xent: 1.19; lr: 0.00010; 9051/3852 tok/s;   2409 sec
[2021-04-23 04:25:38,800 INFO] Step 11150/50000; acc:  66.77; ppl:  3.24; xent: 1.18; lr: 0.00010; 9515/4122 tok/s;   2419 sec
[2021-04-23 04:25:49,172 INFO] Step 11200/50000; acc:  66.66; ppl:  3.26; xent: 1.18; lr: 0.00010; 9959/4117 tok/s;   2429 sec
[2021-04-23 04:25:59,342 INFO] Step 11250/50000; acc:  66.85; ppl:  3.23; xent: 1.17; lr: 0.00010; 9936/4122 tok/s;   2440 sec
[2021-04-23 04:26:10,849 INFO] Step 11300/50000; acc:  66.61; ppl:  3.25; xent: 1.18; lr: 0.00010; 8942/3727 tok/s;   2451 sec
[2021-04-23 04:26:21,683 INFO] Step 11350/50000; acc:  66.86; ppl:  3.25; xent: 1.18; lr: 0.00010; 9467/3934 tok/s;   2462 sec
[2021-04-23 04:26:32,286 INFO] Step 11400/50000; acc:  66.94; ppl:  3.23; xent: 1.17; lr: 0.00010; 9666/3981 tok/s;   2473 sec
[2021-04-23 04:26:43,271 INFO] Step 11450/50000; acc:  66.73; ppl:  3.28; xent: 1.19; lr: 0.00010; 9304/3937 tok/s;   2484 sec
[2021-04-23 04:26:54,052 INFO] Step 11500/50000; acc:  67.17; ppl:  3.23; xent: 1.17; lr: 0.00010; 9275/3957 tok/s;   2494 sec
[2021-04-23 04:27:05,195 INFO] Step 11550/50000; acc:  67.12; ppl:  3.21; xent: 1.17; lr: 0.00010; 9276/3848 tok/s;   2505 sec
[2021-04-23 04:27:15,253 INFO] Step 11600/50000; acc:  67.38; ppl:  3.17; xent: 1.16; lr: 0.00010; 9813/4164 tok/s;   2516 sec
[2021-04-23 04:27:25,738 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:27:26,112 INFO] Step 11650/50000; acc:  66.87; ppl:  3.25; xent: 1.18; lr: 0.00010; 9561/3938 tok/s;   2526 sec
[2021-04-23 04:27:37,289 INFO] Step 11700/50000; acc:  67.25; ppl:  3.21; xent: 1.17; lr: 0.00010; 9154/3920 tok/s;   2538 sec
[2021-04-23 04:27:48,081 INFO] Step 11750/50000; acc:  66.81; ppl:  3.23; xent: 1.17; lr: 0.00010; 9282/3879 tok/s;   2548 sec
[2021-04-23 04:27:59,503 INFO] Step 11800/50000; acc:  67.13; ppl:  3.22; xent: 1.17; lr: 0.00010; 8893/3811 tok/s;   2560 sec
[2021-04-23 04:28:10,146 INFO] Step 11850/50000; acc:  66.81; ppl:  3.25; xent: 1.18; lr: 0.00010; 9498/4072 tok/s;   2570 sec
[2021-04-23 04:28:20,927 INFO] Step 11900/50000; acc:  66.90; ppl:  3.24; xent: 1.17; lr: 0.00010; 9700/3945 tok/s;   2581 sec
[2021-04-23 04:28:31,147 INFO] Step 11950/50000; acc:  67.39; ppl:  3.16; xent: 1.15; lr: 0.00010; 9487/4169 tok/s;   2591 sec
[2021-04-23 04:28:41,984 INFO] Step 12000/50000; acc:  66.90; ppl:  3.25; xent: 1.18; lr: 0.00010; 9707/3868 tok/s;   2602 sec
[2021-04-23 04:28:53,348 INFO] Step 12050/50000; acc:  67.11; ppl:  3.20; xent: 1.16; lr: 0.00010; 8889/3819 tok/s;   2614 sec
[2021-04-23 04:29:03,620 INFO] Step 12100/50000; acc:  67.44; ppl:  3.17; xent: 1.15; lr: 0.00010; 9806/4086 tok/s;   2624 sec
[2021-04-23 04:29:14,870 INFO] Step 12150/50000; acc:  66.95; ppl:  3.23; xent: 1.17; lr: 0.00010; 9199/3810 tok/s;   2635 sec
[2021-04-23 04:29:25,616 INFO] Step 12200/50000; acc:  67.13; ppl:  3.24; xent: 1.18; lr: 0.00010; 9494/4031 tok/s;   2646 sec
[2021-04-23 04:29:36,665 INFO] Step 12250/50000; acc:  67.48; ppl:  3.15; xent: 1.15; lr: 0.00010; 9327/3826 tok/s;   2657 sec
[2021-04-23 04:29:47,046 INFO] Step 12300/50000; acc:  67.69; ppl:  3.14; xent: 1.15; lr: 0.00010; 9570/4099 tok/s;   2667 sec
[2021-04-23 04:29:57,381 INFO] Step 12350/50000; acc:  67.45; ppl:  3.16; xent: 1.15; lr: 0.00010; 9869/4098 tok/s;   2678 sec
[2021-04-23 04:30:04,716 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:30:08,067 INFO] Step 12400/50000; acc:  67.18; ppl:  3.20; xent: 1.16; lr: 0.00010; 9364/4053 tok/s;   2688 sec
[2021-04-23 04:30:18,929 INFO] Step 12450/50000; acc:  67.51; ppl:  3.16; xent: 1.15; lr: 0.00010; 9551/3899 tok/s;   2699 sec
[2021-04-23 04:30:30,294 INFO] Step 12500/50000; acc:  67.00; ppl:  3.21; xent: 1.17; lr: 0.00010; 8999/3752 tok/s;   2711 sec
[2021-04-23 04:30:41,195 INFO] Step 12550/50000; acc:  67.57; ppl:  3.16; xent: 1.15; lr: 0.00010; 9106/4007 tok/s;   2721 sec
[2021-04-23 04:30:52,082 INFO] Step 12600/50000; acc:  67.21; ppl:  3.19; xent: 1.16; lr: 0.00010; 9393/3978 tok/s;   2732 sec
[2021-04-23 04:31:02,809 INFO] Step 12650/50000; acc:  67.46; ppl:  3.18; xent: 1.16; lr: 0.00010; 9405/3965 tok/s;   2743 sec
[2021-04-23 04:31:13,327 INFO] Step 12700/50000; acc:  66.97; ppl:  3.21; xent: 1.17; lr: 0.00010; 10016/4078 tok/s;   2754 sec
[2021-04-23 04:31:23,939 INFO] Step 12750/50000; acc:  68.08; ppl:  3.08; xent: 1.13; lr: 0.00010; 9262/3888 tok/s;   2764 sec
[2021-04-23 04:31:35,395 INFO] Step 12800/50000; acc:  66.90; ppl:  3.21; xent: 1.16; lr: 0.00010; 9112/3811 tok/s;   2776 sec
[2021-04-23 04:31:45,891 INFO] Step 12850/50000; acc:  67.55; ppl:  3.13; xent: 1.14; lr: 0.00010; 9520/3998 tok/s;   2786 sec
[2021-04-23 04:31:56,709 INFO] Step 12900/50000; acc:  67.41; ppl:  3.17; xent: 1.15; lr: 0.00010; 9369/3982 tok/s;   2797 sec
[2021-04-23 04:32:08,006 INFO] Step 12950/50000; acc:  67.51; ppl:  3.18; xent: 1.16; lr: 0.00010; 9117/3810 tok/s;   2808 sec
[2021-04-23 04:32:18,882 INFO] Step 13000/50000; acc:  67.75; ppl:  3.13; xent: 1.14; lr: 0.00010; 9440/3852 tok/s;   2819 sec
[2021-04-23 04:32:29,162 INFO] Step 13050/50000; acc:  67.80; ppl:  3.12; xent: 1.14; lr: 0.00010; 9914/4191 tok/s;   2829 sec
[2021-04-23 04:32:39,643 INFO] Step 13100/50000; acc:  67.73; ppl:  3.12; xent: 1.14; lr: 0.00010; 9486/4085 tok/s;   2840 sec
[2021-04-23 04:32:43,932 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:32:50,447 INFO] Step 13150/50000; acc:  67.66; ppl:  3.15; xent: 1.15; lr: 0.00010; 9511/3955 tok/s;   2851 sec
[2021-04-23 04:33:01,136 INFO] Step 13200/50000; acc:  67.68; ppl:  3.11; xent: 1.14; lr: 0.00010; 9371/3943 tok/s;   2861 sec
[2021-04-23 04:33:13,165 INFO] Step 13250/50000; acc:  67.41; ppl:  3.18; xent: 1.16; lr: 0.00010; 8576/3663 tok/s;   2873 sec
[2021-04-23 04:33:23,463 INFO] Step 13300/50000; acc:  67.62; ppl:  3.13; xent: 1.14; lr: 0.00010; 9903/4138 tok/s;   2884 sec
[2021-04-23 04:33:34,275 INFO] Step 13350/50000; acc:  67.34; ppl:  3.16; xent: 1.15; lr: 0.00010; 9197/3974 tok/s;   2895 sec
[2021-04-23 04:33:44,589 INFO] Step 13400/50000; acc:  67.84; ppl:  3.12; xent: 1.14; lr: 0.00010; 9897/4168 tok/s;   2905 sec
[2021-04-23 04:33:55,076 INFO] Step 13450/50000; acc:  67.56; ppl:  3.14; xent: 1.14; lr: 0.00010; 9788/3957 tok/s;   2915 sec
[2021-04-23 04:34:06,624 INFO] Step 13500/50000; acc:  67.30; ppl:  3.16; xent: 1.15; lr: 0.00010; 9105/3730 tok/s;   2927 sec
[2021-04-23 04:34:17,350 INFO] Step 13550/50000; acc:  67.78; ppl:  3.10; xent: 1.13; lr: 0.00010; 9080/4024 tok/s;   2938 sec
[2021-04-23 04:34:27,759 INFO] Step 13600/50000; acc:  67.94; ppl:  3.11; xent: 1.13; lr: 0.00010; 10014/4061 tok/s;   2948 sec
[2021-04-23 04:34:38,224 INFO] Step 13650/50000; acc:  67.98; ppl:  3.12; xent: 1.14; lr: 0.00010; 9556/4093 tok/s;   2959 sec
[2021-04-23 04:34:49,500 INFO] Step 13700/50000; acc:  67.93; ppl:  3.11; xent: 1.13; lr: 0.00010; 9003/3780 tok/s;   2970 sec
[2021-04-23 04:34:59,991 INFO] Step 13750/50000; acc:  68.09; ppl:  3.08; xent: 1.12; lr: 0.00010; 9809/4021 tok/s;   2980 sec
[2021-04-23 04:35:10,599 INFO] Step 13800/50000; acc:  67.90; ppl:  3.11; xent: 1.13; lr: 0.00010; 9610/3995 tok/s;   2991 sec
[2021-04-23 04:35:21,227 INFO] Step 13850/50000; acc:  67.73; ppl:  3.12; xent: 1.14; lr: 0.00010; 9643/4091 tok/s;   3002 sec
[2021-04-23 04:35:22,363 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:35:31,654 INFO] Step 13900/50000; acc:  68.56; ppl:  3.04; xent: 1.11; lr: 0.00010; 9591/4074 tok/s;   3012 sec
[2021-04-23 04:35:42,626 INFO] Step 13950/50000; acc:  67.46; ppl:  3.15; xent: 1.15; lr: 0.00010; 9318/3862 tok/s;   3023 sec
[2021-04-23 04:35:53,734 INFO] Step 14000/50000; acc:  68.26; ppl:  3.05; xent: 1.12; lr: 0.00010; 8923/3971 tok/s;   3034 sec
[2021-04-23 04:36:04,360 INFO] Step 14050/50000; acc:  67.76; ppl:  3.14; xent: 1.14; lr: 0.00010; 9745/4014 tok/s;   3045 sec
[2021-04-23 04:36:14,808 INFO] Step 14100/50000; acc:  67.76; ppl:  3.11; xent: 1.14; lr: 0.00010; 9749/4065 tok/s;   3055 sec
[2021-04-23 04:36:25,436 INFO] Step 14150/50000; acc:  68.22; ppl:  3.06; xent: 1.12; lr: 0.00010; 9398/3988 tok/s;   3066 sec
[2021-04-23 04:36:36,630 INFO] Step 14200/50000; acc:  67.92; ppl:  3.10; xent: 1.13; lr: 0.00010; 9339/3787 tok/s;   3077 sec
[2021-04-23 04:36:47,420 INFO] Step 14250/50000; acc:  68.11; ppl:  3.07; xent: 1.12; lr: 0.00010; 9366/3965 tok/s;   3088 sec
[2021-04-23 04:36:58,707 INFO] Step 14300/50000; acc:  67.73; ppl:  3.14; xent: 1.14; lr: 0.00010; 9277/3816 tok/s;   3099 sec
[2021-04-23 04:37:08,997 INFO] Step 14350/50000; acc:  68.61; ppl:  3.02; xent: 1.11; lr: 0.00010; 9484/4152 tok/s;   3109 sec
[2021-04-23 04:37:19,787 INFO] Step 14400/50000; acc:  67.59; ppl:  3.16; xent: 1.15; lr: 0.00010; 9597/3970 tok/s;   3120 sec
[2021-04-23 04:37:30,775 INFO] Step 14450/50000; acc:  68.64; ppl:  3.00; xent: 1.10; lr: 0.00010; 9201/3859 tok/s;   3131 sec
[2021-04-23 04:37:41,025 INFO] Step 14500/50000; acc:  68.35; ppl:  3.06; xent: 1.12; lr: 0.00010; 9830/4093 tok/s;   3141 sec
[2021-04-23 04:37:51,643 INFO] Step 14550/50000; acc:  68.05; ppl:  3.05; xent: 1.11; lr: 0.00010; 9647/4049 tok/s;   3152 sec
[2021-04-23 04:37:53,752 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:38:02,554 INFO] Step 14600/50000; acc:  68.07; ppl:  3.09; xent: 1.13; lr: 0.00010; 9415/3989 tok/s;   3163 sec
[2021-04-23 04:38:13,191 INFO] Step 14650/50000; acc:  68.20; ppl:  3.06; xent: 1.12; lr: 0.00010; 9648/3964 tok/s;   3173 sec
[2021-04-23 04:38:24,244 INFO] Step 14700/50000; acc:  67.93; ppl:  3.08; xent: 1.13; lr: 0.00010; 8950/3888 tok/s;   3185 sec
[2021-04-23 04:38:35,227 INFO] Step 14750/50000; acc:  68.43; ppl:  3.06; xent: 1.12; lr: 0.00010; 9322/3973 tok/s;   3196 sec
[2021-04-23 04:38:45,460 INFO] Step 14800/50000; acc:  68.04; ppl:  3.07; xent: 1.12; lr: 0.00010; 9718/4167 tok/s;   3206 sec
[2021-04-23 04:38:55,869 INFO] Step 14850/50000; acc:  68.00; ppl:  3.08; xent: 1.12; lr: 0.00010; 9882/4077 tok/s;   3216 sec
[2021-04-23 04:39:06,422 INFO] Step 14900/50000; acc:  68.06; ppl:  3.07; xent: 1.12; lr: 0.00010; 9806/3980 tok/s;   3227 sec
[2021-04-23 04:39:17,754 INFO] Step 14950/50000; acc:  68.32; ppl:  3.04; xent: 1.11; lr: 0.00010; 8928/3799 tok/s;   3238 sec
[2021-04-23 04:39:28,845 INFO] Step 15000/50000; acc:  68.16; ppl:  3.05; xent: 1.12; lr: 0.00010; 9236/3872 tok/s;   3249 sec
[2021-04-23 04:39:28,848 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-23 04:39:37,598 INFO] Validation perplexity: 3.06679
[2021-04-23 04:39:37,598 INFO] Validation accuracy: 68.4305
[2021-04-23 04:39:37,600 INFO] Saving checkpoint ../models/default_params/strict_ops/model_step_15000.pt
[2021-04-23 04:39:48,702 INFO] Step 15050/50000; acc:  68.36; ppl:  3.03; xent: 1.11; lr: 0.00010; 5104/2128 tok/s;   3269 sec
[2021-04-23 04:39:59,525 INFO] Step 15100/50000; acc:  67.91; ppl:  3.10; xent: 1.13; lr: 0.00010; 9661/3979 tok/s;   3280 sec
[2021-04-23 04:40:10,248 INFO] Step 15150/50000; acc:  69.17; ppl:  2.97; xent: 1.09; lr: 0.00010; 9094/3962 tok/s;   3291 sec
[2021-04-23 04:40:21,479 INFO] Step 15200/50000; acc:  68.24; ppl:  3.05; xent: 1.12; lr: 0.00010; 9327/3813 tok/s;   3302 sec
[2021-04-23 04:40:31,413 INFO] Step 15250/50000; acc:  69.08; ppl:  2.99; xent: 1.10; lr: 0.00010; 9977/4234 tok/s;   3312 sec
[2021-04-23 04:40:41,552 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:40:42,205 INFO] Step 15300/50000; acc:  68.51; ppl:  3.04; xent: 1.11; lr: 0.00010; 9406/3959 tok/s;   3322 sec
[2021-04-23 04:40:53,338 INFO] Step 15350/50000; acc:  68.49; ppl:  3.03; xent: 1.11; lr: 0.00010; 9226/3914 tok/s;   3334 sec
[2021-04-23 04:41:04,009 INFO] Step 15400/50000; acc:  67.97; ppl:  3.07; xent: 1.12; lr: 0.00010; 9629/3945 tok/s;   3344 sec
[2021-04-23 04:41:15,200 INFO] Step 15450/50000; acc:  68.40; ppl:  3.03; xent: 1.11; lr: 0.00010; 9044/3887 tok/s;   3355 sec
[2021-04-23 04:41:25,867 INFO] Step 15500/50000; acc:  68.47; ppl:  3.04; xent: 1.11; lr: 0.00010; 9344/4046 tok/s;   3366 sec
[2021-04-23 04:41:36,478 INFO] Step 15550/50000; acc:  68.40; ppl:  3.03; xent: 1.11; lr: 0.00010; 9667/3975 tok/s;   3377 sec
[2021-04-23 04:41:46,910 INFO] Step 15600/50000; acc:  68.49; ppl:  3.02; xent: 1.11; lr: 0.00010; 9535/4132 tok/s;   3387 sec
[2021-04-23 04:41:57,687 INFO] Step 15650/50000; acc:  68.14; ppl:  3.05; xent: 1.12; lr: 0.00010; 9722/3859 tok/s;   3398 sec
[2021-04-23 04:42:09,263 INFO] Step 15700/50000; acc:  68.45; ppl:  3.03; xent: 1.11; lr: 0.00010; 8928/3784 tok/s;   3410 sec
[2021-04-23 04:42:19,339 INFO] Step 15750/50000; acc:  68.58; ppl:  2.99; xent: 1.09; lr: 0.00010; 9833/4167 tok/s;   3420 sec
[2021-04-23 04:42:30,302 INFO] Step 15800/50000; acc:  68.28; ppl:  3.03; xent: 1.11; lr: 0.00010; 9421/3915 tok/s;   3431 sec
[2021-04-23 04:42:41,246 INFO] Step 15850/50000; acc:  68.71; ppl:  3.04; xent: 1.11; lr: 0.00010; 9209/3949 tok/s;   3442 sec
[2021-04-23 04:42:52,507 INFO] Step 15900/50000; acc:  68.70; ppl:  2.99; xent: 1.10; lr: 0.00010; 9346/3773 tok/s;   3453 sec
[2021-04-23 04:43:02,352 INFO] Step 15950/50000; acc:  69.26; ppl:  2.94; xent: 1.08; lr: 0.00010; 9858/4277 tok/s;   3463 sec
[2021-04-23 04:43:12,953 INFO] Step 16000/50000; acc:  68.74; ppl:  3.01; xent: 1.10; lr: 0.00010; 9750/4013 tok/s;   3473 sec
[2021-04-23 04:43:20,039 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:43:23,751 INFO] Step 16050/50000; acc:  68.75; ppl:  3.00; xent: 1.10; lr: 0.00010; 9322/4014 tok/s;   3484 sec
[2021-04-23 04:43:34,515 INFO] Step 16100/50000; acc:  68.87; ppl:  2.98; xent: 1.09; lr: 0.00010; 9429/3921 tok/s;   3495 sec
[2021-04-23 04:43:46,029 INFO] Step 16150/50000; acc:  68.18; ppl:  3.03; xent: 1.11; lr: 0.00010; 8900/3699 tok/s;   3506 sec
[2021-04-23 04:43:56,961 INFO] Step 16200/50000; acc:  68.72; ppl:  3.02; xent: 1.11; lr: 0.00010; 9309/4026 tok/s;   3517 sec
[2021-04-23 04:44:07,861 INFO] Step 16250/50000; acc:  68.31; ppl:  3.03; xent: 1.11; lr: 0.00010; 9358/3960 tok/s;   3528 sec
[2021-04-23 04:44:18,316 INFO] Step 16300/50000; acc:  69.04; ppl:  2.98; xent: 1.09; lr: 0.00010; 9504/4077 tok/s;   3539 sec
[2021-04-23 04:44:28,934 INFO] Step 16350/50000; acc:  68.46; ppl:  3.02; xent: 1.11; lr: 0.00010; 9741/4002 tok/s;   3549 sec
[2021-04-23 04:44:39,730 INFO] Step 16400/50000; acc:  69.06; ppl:  2.93; xent: 1.08; lr: 0.00010; 9345/3850 tok/s;   3560 sec
[2021-04-23 04:44:51,286 INFO] Step 16450/50000; acc:  68.43; ppl:  3.02; xent: 1.11; lr: 0.00010; 8984/3762 tok/s;   3572 sec
[2021-04-23 04:45:02,080 INFO] Step 16500/50000; acc:  68.70; ppl:  2.98; xent: 1.09; lr: 0.00010; 9475/3919 tok/s;   3582 sec
[2021-04-23 04:45:12,499 INFO] Step 16550/50000; acc:  69.01; ppl:  2.97; xent: 1.09; lr: 0.00010; 9582/4110 tok/s;   3593 sec
[2021-04-23 04:45:23,845 INFO] Step 16600/50000; acc:  68.60; ppl:  3.00; xent: 1.10; lr: 0.00010; 9062/3801 tok/s;   3604 sec
[2021-04-23 04:45:34,544 INFO] Step 16650/50000; acc:  69.26; ppl:  2.95; xent: 1.08; lr: 0.00010; 9483/3912 tok/s;   3615 sec
[2021-04-23 04:45:44,982 INFO] Step 16700/50000; acc:  68.72; ppl:  2.99; xent: 1.10; lr: 0.00010; 9987/4143 tok/s;   3625 sec
[2021-04-23 04:45:55,460 INFO] Step 16750/50000; acc:  69.23; ppl:  2.93; xent: 1.08; lr: 0.00010; 9254/4068 tok/s;   3636 sec
[2021-04-23 04:45:59,323 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:46:06,066 INFO] Step 16800/50000; acc:  68.83; ppl:  2.99; xent: 1.09; lr: 0.00010; 9828/4048 tok/s;   3646 sec
[2021-04-23 04:46:16,958 INFO] Step 16850/50000; acc:  68.95; ppl:  2.96; xent: 1.09; lr: 0.00010; 9218/3871 tok/s;   3657 sec
[2021-04-23 04:46:28,697 INFO] Step 16900/50000; acc:  68.75; ppl:  2.98; xent: 1.09; lr: 0.00010; 8596/3709 tok/s;   3669 sec
[2021-04-23 04:46:39,115 INFO] Step 16950/50000; acc:  69.11; ppl:  2.97; xent: 1.09; lr: 0.00010; 9821/4161 tok/s;   3679 sec
[2021-04-23 04:46:50,080 INFO] Step 17000/50000; acc:  68.55; ppl:  2.99; xent: 1.10; lr: 0.00010; 9298/3895 tok/s;   3690 sec
[2021-04-23 04:47:00,585 INFO] Step 17050/50000; acc:  68.69; ppl:  2.97; xent: 1.09; lr: 0.00010; 9698/4078 tok/s;   3701 sec
[2021-04-23 04:47:10,868 INFO] Step 17100/50000; acc:  69.30; ppl:  2.95; xent: 1.08; lr: 0.00010; 9850/4074 tok/s;   3711 sec
[2021-04-23 04:47:22,210 INFO] Step 17150/50000; acc:  68.78; ppl:  2.98; xent: 1.09; lr: 0.00010; 9088/3747 tok/s;   3722 sec
[2021-04-23 04:47:33,139 INFO] Step 17200/50000; acc:  68.68; ppl:  2.96; xent: 1.08; lr: 0.00010; 9132/3941 tok/s;   3733 sec
[2021-04-23 04:47:43,640 INFO] Step 17250/50000; acc:  69.09; ppl:  2.96; xent: 1.09; lr: 0.00010; 9892/4043 tok/s;   3744 sec
[2021-04-23 04:47:54,338 INFO] Step 17300/50000; acc:  69.07; ppl:  2.99; xent: 1.09; lr: 0.00010; 9556/4008 tok/s;   3755 sec
[2021-04-23 04:48:05,305 INFO] Step 17350/50000; acc:  69.45; ppl:  2.92; xent: 1.07; lr: 0.00010; 9126/3867 tok/s;   3766 sec
[2021-04-23 04:48:16,044 INFO] Step 17400/50000; acc:  69.36; ppl:  2.91; xent: 1.07; lr: 0.00010; 9554/3962 tok/s;   3776 sec
[2021-04-23 04:48:26,426 INFO] Step 17450/50000; acc:  69.20; ppl:  2.94; xent: 1.08; lr: 0.00010; 9720/4082 tok/s;   3787 sec
[2021-04-23 04:48:37,430 INFO] Step 17500/50000; acc:  68.71; ppl:  3.00; xent: 1.10; lr: 0.00010; 9517/3961 tok/s;   3798 sec
[2021-04-23 04:48:38,022 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:48:47,716 INFO] Step 17550/50000; acc:  69.99; ppl:  2.86; xent: 1.05; lr: 0.00010; 9498/4141 tok/s;   3808 sec
[2021-04-23 04:48:58,626 INFO] Step 17600/50000; acc:  68.24; ppl:  3.02; xent: 1.11; lr: 0.00010; 9483/3874 tok/s;   3819 sec
[2021-04-23 04:49:09,843 INFO] Step 17650/50000; acc:  69.41; ppl:  2.92; xent: 1.07; lr: 0.00010; 8888/3942 tok/s;   3830 sec
[2021-04-23 04:49:20,405 INFO] Step 17700/50000; acc:  69.01; ppl:  2.95; xent: 1.08; lr: 0.00010; 9573/4008 tok/s;   3841 sec
[2021-04-23 04:49:30,739 INFO] Step 17750/50000; acc:  69.17; ppl:  2.95; xent: 1.08; lr: 0.00010; 9893/4098 tok/s;   3851 sec
[2021-04-23 04:49:41,306 INFO] Step 17800/50000; acc:  69.37; ppl:  2.93; xent: 1.07; lr: 0.00010; 9701/4005 tok/s;   3862 sec
[2021-04-23 04:49:52,459 INFO] Step 17850/50000; acc:  68.78; ppl:  2.97; xent: 1.09; lr: 0.00010; 9348/3852 tok/s;   3873 sec
[2021-04-23 04:50:03,054 INFO] Step 17900/50000; acc:  69.48; ppl:  2.90; xent: 1.06; lr: 0.00010; 9405/4015 tok/s;   3883 sec
[2021-04-23 04:50:14,151 INFO] Step 17950/50000; acc:  69.14; ppl:  2.94; xent: 1.08; lr: 0.00010; 9238/3868 tok/s;   3894 sec
[2021-04-23 04:50:24,663 INFO] Step 18000/50000; acc:  69.32; ppl:  2.93; xent: 1.07; lr: 0.00010; 9518/4063 tok/s;   3905 sec
[2021-04-23 04:50:35,363 INFO] Step 18050/50000; acc:  68.91; ppl:  2.98; xent: 1.09; lr: 0.00010; 9647/4024 tok/s;   3916 sec
[2021-04-23 04:50:46,576 INFO] Step 18100/50000; acc:  69.46; ppl:  2.89; xent: 1.06; lr: 0.00010; 9226/3776 tok/s;   3927 sec
[2021-04-23 04:50:56,747 INFO] Step 18150/50000; acc:  69.90; ppl:  2.88; xent: 1.06; lr: 0.00010; 9761/4107 tok/s;   3937 sec
[2021-04-23 04:51:07,503 INFO] Step 18200/50000; acc:  69.19; ppl:  2.92; xent: 1.07; lr: 0.00010; 9505/4042 tok/s;   3948 sec
[2021-04-23 04:51:09,160 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:51:18,106 INFO] Step 18250/50000; acc:  69.48; ppl:  2.92; xent: 1.07; lr: 0.00010; 9568/4064 tok/s;   3958 sec
[2021-04-23 04:51:29,030 INFO] Step 18300/50000; acc:  69.14; ppl:  2.96; xent: 1.08; lr: 0.00010; 9613/3879 tok/s;   3969 sec
[2021-04-23 04:51:39,747 INFO] Step 18350/50000; acc:  69.43; ppl:  2.90; xent: 1.06; lr: 0.00010; 8987/4006 tok/s;   3980 sec
[2021-04-23 04:51:50,796 INFO] Step 18400/50000; acc:  69.35; ppl:  2.94; xent: 1.08; lr: 0.00010; 9401/3953 tok/s;   3991 sec
[2021-04-23 04:52:00,995 INFO] Step 18450/50000; acc:  69.25; ppl:  2.90; xent: 1.07; lr: 0.00010; 9793/4147 tok/s;   4001 sec
[2021-04-23 04:52:11,298 INFO] Step 18500/50000; acc:  69.75; ppl:  2.90; xent: 1.06; lr: 0.00010; 9772/4122 tok/s;   4012 sec
[2021-04-23 04:52:21,900 INFO] Step 18550/50000; acc:  69.33; ppl:  2.93; xent: 1.07; lr: 0.00010; 9791/3966 tok/s;   4022 sec
[2021-04-23 04:52:33,478 INFO] Step 18600/50000; acc:  69.24; ppl:  2.92; xent: 1.07; lr: 0.00010; 8933/3727 tok/s;   4034 sec
[2021-04-23 04:52:44,589 INFO] Step 18650/50000; acc:  69.53; ppl:  2.91; xent: 1.07; lr: 0.00010; 9204/3868 tok/s;   4045 sec
[2021-04-23 04:52:55,091 INFO] Step 18700/50000; acc:  69.31; ppl:  2.88; xent: 1.06; lr: 0.00010; 9503/4003 tok/s;   4055 sec
[2021-04-23 04:53:05,864 INFO] Step 18750/50000; acc:  69.19; ppl:  2.94; xent: 1.08; lr: 0.00010; 9539/3990 tok/s;   4066 sec
[2021-04-23 04:53:16,746 INFO] Step 18800/50000; acc:  69.93; ppl:  2.85; xent: 1.05; lr: 0.00010; 9192/3923 tok/s;   4077 sec
[2021-04-23 04:53:27,878 INFO] Step 18850/50000; acc:  69.25; ppl:  2.91; xent: 1.07; lr: 0.00010; 9371/3837 tok/s;   4088 sec
[2021-04-23 04:53:38,110 INFO] Step 18900/50000; acc:  69.66; ppl:  2.89; xent: 1.06; lr: 0.00010; 9909/4137 tok/s;   4098 sec
[2021-04-23 04:53:47,572 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:53:48,844 INFO] Step 18950/50000; acc:  69.72; ppl:  2.87; xent: 1.05; lr: 0.00010; 9311/3990 tok/s;   4109 sec
[2021-04-23 04:53:59,559 INFO] Step 19000/50000; acc:  69.68; ppl:  2.86; xent: 1.05; lr: 0.00010; 9559/4022 tok/s;   4120 sec
[2021-04-23 04:54:10,453 INFO] Step 19050/50000; acc:  69.06; ppl:  2.92; xent: 1.07; lr: 0.00010; 9319/3875 tok/s;   4131 sec
[2021-04-23 04:54:22,061 INFO] Step 19100/50000; acc:  69.18; ppl:  2.92; xent: 1.07; lr: 0.00010; 8936/3800 tok/s;   4142 sec
[2021-04-23 04:54:32,251 INFO] Step 19150/50000; acc:  69.72; ppl:  2.87; xent: 1.05; lr: 0.00010; 9535/4186 tok/s;   4153 sec
[2021-04-23 04:54:42,994 INFO] Step 19200/50000; acc:  69.35; ppl:  2.91; xent: 1.07; lr: 0.00010; 9664/3972 tok/s;   4163 sec
[2021-04-23 04:54:53,397 INFO] Step 19250/50000; acc:  69.73; ppl:  2.87; xent: 1.05; lr: 0.00010; 9634/4100 tok/s;   4174 sec
[2021-04-23 04:55:03,945 INFO] Step 19300/50000; acc:  69.51; ppl:  2.88; xent: 1.06; lr: 0.00010; 9701/3935 tok/s;   4184 sec
[2021-04-23 04:55:15,606 INFO] Step 19350/50000; acc:  69.36; ppl:  2.90; xent: 1.07; lr: 0.00010; 8900/3753 tok/s;   4196 sec
[2021-04-23 04:55:25,951 INFO] Step 19400/50000; acc:  69.68; ppl:  2.87; xent: 1.05; lr: 0.00010; 9816/4080 tok/s;   4206 sec
[2021-04-23 04:55:36,947 INFO] Step 19450/50000; acc:  69.56; ppl:  2.88; xent: 1.06; lr: 0.00010; 9358/3914 tok/s;   4217 sec
[2021-04-23 04:55:47,654 INFO] Step 19500/50000; acc:  69.77; ppl:  2.88; xent: 1.06; lr: 0.00010; 9291/4016 tok/s;   4228 sec
[2021-04-23 04:55:58,565 INFO] Step 19550/50000; acc:  70.09; ppl:  2.84; xent: 1.04; lr: 0.00010; 9446/3859 tok/s;   4239 sec
[2021-04-23 04:56:08,756 INFO] Step 19600/50000; acc:  70.12; ppl:  2.84; xent: 1.04; lr: 0.00010; 9773/4158 tok/s;   4249 sec
[2021-04-23 04:56:19,660 INFO] Step 19650/50000; acc:  69.75; ppl:  2.87; xent: 1.05; lr: 0.00010; 9440/3952 tok/s;   4260 sec
[2021-04-23 04:56:26,110 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:56:30,590 INFO] Step 19700/50000; acc:  69.53; ppl:  2.89; xent: 1.06; lr: 0.00010; 9414/3947 tok/s;   4271 sec
[2021-04-23 04:56:41,186 INFO] Step 19750/50000; acc:  70.05; ppl:  2.85; xent: 1.05; lr: 0.00010; 9464/3977 tok/s;   4281 sec
[2021-04-23 04:56:52,482 INFO] Step 19800/50000; acc:  69.54; ppl:  2.87; xent: 1.06; lr: 0.00010; 9031/3773 tok/s;   4293 sec
[2021-04-23 04:57:03,541 INFO] Step 19850/50000; acc:  69.88; ppl:  2.87; xent: 1.05; lr: 0.00010; 9094/3976 tok/s;   4304 sec
[2021-04-23 04:57:14,365 INFO] Step 19900/50000; acc:  69.24; ppl:  2.91; xent: 1.07; lr: 0.00010; 9650/3979 tok/s;   4315 sec
[2021-04-23 04:57:24,936 INFO] Step 19950/50000; acc:  70.23; ppl:  2.83; xent: 1.04; lr: 0.00010; 9166/4056 tok/s;   4325 sec
[2021-04-23 04:57:35,310 INFO] Step 20000/50000; acc:  69.45; ppl:  2.90; xent: 1.06; lr: 0.00010; 10091/4051 tok/s;   4336 sec
[2021-04-23 04:57:35,313 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-23 04:57:44,079 INFO] Validation perplexity: 2.96071
[2021-04-23 04:57:44,079 INFO] Validation accuracy: 69.123
[2021-04-23 04:57:44,081 INFO] Saving checkpoint ../models/default_params/strict_ops/model_step_20000.pt
[2021-04-23 04:57:55,627 INFO] Step 20050/50000; acc:  69.96; ppl:  2.84; xent: 1.04; lr: 0.00010; 4996/2068 tok/s;   4356 sec
[2021-04-23 04:58:06,848 INFO] Step 20100/50000; acc:  69.90; ppl:  2.84; xent: 1.05; lr: 0.00010; 9039/3852 tok/s;   4367 sec
[2021-04-23 04:58:17,591 INFO] Step 20150/50000; acc:  69.86; ppl:  2.86; xent: 1.05; lr: 0.00010; 9561/3914 tok/s;   4378 sec
[2021-04-23 04:58:28,452 INFO] Step 20200/50000; acc:  69.86; ppl:  2.87; xent: 1.06; lr: 0.00010; 9425/3958 tok/s;   4389 sec
[2021-04-23 04:58:39,634 INFO] Step 20250/50000; acc:  69.73; ppl:  2.86; xent: 1.05; lr: 0.00010; 9173/3847 tok/s;   4400 sec
[2021-04-23 04:58:50,358 INFO] Step 20300/50000; acc:  70.17; ppl:  2.80; xent: 1.03; lr: 0.00010; 9320/3905 tok/s;   4411 sec
[2021-04-23 04:59:00,634 INFO] Step 20350/50000; acc:  70.08; ppl:  2.85; xent: 1.05; lr: 0.00010; 9935/4198 tok/s;   4421 sec
[2021-04-23 04:59:11,450 INFO] Step 20400/50000; acc:  70.15; ppl:  2.83; xent: 1.04; lr: 0.00010; 9203/3959 tok/s;   4432 sec
[2021-04-23 04:59:14,762 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 04:59:21,924 INFO] Step 20450/50000; acc:  70.15; ppl:  2.84; xent: 1.04; lr: 0.00010; 9902/4101 tok/s;   4442 sec
[2021-04-23 04:59:32,949 INFO] Step 20500/50000; acc:  69.84; ppl:  2.86; xent: 1.05; lr: 0.00010; 9320/3809 tok/s;   4453 sec
[2021-04-23 04:59:44,427 INFO] Step 20550/50000; acc:  69.82; ppl:  2.84; xent: 1.04; lr: 0.00010; 8656/3821 tok/s;   4465 sec
[2021-04-23 04:59:54,788 INFO] Step 20600/50000; acc:  69.88; ppl:  2.85; xent: 1.05; lr: 0.00010; 9865/4169 tok/s;   4475 sec
[2021-04-23 05:00:05,661 INFO] Step 20650/50000; acc:  69.74; ppl:  2.85; xent: 1.05; lr: 0.00010; 9269/3902 tok/s;   4486 sec
[2021-04-23 05:00:16,438 INFO] Step 20700/50000; acc:  69.71; ppl:  2.87; xent: 1.05; lr: 0.00010; 9670/4027 tok/s;   4497 sec
[2021-04-23 05:00:26,445 INFO] Step 20750/50000; acc:  70.50; ppl:  2.78; xent: 1.02; lr: 0.00010; 9890/4110 tok/s;   4507 sec
[2021-04-23 05:00:37,806 INFO] Step 20800/50000; acc:  69.58; ppl:  2.86; xent: 1.05; lr: 0.00010; 9161/3808 tok/s;   4518 sec
[2021-04-23 05:00:48,512 INFO] Step 20850/50000; acc:  69.77; ppl:  2.84; xent: 1.04; lr: 0.00010; 9391/3966 tok/s;   4529 sec
[2021-04-23 05:00:58,870 INFO] Step 20900/50000; acc:  70.08; ppl:  2.82; xent: 1.04; lr: 0.00010; 9799/4113 tok/s;   4539 sec
[2021-04-23 05:01:09,657 INFO] Step 20950/50000; acc:  69.97; ppl:  2.86; xent: 1.05; lr: 0.00010; 9506/3978 tok/s;   4550 sec
[2021-04-23 05:01:20,907 INFO] Step 21000/50000; acc:  70.02; ppl:  2.81; xent: 1.03; lr: 0.00010; 9134/3769 tok/s;   4561 sec
[2021-04-23 05:01:31,484 INFO] Step 21050/50000; acc:  70.09; ppl:  2.81; xent: 1.03; lr: 0.00010; 9666/4020 tok/s;   4572 sec
[2021-04-23 05:01:42,078 INFO] Step 21100/50000; acc:  70.37; ppl:  2.80; xent: 1.03; lr: 0.00010; 9386/4047 tok/s;   4582 sec
[2021-04-23 05:01:46,350 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:01:52,981 INFO] Step 21150/50000; acc:  69.81; ppl:  2.85; xent: 1.05; lr: 0.00010; 9433/3938 tok/s;   4593 sec
[2021-04-23 05:02:03,527 INFO] Step 21200/50000; acc:  70.67; ppl:  2.77; xent: 1.02; lr: 0.00010; 9487/4055 tok/s;   4604 sec
[2021-04-23 05:02:14,427 INFO] Step 21250/50000; acc:  69.43; ppl:  2.89; xent: 1.06; lr: 0.00010; 9449/3922 tok/s;   4615 sec
[2021-04-23 05:02:25,775 INFO] Step 21300/50000; acc:  70.22; ppl:  2.82; xent: 1.04; lr: 0.00010; 8985/3851 tok/s;   4626 sec
[2021-04-23 05:02:36,119 INFO] Step 21350/50000; acc:  69.99; ppl:  2.83; xent: 1.04; lr: 0.00010; 9635/4127 tok/s;   4636 sec
[2021-04-23 05:02:46,259 INFO] Step 21400/50000; acc:  70.35; ppl:  2.80; xent: 1.03; lr: 0.00010; 10061/4155 tok/s;   4647 sec
[2021-04-23 05:02:56,894 INFO] Step 21450/50000; acc:  70.29; ppl:  2.81; xent: 1.03; lr: 0.00010; 9534/3986 tok/s;   4657 sec
[2021-04-23 05:03:08,339 INFO] Step 21500/50000; acc:  69.59; ppl:  2.88; xent: 1.06; lr: 0.00010; 9320/3788 tok/s;   4669 sec
[2021-04-23 05:03:18,883 INFO] Step 21550/50000; acc:  70.67; ppl:  2.76; xent: 1.01; lr: 0.00010; 9215/4000 tok/s;   4679 sec
[2021-04-23 05:03:29,851 INFO] Step 21600/50000; acc:  69.98; ppl:  2.83; xent: 1.04; lr: 0.00010; 9464/3887 tok/s;   4690 sec
[2021-04-23 05:03:40,516 INFO] Step 21650/50000; acc:  70.20; ppl:  2.81; xent: 1.03; lr: 0.00010; 9423/4025 tok/s;   4701 sec
[2021-04-23 05:03:51,124 INFO] Step 21700/50000; acc:  70.38; ppl:  2.81; xent: 1.03; lr: 0.00010; 9537/4017 tok/s;   4711 sec
[2021-04-23 05:04:02,532 INFO] Step 21750/50000; acc:  70.43; ppl:  2.77; xent: 1.02; lr: 0.00010; 9079/3762 tok/s;   4723 sec
[2021-04-23 05:04:12,846 INFO] Step 21800/50000; acc:  70.59; ppl:  2.80; xent: 1.03; lr: 0.00010; 9869/4063 tok/s;   4733 sec
[2021-04-23 05:04:23,542 INFO] Step 21850/50000; acc:  70.23; ppl:  2.81; xent: 1.03; lr: 0.00010; 9528/4036 tok/s;   4744 sec
[2021-04-23 05:04:24,686 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:04:34,101 INFO] Step 21900/50000; acc:  70.67; ppl:  2.76; xent: 1.02; lr: 0.00010; 9466/4082 tok/s;   4754 sec
[2021-04-23 05:04:44,903 INFO] Step 21950/50000; acc:  70.15; ppl:  2.81; xent: 1.03; lr: 0.00010; 9538/3898 tok/s;   4765 sec
[2021-04-23 05:04:55,850 INFO] Step 22000/50000; acc:  70.22; ppl:  2.79; xent: 1.03; lr: 0.00010; 9026/3935 tok/s;   4776 sec
[2021-04-23 05:05:07,011 INFO] Step 22050/50000; acc:  70.29; ppl:  2.81; xent: 1.03; lr: 0.00010; 9271/3936 tok/s;   4787 sec
[2021-04-23 05:05:17,242 INFO] Step 22100/50000; acc:  70.10; ppl:  2.82; xent: 1.04; lr: 0.00010; 9993/4131 tok/s;   4798 sec
[2021-04-23 05:05:27,606 INFO] Step 22150/50000; acc:  70.61; ppl:  2.76; xent: 1.02; lr: 0.00010; 9560/4113 tok/s;   4808 sec
[2021-04-23 05:05:38,315 INFO] Step 22200/50000; acc:  70.34; ppl:  2.79; xent: 1.03; lr: 0.00010; 9704/3908 tok/s;   4819 sec
[2021-04-23 05:05:49,867 INFO] Step 22250/50000; acc:  70.47; ppl:  2.78; xent: 1.02; lr: 0.00010; 8830/3739 tok/s;   4830 sec
[2021-04-23 05:06:00,720 INFO] Step 22300/50000; acc:  70.04; ppl:  2.82; xent: 1.04; lr: 0.00010; 9645/3975 tok/s;   4841 sec
[2021-04-23 05:06:11,423 INFO] Step 22350/50000; acc:  70.65; ppl:  2.74; xent: 1.01; lr: 0.00010; 9097/3958 tok/s;   4852 sec
[2021-04-23 05:06:22,015 INFO] Step 22400/50000; acc:  70.41; ppl:  2.83; xent: 1.04; lr: 0.00010; 9806/4018 tok/s;   4862 sec
[2021-04-23 05:06:32,534 INFO] Step 22450/50000; acc:  70.73; ppl:  2.74; xent: 1.01; lr: 0.00010; 9580/4063 tok/s;   4873 sec
[2021-04-23 05:06:43,353 INFO] Step 22500/50000; acc:  70.49; ppl:  2.77; xent: 1.02; lr: 0.00010; 9412/3935 tok/s;   4884 sec
[2021-04-23 05:06:53,805 INFO] Step 22550/50000; acc:  70.58; ppl:  2.79; xent: 1.02; lr: 0.00010; 9737/4045 tok/s;   4894 sec
[2021-04-23 05:07:02,746 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:07:04,632 INFO] Step 22600/50000; acc:  70.50; ppl:  2.79; xent: 1.03; lr: 0.00010; 9467/3987 tok/s;   4905 sec
[2021-04-23 05:07:15,139 INFO] Step 22650/50000; acc:  70.78; ppl:  2.74; xent: 1.01; lr: 0.00010; 9728/4068 tok/s;   4915 sec
[2021-04-23 05:07:25,945 INFO] Step 22700/50000; acc:  70.44; ppl:  2.79; xent: 1.03; lr: 0.00010; 9247/3909 tok/s;   4926 sec
[2021-04-23 05:07:37,283 INFO] Step 22750/50000; acc:  70.36; ppl:  2.79; xent: 1.03; lr: 0.00010; 8979/3887 tok/s;   4938 sec
[2021-04-23 05:07:47,759 INFO] Step 22800/50000; acc:  70.42; ppl:  2.79; xent: 1.03; lr: 0.00010; 9503/4082 tok/s;   4948 sec
[2021-04-23 05:07:58,606 INFO] Step 22850/50000; acc:  70.47; ppl:  2.79; xent: 1.03; lr: 0.00010; 9534/3950 tok/s;   4959 sec
[2021-04-23 05:08:09,220 INFO] Step 22900/50000; acc:  70.38; ppl:  2.78; xent: 1.02; lr: 0.00010; 9665/4028 tok/s;   4970 sec
[2021-04-23 05:08:19,659 INFO] Step 22950/50000; acc:  70.49; ppl:  2.76; xent: 1.01; lr: 0.00010; 9663/3932 tok/s;   4980 sec
[2021-04-23 05:08:31,254 INFO] Step 23000/50000; acc:  70.69; ppl:  2.77; xent: 1.02; lr: 0.00010; 8915/3800 tok/s;   4992 sec
[2021-04-23 05:08:41,622 INFO] Step 23050/50000; acc:  70.41; ppl:  2.77; xent: 1.02; lr: 0.00010; 9700/4063 tok/s;   5002 sec
[2021-04-23 05:08:52,762 INFO] Step 23100/50000; acc:  70.30; ppl:  2.81; xent: 1.03; lr: 0.00010; 9443/3892 tok/s;   5013 sec
[2021-04-23 05:09:03,341 INFO] Step 23150/50000; acc:  71.13; ppl:  2.73; xent: 1.01; lr: 0.00010; 9178/3996 tok/s;   5024 sec
[2021-04-23 05:09:14,641 INFO] Step 23200/50000; acc:  70.63; ppl:  2.76; xent: 1.01; lr: 0.00010; 9243/3767 tok/s;   5035 sec
[2021-04-23 05:09:24,765 INFO] Step 23250/50000; acc:  71.04; ppl:  2.72; xent: 1.00; lr: 0.00010; 9871/4215 tok/s;   5045 sec
[2021-04-23 05:09:35,558 INFO] Step 23300/50000; acc:  70.85; ppl:  2.75; xent: 1.01; lr: 0.00010; 9330/3955 tok/s;   5056 sec
[2021-04-23 05:09:41,585 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:09:46,347 INFO] Step 23350/50000; acc:  70.61; ppl:  2.76; xent: 1.01; lr: 0.00010; 9566/3985 tok/s;   5067 sec
[2021-04-23 05:09:57,134 INFO] Step 23400/50000; acc:  70.61; ppl:  2.77; xent: 1.02; lr: 0.00010; 9534/3916 tok/s;   5077 sec
[2021-04-23 05:10:08,526 INFO] Step 23450/50000; acc:  70.68; ppl:  2.76; xent: 1.02; lr: 0.00010; 8925/3768 tok/s;   5089 sec
[2021-04-23 05:10:19,189 INFO] Step 23500/50000; acc:  71.00; ppl:  2.73; xent: 1.01; lr: 0.00010; 9302/4103 tok/s;   5099 sec
[2021-04-23 05:10:30,049 INFO] Step 23550/50000; acc:  70.40; ppl:  2.78; xent: 1.02; lr: 0.00010; 9438/3935 tok/s;   5110 sec
[2021-04-23 05:10:40,319 INFO] Step 23600/50000; acc:  70.88; ppl:  2.73; xent: 1.01; lr: 0.00010; 9653/4156 tok/s;   5121 sec
[2021-04-23 05:10:51,035 INFO] Step 23650/50000; acc:  70.46; ppl:  2.78; xent: 1.02; lr: 0.00010; 9763/3952 tok/s;   5131 sec
[2021-04-23 05:11:02,207 INFO] Step 23700/50000; acc:  70.74; ppl:  2.75; xent: 1.01; lr: 0.00010; 9270/3770 tok/s;   5142 sec
[2021-04-23 05:11:13,095 INFO] Step 23750/50000; acc:  70.92; ppl:  2.73; xent: 1.00; lr: 0.00010; 9173/3996 tok/s;   5153 sec
[2021-04-23 05:11:23,804 INFO] Step 23800/50000; acc:  70.87; ppl:  2.75; xent: 1.01; lr: 0.00010; 9583/3936 tok/s;   5164 sec
[2021-04-23 05:11:34,496 INFO] Step 23850/50000; acc:  70.95; ppl:  2.75; xent: 1.01; lr: 0.00010; 9462/3968 tok/s;   5175 sec
[2021-04-23 05:11:46,042 INFO] Step 23900/50000; acc:  70.40; ppl:  2.78; xent: 1.02; lr: 0.00010; 9068/3784 tok/s;   5186 sec
[2021-04-23 05:11:56,388 INFO] Step 23950/50000; acc:  71.50; ppl:  2.65; xent: 0.98; lr: 0.00010; 9439/4022 tok/s;   5197 sec
[2021-04-23 05:12:06,752 INFO] Step 24000/50000; acc:  70.61; ppl:  2.77; xent: 1.02; lr: 0.00010; 9971/4148 tok/s;   5207 sec
[2021-04-23 05:12:17,685 INFO] Step 24050/50000; acc:  70.89; ppl:  2.73; xent: 1.01; lr: 0.00010; 9166/3945 tok/s;   5218 sec
[2021-04-23 05:12:20,561 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:12:28,013 INFO] Step 24100/50000; acc:  71.17; ppl:  2.70; xent: 0.99; lr: 0.00010; 9827/4098 tok/s;   5228 sec
[2021-04-23 05:12:39,051 INFO] Step 24150/50000; acc:  70.42; ppl:  2.77; xent: 1.02; lr: 0.00010; 9312/3859 tok/s;   5239 sec
[2021-04-23 05:12:50,550 INFO] Step 24200/50000; acc:  70.49; ppl:  2.77; xent: 1.02; lr: 0.00010; 8864/3828 tok/s;   5251 sec
[2021-04-23 05:13:01,000 INFO] Step 24250/50000; acc:  70.74; ppl:  2.73; xent: 1.00; lr: 0.00010; 9750/4104 tok/s;   5261 sec
[2021-04-23 05:13:11,620 INFO] Step 24300/50000; acc:  71.17; ppl:  2.72; xent: 1.00; lr: 0.00010; 9357/4016 tok/s;   5272 sec
[2021-04-23 05:13:22,164 INFO] Step 24350/50000; acc:  70.97; ppl:  2.74; xent: 1.01; lr: 0.00010; 9700/4041 tok/s;   5282 sec
[2021-04-23 05:13:32,569 INFO] Step 24400/50000; acc:  70.99; ppl:  2.70; xent: 0.99; lr: 0.00010; 9759/4002 tok/s;   5293 sec
[2021-04-23 05:13:44,099 INFO] Step 24450/50000; acc:  70.79; ppl:  2.74; xent: 1.01; lr: 0.00010; 8988/3733 tok/s;   5304 sec
[2021-04-23 05:13:55,129 INFO] Step 24500/50000; acc:  70.78; ppl:  2.75; xent: 1.01; lr: 0.00010; 9322/3846 tok/s;   5315 sec
[2021-04-23 05:14:05,464 INFO] Step 24550/50000; acc:  71.22; ppl:  2.70; xent: 0.99; lr: 0.00010; 9671/4142 tok/s;   5326 sec
[2021-04-23 05:14:16,177 INFO] Step 24600/50000; acc:  70.74; ppl:  2.75; xent: 1.01; lr: 0.00010; 9545/4008 tok/s;   5336 sec
[2021-04-23 05:14:27,419 INFO] Step 24650/50000; acc:  71.36; ppl:  2.69; xent: 0.99; lr: 0.00010; 9041/3794 tok/s;   5348 sec
[2021-04-23 05:14:37,984 INFO] Step 24700/50000; acc:  71.01; ppl:  2.72; xent: 1.00; lr: 0.00010; 9902/4001 tok/s;   5358 sec
[2021-04-23 05:14:48,417 INFO] Step 24750/50000; acc:  71.54; ppl:  2.66; xent: 0.98; lr: 0.00010; 9287/4061 tok/s;   5369 sec
[2021-04-23 05:14:52,284 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:14:59,395 INFO] Step 24800/50000; acc:  70.83; ppl:  2.73; xent: 1.01; lr: 0.00010; 9502/3974 tok/s;   5380 sec
[2021-04-23 05:15:10,138 INFO] Step 24850/50000; acc:  71.38; ppl:  2.68; xent: 0.99; lr: 0.00010; 9349/3960 tok/s;   5390 sec
[2021-04-23 05:15:20,985 INFO] Step 24900/50000; acc:  70.55; ppl:  2.75; xent: 1.01; lr: 0.00010; 9299/3928 tok/s;   5401 sec
[2021-04-23 05:15:32,301 INFO] Step 24950/50000; acc:  70.99; ppl:  2.72; xent: 1.00; lr: 0.00010; 9025/3880 tok/s;   5413 sec
[2021-04-23 05:15:43,016 INFO] Step 25000/50000; acc:  70.46; ppl:  2.76; xent: 1.01; lr: 0.00010; 9546/4012 tok/s;   5423 sec
[2021-04-23 05:15:43,018 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-23 05:15:51,782 INFO] Validation perplexity: 2.91967
[2021-04-23 05:15:51,783 INFO] Validation accuracy: 69.8246
[2021-04-23 05:15:51,785 INFO] Saving checkpoint ../models/default_params/strict_ops/model_step_25000.pt
[2021-04-23 05:16:02,585 INFO] Step 25050/50000; acc:  71.27; ppl:  2.68; xent: 0.99; lr: 0.00010; 5195/2140 tok/s;   5443 sec
[2021-04-23 05:16:12,869 INFO] Step 25100/50000; acc:  71.36; ppl:  2.68; xent: 0.99; lr: 0.00010; 9729/4113 tok/s;   5453 sec
[2021-04-23 05:16:24,465 INFO] Step 25150/50000; acc:  70.52; ppl:  2.75; xent: 1.01; lr: 0.00010; 9008/3706 tok/s;   5465 sec
[2021-04-23 05:16:34,969 INFO] Step 25200/50000; acc:  71.12; ppl:  2.67; xent: 0.98; lr: 0.00010; 9499/4019 tok/s;   5475 sec
[2021-04-23 05:16:46,007 INFO] Step 25250/50000; acc:  71.02; ppl:  2.72; xent: 1.00; lr: 0.00010; 9383/3907 tok/s;   5486 sec
[2021-04-23 05:16:56,772 INFO] Step 25300/50000; acc:  70.98; ppl:  2.74; xent: 1.01; lr: 0.00010; 9522/3967 tok/s;   5497 sec
[2021-04-23 05:17:07,362 INFO] Step 25350/50000; acc:  71.14; ppl:  2.70; xent: 0.99; lr: 0.00010; 9421/4029 tok/s;   5508 sec
[2021-04-23 05:17:18,776 INFO] Step 25400/50000; acc:  71.35; ppl:  2.68; xent: 0.99; lr: 0.00010; 9057/3775 tok/s;   5519 sec
[2021-04-23 05:17:29,076 INFO] Step 25450/50000; acc:  71.59; ppl:  2.67; xent: 0.98; lr: 0.00010; 9758/4010 tok/s;   5529 sec
[2021-04-23 05:17:39,909 INFO] Step 25500/50000; acc:  71.23; ppl:  2.72; xent: 1.00; lr: 0.00010; 9650/4029 tok/s;   5540 sec
[2021-04-23 05:17:40,585 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:17:50,252 INFO] Step 25550/50000; acc:  71.80; ppl:  2.64; xent: 0.97; lr: 0.00010; 9406/4182 tok/s;   5551 sec
[2021-04-23 05:18:01,063 INFO] Step 25600/50000; acc:  70.74; ppl:  2.74; xent: 1.01; lr: 0.00010; 9657/3877 tok/s;   5561 sec
[2021-04-23 05:18:12,188 INFO] Step 25650/50000; acc:  70.94; ppl:  2.69; xent: 0.99; lr: 0.00010; 8916/3892 tok/s;   5572 sec
[2021-04-23 05:18:23,065 INFO] Step 25700/50000; acc:  71.13; ppl:  2.69; xent: 0.99; lr: 0.00010; 9305/3990 tok/s;   5583 sec
[2021-04-23 05:18:33,570 INFO] Step 25750/50000; acc:  70.77; ppl:  2.72; xent: 1.00; lr: 0.00010; 9769/4048 tok/s;   5594 sec
[2021-04-23 05:18:44,124 INFO] Step 25800/50000; acc:  71.58; ppl:  2.68; xent: 0.98; lr: 0.00010; 9627/4028 tok/s;   5604 sec
[2021-04-23 05:18:55,149 INFO] Step 25850/50000; acc:  70.90; ppl:  2.70; xent: 0.99; lr: 0.00010; 9405/3830 tok/s;   5615 sec
[2021-04-23 05:19:06,079 INFO] Step 25900/50000; acc:  71.26; ppl:  2.66; xent: 0.98; lr: 0.00010; 9195/3926 tok/s;   5626 sec
[2021-04-23 05:19:16,822 INFO] Step 25950/50000; acc:  71.32; ppl:  2.69; xent: 0.99; lr: 0.00010; 9550/3993 tok/s;   5637 sec
[2021-04-23 05:19:27,764 INFO] Step 26000/50000; acc:  71.42; ppl:  2.68; xent: 0.99; lr: 0.00010; 9139/3890 tok/s;   5648 sec
[2021-04-23 05:19:38,249 INFO] Step 26050/50000; acc:  71.10; ppl:  2.72; xent: 1.00; lr: 0.00010; 9863/4098 tok/s;   5659 sec
[2021-04-23 05:19:49,225 INFO] Step 26100/50000; acc:  71.48; ppl:  2.65; xent: 0.98; lr: 0.00010; 9388/3871 tok/s;   5670 sec
[2021-04-23 05:19:59,488 INFO] Step 26150/50000; acc:  71.71; ppl:  2.64; xent: 0.97; lr: 0.00010; 9744/4109 tok/s;   5680 sec
[2021-04-23 05:20:09,957 INFO] Step 26200/50000; acc:  71.41; ppl:  2.69; xent: 0.99; lr: 0.00010; 9710/4092 tok/s;   5690 sec
[2021-04-23 05:20:18,292 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:20:20,522 INFO] Step 26250/50000; acc:  71.34; ppl:  2.68; xent: 0.98; lr: 0.00010; 9606/4072 tok/s;   5701 sec
[2021-04-23 05:20:31,479 INFO] Step 26300/50000; acc:  71.26; ppl:  2.67; xent: 0.98; lr: 0.00010; 9541/3908 tok/s;   5712 sec
[2021-04-23 05:20:42,001 INFO] Step 26350/50000; acc:  71.48; ppl:  2.64; xent: 0.97; lr: 0.00010; 9250/4009 tok/s;   5722 sec
[2021-04-23 05:20:53,453 INFO] Step 26400/50000; acc:  71.15; ppl:  2.69; xent: 0.99; lr: 0.00010; 9011/3832 tok/s;   5734 sec
[2021-04-23 05:21:04,041 INFO] Step 26450/50000; acc:  71.12; ppl:  2.70; xent: 0.99; lr: 0.00010; 9466/4066 tok/s;   5744 sec
[2021-04-23 05:21:14,447 INFO] Step 26500/50000; acc:  71.65; ppl:  2.66; xent: 0.98; lr: 0.00010; 9706/4064 tok/s;   5755 sec
[2021-04-23 05:21:25,040 INFO] Step 26550/50000; acc:  71.22; ppl:  2.70; xent: 0.99; lr: 0.00010; 9727/4059 tok/s;   5765 sec
[2021-04-23 05:21:35,935 INFO] Step 26600/50000; acc:  71.30; ppl:  2.68; xent: 0.99; lr: 0.00010; 9507/3814 tok/s;   5776 sec
[2021-04-23 05:21:47,432 INFO] Step 26650/50000; acc:  71.42; ppl:  2.67; xent: 0.98; lr: 0.00010; 8940/3805 tok/s;   5788 sec
[2021-04-23 05:21:57,613 INFO] Step 26700/50000; acc:  71.66; ppl:  2.64; xent: 0.97; lr: 0.00010; 9755/4114 tok/s;   5798 sec
[2021-04-23 05:22:08,672 INFO] Step 26750/50000; acc:  71.37; ppl:  2.68; xent: 0.99; lr: 0.00010; 9322/3925 tok/s;   5809 sec
[2021-04-23 05:22:19,477 INFO] Step 26800/50000; acc:  71.64; ppl:  2.63; xent: 0.97; lr: 0.00010; 9217/3910 tok/s;   5820 sec
[2021-04-23 05:22:30,826 INFO] Step 26850/50000; acc:  71.35; ppl:  2.68; xent: 0.98; lr: 0.00010; 9157/3777 tok/s;   5831 sec
[2021-04-23 05:22:41,030 INFO] Step 26900/50000; acc:  71.54; ppl:  2.65; xent: 0.98; lr: 0.00010; 10019/4173 tok/s;   5841 sec
[2021-04-23 05:22:51,872 INFO] Step 26950/50000; acc:  71.73; ppl:  2.64; xent: 0.97; lr: 0.00010; 9161/3943 tok/s;   5852 sec
[2021-04-23 05:22:57,314 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:23:02,568 INFO] Step 27000/50000; acc:  71.46; ppl:  2.67; xent: 0.98; lr: 0.00010; 9619/3999 tok/s;   5863 sec
[2021-04-23 05:23:13,268 INFO] Step 27050/50000; acc:  71.56; ppl:  2.64; xent: 0.97; lr: 0.00010; 9520/3942 tok/s;   5874 sec
[2021-04-23 05:23:25,179 INFO] Step 27100/50000; acc:  71.11; ppl:  2.70; xent: 0.99; lr: 0.00010; 8722/3648 tok/s;   5885 sec
[2021-04-23 05:23:35,595 INFO] Step 27150/50000; acc:  71.89; ppl:  2.61; xent: 0.96; lr: 0.00010; 9287/4174 tok/s;   5896 sec
[2021-04-23 05:23:46,594 INFO] Step 27200/50000; acc:  71.25; ppl:  2.70; xent: 0.99; lr: 0.00010; 9429/3890 tok/s;   5907 sec
[2021-04-23 05:23:56,671 INFO] Step 27250/50000; acc:  71.87; ppl:  2.62; xent: 0.96; lr: 0.00010; 9886/4203 tok/s;   5917 sec
[2021-04-23 05:24:07,279 INFO] Step 27300/50000; acc:  71.50; ppl:  2.65; xent: 0.98; lr: 0.00010; 9665/3985 tok/s;   5928 sec
[2021-04-23 05:24:18,475 INFO] Step 27350/50000; acc:  71.60; ppl:  2.66; xent: 0.98; lr: 0.00010; 9270/3808 tok/s;   5939 sec
[2021-04-23 05:24:29,673 INFO] Step 27400/50000; acc:  71.44; ppl:  2.66; xent: 0.98; lr: 0.00010; 9153/3851 tok/s;   5950 sec
[2021-04-23 05:24:40,108 INFO] Step 27450/50000; acc:  71.55; ppl:  2.65; xent: 0.97; lr: 0.00010; 9802/4039 tok/s;   5960 sec
[2021-04-23 05:24:50,669 INFO] Step 27500/50000; acc:  71.94; ppl:  2.64; xent: 0.97; lr: 0.00010; 9432/4063 tok/s;   5971 sec
[2021-04-23 05:25:01,806 INFO] Step 27550/50000; acc:  71.53; ppl:  2.66; xent: 0.98; lr: 0.00010; 9226/3843 tok/s;   5982 sec
[2021-04-23 05:25:12,364 INFO] Step 27600/50000; acc:  72.10; ppl:  2.59; xent: 0.95; lr: 0.00010; 9484/3963 tok/s;   5993 sec
[2021-04-23 05:25:22,981 INFO] Step 27650/50000; acc:  71.72; ppl:  2.66; xent: 0.98; lr: 0.00010; 9690/4064 tok/s;   6003 sec
[2021-04-23 05:25:33,913 INFO] Step 27700/50000; acc:  71.53; ppl:  2.65; xent: 0.98; lr: 0.00010; 9392/3956 tok/s;   6014 sec
[2021-04-23 05:25:36,253 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:25:44,164 INFO] Step 27750/50000; acc:  72.08; ppl:  2.60; xent: 0.95; lr: 0.00010; 9755/4138 tok/s;   6024 sec
[2021-04-23 05:25:54,965 INFO] Step 27800/50000; acc:  71.28; ppl:  2.68; xent: 0.99; lr: 0.00010; 9482/3928 tok/s;   6035 sec
[2021-04-23 05:26:06,261 INFO] Step 27850/50000; acc:  71.58; ppl:  2.65; xent: 0.97; lr: 0.00010; 8913/3882 tok/s;   6047 sec
[2021-04-23 05:26:16,901 INFO] Step 27900/50000; acc:  71.48; ppl:  2.67; xent: 0.98; lr: 0.00010; 9812/4061 tok/s;   6057 sec
[2021-04-23 05:26:27,420 INFO] Step 27950/50000; acc:  72.33; ppl:  2.59; xent: 0.95; lr: 0.00010; 9206/4062 tok/s;   6068 sec
[2021-04-23 05:26:37,891 INFO] Step 28000/50000; acc:  71.59; ppl:  2.66; xent: 0.98; lr: 0.00010; 9899/4021 tok/s;   6078 sec
[2021-04-23 05:26:48,575 INFO] Step 28050/50000; acc:  71.94; ppl:  2.62; xent: 0.96; lr: 0.00010; 9560/3947 tok/s;   6089 sec
[2021-04-23 05:26:59,679 INFO] Step 28100/50000; acc:  71.74; ppl:  2.62; xent: 0.96; lr: 0.00010; 9128/3842 tok/s;   6100 sec
[2021-04-23 05:27:10,657 INFO] Step 28150/50000; acc:  71.56; ppl:  2.65; xent: 0.97; lr: 0.00010; 9391/3861 tok/s;   6111 sec
[2021-04-23 05:27:21,207 INFO] Step 28200/50000; acc:  71.65; ppl:  2.65; xent: 0.97; lr: 0.00010; 9715/4118 tok/s;   6121 sec
[2021-04-23 05:27:32,056 INFO] Step 28250/50000; acc:  71.85; ppl:  2.65; xent: 0.97; lr: 0.00010; 9389/3953 tok/s;   6132 sec
[2021-04-23 05:27:42,990 INFO] Step 28300/50000; acc:  72.33; ppl:  2.57; xent: 0.94; lr: 0.00010; 9181/3866 tok/s;   6143 sec
[2021-04-23 05:27:53,267 INFO] Step 28350/50000; acc:  71.94; ppl:  2.62; xent: 0.96; lr: 0.00010; 9982/4072 tok/s;   6154 sec
[2021-04-23 05:28:03,998 INFO] Step 28400/50000; acc:  72.14; ppl:  2.60; xent: 0.96; lr: 0.00010; 9236/3991 tok/s;   6164 sec
[2021-04-23 05:28:07,520 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:28:14,859 INFO] Step 28450/50000; acc:  71.49; ppl:  2.65; xent: 0.98; lr: 0.00010; 9590/4031 tok/s;   6175 sec
[2021-04-23 05:28:25,490 INFO] Step 28500/50000; acc:  72.07; ppl:  2.61; xent: 0.96; lr: 0.00010; 9653/3961 tok/s;   6186 sec
[2021-04-23 05:28:36,245 INFO] Step 28550/50000; acc:  71.53; ppl:  2.63; xent: 0.97; lr: 0.00010; 9216/4000 tok/s;   6197 sec
[2021-04-23 05:28:47,543 INFO] Step 28600/50000; acc:  71.88; ppl:  2.62; xent: 0.96; lr: 0.00010; 9037/3878 tok/s;   6208 sec
[2021-04-23 05:28:58,081 INFO] Step 28650/50000; acc:  71.62; ppl:  2.65; xent: 0.98; lr: 0.00010; 9603/4043 tok/s;   6218 sec
[2021-04-23 05:29:08,562 INFO] Step 28700/50000; acc:  72.13; ppl:  2.61; xent: 0.96; lr: 0.00010; 9917/4061 tok/s;   6229 sec
[2021-04-23 05:29:18,664 INFO] Step 28750/50000; acc:  72.41; ppl:  2.56; xent: 0.94; lr: 0.00010; 9688/4149 tok/s;   6239 sec
[2021-04-23 05:29:30,239 INFO] Step 28800/50000; acc:  71.52; ppl:  2.64; xent: 0.97; lr: 0.00010; 9126/3727 tok/s;   6251 sec
[2021-04-23 05:29:40,634 INFO] Step 28850/50000; acc:  72.27; ppl:  2.59; xent: 0.95; lr: 0.00010; 9633/4070 tok/s;   6261 sec
[2021-04-23 05:29:51,571 INFO] Step 28900/50000; acc:  72.05; ppl:  2.60; xent: 0.96; lr: 0.00010; 9269/3899 tok/s;   6272 sec
[2021-04-23 05:30:02,328 INFO] Step 28950/50000; acc:  71.94; ppl:  2.62; xent: 0.96; lr: 0.00010; 9554/3986 tok/s;   6283 sec
[2021-04-23 05:30:13,181 INFO] Step 29000/50000; acc:  71.92; ppl:  2.63; xent: 0.97; lr: 0.00010; 9443/3916 tok/s;   6293 sec
[2021-04-23 05:30:24,374 INFO] Step 29050/50000; acc:  72.16; ppl:  2.58; xent: 0.95; lr: 0.00010; 9207/3864 tok/s;   6305 sec
[2021-04-23 05:30:34,535 INFO] Step 29100/50000; acc:  72.39; ppl:  2.58; xent: 0.95; lr: 0.00010; 9719/4103 tok/s;   6315 sec
[2021-04-23 05:30:45,298 INFO] Step 29150/50000; acc:  72.04; ppl:  2.61; xent: 0.96; lr: 0.00010; 9550/3983 tok/s;   6326 sec
[2021-04-23 05:30:45,788 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:30:56,125 INFO] Step 29200/50000; acc:  72.37; ppl:  2.57; xent: 0.94; lr: 0.00010; 9209/4007 tok/s;   6336 sec
[2021-04-23 05:31:07,035 INFO] Step 29250/50000; acc:  71.46; ppl:  2.65; xent: 0.97; lr: 0.00010; 9530/3873 tok/s;   6347 sec
[2021-04-23 05:31:18,554 INFO] Step 29300/50000; acc:  71.83; ppl:  2.61; xent: 0.96; lr: 0.00010; 8809/3744 tok/s;   6359 sec
[2021-04-23 05:31:29,058 INFO] Step 29350/50000; acc:  72.26; ppl:  2.60; xent: 0.96; lr: 0.00010; 9495/4126 tok/s;   6369 sec
[2021-04-23 05:31:39,550 INFO] Step 29400/50000; acc:  71.75; ppl:  2.61; xent: 0.96; lr: 0.00010; 9761/4057 tok/s;   6380 sec
[2021-04-23 05:31:50,269 INFO] Step 29450/50000; acc:  72.21; ppl:  2.59; xent: 0.95; lr: 0.00010; 9377/4008 tok/s;   6391 sec
[2021-04-23 05:32:01,299 INFO] Step 29500/50000; acc:  71.30; ppl:  2.64; xent: 0.97; lr: 0.00010; 9613/3798 tok/s;   6402 sec
[2021-04-23 05:32:12,129 INFO] Step 29550/50000; acc:  72.44; ppl:  2.55; xent: 0.94; lr: 0.00010; 9058/3969 tok/s;   6412 sec
[2021-04-23 05:32:22,673 INFO] Step 29600/50000; acc:  71.96; ppl:  2.60; xent: 0.95; lr: 0.00010; 9816/4040 tok/s;   6423 sec
[2021-04-23 05:32:33,555 INFO] Step 29650/50000; acc:  72.25; ppl:  2.58; xent: 0.95; lr: 0.00010; 9260/3924 tok/s;   6434 sec
[2021-04-23 05:32:43,996 INFO] Step 29700/50000; acc:  72.14; ppl:  2.61; xent: 0.96; lr: 0.00010; 9685/4100 tok/s;   6444 sec
[2021-04-23 05:32:55,116 INFO] Step 29750/50000; acc:  72.29; ppl:  2.57; xent: 0.94; lr: 0.00010; 9298/3835 tok/s;   6455 sec
[2021-04-23 05:33:05,695 INFO] Step 29800/50000; acc:  72.29; ppl:  2.59; xent: 0.95; lr: 0.00010; 9678/4025 tok/s;   6466 sec
[2021-04-23 05:33:16,118 INFO] Step 29850/50000; acc:  72.28; ppl:  2.59; xent: 0.95; lr: 0.00010; 9726/4099 tok/s;   6476 sec
[2021-04-23 05:33:24,075 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:33:26,654 INFO] Step 29900/50000; acc:  72.36; ppl:  2.57; xent: 0.94; lr: 0.00010; 9508/4069 tok/s;   6487 sec
[2021-04-23 05:33:37,294 INFO] Step 29950/50000; acc:  72.09; ppl:  2.58; xent: 0.95; lr: 0.00010; 9652/3994 tok/s;   6498 sec
[2021-04-23 05:33:48,104 INFO] Step 30000/50000; acc:  72.24; ppl:  2.58; xent: 0.95; lr: 0.00010; 9214/3896 tok/s;   6508 sec
[2021-04-23 05:33:48,105 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-23 05:33:56,856 INFO] Validation perplexity: 2.89724
[2021-04-23 05:33:56,856 INFO] Validation accuracy: 70.0669
[2021-04-23 05:33:56,858 INFO] Saving checkpoint ../models/default_params/strict_ops/model_step_30000.pt
[2021-04-23 05:34:08,903 INFO] Step 30050/50000; acc:  71.88; ppl:  2.62; xent: 0.96; lr: 0.00010; 4948/2130 tok/s;   6529 sec
[2021-04-23 05:34:19,599 INFO] Step 30100/50000; acc:  71.87; ppl:  2.62; xent: 0.96; lr: 0.00010; 9562/4026 tok/s;   6540 sec
[2021-04-23 05:34:30,099 INFO] Step 30150/50000; acc:  72.59; ppl:  2.55; xent: 0.94; lr: 0.00010; 9474/4052 tok/s;   6550 sec
[2021-04-23 05:34:40,475 INFO] Step 30200/50000; acc:  72.01; ppl:  2.59; xent: 0.95; lr: 0.00010; 9937/4098 tok/s;   6561 sec
[2021-04-23 05:34:51,567 INFO] Step 30250/50000; acc:  72.31; ppl:  2.56; xent: 0.94; lr: 0.00010; 9230/3782 tok/s;   6572 sec
[2021-04-23 05:35:03,028 INFO] Step 30300/50000; acc:  71.75; ppl:  2.62; xent: 0.96; lr: 0.00010; 9165/3821 tok/s;   6583 sec
[2021-04-23 05:35:13,229 INFO] Step 30350/50000; acc:  72.69; ppl:  2.53; xent: 0.93; lr: 0.00010; 9492/4089 tok/s;   6594 sec
[2021-04-23 05:35:24,135 INFO] Step 30400/50000; acc:  72.10; ppl:  2.59; xent: 0.95; lr: 0.00010; 9558/3962 tok/s;   6604 sec
[2021-04-23 05:35:35,018 INFO] Step 30450/50000; acc:  72.61; ppl:  2.55; xent: 0.94; lr: 0.00010; 9211/3904 tok/s;   6615 sec
[2021-04-23 05:35:46,003 INFO] Step 30500/50000; acc:  72.67; ppl:  2.55; xent: 0.94; lr: 0.00010; 9260/3840 tok/s;   6626 sec
[2021-04-23 05:35:56,353 INFO] Step 30550/50000; acc:  72.59; ppl:  2.56; xent: 0.94; lr: 0.00010; 9905/4161 tok/s;   6637 sec
[2021-04-23 05:36:07,206 INFO] Step 30600/50000; acc:  72.20; ppl:  2.59; xent: 0.95; lr: 0.00010; 9389/3941 tok/s;   6647 sec
[2021-04-23 05:36:12,123 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:36:17,848 INFO] Step 30650/50000; acc:  72.47; ppl:  2.56; xent: 0.94; lr: 0.00010; 9626/4056 tok/s;   6658 sec
[2021-04-23 05:36:28,462 INFO] Step 30700/50000; acc:  72.51; ppl:  2.55; xent: 0.94; lr: 0.00010; 9454/3956 tok/s;   6669 sec
[2021-04-23 05:36:40,316 INFO] Step 30750/50000; acc:  72.01; ppl:  2.59; xent: 0.95; lr: 0.00010; 8617/3659 tok/s;   6681 sec
[2021-04-23 05:36:50,601 INFO] Step 30800/50000; acc:  72.60; ppl:  2.55; xent: 0.93; lr: 0.00010; 9635/4216 tok/s;   6691 sec
[2021-04-23 05:37:01,631 INFO] Step 30850/50000; acc:  72.02; ppl:  2.60; xent: 0.96; lr: 0.00010; 9373/3895 tok/s;   6702 sec
[2021-04-23 05:37:11,789 INFO] Step 30900/50000; acc:  72.45; ppl:  2.56; xent: 0.94; lr: 0.00010; 10026/4169 tok/s;   6712 sec
[2021-04-23 05:37:22,283 INFO] Step 30950/50000; acc:  72.58; ppl:  2.54; xent: 0.93; lr: 0.00010; 9634/4015 tok/s;   6723 sec
[2021-04-23 05:37:33,623 INFO] Step 31000/50000; acc:  72.17; ppl:  2.57; xent: 0.94; lr: 0.00010; 9123/3794 tok/s;   6734 sec
[2021-04-23 05:37:44,600 INFO] Step 31050/50000; acc:  72.39; ppl:  2.56; xent: 0.94; lr: 0.00010; 9213/3891 tok/s;   6745 sec
[2021-04-23 05:37:55,185 INFO] Step 31100/50000; acc:  72.16; ppl:  2.58; xent: 0.95; lr: 0.00010; 9904/4033 tok/s;   6755 sec
[2021-04-23 05:38:05,375 INFO] Step 31150/50000; acc:  73.14; ppl:  2.51; xent: 0.92; lr: 0.00010; 9534/4189 tok/s;   6766 sec
[2021-04-23 05:38:16,696 INFO] Step 31200/50000; acc:  72.18; ppl:  2.58; xent: 0.95; lr: 0.00010; 9186/3751 tok/s;   6777 sec
[2021-04-23 05:38:27,321 INFO] Step 31250/50000; acc:  72.84; ppl:  2.51; xent: 0.92; lr: 0.00010; 9471/4012 tok/s;   6788 sec
[2021-04-23 05:38:37,689 INFO] Step 31300/50000; acc:  72.63; ppl:  2.55; xent: 0.93; lr: 0.00010; 9715/4076 tok/s;   6798 sec
[2021-04-23 05:38:48,630 INFO] Step 31350/50000; acc:  72.36; ppl:  2.56; xent: 0.94; lr: 0.00010; 9415/3949 tok/s;   6809 sec
[2021-04-23 05:38:50,572 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:38:59,024 INFO] Step 31400/50000; acc:  72.61; ppl:  2.53; xent: 0.93; lr: 0.00010; 9870/4100 tok/s;   6819 sec
[2021-04-23 05:39:09,892 INFO] Step 31450/50000; acc:  72.06; ppl:  2.57; xent: 0.95; lr: 0.00010; 9379/3913 tok/s;   6830 sec
[2021-04-23 05:39:20,850 INFO] Step 31500/50000; acc:  72.49; ppl:  2.55; xent: 0.94; lr: 0.00010; 9059/4000 tok/s;   6841 sec
[2021-04-23 05:39:31,520 INFO] Step 31550/50000; acc:  72.40; ppl:  2.56; xent: 0.94; lr: 0.00010; 9606/4003 tok/s;   6852 sec
[2021-04-23 05:39:42,224 INFO] Step 31600/50000; acc:  72.68; ppl:  2.53; xent: 0.93; lr: 0.00010; 9265/4007 tok/s;   6863 sec
[2021-04-23 05:39:52,903 INFO] Step 31650/50000; acc:  72.24; ppl:  2.57; xent: 0.94; lr: 0.00010; 9685/3948 tok/s;   6873 sec
[2021-04-23 05:40:03,975 INFO] Step 31700/50000; acc:  72.39; ppl:  2.54; xent: 0.93; lr: 0.00010; 9427/3831 tok/s;   6884 sec
[2021-04-23 05:40:14,442 INFO] Step 31750/50000; acc:  72.87; ppl:  2.51; xent: 0.92; lr: 0.00010; 9525/4065 tok/s;   6895 sec
[2021-04-23 05:40:25,398 INFO] Step 31800/50000; acc:  72.15; ppl:  2.56; xent: 0.94; lr: 0.00010; 9394/3898 tok/s;   6906 sec
[2021-04-23 05:40:36,096 INFO] Step 31850/50000; acc:  72.58; ppl:  2.54; xent: 0.93; lr: 0.00010; 9475/4026 tok/s;   6916 sec
[2021-04-23 05:40:47,032 INFO] Step 31900/50000; acc:  72.15; ppl:  2.58; xent: 0.95; lr: 0.00010; 9527/3952 tok/s;   6927 sec
[2021-04-23 05:40:57,647 INFO] Step 31950/50000; acc:  73.33; ppl:  2.45; xent: 0.90; lr: 0.00010; 9240/3930 tok/s;   6938 sec
[2021-04-23 05:41:07,967 INFO] Step 32000/50000; acc:  72.28; ppl:  2.56; xent: 0.94; lr: 0.00010; 10049/4102 tok/s;   6948 sec
[2021-04-23 05:41:18,703 INFO] Step 32050/50000; acc:  72.75; ppl:  2.51; xent: 0.92; lr: 0.00010; 9293/4019 tok/s;   6959 sec
[2021-04-23 05:41:21,569 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:41:29,117 INFO] Step 32100/50000; acc:  72.61; ppl:  2.53; xent: 0.93; lr: 0.00010; 9775/4135 tok/s;   6969 sec
[2021-04-23 05:41:39,891 INFO] Step 32150/50000; acc:  72.84; ppl:  2.52; xent: 0.93; lr: 0.00010; 9558/3935 tok/s;   6980 sec
[2021-04-23 05:41:50,825 INFO] Step 32200/50000; acc:  72.09; ppl:  2.57; xent: 0.95; lr: 0.00010; 9289/3922 tok/s;   6991 sec
[2021-04-23 05:42:01,989 INFO] Step 32250/50000; acc:  72.66; ppl:  2.53; xent: 0.93; lr: 0.00010; 9127/3948 tok/s;   7002 sec
[2021-04-23 05:42:12,467 INFO] Step 32300/50000; acc:  72.40; ppl:  2.54; xent: 0.93; lr: 0.00010; 9503/4043 tok/s;   7013 sec
[2021-04-23 05:42:22,755 INFO] Step 32350/50000; acc:  73.05; ppl:  2.51; xent: 0.92; lr: 0.00010; 9916/4117 tok/s;   7023 sec
[2021-04-23 05:42:32,884 INFO] Step 32400/50000; acc:  72.86; ppl:  2.51; xent: 0.92; lr: 0.00010; 9917/4185 tok/s;   7033 sec
[2021-04-23 05:42:44,496 INFO] Step 32450/50000; acc:  72.44; ppl:  2.54; xent: 0.93; lr: 0.00010; 9058/3711 tok/s;   7045 sec
[2021-04-23 05:42:55,313 INFO] Step 32500/50000; acc:  72.74; ppl:  2.53; xent: 0.93; lr: 0.00010; 9455/3890 tok/s;   7056 sec
[2021-04-23 05:43:05,810 INFO] Step 32550/50000; acc:  73.14; ppl:  2.50; xent: 0.92; lr: 0.00010; 9517/4061 tok/s;   7066 sec
[2021-04-23 05:43:16,675 INFO] Step 32600/50000; acc:  72.76; ppl:  2.54; xent: 0.93; lr: 0.00010; 9445/3962 tok/s;   7077 sec
[2021-04-23 05:43:27,558 INFO] Step 32650/50000; acc:  72.90; ppl:  2.51; xent: 0.92; lr: 0.00010; 9314/3908 tok/s;   7088 sec
[2021-04-23 05:43:38,885 INFO] Step 32700/50000; acc:  72.64; ppl:  2.52; xent: 0.93; lr: 0.00010; 9292/3830 tok/s;   7099 sec
[2021-04-23 05:43:48,785 INFO] Step 32750/50000; acc:  73.49; ppl:  2.46; xent: 0.90; lr: 0.00010; 9737/4185 tok/s;   7109 sec
[2021-04-23 05:43:59,673 INFO] Step 32800/50000; acc:  72.62; ppl:  2.53; xent: 0.93; lr: 0.00010; 9564/3950 tok/s;   7120 sec
[2021-04-23 05:43:59,681 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:44:10,718 INFO] Step 32850/50000; acc:  72.94; ppl:  2.49; xent: 0.91; lr: 0.00010; 9067/3946 tok/s;   7131 sec
[2021-04-23 05:44:21,519 INFO] Step 32900/50000; acc:  72.39; ppl:  2.54; xent: 0.93; lr: 0.00010; 9418/3878 tok/s;   7142 sec
[2021-04-23 05:44:32,923 INFO] Step 32950/50000; acc:  72.64; ppl:  2.52; xent: 0.93; lr: 0.00010; 8929/3796 tok/s;   7153 sec
[2021-04-23 05:44:43,562 INFO] Step 33000/50000; acc:  72.66; ppl:  2.54; xent: 0.93; lr: 0.00010; 9598/4104 tok/s;   7164 sec
[2021-04-23 05:44:54,126 INFO] Step 33050/50000; acc:  72.75; ppl:  2.52; xent: 0.92; lr: 0.00010; 9677/4005 tok/s;   7174 sec
[2021-04-23 05:45:04,496 INFO] Step 33100/50000; acc:  73.29; ppl:  2.48; xent: 0.91; lr: 0.00010; 9583/4117 tok/s;   7185 sec
[2021-04-23 05:45:15,300 INFO] Step 33150/50000; acc:  72.84; ppl:  2.53; xent: 0.93; lr: 0.00010; 9620/3875 tok/s;   7196 sec
[2021-04-23 05:45:26,520 INFO] Step 33200/50000; acc:  73.02; ppl:  2.48; xent: 0.91; lr: 0.00010; 8956/3884 tok/s;   7207 sec
[2021-04-23 05:45:37,009 INFO] Step 33250/50000; acc:  72.75; ppl:  2.52; xent: 0.92; lr: 0.00010; 9817/4015 tok/s;   7217 sec
[2021-04-23 05:45:48,075 INFO] Step 33300/50000; acc:  72.88; ppl:  2.52; xent: 0.92; lr: 0.00010; 9317/3852 tok/s;   7228 sec
[2021-04-23 05:45:58,615 INFO] Step 33350/50000; acc:  73.27; ppl:  2.50; xent: 0.92; lr: 0.00010; 9447/4087 tok/s;   7239 sec
[2021-04-23 05:46:09,724 INFO] Step 33400/50000; acc:  72.94; ppl:  2.49; xent: 0.91; lr: 0.00010; 9298/3838 tok/s;   7250 sec
[2021-04-23 05:46:20,196 INFO] Step 33450/50000; acc:  73.39; ppl:  2.49; xent: 0.91; lr: 0.00010; 9649/4061 tok/s;   7260 sec
[2021-04-23 05:46:30,729 INFO] Step 33500/50000; acc:  72.76; ppl:  2.53; xent: 0.93; lr: 0.00010; 9849/4047 tok/s;   7271 sec
[2021-04-23 05:46:38,384 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:46:41,382 INFO] Step 33550/50000; acc:  73.35; ppl:  2.48; xent: 0.91; lr: 0.00010; 9171/4060 tok/s;   7282 sec
[2021-04-23 05:46:52,171 INFO] Step 33600/50000; acc:  73.00; ppl:  2.49; xent: 0.91; lr: 0.00010; 9647/3905 tok/s;   7292 sec
[2021-04-23 05:47:03,171 INFO] Step 33650/50000; acc:  72.80; ppl:  2.50; xent: 0.92; lr: 0.00010; 9093/3828 tok/s;   7303 sec
[2021-04-23 05:47:14,458 INFO] Step 33700/50000; acc:  72.99; ppl:  2.51; xent: 0.92; lr: 0.00010; 8932/3915 tok/s;   7315 sec
[2021-04-23 05:47:25,325 INFO] Step 33750/50000; acc:  72.67; ppl:  2.52; xent: 0.93; lr: 0.00010; 9423/3970 tok/s;   7326 sec
[2021-04-23 05:47:36,110 INFO] Step 33800/50000; acc:  72.93; ppl:  2.51; xent: 0.92; lr: 0.00010; 9467/3969 tok/s;   7336 sec
[2021-04-23 05:47:46,440 INFO] Step 33850/50000; acc:  73.01; ppl:  2.51; xent: 0.92; lr: 0.00010; 9967/4113 tok/s;   7347 sec
[2021-04-23 05:47:57,224 INFO] Step 33900/50000; acc:  73.25; ppl:  2.45; xent: 0.90; lr: 0.00010; 9344/3876 tok/s;   7358 sec
[2021-04-23 05:48:08,600 INFO] Step 33950/50000; acc:  72.88; ppl:  2.51; xent: 0.92; lr: 0.00010; 9066/3817 tok/s;   7369 sec
[2021-04-23 05:48:19,190 INFO] Step 34000/50000; acc:  73.47; ppl:  2.46; xent: 0.90; lr: 0.00010; 9381/3978 tok/s;   7379 sec
[2021-04-23 05:48:30,098 INFO] Step 34050/50000; acc:  72.87; ppl:  2.51; xent: 0.92; lr: 0.00010; 9505/3945 tok/s;   7390 sec
[2021-04-23 05:48:41,421 INFO] Step 34100/50000; acc:  73.22; ppl:  2.50; xent: 0.91; lr: 0.00010; 9066/3777 tok/s;   7402 sec
[2021-04-23 05:48:52,068 INFO] Step 34150/50000; acc:  73.45; ppl:  2.46; xent: 0.90; lr: 0.00010; 9402/3951 tok/s;   7412 sec
[2021-04-23 05:49:02,361 INFO] Step 34200/50000; acc:  73.28; ppl:  2.48; xent: 0.91; lr: 0.00010; 9934/4166 tok/s;   7423 sec
[2021-04-23 05:49:13,022 INFO] Step 34250/50000; acc:  72.86; ppl:  2.50; xent: 0.91; lr: 0.00010; 9455/4025 tok/s;   7433 sec
[2021-04-23 05:49:17,714 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:49:23,972 INFO] Step 34300/50000; acc:  72.91; ppl:  2.49; xent: 0.91; lr: 0.00010; 9568/3948 tok/s;   7444 sec
[2021-04-23 05:49:34,315 INFO] Step 34350/50000; acc:  73.42; ppl:  2.45; xent: 0.89; lr: 0.00010; 9456/4056 tok/s;   7455 sec
[2021-04-23 05:49:46,232 INFO] Step 34400/50000; acc:  72.67; ppl:  2.51; xent: 0.92; lr: 0.00010; 8683/3657 tok/s;   7467 sec
[2021-04-23 05:49:56,591 INFO] Step 34450/50000; acc:  73.44; ppl:  2.46; xent: 0.90; lr: 0.00010; 9626/4136 tok/s;   7477 sec
[2021-04-23 05:50:07,558 INFO] Step 34500/50000; acc:  72.90; ppl:  2.50; xent: 0.92; lr: 0.00010; 9207/3926 tok/s;   7488 sec
[2021-04-23 05:50:17,847 INFO] Step 34550/50000; acc:  73.20; ppl:  2.48; xent: 0.91; lr: 0.00010; 9940/4139 tok/s;   7498 sec
[2021-04-23 05:50:28,641 INFO] Step 34600/50000; acc:  73.08; ppl:  2.49; xent: 0.91; lr: 0.00010; 9606/3888 tok/s;   7509 sec
[2021-04-23 05:50:39,882 INFO] Step 34650/50000; acc:  72.94; ppl:  2.48; xent: 0.91; lr: 0.00010; 9158/3814 tok/s;   7520 sec
[2021-04-23 05:50:50,831 INFO] Step 34700/50000; acc:  73.15; ppl:  2.47; xent: 0.91; lr: 0.00010; 9118/3948 tok/s;   7531 sec
[2021-04-23 05:51:01,228 INFO] Step 34750/50000; acc:  73.30; ppl:  2.48; xent: 0.91; lr: 0.00010; 9901/4068 tok/s;   7542 sec
[2021-04-23 05:51:11,517 INFO] Step 34800/50000; acc:  73.52; ppl:  2.46; xent: 0.90; lr: 0.00010; 9663/4152 tok/s;   7552 sec
[2021-04-23 05:51:23,059 INFO] Step 34850/50000; acc:  73.15; ppl:  2.49; xent: 0.91; lr: 0.00010; 8988/3700 tok/s;   7563 sec
[2021-04-23 05:51:33,848 INFO] Step 34900/50000; acc:  73.56; ppl:  2.45; xent: 0.89; lr: 0.00010; 9519/3927 tok/s;   7574 sec
[2021-04-23 05:51:44,108 INFO] Step 34950/50000; acc:  73.44; ppl:  2.45; xent: 0.90; lr: 0.00010; 9688/4102 tok/s;   7584 sec
[2021-04-23 05:51:54,979 INFO] Step 35000/50000; acc:  73.12; ppl:  2.48; xent: 0.91; lr: 0.00010; 9452/4009 tok/s;   7595 sec
[2021-04-23 05:51:54,983 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-23 05:52:03,749 INFO] Validation perplexity: 2.91017
[2021-04-23 05:52:03,749 INFO] Validation accuracy: 70.1503
[2021-04-23 05:52:03,751 INFO] Saving checkpoint ../models/default_params/strict_ops/model_step_35000.pt
[2021-04-23 05:52:05,896 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:52:14,835 INFO] Step 35050/50000; acc:  73.45; ppl:  2.44; xent: 0.89; lr: 0.00010; 5107/2144 tok/s;   7615 sec
[2021-04-23 05:52:25,947 INFO] Step 35100/50000; acc:  72.62; ppl:  2.52; xent: 0.92; lr: 0.00010; 9381/3846 tok/s;   7626 sec
[2021-04-23 05:52:36,866 INFO] Step 35150/50000; acc:  73.64; ppl:  2.42; xent: 0.89; lr: 0.00010; 8859/3997 tok/s;   7637 sec
[2021-04-23 05:52:47,586 INFO] Step 35200/50000; acc:  72.90; ppl:  2.51; xent: 0.92; lr: 0.00010; 9691/3983 tok/s;   7648 sec
[2021-04-23 05:52:57,904 INFO] Step 35250/50000; acc:  73.66; ppl:  2.43; xent: 0.89; lr: 0.00010; 9658/4107 tok/s;   7658 sec
[2021-04-23 05:53:08,775 INFO] Step 35300/50000; acc:  73.32; ppl:  2.47; xent: 0.90; lr: 0.00010; 9316/3924 tok/s;   7669 sec
[2021-04-23 05:53:19,919 INFO] Step 35350/50000; acc:  73.19; ppl:  2.47; xent: 0.91; lr: 0.00010; 9397/3789 tok/s;   7680 sec
[2021-04-23 05:53:30,796 INFO] Step 35400/50000; acc:  73.26; ppl:  2.46; xent: 0.90; lr: 0.00010; 9404/3936 tok/s;   7691 sec
[2021-04-23 05:53:41,840 INFO] Step 35450/50000; acc:  73.36; ppl:  2.47; xent: 0.90; lr: 0.00010; 9274/3869 tok/s;   7702 sec
[2021-04-23 05:53:52,429 INFO] Step 35500/50000; acc:  73.58; ppl:  2.44; xent: 0.89; lr: 0.00010; 9449/4057 tok/s;   7713 sec
[2021-04-23 05:54:03,191 INFO] Step 35550/50000; acc:  73.25; ppl:  2.49; xent: 0.91; lr: 0.00010; 9501/3979 tok/s;   7723 sec
[2021-04-23 05:54:14,150 INFO] Step 35600/50000; acc:  73.95; ppl:  2.40; xent: 0.87; lr: 0.00010; 9178/3860 tok/s;   7734 sec
[2021-04-23 05:54:24,499 INFO] Step 35650/50000; acc:  73.54; ppl:  2.47; xent: 0.91; lr: 0.00010; 9960/4088 tok/s;   7745 sec
[2021-04-23 05:54:35,109 INFO] Step 35700/50000; acc:  73.55; ppl:  2.44; xent: 0.89; lr: 0.00010; 9618/4045 tok/s;   7755 sec
[2021-04-23 05:54:37,559 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:54:45,825 INFO] Step 35750/50000; acc:  73.70; ppl:  2.44; xent: 0.89; lr: 0.00010; 9357/4047 tok/s;   7766 sec
[2021-04-23 05:54:56,587 INFO] Step 35800/50000; acc:  73.51; ppl:  2.44; xent: 0.89; lr: 0.00010; 9557/3933 tok/s;   7777 sec
[2021-04-23 05:55:07,453 INFO] Step 35850/50000; acc:  73.21; ppl:  2.46; xent: 0.90; lr: 0.00010; 9242/3923 tok/s;   7788 sec
[2021-04-23 05:55:18,637 INFO] Step 35900/50000; acc:  73.04; ppl:  2.48; xent: 0.91; lr: 0.00010; 9324/3960 tok/s;   7799 sec
[2021-04-23 05:55:28,675 INFO] Step 35950/50000; acc:  73.69; ppl:  2.42; xent: 0.88; lr: 0.00010; 9667/4200 tok/s;   7809 sec
[2021-04-23 05:55:39,274 INFO] Step 36000/50000; acc:  73.51; ppl:  2.47; xent: 0.90; lr: 0.00010; 9736/4028 tok/s;   7820 sec
[2021-04-23 05:55:49,513 INFO] Step 36050/50000; acc:  73.61; ppl:  2.44; xent: 0.89; lr: 0.00010; 9888/4088 tok/s;   7830 sec
[2021-04-23 05:56:01,074 INFO] Step 36100/50000; acc:  73.54; ppl:  2.44; xent: 0.89; lr: 0.00010; 8882/3719 tok/s;   7841 sec
[2021-04-23 05:56:12,056 INFO] Step 36150/50000; acc:  73.45; ppl:  2.46; xent: 0.90; lr: 0.00010; 9351/3883 tok/s;   7852 sec
[2021-04-23 05:56:22,699 INFO] Step 36200/50000; acc:  73.46; ppl:  2.45; xent: 0.90; lr: 0.00010; 9624/3978 tok/s;   7863 sec
[2021-04-23 05:56:33,296 INFO] Step 36250/50000; acc:  73.47; ppl:  2.46; xent: 0.90; lr: 0.00010; 9657/4071 tok/s;   7874 sec
[2021-04-23 05:56:44,153 INFO] Step 36300/50000; acc:  73.85; ppl:  2.41; xent: 0.88; lr: 0.00010; 9204/3923 tok/s;   7884 sec
[2021-04-23 05:56:55,173 INFO] Step 36350/50000; acc:  73.54; ppl:  2.44; xent: 0.89; lr: 0.00010; 9384/3901 tok/s;   7895 sec
[2021-04-23 05:57:05,230 INFO] Step 36400/50000; acc:  73.92; ppl:  2.41; xent: 0.88; lr: 0.00010; 9807/4166 tok/s;   7906 sec
[2021-04-23 05:57:15,804 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:57:16,162 INFO] Step 36450/50000; acc:  73.58; ppl:  2.46; xent: 0.90; lr: 0.00010; 9487/3919 tok/s;   7916 sec
[2021-04-23 05:57:27,220 INFO] Step 36500/50000; acc:  73.55; ppl:  2.43; xent: 0.89; lr: 0.00010; 9264/3946 tok/s;   7928 sec
[2021-04-23 05:57:37,900 INFO] Step 36550/50000; acc:  73.36; ppl:  2.44; xent: 0.89; lr: 0.00010; 9387/3939 tok/s;   7938 sec
[2021-04-23 05:57:49,155 INFO] Step 36600/50000; acc:  73.58; ppl:  2.44; xent: 0.89; lr: 0.00010; 9024/3860 tok/s;   7949 sec
[2021-04-23 05:57:59,781 INFO] Step 36650/50000; acc:  73.54; ppl:  2.46; xent: 0.90; lr: 0.00010; 9514/4066 tok/s;   7960 sec
[2021-04-23 05:58:10,553 INFO] Step 36700/50000; acc:  73.34; ppl:  2.46; xent: 0.90; lr: 0.00010; 9697/3942 tok/s;   7971 sec
[2021-04-23 05:58:20,859 INFO] Step 36750/50000; acc:  73.95; ppl:  2.40; xent: 0.87; lr: 0.00010; 9419/4152 tok/s;   7981 sec
[2021-04-23 05:58:31,668 INFO] Step 36800/50000; acc:  73.48; ppl:  2.45; xent: 0.89; lr: 0.00010; 9726/3873 tok/s;   7992 sec
[2021-04-23 05:58:43,044 INFO] Step 36850/50000; acc:  73.91; ppl:  2.42; xent: 0.88; lr: 0.00010; 8891/3812 tok/s;   8003 sec
[2021-04-23 05:58:53,416 INFO] Step 36900/50000; acc:  73.81; ppl:  2.42; xent: 0.88; lr: 0.00010; 9691/4061 tok/s;   8014 sec
[2021-04-23 05:59:04,391 INFO] Step 36950/50000; acc:  73.81; ppl:  2.43; xent: 0.89; lr: 0.00010; 9432/3899 tok/s;   8025 sec
[2021-04-23 05:59:15,254 INFO] Step 37000/50000; acc:  73.57; ppl:  2.46; xent: 0.90; lr: 0.00010; 9387/3975 tok/s;   8036 sec
[2021-04-23 05:59:26,443 INFO] Step 37050/50000; acc:  73.75; ppl:  2.41; xent: 0.88; lr: 0.00010; 9196/3800 tok/s;   8047 sec
[2021-04-23 05:59:36,527 INFO] Step 37100/50000; acc:  74.16; ppl:  2.40; xent: 0.88; lr: 0.00010; 9875/4206 tok/s;   8057 sec
[2021-04-23 05:59:46,943 INFO] Step 37150/50000; acc:  73.70; ppl:  2.42; xent: 0.88; lr: 0.00010; 9797/4064 tok/s;   8067 sec
[2021-04-23 05:59:54,477 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 05:59:57,789 INFO] Step 37200/50000; acc:  73.89; ppl:  2.42; xent: 0.88; lr: 0.00010; 9233/4008 tok/s;   8078 sec
[2021-04-23 06:00:08,644 INFO] Step 37250/50000; acc:  73.65; ppl:  2.42; xent: 0.88; lr: 0.00010; 9560/3894 tok/s;   8089 sec
[2021-04-23 06:00:20,124 INFO] Step 37300/50000; acc:  73.21; ppl:  2.44; xent: 0.89; lr: 0.00010; 8895/3702 tok/s;   8100 sec
[2021-04-23 06:00:30,832 INFO] Step 37350/50000; acc:  73.97; ppl:  2.42; xent: 0.88; lr: 0.00010; 9269/4093 tok/s;   8111 sec
[2021-04-23 06:00:41,514 INFO] Step 37400/50000; acc:  73.42; ppl:  2.44; xent: 0.89; lr: 0.00010; 9576/4042 tok/s;   8122 sec
[2021-04-23 06:00:52,307 INFO] Step 37450/50000; acc:  73.72; ppl:  2.43; xent: 0.89; lr: 0.00010; 9340/3948 tok/s;   8133 sec
[2021-04-23 06:01:02,965 INFO] Step 37500/50000; acc:  73.41; ppl:  2.45; xent: 0.90; lr: 0.00010; 9887/4033 tok/s;   8143 sec
[2021-04-23 06:01:13,264 INFO] Step 37550/50000; acc:  74.44; ppl:  2.35; xent: 0.85; lr: 0.00010; 9564/3989 tok/s;   8154 sec
[2021-04-23 06:01:24,940 INFO] Step 37600/50000; acc:  73.50; ppl:  2.44; xent: 0.89; lr: 0.00010; 8929/3740 tok/s;   8165 sec
[2021-04-23 06:01:35,408 INFO] Step 37650/50000; acc:  74.27; ppl:  2.38; xent: 0.87; lr: 0.00010; 9544/4026 tok/s;   8176 sec
[2021-04-23 06:01:46,207 INFO] Step 37700/50000; acc:  74.05; ppl:  2.41; xent: 0.88; lr: 0.00010; 9389/3975 tok/s;   8186 sec
[2021-04-23 06:01:57,577 INFO] Step 37750/50000; acc:  73.98; ppl:  2.41; xent: 0.88; lr: 0.00010; 9058/3789 tok/s;   8198 sec
[2021-04-23 06:02:08,285 INFO] Step 37800/50000; acc:  74.09; ppl:  2.40; xent: 0.87; lr: 0.00010; 9589/3904 tok/s;   8209 sec
[2021-04-23 06:02:18,550 INFO] Step 37850/50000; acc:  74.00; ppl:  2.40; xent: 0.88; lr: 0.00010; 9925/4208 tok/s;   8219 sec
[2021-04-23 06:02:29,179 INFO] Step 37900/50000; acc:  73.93; ppl:  2.40; xent: 0.88; lr: 0.00010; 9345/4024 tok/s;   8229 sec
[2021-04-23 06:02:33,408 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:02:39,792 INFO] Step 37950/50000; acc:  73.86; ppl:  2.40; xent: 0.88; lr: 0.00010; 9713/4027 tok/s;   8240 sec
[2021-04-23 06:02:50,443 INFO] Step 38000/50000; acc:  73.94; ppl:  2.39; xent: 0.87; lr: 0.00010; 9368/3970 tok/s;   8251 sec
[2021-04-23 06:03:02,477 INFO] Step 38050/50000; acc:  73.36; ppl:  2.43; xent: 0.89; lr: 0.00010; 8578/3642 tok/s;   8263 sec
[2021-04-23 06:03:12,779 INFO] Step 38100/50000; acc:  73.91; ppl:  2.40; xent: 0.88; lr: 0.00010; 9896/4147 tok/s;   8273 sec
[2021-04-23 06:03:23,419 INFO] Step 38150/50000; acc:  73.87; ppl:  2.41; xent: 0.88; lr: 0.00010; 9350/4037 tok/s;   8284 sec
[2021-04-23 06:03:33,770 INFO] Step 38200/50000; acc:  73.92; ppl:  2.40; xent: 0.88; lr: 0.00010; 9867/4133 tok/s;   8294 sec
[2021-04-23 06:03:44,335 INFO] Step 38250/50000; acc:  73.75; ppl:  2.40; xent: 0.88; lr: 0.00010; 9715/3971 tok/s;   8305 sec
[2021-04-23 06:03:55,808 INFO] Step 38300/50000; acc:  73.37; ppl:  2.44; xent: 0.89; lr: 0.00010; 9172/3747 tok/s;   8316 sec
[2021-04-23 06:04:06,630 INFO] Step 38350/50000; acc:  74.57; ppl:  2.37; xent: 0.86; lr: 0.00010; 8993/3971 tok/s;   8327 sec
[2021-04-23 06:04:17,007 INFO] Step 38400/50000; acc:  73.73; ppl:  2.41; xent: 0.88; lr: 0.00010; 10050/4061 tok/s;   8337 sec
[2021-04-23 06:04:27,446 INFO] Step 38450/50000; acc:  74.22; ppl:  2.38; xent: 0.87; lr: 0.00010; 9575/4102 tok/s;   8348 sec
[2021-04-23 06:04:38,757 INFO] Step 38500/50000; acc:  74.07; ppl:  2.39; xent: 0.87; lr: 0.00010; 8975/3767 tok/s;   8359 sec
[2021-04-23 06:04:49,507 INFO] Step 38550/50000; acc:  74.21; ppl:  2.39; xent: 0.87; lr: 0.00010; 9572/3952 tok/s;   8370 sec
[2021-04-23 06:05:00,090 INFO] Step 38600/50000; acc:  74.07; ppl:  2.40; xent: 0.87; lr: 0.00010; 9637/3990 tok/s;   8380 sec
[2021-04-23 06:05:10,879 INFO] Step 38650/50000; acc:  73.59; ppl:  2.42; xent: 0.88; lr: 0.00010; 9496/4033 tok/s;   8391 sec
[2021-04-23 06:05:12,013 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:05:21,314 INFO] Step 38700/50000; acc:  74.66; ppl:  2.34; xent: 0.85; lr: 0.00010; 9592/4087 tok/s;   8402 sec
[2021-04-23 06:05:32,263 INFO] Step 38750/50000; acc:  73.48; ppl:  2.44; xent: 0.89; lr: 0.00010; 9339/3863 tok/s;   8413 sec
[2021-04-23 06:05:43,349 INFO] Step 38800/50000; acc:  74.25; ppl:  2.36; xent: 0.86; lr: 0.00010; 8942/3963 tok/s;   8424 sec
[2021-04-23 06:05:53,985 INFO] Step 38850/50000; acc:  73.77; ppl:  2.42; xent: 0.88; lr: 0.00010; 9718/4015 tok/s;   8434 sec
[2021-04-23 06:06:04,507 INFO] Step 38900/50000; acc:  74.21; ppl:  2.39; xent: 0.87; lr: 0.00010; 9688/4038 tok/s;   8445 sec
[2021-04-23 06:06:14,997 INFO] Step 38950/50000; acc:  74.17; ppl:  2.37; xent: 0.86; lr: 0.00010; 9523/4055 tok/s;   8455 sec
[2021-04-23 06:06:25,980 INFO] Step 39000/50000; acc:  73.97; ppl:  2.40; xent: 0.87; lr: 0.00010; 9513/3852 tok/s;   8466 sec
[2021-04-23 06:06:36,756 INFO] Step 39050/50000; acc:  74.34; ppl:  2.38; xent: 0.87; lr: 0.00010; 9387/3964 tok/s;   8477 sec
[2021-04-23 06:06:47,993 INFO] Step 39100/50000; acc:  73.83; ppl:  2.41; xent: 0.88; lr: 0.00010; 9309/3856 tok/s;   8488 sec
[2021-04-23 06:06:58,260 INFO] Step 39150/50000; acc:  74.48; ppl:  2.34; xent: 0.85; lr: 0.00010; 9504/4143 tok/s;   8499 sec
[2021-04-23 06:07:09,092 INFO] Step 39200/50000; acc:  73.79; ppl:  2.41; xent: 0.88; lr: 0.00010; 9563/3958 tok/s;   8509 sec
[2021-04-23 06:07:20,185 INFO] Step 39250/50000; acc:  74.59; ppl:  2.33; xent: 0.85; lr: 0.00010; 9119/3808 tok/s;   8520 sec
[2021-04-23 06:07:30,453 INFO] Step 39300/50000; acc:  74.45; ppl:  2.39; xent: 0.87; lr: 0.00010; 9827/4094 tok/s;   8531 sec
[2021-04-23 06:07:41,201 INFO] Step 39350/50000; acc:  74.04; ppl:  2.37; xent: 0.86; lr: 0.00010; 9524/4013 tok/s;   8541 sec
[2021-04-23 06:07:43,279 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:07:51,845 INFO] Step 39400/50000; acc:  74.23; ppl:  2.39; xent: 0.87; lr: 0.00010; 9642/4077 tok/s;   8552 sec
[2021-04-23 06:08:02,450 INFO] Step 39450/50000; acc:  74.20; ppl:  2.37; xent: 0.86; lr: 0.00010; 9674/3974 tok/s;   8563 sec
[2021-04-23 06:08:13,401 INFO] Step 39500/50000; acc:  74.29; ppl:  2.38; xent: 0.87; lr: 0.00010; 9024/3932 tok/s;   8574 sec
[2021-04-23 06:08:24,208 INFO] Step 39550/50000; acc:  74.22; ppl:  2.38; xent: 0.87; lr: 0.00010; 9492/4031 tok/s;   8584 sec
[2021-04-23 06:08:34,579 INFO] Step 39600/50000; acc:  74.17; ppl:  2.38; xent: 0.87; lr: 0.00010; 9585/4089 tok/s;   8595 sec
[2021-04-23 06:08:45,005 INFO] Step 39650/50000; acc:  74.24; ppl:  2.38; xent: 0.87; lr: 0.00010; 9863/4096 tok/s;   8605 sec
[2021-04-23 06:08:55,485 INFO] Step 39700/50000; acc:  74.21; ppl:  2.39; xent: 0.87; lr: 0.00010; 9877/3991 tok/s;   8616 sec
[2021-04-23 06:09:06,868 INFO] Step 39750/50000; acc:  74.37; ppl:  2.34; xent: 0.85; lr: 0.00010; 8869/3793 tok/s;   8627 sec
[2021-04-23 06:09:18,027 INFO] Step 39800/50000; acc:  74.48; ppl:  2.36; xent: 0.86; lr: 0.00010; 9199/3853 tok/s;   8638 sec
[2021-04-23 06:09:28,490 INFO] Step 39850/50000; acc:  74.29; ppl:  2.36; xent: 0.86; lr: 0.00010; 9667/4029 tok/s;   8649 sec
[2021-04-23 06:09:39,418 INFO] Step 39900/50000; acc:  74.09; ppl:  2.39; xent: 0.87; lr: 0.00010; 9586/3937 tok/s;   8660 sec
[2021-04-23 06:09:50,084 INFO] Step 39950/50000; acc:  75.00; ppl:  2.31; xent: 0.84; lr: 0.00010; 9148/3980 tok/s;   8670 sec
[2021-04-23 06:10:01,210 INFO] Step 40000/50000; acc:  73.99; ppl:  2.39; xent: 0.87; lr: 0.00010; 9416/3846 tok/s;   8681 sec
[2021-04-23 06:10:01,216 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-23 06:10:09,972 INFO] Validation perplexity: 2.95224
[2021-04-23 06:10:09,972 INFO] Validation accuracy: 70.235
[2021-04-23 06:10:09,974 INFO] Saving checkpoint ../models/default_params/strict_ops/model_step_40000.pt
[2021-04-23 06:10:20,694 INFO] Step 40050/50000; acc:  74.65; ppl:  2.35; xent: 0.85; lr: 0.00010; 5082/2165 tok/s;   8701 sec
[2021-04-23 06:10:30,770 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:10:31,438 INFO] Step 40100/50000; acc:  74.31; ppl:  2.37; xent: 0.86; lr: 0.00010; 9451/3980 tok/s;   8712 sec
[2021-04-23 06:10:42,490 INFO] Step 40150/50000; acc:  74.22; ppl:  2.35; xent: 0.86; lr: 0.00010; 9292/3923 tok/s;   8723 sec
[2021-04-23 06:10:53,364 INFO] Step 40200/50000; acc:  74.07; ppl:  2.39; xent: 0.87; lr: 0.00010; 9440/3890 tok/s;   8734 sec
[2021-04-23 06:11:04,386 INFO] Step 40250/50000; acc:  74.43; ppl:  2.35; xent: 0.86; lr: 0.00010; 9198/3964 tok/s;   8745 sec
[2021-04-23 06:11:14,933 INFO] Step 40300/50000; acc:  74.47; ppl:  2.36; xent: 0.86; lr: 0.00010; 9441/4068 tok/s;   8755 sec
[2021-04-23 06:11:25,563 INFO] Step 40350/50000; acc:  74.42; ppl:  2.37; xent: 0.86; lr: 0.00010; 9648/3992 tok/s;   8766 sec
[2021-04-23 06:11:35,851 INFO] Step 40400/50000; acc:  74.44; ppl:  2.36; xent: 0.86; lr: 0.00010; 9682/4174 tok/s;   8776 sec
[2021-04-23 06:11:46,577 INFO] Step 40450/50000; acc:  74.35; ppl:  2.36; xent: 0.86; lr: 0.00010; 9757/3876 tok/s;   8787 sec
[2021-04-23 06:11:58,148 INFO] Step 40500/50000; acc:  74.18; ppl:  2.36; xent: 0.86; lr: 0.00010; 8930/3788 tok/s;   8798 sec
[2021-04-23 06:12:08,275 INFO] Step 40550/50000; acc:  74.70; ppl:  2.32; xent: 0.84; lr: 0.00010; 9786/4145 tok/s;   8809 sec
[2021-04-23 06:12:19,206 INFO] Step 40600/50000; acc:  74.50; ppl:  2.35; xent: 0.86; lr: 0.00010; 9446/3931 tok/s;   8819 sec
[2021-04-23 06:12:30,094 INFO] Step 40650/50000; acc:  74.62; ppl:  2.35; xent: 0.85; lr: 0.00010; 9264/3965 tok/s;   8830 sec
[2021-04-23 06:12:41,200 INFO] Step 40700/50000; acc:  74.19; ppl:  2.36; xent: 0.86; lr: 0.00010; 9467/3821 tok/s;   8841 sec
[2021-04-23 06:12:51,310 INFO] Step 40750/50000; acc:  75.04; ppl:  2.30; xent: 0.83; lr: 0.00010; 9606/4146 tok/s;   8852 sec
[2021-04-23 06:13:01,995 INFO] Step 40800/50000; acc:  74.41; ppl:  2.37; xent: 0.86; lr: 0.00010; 9666/4014 tok/s;   8862 sec
[2021-04-23 06:13:09,002 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:13:12,755 INFO] Step 40850/50000; acc:  74.23; ppl:  2.35; xent: 0.85; lr: 0.00010; 9358/4034 tok/s;   8873 sec
[2021-04-23 06:13:23,389 INFO] Step 40900/50000; acc:  74.51; ppl:  2.33; xent: 0.85; lr: 0.00010; 9569/3965 tok/s;   8884 sec
[2021-04-23 06:13:34,905 INFO] Step 40950/50000; acc:  74.31; ppl:  2.36; xent: 0.86; lr: 0.00010; 8882/3708 tok/s;   8895 sec
[2021-04-23 06:13:45,938 INFO] Step 41000/50000; acc:  74.57; ppl:  2.35; xent: 0.86; lr: 0.00010; 9216/3987 tok/s;   8906 sec
[2021-04-23 06:13:56,624 INFO] Step 41050/50000; acc:  74.34; ppl:  2.37; xent: 0.86; lr: 0.00010; 9558/4008 tok/s;   8917 sec
[2021-04-23 06:14:07,371 INFO] Step 41100/50000; acc:  74.73; ppl:  2.34; xent: 0.85; lr: 0.00010; 9243/4002 tok/s;   8928 sec
[2021-04-23 06:14:17,675 INFO] Step 41150/50000; acc:  74.46; ppl:  2.35; xent: 0.86; lr: 0.00010; 10028/4094 tok/s;   8938 sec
[2021-04-23 06:14:28,326 INFO] Step 41200/50000; acc:  74.88; ppl:  2.30; xent: 0.83; lr: 0.00010; 9492/3903 tok/s;   8949 sec
[2021-04-23 06:14:39,912 INFO] Step 41250/50000; acc:  74.26; ppl:  2.36; xent: 0.86; lr: 0.00010; 8946/3771 tok/s;   8960 sec
[2021-04-23 06:14:50,692 INFO] Step 41300/50000; acc:  74.70; ppl:  2.34; xent: 0.85; lr: 0.00010; 9494/3908 tok/s;   8971 sec
[2021-04-23 06:15:01,168 INFO] Step 41350/50000; acc:  74.94; ppl:  2.33; xent: 0.84; lr: 0.00010; 9529/4082 tok/s;   8981 sec
[2021-04-23 06:15:12,507 INFO] Step 41400/50000; acc:  74.64; ppl:  2.34; xent: 0.85; lr: 0.00010; 9076/3806 tok/s;   8993 sec
[2021-04-23 06:15:23,336 INFO] Step 41450/50000; acc:  74.87; ppl:  2.31; xent: 0.84; lr: 0.00010; 9359/3864 tok/s;   9004 sec
[2021-04-23 06:15:33,788 INFO] Step 41500/50000; acc:  74.26; ppl:  2.37; xent: 0.86; lr: 0.00010; 9960/4141 tok/s;   9014 sec
[2021-04-23 06:15:44,322 INFO] Step 41550/50000; acc:  74.92; ppl:  2.30; xent: 0.83; lr: 0.00010; 9216/4048 tok/s;   9025 sec
[2021-04-23 06:15:48,169 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:15:54,842 INFO] Step 41600/50000; acc:  74.69; ppl:  2.33; xent: 0.84; lr: 0.00010; 9889/4077 tok/s;   9035 sec
[2021-04-23 06:16:05,617 INFO] Step 41650/50000; acc:  74.79; ppl:  2.33; xent: 0.85; lr: 0.00010; 9335/3906 tok/s;   9046 sec
[2021-04-23 06:16:17,241 INFO] Step 41700/50000; acc:  74.61; ppl:  2.34; xent: 0.85; lr: 0.00010; 8682/3758 tok/s;   9058 sec
[2021-04-23 06:16:27,694 INFO] Step 41750/50000; acc:  74.55; ppl:  2.35; xent: 0.85; lr: 0.00010; 9791/4151 tok/s;   9068 sec
[2021-04-23 06:16:38,499 INFO] Step 41800/50000; acc:  74.65; ppl:  2.33; xent: 0.85; lr: 0.00010; 9441/3933 tok/s;   9079 sec
[2021-04-23 06:16:48,946 INFO] Step 41850/50000; acc:  74.38; ppl:  2.35; xent: 0.85; lr: 0.00010; 9745/4109 tok/s;   9089 sec
[2021-04-23 06:16:59,224 INFO] Step 41900/50000; acc:  74.92; ppl:  2.30; xent: 0.83; lr: 0.00010; 9861/4032 tok/s;   9100 sec
[2021-04-23 06:17:10,468 INFO] Step 41950/50000; acc:  74.74; ppl:  2.33; xent: 0.84; lr: 0.00010; 9150/3821 tok/s;   9111 sec
[2021-04-23 06:17:21,312 INFO] Step 42000/50000; acc:  74.74; ppl:  2.31; xent: 0.84; lr: 0.00010; 9227/3944 tok/s;   9122 sec
[2021-04-23 06:17:31,794 INFO] Step 42050/50000; acc:  74.59; ppl:  2.34; xent: 0.85; lr: 0.00010; 9895/4073 tok/s;   9132 sec
[2021-04-23 06:17:42,527 INFO] Step 42100/50000; acc:  74.95; ppl:  2.32; xent: 0.84; lr: 0.00010; 9523/3982 tok/s;   9143 sec
[2021-04-23 06:17:53,624 INFO] Step 42150/50000; acc:  75.05; ppl:  2.30; xent: 0.83; lr: 0.00010; 9027/3842 tok/s;   9154 sec
[2021-04-23 06:18:04,246 INFO] Step 42200/50000; acc:  74.96; ppl:  2.31; xent: 0.84; lr: 0.00010; 9662/3992 tok/s;   9165 sec
[2021-04-23 06:18:14,851 INFO] Step 42250/50000; acc:  74.72; ppl:  2.32; xent: 0.84; lr: 0.00010; 9504/4030 tok/s;   9175 sec
[2021-04-23 06:18:19,492 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:18:25,911 INFO] Step 42300/50000; acc:  74.47; ppl:  2.35; xent: 0.86; lr: 0.00010; 9483/3921 tok/s;   9186 sec
[2021-04-23 06:18:36,238 INFO] Step 42350/50000; acc:  75.55; ppl:  2.26; xent: 0.81; lr: 0.00010; 9449/4103 tok/s;   9197 sec
[2021-04-23 06:18:46,949 INFO] Step 42400/50000; acc:  74.08; ppl:  2.37; xent: 0.86; lr: 0.00010; 9654/3979 tok/s;   9207 sec
[2021-04-23 06:18:58,174 INFO] Step 42450/50000; acc:  75.00; ppl:  2.31; xent: 0.84; lr: 0.00010; 8884/3922 tok/s;   9218 sec
[2021-04-23 06:19:08,618 INFO] Step 42500/50000; acc:  74.81; ppl:  2.32; xent: 0.84; lr: 0.00010; 9687/4070 tok/s;   9229 sec
[2021-04-23 06:19:18,812 INFO] Step 42550/50000; acc:  74.84; ppl:  2.32; xent: 0.84; lr: 0.00010; 10025/4125 tok/s;   9239 sec
[2021-04-23 06:19:29,520 INFO] Step 42600/50000; acc:  74.80; ppl:  2.31; xent: 0.84; lr: 0.00010; 9568/3971 tok/s;   9250 sec
[2021-04-23 06:19:40,661 INFO] Step 42650/50000; acc:  74.48; ppl:  2.32; xent: 0.84; lr: 0.00010; 9368/3866 tok/s;   9261 sec
[2021-04-23 06:19:51,326 INFO] Step 42700/50000; acc:  74.97; ppl:  2.29; xent: 0.83; lr: 0.00010; 9341/3971 tok/s;   9272 sec
[2021-04-23 06:20:02,370 INFO] Step 42750/50000; acc:  74.78; ppl:  2.32; xent: 0.84; lr: 0.00010; 9290/3880 tok/s;   9283 sec
[2021-04-23 06:20:13,013 INFO] Step 42800/50000; acc:  75.01; ppl:  2.29; xent: 0.83; lr: 0.00010; 9392/4031 tok/s;   9293 sec
[2021-04-23 06:20:23,855 INFO] Step 42850/50000; acc:  74.44; ppl:  2.34; xent: 0.85; lr: 0.00010; 9539/3951 tok/s;   9304 sec
[2021-04-23 06:20:35,176 INFO] Step 42900/50000; acc:  75.04; ppl:  2.30; xent: 0.83; lr: 0.00010; 9118/3759 tok/s;   9315 sec
[2021-04-23 06:20:45,274 INFO] Step 42950/50000; acc:  75.35; ppl:  2.28; xent: 0.83; lr: 0.00010; 9833/4127 tok/s;   9326 sec
[2021-04-23 06:20:55,946 INFO] Step 43000/50000; acc:  74.82; ppl:  2.31; xent: 0.84; lr: 0.00010; 9582/4055 tok/s;   9336 sec
[2021-04-23 06:20:57,732 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:21:06,718 INFO] Step 43050/50000; acc:  75.19; ppl:  2.29; xent: 0.83; lr: 0.00010; 9406/4016 tok/s;   9347 sec
[2021-04-23 06:21:17,720 INFO] Step 43100/50000; acc:  74.48; ppl:  2.33; xent: 0.85; lr: 0.00010; 9541/3867 tok/s;   9358 sec
[2021-04-23 06:21:28,522 INFO] Step 43150/50000; acc:  75.17; ppl:  2.27; xent: 0.82; lr: 0.00010; 8926/3967 tok/s;   9369 sec
[2021-04-23 06:21:39,625 INFO] Step 43200/50000; acc:  74.84; ppl:  2.33; xent: 0.85; lr: 0.00010; 9353/3918 tok/s;   9380 sec
[2021-04-23 06:21:49,786 INFO] Step 43250/50000; acc:  74.84; ppl:  2.30; xent: 0.83; lr: 0.00010; 9837/4174 tok/s;   9390 sec
[2021-04-23 06:22:00,113 INFO] Step 43300/50000; acc:  75.09; ppl:  2.29; xent: 0.83; lr: 0.00010; 9738/4120 tok/s;   9400 sec
[2021-04-23 06:22:10,942 INFO] Step 43350/50000; acc:  74.67; ppl:  2.31; xent: 0.84; lr: 0.00010; 9615/3896 tok/s;   9411 sec
[2021-04-23 06:22:22,567 INFO] Step 43400/50000; acc:  74.66; ppl:  2.32; xent: 0.84; lr: 0.00010; 8876/3706 tok/s;   9423 sec
[2021-04-23 06:22:33,275 INFO] Step 43450/50000; acc:  75.29; ppl:  2.29; xent: 0.83; lr: 0.00010; 9564/3996 tok/s;   9434 sec
[2021-04-23 06:22:44,011 INFO] Step 43500/50000; acc:  75.46; ppl:  2.27; xent: 0.82; lr: 0.00010; 9292/3954 tok/s;   9444 sec
[2021-04-23 06:22:54,520 INFO] Step 43550/50000; acc:  75.04; ppl:  2.32; xent: 0.84; lr: 0.00010; 9765/4058 tok/s;   9455 sec
[2021-04-23 06:23:05,127 INFO] Step 43600/50000; acc:  75.53; ppl:  2.26; xent: 0.81; lr: 0.00010; 9446/4021 tok/s;   9465 sec
[2021-04-23 06:23:16,177 INFO] Step 43650/50000; acc:  74.83; ppl:  2.31; xent: 0.84; lr: 0.00010; 9428/3877 tok/s;   9476 sec
[2021-04-23 06:23:26,607 INFO] Step 43700/50000; acc:  75.01; ppl:  2.30; xent: 0.83; lr: 0.00010; 9725/4055 tok/s;   9487 sec
[2021-04-23 06:23:35,902 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:23:37,137 INFO] Step 43750/50000; acc:  75.29; ppl:  2.27; xent: 0.82; lr: 0.00010; 9494/4048 tok/s;   9497 sec
[2021-04-23 06:23:47,911 INFO] Step 43800/50000; acc:  75.42; ppl:  2.27; xent: 0.82; lr: 0.00010; 9519/4009 tok/s;   9508 sec
[2021-04-23 06:23:58,729 INFO] Step 43850/50000; acc:  74.71; ppl:  2.31; xent: 0.84; lr: 0.00010; 9367/3902 tok/s;   9519 sec
[2021-04-23 06:24:10,190 INFO] Step 43900/50000; acc:  74.58; ppl:  2.32; xent: 0.84; lr: 0.00010; 9048/3856 tok/s;   9530 sec
[2021-04-23 06:24:20,427 INFO] Step 43950/50000; acc:  75.59; ppl:  2.26; xent: 0.82; lr: 0.00010; 9494/4151 tok/s;   9541 sec
[2021-04-23 06:24:31,125 INFO] Step 44000/50000; acc:  74.82; ppl:  2.32; xent: 0.84; lr: 0.00010; 9702/3986 tok/s;   9551 sec
[2021-04-23 06:24:41,760 INFO] Step 44050/50000; acc:  75.14; ppl:  2.29; xent: 0.83; lr: 0.00010; 9430/4046 tok/s;   9562 sec
[2021-04-23 06:24:52,305 INFO] Step 44100/50000; acc:  75.12; ppl:  2.27; xent: 0.82; lr: 0.00010; 9709/3906 tok/s;   9573 sec
[2021-04-23 06:25:03,967 INFO] Step 44150/50000; acc:  75.05; ppl:  2.29; xent: 0.83; lr: 0.00010; 8881/3769 tok/s;   9584 sec
[2021-04-23 06:25:14,388 INFO] Step 44200/50000; acc:  75.25; ppl:  2.28; xent: 0.83; lr: 0.00010; 9769/4027 tok/s;   9595 sec
[2021-04-23 06:25:25,406 INFO] Step 44250/50000; acc:  75.34; ppl:  2.27; xent: 0.82; lr: 0.00010; 9334/3931 tok/s;   9606 sec
[2021-04-23 06:25:36,149 INFO] Step 44300/50000; acc:  75.22; ppl:  2.28; xent: 0.82; lr: 0.00010; 9263/3989 tok/s;   9616 sec
[2021-04-23 06:25:47,142 INFO] Step 44350/50000; acc:  75.21; ppl:  2.28; xent: 0.82; lr: 0.00010; 9383/3829 tok/s;   9627 sec
[2021-04-23 06:25:57,247 INFO] Step 44400/50000; acc:  75.42; ppl:  2.27; xent: 0.82; lr: 0.00010; 9843/4195 tok/s;   9638 sec
[2021-04-23 06:26:08,336 INFO] Step 44450/50000; acc:  74.97; ppl:  2.29; xent: 0.83; lr: 0.00010; 9284/3902 tok/s;   9649 sec
[2021-04-23 06:26:14,590 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:26:19,090 INFO] Step 44500/50000; acc:  75.20; ppl:  2.27; xent: 0.82; lr: 0.00010; 9566/3992 tok/s;   9659 sec
[2021-04-23 06:26:29,665 INFO] Step 44550/50000; acc:  75.41; ppl:  2.26; xent: 0.81; lr: 0.00010; 9497/3981 tok/s;   9670 sec
[2021-04-23 06:26:40,975 INFO] Step 44600/50000; acc:  74.99; ppl:  2.30; xent: 0.83; lr: 0.00010; 9009/3792 tok/s;   9681 sec
[2021-04-23 06:26:51,960 INFO] Step 44650/50000; acc:  75.42; ppl:  2.28; xent: 0.82; lr: 0.00010; 9159/3989 tok/s;   9692 sec
[2021-04-23 06:27:02,839 INFO] Step 44700/50000; acc:  74.68; ppl:  2.31; xent: 0.84; lr: 0.00010; 9599/3955 tok/s;   9703 sec
[2021-04-23 06:27:13,067 INFO] Step 44750/50000; acc:  75.76; ppl:  2.25; xent: 0.81; lr: 0.00010; 9455/4162 tok/s;   9713 sec
[2021-04-23 06:27:23,650 INFO] Step 44800/50000; acc:  74.98; ppl:  2.30; xent: 0.83; lr: 0.00010; 9919/4003 tok/s;   9724 sec
[2021-04-23 06:27:34,519 INFO] Step 44850/50000; acc:  75.51; ppl:  2.25; xent: 0.81; lr: 0.00010; 9334/3861 tok/s;   9735 sec
[2021-04-23 06:27:45,769 INFO] Step 44900/50000; acc:  75.66; ppl:  2.26; xent: 0.82; lr: 0.00010; 9011/3845 tok/s;   9746 sec
[2021-04-23 06:27:56,460 INFO] Step 44950/50000; acc:  75.26; ppl:  2.27; xent: 0.82; lr: 0.00010; 9609/3937 tok/s;   9757 sec
[2021-04-23 06:28:07,266 INFO] Step 45000/50000; acc:  75.49; ppl:  2.28; xent: 0.82; lr: 0.00010; 9470/3965 tok/s;   9768 sec
[2021-04-23 06:28:07,268 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-23 06:28:16,017 INFO] Validation perplexity: 3.0121
[2021-04-23 06:28:16,018 INFO] Validation accuracy: 70.0298
[2021-04-23 06:28:16,020 INFO] Saving checkpoint ../models/default_params/strict_ops/model_step_45000.pt
[2021-04-23 06:28:27,768 INFO] Step 45050/50000; acc:  75.32; ppl:  2.28; xent: 0.82; lr: 0.00010; 4993/2106 tok/s;   9788 sec
[2021-04-23 06:28:38,315 INFO] Step 45100/50000; acc:  75.71; ppl:  2.23; xent: 0.80; lr: 0.00010; 9494/3984 tok/s;   9799 sec
[2021-04-23 06:28:48,681 INFO] Step 45150/50000; acc:  75.52; ppl:  2.27; xent: 0.82; lr: 0.00010; 9856/4138 tok/s;   9809 sec
[2021-04-23 06:28:59,543 INFO] Step 45200/50000; acc:  75.25; ppl:  2.26; xent: 0.82; lr: 0.00010; 9165/3960 tok/s;   9820 sec
[2021-04-23 06:29:02,884 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:29:09,986 INFO] Step 45250/50000; acc:  75.50; ppl:  2.26; xent: 0.81; lr: 0.00010; 9932/4093 tok/s;   9830 sec
[2021-04-23 06:29:21,084 INFO] Step 45300/50000; acc:  75.05; ppl:  2.28; xent: 0.83; lr: 0.00010; 9249/3816 tok/s;   9841 sec
[2021-04-23 06:29:32,209 INFO] Step 45350/50000; acc:  75.42; ppl:  2.27; xent: 0.82; lr: 0.00010; 8924/3930 tok/s;   9852 sec
[2021-04-23 06:29:42,598 INFO] Step 45400/50000; acc:  75.47; ppl:  2.27; xent: 0.82; lr: 0.00010; 9836/4143 tok/s;   9863 sec
[2021-04-23 06:29:53,503 INFO] Step 45450/50000; acc:  75.36; ppl:  2.27; xent: 0.82; lr: 0.00010; 9254/3909 tok/s;   9874 sec
[2021-04-23 06:30:04,166 INFO] Step 45500/50000; acc:  74.89; ppl:  2.30; xent: 0.83; lr: 0.00010; 9765/4041 tok/s;   9884 sec
[2021-04-23 06:30:14,389 INFO] Step 45550/50000; acc:  76.09; ppl:  2.20; xent: 0.79; lr: 0.00010; 9694/4038 tok/s;   9895 sec
[2021-04-23 06:30:25,965 INFO] Step 45600/50000; acc:  75.36; ppl:  2.27; xent: 0.82; lr: 0.00010; 8992/3737 tok/s;   9906 sec
[2021-04-23 06:30:36,682 INFO] Step 45650/50000; acc:  75.60; ppl:  2.25; xent: 0.81; lr: 0.00010; 9379/3948 tok/s;   9917 sec
[2021-04-23 06:30:47,115 INFO] Step 45700/50000; acc:  75.47; ppl:  2.25; xent: 0.81; lr: 0.00010; 9731/4102 tok/s;   9927 sec
[2021-04-23 06:30:57,911 INFO] Step 45750/50000; acc:  75.58; ppl:  2.26; xent: 0.81; lr: 0.00010; 9488/3965 tok/s;   9938 sec
[2021-04-23 06:31:09,278 INFO] Step 45800/50000; acc:  75.75; ppl:  2.24; xent: 0.81; lr: 0.00010; 9042/3753 tok/s;   9950 sec
[2021-04-23 06:31:19,657 INFO] Step 45850/50000; acc:  75.63; ppl:  2.25; xent: 0.81; lr: 0.00010; 9854/4078 tok/s;   9960 sec
[2021-04-23 06:31:30,249 INFO] Step 45900/50000; acc:  76.03; ppl:  2.23; xent: 0.80; lr: 0.00010; 9376/4028 tok/s;   9971 sec
[2021-04-23 06:31:34,436 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:31:41,235 INFO] Step 45950/50000; acc:  75.25; ppl:  2.27; xent: 0.82; lr: 0.00010; 9377/3929 tok/s;   9982 sec
[2021-04-23 06:31:51,883 INFO] Step 46000/50000; acc:  76.11; ppl:  2.21; xent: 0.79; lr: 0.00010; 9381/4013 tok/s;   9992 sec
[2021-04-23 06:32:02,885 INFO] Step 46050/50000; acc:  74.62; ppl:  2.30; xent: 0.83; lr: 0.00010; 9379/3879 tok/s;  10003 sec
[2021-04-23 06:32:14,232 INFO] Step 46100/50000; acc:  75.75; ppl:  2.25; xent: 0.81; lr: 0.00010; 8970/3868 tok/s;  10015 sec
[2021-04-23 06:32:24,570 INFO] Step 46150/50000; acc:  75.37; ppl:  2.26; xent: 0.81; lr: 0.00010; 9656/4140 tok/s;  10025 sec
[2021-04-23 06:32:34,974 INFO] Step 46200/50000; acc:  75.59; ppl:  2.25; xent: 0.81; lr: 0.00010; 9796/4029 tok/s;  10035 sec
[2021-04-23 06:32:45,569 INFO] Step 46250/50000; acc:  75.74; ppl:  2.24; xent: 0.81; lr: 0.00010; 9573/3990 tok/s;  10046 sec
[2021-04-23 06:32:56,988 INFO] Step 46300/50000; acc:  75.11; ppl:  2.27; xent: 0.82; lr: 0.00010; 9322/3806 tok/s;  10057 sec
[2021-04-23 06:33:07,118 INFO] Step 46350/50000; acc:  76.38; ppl:  2.19; xent: 0.78; lr: 0.00010; 9612/4140 tok/s;  10067 sec
[2021-04-23 06:33:18,226 INFO] Step 46400/50000; acc:  75.54; ppl:  2.26; xent: 0.81; lr: 0.00010; 9357/3868 tok/s;  10079 sec
[2021-04-23 06:33:28,997 INFO] Step 46450/50000; acc:  75.80; ppl:  2.23; xent: 0.80; lr: 0.00010; 9311/3985 tok/s;  10089 sec
[2021-04-23 06:33:39,665 INFO] Step 46500/50000; acc:  75.65; ppl:  2.24; xent: 0.81; lr: 0.00010; 9491/3976 tok/s;  10100 sec
[2021-04-23 06:33:51,023 INFO] Step 46550/50000; acc:  75.72; ppl:  2.22; xent: 0.80; lr: 0.00010; 9119/3784 tok/s;  10111 sec
[2021-04-23 06:34:01,373 INFO] Step 46600/50000; acc:  75.68; ppl:  2.25; xent: 0.81; lr: 0.00010; 9827/4014 tok/s;  10122 sec
[2021-04-23 06:34:12,167 INFO] Step 46650/50000; acc:  75.49; ppl:  2.24; xent: 0.81; lr: 0.00010; 9464/4032 tok/s;  10132 sec
[2021-04-23 06:34:13,313 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:34:22,683 INFO] Step 46700/50000; acc:  75.98; ppl:  2.22; xent: 0.80; lr: 0.00010; 9490/4088 tok/s;  10143 sec
[2021-04-23 06:34:33,463 INFO] Step 46750/50000; acc:  75.56; ppl:  2.24; xent: 0.81; lr: 0.00010; 9563/3915 tok/s;  10154 sec
[2021-04-23 06:34:44,526 INFO] Step 46800/50000; acc:  75.74; ppl:  2.22; xent: 0.80; lr: 0.00010; 8920/3900 tok/s;  10165 sec
[2021-04-23 06:34:55,770 INFO] Step 46850/50000; acc:  75.72; ppl:  2.25; xent: 0.81; lr: 0.00010; 9202/3890 tok/s;  10176 sec
[2021-04-23 06:35:06,201 INFO] Step 46900/50000; acc:  75.32; ppl:  2.27; xent: 0.82; lr: 0.00010; 9809/4073 tok/s;  10186 sec
[2021-04-23 06:35:16,596 INFO] Step 46950/50000; acc:  76.33; ppl:  2.21; xent: 0.79; lr: 0.00010; 9525/4096 tok/s;  10197 sec
[2021-04-23 06:35:27,465 INFO] Step 47000/50000; acc:  75.63; ppl:  2.24; xent: 0.81; lr: 0.00010; 9570/3858 tok/s;  10208 sec
[2021-04-23 06:35:38,570 INFO] Step 47050/50000; acc:  75.89; ppl:  2.22; xent: 0.80; lr: 0.00010; 9175/3850 tok/s;  10219 sec
[2021-04-23 06:35:49,439 INFO] Step 47100/50000; acc:  75.68; ppl:  2.25; xent: 0.81; lr: 0.00010; 9620/3999 tok/s;  10230 sec
[2021-04-23 06:35:59,925 INFO] Step 47150/50000; acc:  76.19; ppl:  2.19; xent: 0.78; lr: 0.00010; 9305/4016 tok/s;  10240 sec
[2021-04-23 06:36:10,762 INFO] Step 47200/50000; acc:  75.75; ppl:  2.25; xent: 0.81; lr: 0.00010; 9580/3960 tok/s;  10251 sec
[2021-04-23 06:36:21,307 INFO] Step 47250/50000; acc:  76.12; ppl:  2.20; xent: 0.79; lr: 0.00010; 9557/4053 tok/s;  10262 sec
[2021-04-23 06:36:32,001 INFO] Step 47300/50000; acc:  76.06; ppl:  2.21; xent: 0.79; lr: 0.00010; 9502/3955 tok/s;  10272 sec
[2021-04-23 06:36:42,449 INFO] Step 47350/50000; acc:  75.74; ppl:  2.24; xent: 0.80; lr: 0.00010; 9743/4064 tok/s;  10283 sec
[2021-04-23 06:36:51,254 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:36:53,209 INFO] Step 47400/50000; acc:  75.66; ppl:  2.24; xent: 0.81; lr: 0.00010; 9539/4025 tok/s;  10293 sec
[2021-04-23 06:37:03,859 INFO] Step 47450/50000; acc:  76.21; ppl:  2.20; xent: 0.79; lr: 0.00010; 9605/4005 tok/s;  10304 sec
[2021-04-23 06:37:14,661 INFO] Step 47500/50000; acc:  75.81; ppl:  2.23; xent: 0.80; lr: 0.00010; 9236/3913 tok/s;  10315 sec
[2021-04-23 06:37:26,105 INFO] Step 47550/50000; acc:  75.80; ppl:  2.23; xent: 0.80; lr: 0.00010; 8909/3834 tok/s;  10326 sec
[2021-04-23 06:37:36,660 INFO] Step 47600/50000; acc:  75.95; ppl:  2.22; xent: 0.80; lr: 0.00010; 9429/4068 tok/s;  10337 sec
[2021-04-23 06:37:47,272 INFO] Step 47650/50000; acc:  75.40; ppl:  2.26; xent: 0.82; lr: 0.00010; 9742/4017 tok/s;  10348 sec
[2021-04-23 06:37:57,850 INFO] Step 47700/50000; acc:  75.75; ppl:  2.23; xent: 0.80; lr: 0.00010; 9700/4059 tok/s;  10358 sec
[2021-04-23 06:38:08,351 INFO] Step 47750/50000; acc:  76.13; ppl:  2.19; xent: 0.78; lr: 0.00010; 9632/3911 tok/s;  10369 sec
[2021-04-23 06:38:19,776 INFO] Step 47800/50000; acc:  75.99; ppl:  2.22; xent: 0.80; lr: 0.00010; 9019/3860 tok/s;  10380 sec
[2021-04-23 06:38:30,020 INFO] Step 47850/50000; acc:  76.06; ppl:  2.20; xent: 0.79; lr: 0.00010; 9835/4087 tok/s;  10390 sec
[2021-04-23 06:38:41,156 INFO] Step 47900/50000; acc:  75.58; ppl:  2.24; xent: 0.81; lr: 0.00010; 9430/3922 tok/s;  10401 sec
[2021-04-23 06:38:51,729 INFO] Step 47950/50000; acc:  76.42; ppl:  2.17; xent: 0.77; lr: 0.00010; 9191/3985 tok/s;  10412 sec
[2021-04-23 06:39:03,049 INFO] Step 48000/50000; acc:  75.64; ppl:  2.24; xent: 0.81; lr: 0.00010; 9215/3764 tok/s;  10423 sec
[2021-04-23 06:39:13,189 INFO] Step 48050/50000; acc:  76.35; ppl:  2.20; xent: 0.79; lr: 0.00010; 9864/4208 tok/s;  10433 sec
[2021-04-23 06:39:23,998 INFO] Step 48100/50000; acc:  76.11; ppl:  2.21; xent: 0.79; lr: 0.00010; 9323/3944 tok/s;  10444 sec
[2021-04-23 06:39:30,041 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:39:34,911 INFO] Step 48150/50000; acc:  75.87; ppl:  2.22; xent: 0.80; lr: 0.00010; 9452/3944 tok/s;  10455 sec
[2021-04-23 06:39:45,576 INFO] Step 48200/50000; acc:  75.87; ppl:  2.22; xent: 0.80; lr: 0.00010; 9661/3937 tok/s;  10466 sec
[2021-04-23 06:39:57,052 INFO] Step 48250/50000; acc:  75.97; ppl:  2.22; xent: 0.80; lr: 0.00010; 8848/3759 tok/s;  10477 sec
[2021-04-23 06:40:07,891 INFO] Step 48300/50000; acc:  76.24; ppl:  2.20; xent: 0.79; lr: 0.00010; 9140/4055 tok/s;  10488 sec
[2021-04-23 06:40:18,756 INFO] Step 48350/50000; acc:  75.80; ppl:  2.23; xent: 0.80; lr: 0.00010; 9440/3916 tok/s;  10499 sec
[2021-04-23 06:40:28,971 INFO] Step 48400/50000; acc:  76.50; ppl:  2.19; xent: 0.79; lr: 0.00010; 9696/4174 tok/s;  10509 sec
[2021-04-23 06:40:39,622 INFO] Step 48450/50000; acc:  75.80; ppl:  2.23; xent: 0.80; lr: 0.00010; 9828/3972 tok/s;  10520 sec
[2021-04-23 06:40:50,685 INFO] Step 48500/50000; acc:  76.07; ppl:  2.21; xent: 0.79; lr: 0.00010; 9366/3819 tok/s;  10531 sec
[2021-04-23 06:41:01,690 INFO] Step 48550/50000; acc:  76.49; ppl:  2.18; xent: 0.78; lr: 0.00010; 9084/3933 tok/s;  10542 sec
[2021-04-23 06:41:12,120 INFO] Step 48600/50000; acc:  76.22; ppl:  2.19; xent: 0.79; lr: 0.00010; 9828/4054 tok/s;  10552 sec
[2021-04-23 06:41:22,883 INFO] Step 48650/50000; acc:  76.13; ppl:  2.21; xent: 0.79; lr: 0.00010; 9396/3973 tok/s;  10563 sec
[2021-04-23 06:41:34,132 INFO] Step 48700/50000; acc:  75.79; ppl:  2.23; xent: 0.80; lr: 0.00010; 9311/3838 tok/s;  10574 sec
[2021-04-23 06:41:44,491 INFO] Step 48750/50000; acc:  76.76; ppl:  2.13; xent: 0.76; lr: 0.00010; 9424/4032 tok/s;  10585 sec
[2021-04-23 06:41:54,900 INFO] Step 48800/50000; acc:  76.00; ppl:  2.22; xent: 0.80; lr: 0.00010; 9918/4114 tok/s;  10595 sec
[2021-04-23 06:42:05,786 INFO] Step 48850/50000; acc:  76.14; ppl:  2.20; xent: 0.79; lr: 0.00010; 9226/3969 tok/s;  10606 sec
[2021-04-23 06:42:08,748 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:42:16,082 INFO] Step 48900/50000; acc:  76.61; ppl:  2.17; xent: 0.77; lr: 0.00010; 9860/4126 tok/s;  10616 sec
[2021-04-23 06:42:27,151 INFO] Step 48950/50000; acc:  75.75; ppl:  2.22; xent: 0.80; lr: 0.00010; 9277/3846 tok/s;  10627 sec
[2021-04-23 06:42:38,372 INFO] Step 49000/50000; acc:  75.95; ppl:  2.22; xent: 0.80; lr: 0.00010; 9077/3898 tok/s;  10639 sec
[2021-04-23 06:42:48,849 INFO] Step 49050/50000; acc:  76.34; ppl:  2.19; xent: 0.78; lr: 0.00010; 9737/4115 tok/s;  10649 sec
[2021-04-23 06:42:59,475 INFO] Step 49100/50000; acc:  76.24; ppl:  2.18; xent: 0.78; lr: 0.00010; 9343/4049 tok/s;  10660 sec
[2021-04-23 06:43:09,873 INFO] Step 49150/50000; acc:  75.91; ppl:  2.22; xent: 0.80; lr: 0.00010; 9846/4052 tok/s;  10670 sec
[2021-04-23 06:43:20,382 INFO] Step 49200/50000; acc:  76.49; ppl:  2.17; xent: 0.78; lr: 0.00010; 9658/3979 tok/s;  10681 sec
[2021-04-23 06:43:31,717 INFO] Step 49250/50000; acc:  76.10; ppl:  2.19; xent: 0.79; lr: 0.00010; 9145/3804 tok/s;  10692 sec
[2021-04-23 06:43:42,650 INFO] Step 49300/50000; acc:  76.26; ppl:  2.20; xent: 0.79; lr: 0.00010; 9407/3861 tok/s;  10703 sec
[2021-04-23 06:43:52,920 INFO] Step 49350/50000; acc:  76.52; ppl:  2.18; xent: 0.78; lr: 0.00010; 9735/4191 tok/s;  10713 sec
[2021-04-23 06:44:03,663 INFO] Step 49400/50000; acc:  76.36; ppl:  2.19; xent: 0.78; lr: 0.00010; 9508/3998 tok/s;  10724 sec
[2021-04-23 06:44:14,736 INFO] Step 49450/50000; acc:  76.51; ppl:  2.16; xent: 0.77; lr: 0.00010; 9186/3835 tok/s;  10735 sec
[2021-04-23 06:44:25,202 INFO] Step 49500/50000; acc:  76.12; ppl:  2.21; xent: 0.79; lr: 0.00010; 9999/4031 tok/s;  10745 sec
[2021-04-23 06:44:35,520 INFO] Step 49550/50000; acc:  76.68; ppl:  2.15; xent: 0.76; lr: 0.00010; 9372/4118 tok/s;  10756 sec
[2021-04-23 06:44:39,665 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-23 06:44:46,502 INFO] Step 49600/50000; acc:  75.81; ppl:  2.22; xent: 0.80; lr: 0.00010; 9518/3982 tok/s;  10767 sec
[2021-04-23 06:44:57,070 INFO] Step 49650/50000; acc:  76.64; ppl:  2.15; xent: 0.77; lr: 0.00010; 9494/4009 tok/s;  10777 sec
[2021-04-23 06:45:07,784 INFO] Step 49700/50000; acc:  76.00; ppl:  2.21; xent: 0.79; lr: 0.00010; 9394/3971 tok/s;  10788 sec
[2021-04-23 06:45:19,107 INFO] Step 49750/50000; acc:  76.38; ppl:  2.20; xent: 0.79; lr: 0.00010; 9037/3882 tok/s;  10799 sec
[2021-04-23 06:45:29,841 INFO] Step 49800/50000; acc:  76.03; ppl:  2.22; xent: 0.80; lr: 0.00010; 9530/4015 tok/s;  10810 sec
[2021-04-23 06:45:40,001 INFO] Step 49850/50000; acc:  76.52; ppl:  2.17; xent: 0.77; lr: 0.00010; 10005/4131 tok/s;  10820 sec
[2021-04-23 06:45:50,262 INFO] Step 49900/50000; acc:  76.55; ppl:  2.16; xent: 0.77; lr: 0.00010; 9761/4113 tok/s;  10831 sec
[2021-04-23 06:46:01,846 INFO] Step 49950/50000; acc:  75.93; ppl:  2.20; xent: 0.79; lr: 0.00010; 9014/3732 tok/s;  10842 sec
[2021-04-23 06:46:12,227 INFO] Step 50000/50000; acc:  76.76; ppl:  2.15; xent: 0.77; lr: 0.00005; 9604/4053 tok/s;  10853 sec
[2021-04-23 06:46:12,230 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-23 06:46:20,991 INFO] Validation perplexity: 3.09779
[2021-04-23 06:46:20,991 INFO] Validation accuracy: 69.949
[2021-04-23 06:46:20,993 INFO] Saving checkpoint ../models/default_params/strict_ops/model_step_50000.pt

Train on loosely condensed EditOperations:

modelDefaultLoose = HephaestusModel(MODEL_DEFAULT_LOOSE)
modelDefaultLoose.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_OPS_GENERAL_LOOSE_TRAIN,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_OPS_GENERAL_LOOSE_VALID
)
[2021-04-23 06:46:23,510 INFO] Counter vocab from -1 samples.
[2021-04-23 06:46:23,511 INFO] n_sample=-1: Build vocab on full datasets.
[2021-04-23 06:46:23,515 INFO] corpus_1's transforms: TransformPipe()
[2021-04-23 06:46:23,516 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 06:46:24,117 INFO] Counters src:429
[2021-04-23 06:46:24,117 INFO] Counters tgt:444
[2021-04-23 06:46:24,117 WARNING] path ../models/default_params/loose_ops/save_data.vocab.src exists, may overwrite...
[2021-04-23 06:46:24,119 WARNING] path ../models/default_params/loose_ops/save_data.vocab.tgt exists, may overwrite...
[2021-04-23 06:46:24,782 INFO] Parsed 2 corpora from -data.
[2021-04-23 06:46:24,782 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-04-23 06:46:24,782 INFO] Loading vocab from text file...
[2021-04-23 06:46:24,782 INFO] Loading src vocabulary from ../models/default_params/loose_ops/save_data.vocab.src
[2021-04-23 06:46:24,784 INFO] Loaded src vocab has 429 tokens.
[2021-04-23 06:46:24,784 INFO] Loading tgt vocabulary from ../models/default_params/loose_ops/save_data.vocab.tgt
[2021-04-23 06:46:24,786 INFO] Loaded tgt vocab has 444 tokens.
[2021-04-23 06:46:24,786 INFO] Building fields with vocab in counters...
[2021-04-23 06:46:24,786 INFO]  * tgt vocab size: 448.
[2021-04-23 06:46:24,787 INFO]  * src vocab size: 431.
[2021-04-23 06:46:24,787 INFO]  * src vocab size = 431
[2021-04-23 06:46:24,787 INFO]  * tgt vocab size = 448
[2021-04-23 06:46:24,788 INFO] Building model...
[2021-04-23 06:46:25,938 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): LSTM(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(448, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedLSTM(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): LSTMCell(768, 256)
        (1): LSTMCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=448, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-04-23 06:46:25,939 INFO] encoder: 1535488
[2021-04-23 06:46:25,939 INFO] decoder: 2184384
[2021-04-23 06:46:25,939 INFO] * number of parameters: 3719872
[2021-04-23 06:46:25,940 INFO] Starting training on GPU: [0]
[2021-04-23 06:46:25,940 INFO] Start training loop and validate every 5000 steps...
[2021-04-23 06:46:25,940 INFO] corpus_1's transforms: TransformPipe()
[2021-04-23 06:46:25,940 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 06:46:35,693 INFO] Step 50/50000; acc:  17.98; ppl: 157.67; xent: 5.06; lr: 0.00010; 10323/4076 tok/s;     10 sec
[2021-04-23 06:46:45,302 INFO] Step 100/50000; acc:  25.21; ppl: 36.07; xent: 3.59; lr: 0.00010; 10701/4223 tok/s;     19 sec
[2021-04-23 06:46:55,325 INFO] Step 150/50000; acc:  29.15; ppl: 25.80; xent: 3.25; lr: 0.00010; 10054/4091 tok/s;     29 sec
[2021-04-23 06:47:04,579 INFO] Step 200/50000; acc:  39.33; ppl: 17.65; xent: 2.87; lr: 0.00010; 10926/4351 tok/s;     39 sec
[2021-04-23 06:47:13,632 INFO] Step 250/50000; acc:  44.59; ppl: 11.88; xent: 2.47; lr: 0.00010; 11095/4390 tok/s;     48 sec
[2021-04-23 06:47:23,586 INFO] Step 300/50000; acc:  45.52; ppl: 10.08; xent: 2.31; lr: 0.00010; 10350/4070 tok/s;     58 sec
[2021-04-23 06:47:33,166 INFO] Step 350/50000; acc:  45.95; ppl:  9.37; xent: 2.24; lr: 0.00010; 10782/4126 tok/s;     67 sec
[2021-04-23 06:47:42,942 INFO] Step 400/50000; acc:  45.72; ppl:  9.17; xent: 2.22; lr: 0.00010; 10398/4145 tok/s;     77 sec
[2021-04-23 06:47:52,503 INFO] Step 450/50000; acc:  46.44; ppl:  8.78; xent: 2.17; lr: 0.00010; 10613/4175 tok/s;     87 sec
[2021-04-23 06:48:01,849 INFO] Step 500/50000; acc:  46.43; ppl:  8.88; xent: 2.18; lr: 0.00010; 10798/4298 tok/s;     96 sec
[2021-04-23 06:48:11,628 INFO] Step 550/50000; acc:  46.61; ppl:  8.68; xent: 2.16; lr: 0.00010; 10691/4151 tok/s;    106 sec
[2021-04-23 06:48:21,139 INFO] Step 600/50000; acc:  47.56; ppl:  8.29; xent: 2.12; lr: 0.00010; 10449/4203 tok/s;    115 sec
[2021-04-23 06:48:30,047 INFO] Step 650/50000; acc:  48.08; ppl:  8.15; xent: 2.10; lr: 0.00010; 11610/4425 tok/s;    124 sec
[2021-04-23 06:48:39,369 INFO] Step 700/50000; acc:  48.22; ppl:  8.06; xent: 2.09; lr: 0.00010; 10699/4328 tok/s;    133 sec
[2021-04-23 06:48:40,234 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 06:48:49,090 INFO] Step 750/50000; acc:  48.69; ppl:  7.87; xent: 2.06; lr: 0.00010; 10485/4211 tok/s;    143 sec
[2021-04-23 06:48:58,726 INFO] Step 800/50000; acc:  49.12; ppl:  7.85; xent: 2.06; lr: 0.00010; 10627/4136 tok/s;    153 sec
[2021-04-23 06:49:08,685 INFO] Step 850/50000; acc:  49.47; ppl:  7.64; xent: 2.03; lr: 0.00010; 9986/4077 tok/s;    163 sec
[2021-04-23 06:49:18,490 INFO] Step 900/50000; acc:  49.54; ppl:  7.49; xent: 2.01; lr: 0.00010; 10549/4150 tok/s;    173 sec
[2021-04-23 06:49:27,860 INFO] Step 950/50000; acc:  50.40; ppl:  7.31; xent: 1.99; lr: 0.00010; 10769/4296 tok/s;    182 sec
[2021-04-23 06:49:36,921 INFO] Step 1000/50000; acc:  50.83; ppl:  7.08; xent: 1.96; lr: 0.00010; 11102/4366 tok/s;    191 sec
[2021-04-23 06:49:46,482 INFO] Step 1050/50000; acc:  51.45; ppl:  6.86; xent: 1.93; lr: 0.00010; 10699/4193 tok/s;    201 sec
[2021-04-23 06:49:56,950 INFO] Step 1100/50000; acc:  51.58; ppl:  6.76; xent: 1.91; lr: 0.00010; 9900/3888 tok/s;    211 sec
[2021-04-23 06:50:06,442 INFO] Step 1150/50000; acc:  52.02; ppl:  6.49; xent: 1.87; lr: 0.00010; 10681/4198 tok/s;    221 sec
[2021-04-23 06:50:15,903 INFO] Step 1200/50000; acc:  51.97; ppl:  6.49; xent: 1.87; lr: 0.00010; 10775/4249 tok/s;    230 sec
[2021-04-23 06:50:25,394 INFO] Step 1250/50000; acc:  52.42; ppl:  6.46; xent: 1.87; lr: 0.00010; 10681/4235 tok/s;    239 sec
[2021-04-23 06:50:35,060 INFO] Step 1300/50000; acc:  52.91; ppl:  6.22; xent: 1.83; lr: 0.00010; 10468/4089 tok/s;    249 sec
[2021-04-23 06:50:44,230 INFO] Step 1350/50000; acc:  53.10; ppl:  6.18; xent: 1.82; lr: 0.00010; 11459/4422 tok/s;    258 sec
[2021-04-23 06:50:53,207 INFO] Step 1400/50000; acc:  53.47; ppl:  5.92; xent: 1.78; lr: 0.00010; 10903/4445 tok/s;    267 sec
[2021-04-23 06:51:01,076 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 06:51:03,156 INFO] Step 1450/50000; acc:  53.38; ppl:  5.97; xent: 1.79; lr: 0.00010; 10448/4090 tok/s;    277 sec
[2021-04-23 06:51:12,658 INFO] Step 1500/50000; acc:  53.67; ppl:  5.87; xent: 1.77; lr: 0.00010; 10538/4224 tok/s;    287 sec
[2021-04-23 06:51:22,703 INFO] Step 1550/50000; acc:  53.58; ppl:  5.89; xent: 1.77; lr: 0.00010; 10145/3988 tok/s;    297 sec
[2021-04-23 06:51:32,416 INFO] Step 1600/50000; acc:  53.53; ppl:  5.80; xent: 1.76; lr: 0.00010; 10409/4263 tok/s;    306 sec
[2021-04-23 06:51:41,807 INFO] Step 1650/50000; acc:  54.03; ppl:  5.65; xent: 1.73; lr: 0.00010; 10687/4272 tok/s;    316 sec
[2021-04-23 06:51:51,329 INFO] Step 1700/50000; acc:  53.97; ppl:  5.73; xent: 1.75; lr: 0.00010; 10828/4222 tok/s;    325 sec
[2021-04-23 06:52:00,818 INFO] Step 1750/50000; acc:  54.46; ppl:  5.61; xent: 1.72; lr: 0.00010; 10684/4215 tok/s;    335 sec
[2021-04-23 06:52:10,392 INFO] Step 1800/50000; acc:  54.73; ppl:  5.48; xent: 1.70; lr: 0.00010; 10687/4150 tok/s;    344 sec
[2021-04-23 06:52:20,648 INFO] Step 1850/50000; acc:  55.07; ppl:  5.43; xent: 1.69; lr: 0.00010; 9922/3967 tok/s;    355 sec
[2021-04-23 06:52:29,839 INFO] Step 1900/50000; acc:  55.28; ppl:  5.33; xent: 1.67; lr: 0.00010; 11123/4289 tok/s;    364 sec
[2021-04-23 06:52:39,507 INFO] Step 1950/50000; acc:  55.14; ppl:  5.39; xent: 1.68; lr: 0.00010; 10527/4248 tok/s;    374 sec
[2021-04-23 06:52:49,174 INFO] Step 2000/50000; acc:  55.41; ppl:  5.36; xent: 1.68; lr: 0.00010; 10523/4105 tok/s;    383 sec
[2021-04-23 06:52:58,847 INFO] Step 2050/50000; acc:  55.69; ppl:  5.25; xent: 1.66; lr: 0.00010; 10554/4141 tok/s;    393 sec
[2021-04-23 06:53:07,746 INFO] Step 2100/50000; acc:  55.64; ppl:  5.20; xent: 1.65; lr: 0.00010; 11287/4451 tok/s;    402 sec
[2021-04-23 06:53:17,469 INFO] Step 2150/50000; acc:  56.18; ppl:  5.13; xent: 1.63; lr: 0.00010; 10682/4213 tok/s;    412 sec
[2021-04-23 06:53:22,391 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 06:53:26,866 INFO] Step 2200/50000; acc:  56.48; ppl:  5.10; xent: 1.63; lr: 0.00010; 10549/4286 tok/s;    421 sec
[2021-04-23 06:53:37,136 INFO] Step 2250/50000; acc:  56.27; ppl:  5.13; xent: 1.64; lr: 0.00010; 10165/3932 tok/s;    431 sec
[2021-04-23 06:53:46,966 INFO] Step 2300/50000; acc:  56.47; ppl:  5.02; xent: 1.61; lr: 0.00010; 10134/4103 tok/s;    441 sec
[2021-04-23 06:53:56,384 INFO] Step 2350/50000; acc:  56.24; ppl:  5.06; xent: 1.62; lr: 0.00010; 10744/4333 tok/s;    450 sec
[2021-04-23 06:54:05,925 INFO] Step 2400/50000; acc:  56.95; ppl:  4.98; xent: 1.61; lr: 0.00010; 10670/4234 tok/s;    460 sec
[2021-04-23 06:54:15,151 INFO] Step 2450/50000; acc:  56.99; ppl:  4.92; xent: 1.59; lr: 0.00010; 10815/4294 tok/s;    469 sec
[2021-04-23 06:54:24,605 INFO] Step 2500/50000; acc:  56.90; ppl:  4.97; xent: 1.60; lr: 0.00010; 11063/4270 tok/s;    479 sec
[2021-04-23 06:54:34,620 INFO] Step 2550/50000; acc:  57.11; ppl:  4.90; xent: 1.59; lr: 0.00010; 10201/3968 tok/s;    489 sec
[2021-04-23 06:54:44,423 INFO] Step 2600/50000; acc:  57.55; ppl:  4.85; xent: 1.58; lr: 0.00010; 10335/4101 tok/s;    498 sec
[2021-04-23 06:54:53,897 INFO] Step 2650/50000; acc:  57.53; ppl:  4.79; xent: 1.57; lr: 0.00010; 10675/4248 tok/s;    508 sec
[2021-04-23 06:55:03,563 INFO] Step 2700/50000; acc:  57.16; ppl:  4.90; xent: 1.59; lr: 0.00010; 10629/4161 tok/s;    518 sec
[2021-04-23 06:55:13,088 INFO] Step 2750/50000; acc:  57.59; ppl:  4.81; xent: 1.57; lr: 0.00010; 10635/4251 tok/s;    527 sec
[2021-04-23 06:55:22,428 INFO] Step 2800/50000; acc:  57.82; ppl:  4.74; xent: 1.56; lr: 0.00010; 10961/4208 tok/s;    536 sec
[2021-04-23 06:55:31,488 INFO] Step 2850/50000; acc:  58.20; ppl:  4.70; xent: 1.55; lr: 0.00010; 11132/4468 tok/s;    546 sec
[2021-04-23 06:55:40,858 INFO] Step 2900/50000; acc:  58.69; ppl:  4.59; xent: 1.52; lr: 0.00010; 10731/4278 tok/s;    555 sec
[2021-04-23 06:55:43,350 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 06:55:50,693 INFO] Step 2950/50000; acc:  58.07; ppl:  4.73; xent: 1.55; lr: 0.00010; 10655/4187 tok/s;    565 sec
[2021-04-23 06:56:00,159 INFO] Step 3000/50000; acc:  58.62; ppl:  4.63; xent: 1.53; lr: 0.00010; 10439/4158 tok/s;    574 sec
[2021-04-23 06:56:10,303 INFO] Step 3050/50000; acc:  58.15; ppl:  4.66; xent: 1.54; lr: 0.00010; 10198/4064 tok/s;    584 sec
[2021-04-23 06:56:19,588 INFO] Step 3100/50000; acc:  58.52; ppl:  4.61; xent: 1.53; lr: 0.00010; 10744/4356 tok/s;    594 sec
[2021-04-23 06:56:28,980 INFO] Step 3150/50000; acc:  58.57; ppl:  4.65; xent: 1.54; lr: 0.00010; 10796/4247 tok/s;    603 sec
[2021-04-23 06:56:38,379 INFO] Step 3200/50000; acc:  58.79; ppl:  4.54; xent: 1.51; lr: 0.00010; 10811/4298 tok/s;    612 sec
[2021-04-23 06:56:47,691 INFO] Step 3250/50000; acc:  58.95; ppl:  4.52; xent: 1.51; lr: 0.00010; 10981/4238 tok/s;    622 sec
[2021-04-23 06:56:57,841 INFO] Step 3300/50000; acc:  58.59; ppl:  4.60; xent: 1.53; lr: 0.00010; 10196/3999 tok/s;    632 sec
[2021-04-23 06:57:07,347 INFO] Step 3350/50000; acc:  59.25; ppl:  4.51; xent: 1.51; lr: 0.00010; 10669/4213 tok/s;    641 sec
[2021-04-23 06:57:16,711 INFO] Step 3400/50000; acc:  58.88; ppl:  4.54; xent: 1.51; lr: 0.00010; 10834/4302 tok/s;    651 sec
[2021-04-23 06:57:26,138 INFO] Step 3450/50000; acc:  59.17; ppl:  4.51; xent: 1.51; lr: 0.00010; 10692/4266 tok/s;    660 sec
[2021-04-23 06:57:35,866 INFO] Step 3500/50000; acc:  59.64; ppl:  4.45; xent: 1.49; lr: 0.00010; 10609/4085 tok/s;    670 sec
[2021-04-23 06:57:44,765 INFO] Step 3550/50000; acc:  59.74; ppl:  4.40; xent: 1.48; lr: 0.00010; 11375/4495 tok/s;    679 sec
[2021-04-23 06:57:54,107 INFO] Step 3600/50000; acc:  59.83; ppl:  4.39; xent: 1.48; lr: 0.00010; 10865/4267 tok/s;    688 sec
[2021-04-23 06:57:57,131 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 06:58:03,965 INFO] Step 3650/50000; acc:  59.65; ppl:  4.37; xent: 1.47; lr: 0.00010; 10339/4143 tok/s;    698 sec
[2021-04-23 06:58:13,637 INFO] Step 3700/50000; acc:  60.30; ppl:  4.34; xent: 1.47; lr: 0.00010; 10430/4131 tok/s;    708 sec
[2021-04-23 06:58:23,530 INFO] Step 3750/50000; acc:  59.41; ppl:  4.48; xent: 1.50; lr: 0.00010; 10506/4151 tok/s;    718 sec
[2021-04-23 06:58:33,295 INFO] Step 3800/50000; acc:  60.46; ppl:  4.28; xent: 1.46; lr: 0.00010; 10069/4147 tok/s;    727 sec
[2021-04-23 06:58:42,660 INFO] Step 3850/50000; acc:  59.57; ppl:  4.42; xent: 1.49; lr: 0.00010; 11088/4336 tok/s;    737 sec
[2021-04-23 06:58:51,756 INFO] Step 3900/50000; acc:  60.57; ppl:  4.28; xent: 1.45; lr: 0.00010; 10936/4350 tok/s;    746 sec
[2021-04-23 06:59:01,346 INFO] Step 3950/50000; acc:  60.26; ppl:  4.35; xent: 1.47; lr: 0.00010; 10644/4214 tok/s;    755 sec
[2021-04-23 06:59:10,891 INFO] Step 4000/50000; acc:  59.81; ppl:  4.32; xent: 1.46; lr: 0.00010; 10881/4148 tok/s;    765 sec
[2021-04-23 06:59:20,566 INFO] Step 4050/50000; acc:  60.71; ppl:  4.24; xent: 1.45; lr: 0.00010; 10372/4179 tok/s;    775 sec
[2021-04-23 06:59:30,319 INFO] Step 4100/50000; acc:  60.15; ppl:  4.29; xent: 1.46; lr: 0.00010; 10605/4132 tok/s;    784 sec
[2021-04-23 06:59:39,709 INFO] Step 4150/50000; acc:  60.12; ppl:  4.37; xent: 1.48; lr: 0.00010; 10779/4268 tok/s;    794 sec
[2021-04-23 06:59:49,282 INFO] Step 4200/50000; acc:  60.83; ppl:  4.26; xent: 1.45; lr: 0.00010; 10584/4205 tok/s;    803 sec
[2021-04-23 06:59:58,852 INFO] Step 4250/50000; acc:  60.75; ppl:  4.21; xent: 1.44; lr: 0.00010; 10627/4199 tok/s;    813 sec
[2021-04-23 07:00:07,730 INFO] Step 4300/50000; acc:  61.30; ppl:  4.15; xent: 1.42; lr: 0.00010; 11514/4426 tok/s;    822 sec
[2021-04-23 07:00:17,259 INFO] Step 4350/50000; acc:  61.03; ppl:  4.16; xent: 1.42; lr: 0.00010; 10599/4269 tok/s;    831 sec
[2021-04-23 07:00:17,652 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:00:26,789 INFO] Step 4400/50000; acc:  61.25; ppl:  4.16; xent: 1.43; lr: 0.00010; 10704/4247 tok/s;    841 sec
[2021-04-23 07:00:36,771 INFO] Step 4450/50000; acc:  61.04; ppl:  4.23; xent: 1.44; lr: 0.00010; 10216/4008 tok/s;    851 sec
[2021-04-23 07:00:46,641 INFO] Step 4500/50000; acc:  61.12; ppl:  4.17; xent: 1.43; lr: 0.00010; 10099/4135 tok/s;    861 sec
[2021-04-23 07:00:56,452 INFO] Step 4550/50000; acc:  61.04; ppl:  4.22; xent: 1.44; lr: 0.00010; 10635/4146 tok/s;    871 sec
[2021-04-23 07:01:05,927 INFO] Step 4600/50000; acc:  61.55; ppl:  4.14; xent: 1.42; lr: 0.00010; 10409/4226 tok/s;    880 sec
[2021-04-23 07:01:15,130 INFO] Step 4650/50000; acc:  61.21; ppl:  4.16; xent: 1.42; lr: 0.00010; 11212/4297 tok/s;    889 sec
[2021-04-23 07:01:24,849 INFO] Step 4700/50000; acc:  61.24; ppl:  4.13; xent: 1.42; lr: 0.00010; 10452/4145 tok/s;    899 sec
[2021-04-23 07:01:35,019 INFO] Step 4750/50000; acc:  61.02; ppl:  4.14; xent: 1.42; lr: 0.00010; 10086/3989 tok/s;    909 sec
[2021-04-23 07:01:44,493 INFO] Step 4800/50000; acc:  61.61; ppl:  4.07; xent: 1.40; lr: 0.00010; 10751/4227 tok/s;    919 sec
[2021-04-23 07:01:53,980 INFO] Step 4850/50000; acc:  61.41; ppl:  4.08; xent: 1.41; lr: 0.00010; 10602/4207 tok/s;    928 sec
[2021-04-23 07:02:03,591 INFO] Step 4900/50000; acc:  61.07; ppl:  4.18; xent: 1.43; lr: 0.00010; 10753/4194 tok/s;    938 sec
[2021-04-23 07:02:13,326 INFO] Step 4950/50000; acc:  61.57; ppl:  4.10; xent: 1.41; lr: 0.00010; 10445/4092 tok/s;    947 sec
[2021-04-23 07:02:22,248 INFO] Step 5000/50000; acc:  61.97; ppl:  4.05; xent: 1.40; lr: 0.00010; 11393/4507 tok/s;    956 sec
[2021-04-23 07:02:22,248 INFO] valid's transforms: TransformPipe()
[2021-04-23 07:02:22,251 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-23 07:02:30,167 INFO] Validation perplexity: 3.87977
[2021-04-23 07:02:30,168 INFO] Validation accuracy: 63.3767
[2021-04-23 07:02:30,170 INFO] Saving checkpoint ../models/default_params/loose_ops/model_step_5000.pt
[2021-04-23 07:02:39,852 INFO] Step 5050/50000; acc:  62.23; ppl:  3.98; xent: 1.38; lr: 0.00010; 5686/2277 tok/s;    974 sec
[2021-04-23 07:02:47,156 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:02:49,468 INFO] Step 5100/50000; acc:  61.96; ppl:  4.02; xent: 1.39; lr: 0.00010; 10709/4198 tok/s;    984 sec
[2021-04-23 07:02:59,186 INFO] Step 5150/50000; acc:  62.28; ppl:  4.03; xent: 1.39; lr: 0.00010; 10416/4145 tok/s;    993 sec
[2021-04-23 07:03:09,193 INFO] Step 5200/50000; acc:  61.72; ppl:  4.06; xent: 1.40; lr: 0.00010; 10184/4018 tok/s;   1003 sec
[2021-04-23 07:03:19,013 INFO] Step 5250/50000; acc:  61.78; ppl:  4.03; xent: 1.39; lr: 0.00010; 10262/4180 tok/s;   1013 sec
[2021-04-23 07:03:28,379 INFO] Step 5300/50000; acc:  61.93; ppl:  3.99; xent: 1.38; lr: 0.00010; 10758/4296 tok/s;   1022 sec
[2021-04-23 07:03:37,897 INFO] Step 5350/50000; acc:  61.85; ppl:  4.10; xent: 1.41; lr: 0.00010; 10932/4234 tok/s;   1032 sec
[2021-04-23 07:03:47,307 INFO] Step 5400/50000; acc:  62.21; ppl:  3.98; xent: 1.38; lr: 0.00010; 10531/4266 tok/s;   1041 sec
[2021-04-23 07:03:56,988 INFO] Step 5450/50000; acc:  61.98; ppl:  4.01; xent: 1.39; lr: 0.00010; 10850/4089 tok/s;   1051 sec
[2021-04-23 07:04:07,175 INFO] Step 5500/50000; acc:  62.25; ppl:  3.96; xent: 1.38; lr: 0.00010; 9889/3989 tok/s;   1061 sec
[2021-04-23 07:04:16,333 INFO] Step 5550/50000; acc:  62.45; ppl:  3.95; xent: 1.37; lr: 0.00010; 11064/4318 tok/s;   1070 sec
[2021-04-23 07:04:26,002 INFO] Step 5600/50000; acc:  62.11; ppl:  3.98; xent: 1.38; lr: 0.00010; 10595/4250 tok/s;   1080 sec
[2021-04-23 07:04:35,564 INFO] Step 5650/50000; acc:  62.54; ppl:  3.97; xent: 1.38; lr: 0.00010; 10475/4127 tok/s;   1090 sec
[2021-04-23 07:04:45,242 INFO] Step 5700/50000; acc:  62.28; ppl:  3.96; xent: 1.38; lr: 0.00010; 10727/4176 tok/s;   1099 sec
[2021-04-23 07:04:54,256 INFO] Step 5750/50000; acc:  62.44; ppl:  3.94; xent: 1.37; lr: 0.00010; 11209/4410 tok/s;   1108 sec
[2021-04-23 07:05:03,581 INFO] Step 5800/50000; acc:  63.14; ppl:  3.85; xent: 1.35; lr: 0.00010; 10802/4319 tok/s;   1118 sec
[2021-04-23 07:05:08,302 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:05:13,192 INFO] Step 5850/50000; acc:  62.44; ppl:  3.94; xent: 1.37; lr: 0.00010; 10538/4222 tok/s;   1127 sec
[2021-04-23 07:05:23,358 INFO] Step 5900/50000; acc:  62.69; ppl:  3.94; xent: 1.37; lr: 0.00010; 10176/3965 tok/s;   1137 sec
[2021-04-23 07:05:33,306 INFO] Step 5950/50000; acc:  62.93; ppl:  3.87; xent: 1.35; lr: 0.00010; 10101/4085 tok/s;   1147 sec
[2021-04-23 07:05:42,914 INFO] Step 6000/50000; acc:  62.60; ppl:  3.92; xent: 1.37; lr: 0.00010; 10554/4230 tok/s;   1157 sec
[2021-04-23 07:05:52,417 INFO] Step 6050/50000; acc:  62.52; ppl:  3.92; xent: 1.37; lr: 0.00010; 10663/4258 tok/s;   1166 sec
[2021-04-23 07:06:01,542 INFO] Step 6100/50000; acc:  63.12; ppl:  3.86; xent: 1.35; lr: 0.00010; 10960/4319 tok/s;   1176 sec
[2021-04-23 07:06:10,971 INFO] Step 6150/50000; acc:  62.38; ppl:  3.93; xent: 1.37; lr: 0.00010; 11223/4271 tok/s;   1185 sec
[2021-04-23 07:06:20,930 INFO] Step 6200/50000; acc:  62.99; ppl:  3.85; xent: 1.35; lr: 0.00010; 10016/4009 tok/s;   1195 sec
[2021-04-23 07:06:31,067 INFO] Step 6250/50000; acc:  62.58; ppl:  3.92; xent: 1.37; lr: 0.00010; 10251/3973 tok/s;   1205 sec
[2021-04-23 07:06:40,235 INFO] Step 6300/50000; acc:  63.24; ppl:  3.80; xent: 1.33; lr: 0.00010; 10940/4405 tok/s;   1214 sec
[2021-04-23 07:06:49,830 INFO] Step 6350/50000; acc:  62.60; ppl:  3.92; xent: 1.37; lr: 0.00010; 10589/4193 tok/s;   1224 sec
[2021-04-23 07:06:59,291 INFO] Step 6400/50000; acc:  63.02; ppl:  3.85; xent: 1.35; lr: 0.00010; 10796/4244 tok/s;   1233 sec
[2021-04-23 07:07:08,448 INFO] Step 6450/50000; acc:  63.38; ppl:  3.78; xent: 1.33; lr: 0.00010; 11007/4316 tok/s;   1243 sec
[2021-04-23 07:07:17,738 INFO] Step 6500/50000; acc:  62.90; ppl:  3.83; xent: 1.34; lr: 0.00010; 11049/4345 tok/s;   1252 sec
[2021-04-23 07:07:27,264 INFO] Step 6550/50000; acc:  63.60; ppl:  3.76; xent: 1.33; lr: 0.00010; 10642/4261 tok/s;   1261 sec
[2021-04-23 07:07:29,230 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:07:36,730 INFO] Step 6600/50000; acc:  63.49; ppl:  3.82; xent: 1.34; lr: 0.00010; 10721/4272 tok/s;   1271 sec
[2021-04-23 07:07:46,409 INFO] Step 6650/50000; acc:  63.30; ppl:  3.82; xent: 1.34; lr: 0.00010; 10418/4084 tok/s;   1280 sec
[2021-04-23 07:07:56,469 INFO] Step 6700/50000; acc:  63.17; ppl:  3.82; xent: 1.34; lr: 0.00010; 10171/4089 tok/s;   1291 sec
[2021-04-23 07:08:05,987 INFO] Step 6750/50000; acc:  63.13; ppl:  3.80; xent: 1.33; lr: 0.00010; 10618/4292 tok/s;   1300 sec
[2021-04-23 07:08:15,538 INFO] Step 6800/50000; acc:  63.28; ppl:  3.82; xent: 1.34; lr: 0.00010; 10622/4156 tok/s;   1310 sec
[2021-04-23 07:08:24,855 INFO] Step 6850/50000; acc:  63.48; ppl:  3.75; xent: 1.32; lr: 0.00010; 10863/4362 tok/s;   1319 sec
[2021-04-23 07:08:34,205 INFO] Step 6900/50000; acc:  63.51; ppl:  3.76; xent: 1.33; lr: 0.00010; 10968/4177 tok/s;   1328 sec
[2021-04-23 07:08:44,217 INFO] Step 6950/50000; acc:  62.97; ppl:  3.83; xent: 1.34; lr: 0.00010; 10446/4073 tok/s;   1338 sec
[2021-04-23 07:08:53,850 INFO] Step 7000/50000; acc:  63.70; ppl:  3.73; xent: 1.32; lr: 0.00010; 10275/4173 tok/s;   1348 sec
[2021-04-23 07:09:03,333 INFO] Step 7050/50000; acc:  63.01; ppl:  3.84; xent: 1.34; lr: 0.00010; 10981/4256 tok/s;   1357 sec
[2021-04-23 07:09:12,781 INFO] Step 7100/50000; acc:  63.46; ppl:  3.77; xent: 1.33; lr: 0.00010; 10564/4216 tok/s;   1367 sec
[2021-04-23 07:09:22,482 INFO] Step 7150/50000; acc:  63.70; ppl:  3.72; xent: 1.31; lr: 0.00010; 10551/4137 tok/s;   1377 sec
[2021-04-23 07:09:31,231 INFO] Step 7200/50000; acc:  63.75; ppl:  3.71; xent: 1.31; lr: 0.00010; 11640/4537 tok/s;   1385 sec
[2021-04-23 07:09:40,685 INFO] Step 7250/50000; acc:  63.87; ppl:  3.69; xent: 1.31; lr: 0.00010; 10560/4223 tok/s;   1395 sec
[2021-04-23 07:09:43,412 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:09:50,532 INFO] Step 7300/50000; acc:  63.60; ppl:  3.76; xent: 1.32; lr: 0.00010; 10566/4168 tok/s;   1405 sec
[2021-04-23 07:10:00,140 INFO] Step 7350/50000; acc:  64.02; ppl:  3.71; xent: 1.31; lr: 0.00010; 10545/4181 tok/s;   1414 sec
[2021-04-23 07:10:09,914 INFO] Step 7400/50000; acc:  63.45; ppl:  3.75; xent: 1.32; lr: 0.00010; 10291/4115 tok/s;   1424 sec
[2021-04-23 07:10:19,903 INFO] Step 7450/50000; acc:  64.01; ppl:  3.70; xent: 1.31; lr: 0.00010; 10067/4117 tok/s;   1434 sec
[2021-04-23 07:10:29,046 INFO] Step 7500/50000; acc:  63.67; ppl:  3.74; xent: 1.32; lr: 0.00010; 11240/4411 tok/s;   1443 sec
[2021-04-23 07:10:38,219 INFO] Step 7550/50000; acc:  64.16; ppl:  3.69; xent: 1.30; lr: 0.00010; 10956/4355 tok/s;   1452 sec
[2021-04-23 07:10:47,855 INFO] Step 7600/50000; acc:  63.78; ppl:  3.73; xent: 1.32; lr: 0.00010; 10641/4149 tok/s;   1462 sec
[2021-04-23 07:10:57,763 INFO] Step 7650/50000; acc:  64.02; ppl:  3.71; xent: 1.31; lr: 0.00010; 10418/4040 tok/s;   1472 sec
[2021-04-23 07:11:07,172 INFO] Step 7700/50000; acc:  64.36; ppl:  3.65; xent: 1.30; lr: 0.00010; 10691/4238 tok/s;   1481 sec
[2021-04-23 07:11:17,045 INFO] Step 7750/50000; acc:  63.65; ppl:  3.69; xent: 1.31; lr: 0.00010; 10589/4092 tok/s;   1491 sec
[2021-04-23 07:11:26,356 INFO] Step 7800/50000; acc:  63.75; ppl:  3.72; xent: 1.31; lr: 0.00010; 10606/4333 tok/s;   1500 sec
[2021-04-23 07:11:36,165 INFO] Step 7850/50000; acc:  63.93; ppl:  3.71; xent: 1.31; lr: 0.00010; 10616/4073 tok/s;   1510 sec
[2021-04-23 07:11:45,529 INFO] Step 7900/50000; acc:  64.35; ppl:  3.64; xent: 1.29; lr: 0.00010; 10772/4299 tok/s;   1520 sec
[2021-04-23 07:11:54,399 INFO] Step 7950/50000; acc:  64.58; ppl:  3.57; xent: 1.27; lr: 0.00010; 11372/4449 tok/s;   1528 sec
[2021-04-23 07:12:04,079 INFO] Step 8000/50000; acc:  64.26; ppl:  3.65; xent: 1.30; lr: 0.00010; 10542/4196 tok/s;   1538 sec
[2021-04-23 07:12:04,087 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:12:13,632 INFO] Step 8050/50000; acc:  64.61; ppl:  3.59; xent: 1.28; lr: 0.00010; 10508/4225 tok/s;   1548 sec
[2021-04-23 07:12:23,576 INFO] Step 8100/50000; acc:  64.01; ppl:  3.72; xent: 1.31; lr: 0.00010; 10437/4034 tok/s;   1558 sec
[2021-04-23 07:12:33,578 INFO] Step 8150/50000; acc:  64.24; ppl:  3.63; xent: 1.29; lr: 0.00010; 10019/4094 tok/s;   1568 sec
[2021-04-23 07:12:43,177 INFO] Step 8200/50000; acc:  64.13; ppl:  3.65; xent: 1.29; lr: 0.00010; 10547/4194 tok/s;   1577 sec
[2021-04-23 07:12:52,702 INFO] Step 8250/50000; acc:  64.49; ppl:  3.63; xent: 1.29; lr: 0.00010; 10586/4215 tok/s;   1587 sec
[2021-04-23 07:13:02,196 INFO] Step 8300/50000; acc:  64.45; ppl:  3.64; xent: 1.29; lr: 0.00010; 10764/4207 tok/s;   1596 sec
[2021-04-23 07:13:11,775 INFO] Step 8350/50000; acc:  64.48; ppl:  3.62; xent: 1.29; lr: 0.00010; 10703/4179 tok/s;   1606 sec
[2021-04-23 07:13:22,077 INFO] Step 8400/50000; acc:  64.14; ppl:  3.64; xent: 1.29; lr: 0.00010; 9973/3960 tok/s;   1616 sec
[2021-04-23 07:13:31,262 INFO] Step 8450/50000; acc:  64.98; ppl:  3.55; xent: 1.27; lr: 0.00010; 11006/4332 tok/s;   1625 sec
[2021-04-23 07:13:40,889 INFO] Step 8500/50000; acc:  64.37; ppl:  3.62; xent: 1.29; lr: 0.00010; 10513/4158 tok/s;   1635 sec
[2021-04-23 07:13:50,657 INFO] Step 8550/50000; acc:  63.99; ppl:  3.68; xent: 1.30; lr: 0.00010; 10688/4140 tok/s;   1645 sec
[2021-04-23 07:14:00,196 INFO] Step 8600/50000; acc:  64.97; ppl:  3.55; xent: 1.27; lr: 0.00010; 10408/4152 tok/s;   1654 sec
[2021-04-23 07:14:09,231 INFO] Step 8650/50000; acc:  64.36; ppl:  3.63; xent: 1.29; lr: 0.00010; 11512/4466 tok/s;   1663 sec
[2021-04-23 07:14:18,271 INFO] Step 8700/50000; acc:  65.17; ppl:  3.51; xent: 1.25; lr: 0.00010; 10985/4445 tok/s;   1672 sec
[2021-04-23 07:14:25,267 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:14:27,878 INFO] Step 8750/50000; acc:  64.70; ppl:  3.56; xent: 1.27; lr: 0.00010; 10626/4183 tok/s;   1682 sec
[2021-04-23 07:14:37,649 INFO] Step 8800/50000; acc:  65.04; ppl:  3.57; xent: 1.27; lr: 0.00010; 10440/4146 tok/s;   1692 sec
[2021-04-23 07:14:47,448 INFO] Step 8850/50000; acc:  64.82; ppl:  3.57; xent: 1.27; lr: 0.00010; 10232/4086 tok/s;   1702 sec
[2021-04-23 07:14:57,286 INFO] Step 8900/50000; acc:  64.25; ppl:  3.62; xent: 1.29; lr: 0.00010; 10449/4160 tok/s;   1711 sec
[2021-04-23 07:15:06,565 INFO] Step 8950/50000; acc:  64.98; ppl:  3.53; xent: 1.26; lr: 0.00010; 10903/4379 tok/s;   1721 sec
[2021-04-23 07:15:16,043 INFO] Step 9000/50000; acc:  64.76; ppl:  3.60; xent: 1.28; lr: 0.00010; 10631/4193 tok/s;   1730 sec
[2021-04-23 07:15:25,384 INFO] Step 9050/50000; acc:  64.52; ppl:  3.58; xent: 1.28; lr: 0.00010; 10870/4315 tok/s;   1739 sec
[2021-04-23 07:15:35,299 INFO] Step 9100/50000; acc:  64.73; ppl:  3.59; xent: 1.28; lr: 0.00010; 10489/3996 tok/s;   1749 sec
[2021-04-23 07:15:45,449 INFO] Step 9150/50000; acc:  64.73; ppl:  3.54; xent: 1.27; lr: 0.00010; 10020/4039 tok/s;   1760 sec
[2021-04-23 07:15:54,600 INFO] Step 9200/50000; acc:  64.97; ppl:  3.55; xent: 1.27; lr: 0.00010; 11092/4302 tok/s;   1769 sec
[2021-04-23 07:16:04,275 INFO] Step 9250/50000; acc:  64.84; ppl:  3.53; xent: 1.26; lr: 0.00010; 10521/4213 tok/s;   1778 sec
[2021-04-23 07:16:13,712 INFO] Step 9300/50000; acc:  65.14; ppl:  3.52; xent: 1.26; lr: 0.00010; 10656/4173 tok/s;   1788 sec
[2021-04-23 07:16:23,476 INFO] Step 9350/50000; acc:  64.47; ppl:  3.59; xent: 1.28; lr: 0.00010; 10756/4166 tok/s;   1798 sec
[2021-04-23 07:16:32,358 INFO] Step 9400/50000; acc:  65.19; ppl:  3.49; xent: 1.25; lr: 0.00010; 11099/4482 tok/s;   1806 sec
[2021-04-23 07:16:41,895 INFO] Step 9450/50000; acc:  65.39; ppl:  3.49; xent: 1.25; lr: 0.00010; 10836/4242 tok/s;   1816 sec
[2021-04-23 07:16:46,096 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:16:51,446 INFO] Step 9500/50000; acc:  65.15; ppl:  3.51; xent: 1.26; lr: 0.00010; 10511/4250 tok/s;   1826 sec
[2021-04-23 07:17:01,401 INFO] Step 9550/50000; acc:  65.23; ppl:  3.52; xent: 1.26; lr: 0.00010; 10273/4009 tok/s;   1835 sec
[2021-04-23 07:17:11,421 INFO] Step 9600/50000; acc:  65.01; ppl:  3.51; xent: 1.26; lr: 0.00010; 10131/4093 tok/s;   1845 sec
[2021-04-23 07:17:20,877 INFO] Step 9650/50000; acc:  65.06; ppl:  3.50; xent: 1.25; lr: 0.00010; 10550/4276 tok/s;   1855 sec
[2021-04-23 07:17:30,472 INFO] Step 9700/50000; acc:  64.98; ppl:  3.54; xent: 1.26; lr: 0.00010; 10756/4227 tok/s;   1865 sec
[2021-04-23 07:17:39,813 INFO] Step 9750/50000; acc:  65.19; ppl:  3.49; xent: 1.25; lr: 0.00010; 10771/4226 tok/s;   1874 sec
[2021-04-23 07:17:48,849 INFO] Step 9800/50000; acc:  65.18; ppl:  3.50; xent: 1.25; lr: 0.00010; 11361/4393 tok/s;   1883 sec
[2021-04-23 07:17:59,164 INFO] Step 9850/50000; acc:  64.97; ppl:  3.50; xent: 1.25; lr: 0.00010; 9869/3920 tok/s;   1893 sec
[2021-04-23 07:18:09,049 INFO] Step 9900/50000; acc:  64.95; ppl:  3.52; xent: 1.26; lr: 0.00010; 10396/4031 tok/s;   1903 sec
[2021-04-23 07:18:18,239 INFO] Step 9950/50000; acc:  65.57; ppl:  3.44; xent: 1.24; lr: 0.00010; 11036/4421 tok/s;   1912 sec
[2021-04-23 07:18:27,881 INFO] Step 10000/50000; acc:  65.03; ppl:  3.53; xent: 1.26; lr: 0.00010; 10555/4175 tok/s;   1922 sec
[2021-04-23 07:18:27,885 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-23 07:18:35,783 INFO] Validation perplexity: 3.39792
[2021-04-23 07:18:35,783 INFO] Validation accuracy: 66.0997
[2021-04-23 07:18:35,785 INFO] Saving checkpoint ../models/default_params/loose_ops/model_step_10000.pt
[2021-04-23 07:18:45,908 INFO] Step 10050/50000; acc:  65.18; ppl:  3.49; xent: 1.25; lr: 0.00010; 5638/2222 tok/s;   1940 sec
[2021-04-23 07:18:55,115 INFO] Step 10100/50000; acc:  65.75; ppl:  3.44; xent: 1.23; lr: 0.00010; 10981/4308 tok/s;   1949 sec
[2021-04-23 07:19:04,389 INFO] Step 10150/50000; acc:  65.12; ppl:  3.48; xent: 1.25; lr: 0.00010; 11198/4354 tok/s;   1958 sec
[2021-04-23 07:19:13,678 INFO] Step 10200/50000; acc:  65.90; ppl:  3.39; xent: 1.22; lr: 0.00010; 10652/4365 tok/s;   1968 sec
[2021-04-23 07:19:15,388 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:19:23,394 INFO] Step 10250/50000; acc:  65.35; ppl:  3.49; xent: 1.25; lr: 0.00010; 10725/4168 tok/s;   1977 sec
[2021-04-23 07:19:33,061 INFO] Step 10300/50000; acc:  65.47; ppl:  3.45; xent: 1.24; lr: 0.00010; 10327/4127 tok/s;   1987 sec
[2021-04-23 07:19:43,001 INFO] Step 10350/50000; acc:  65.76; ppl:  3.45; xent: 1.24; lr: 0.00010; 10194/4126 tok/s;   1997 sec
[2021-04-23 07:19:52,298 INFO] Step 10400/50000; acc:  65.30; ppl:  3.47; xent: 1.25; lr: 0.00010; 10942/4344 tok/s;   2006 sec
[2021-04-23 07:20:01,759 INFO] Step 10450/50000; acc:  65.63; ppl:  3.44; xent: 1.24; lr: 0.00010; 10555/4239 tok/s;   2016 sec
[2021-04-23 07:20:11,246 INFO] Step 10500/50000; acc:  65.40; ppl:  3.46; xent: 1.24; lr: 0.00010; 10894/4275 tok/s;   2025 sec
[2021-04-23 07:20:20,742 INFO] Step 10550/50000; acc:  65.54; ppl:  3.44; xent: 1.24; lr: 0.00010; 10854/4114 tok/s;   2035 sec
[2021-04-23 07:20:30,456 INFO] Step 10600/50000; acc:  65.34; ppl:  3.44; xent: 1.24; lr: 0.00010; 10414/4153 tok/s;   2045 sec
[2021-04-23 07:20:40,216 INFO] Step 10650/50000; acc:  65.66; ppl:  3.41; xent: 1.23; lr: 0.00010; 10379/4147 tok/s;   2054 sec
[2021-04-23 07:20:49,670 INFO] Step 10700/50000; acc:  65.14; ppl:  3.48; xent: 1.25; lr: 0.00010; 10897/4274 tok/s;   2064 sec
[2021-04-23 07:20:59,104 INFO] Step 10750/50000; acc:  65.79; ppl:  3.46; xent: 1.24; lr: 0.00010; 10682/4240 tok/s;   2073 sec
[2021-04-23 07:21:08,768 INFO] Step 10800/50000; acc:  65.94; ppl:  3.38; xent: 1.22; lr: 0.00010; 10631/4122 tok/s;   2083 sec
[2021-04-23 07:21:17,518 INFO] Step 10850/50000; acc:  65.91; ppl:  3.39; xent: 1.22; lr: 0.00010; 11575/4522 tok/s;   2092 sec
[2021-04-23 07:21:26,842 INFO] Step 10900/50000; acc:  65.98; ppl:  3.38; xent: 1.22; lr: 0.00010; 10749/4308 tok/s;   2101 sec
[2021-04-23 07:21:29,227 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:21:37,006 INFO] Step 10950/50000; acc:  65.57; ppl:  3.45; xent: 1.24; lr: 0.00010; 10328/4054 tok/s;   2111 sec
[2021-04-23 07:21:46,363 INFO] Step 11000/50000; acc:  66.35; ppl:  3.35; xent: 1.21; lr: 0.00010; 10582/4241 tok/s;   2120 sec
[2021-04-23 07:21:56,525 INFO] Step 11050/50000; acc:  65.16; ppl:  3.48; xent: 1.25; lr: 0.00010; 10146/3992 tok/s;   2131 sec
[2021-04-23 07:22:06,239 INFO] Step 11100/50000; acc:  65.72; ppl:  3.39; xent: 1.22; lr: 0.00010; 10267/4220 tok/s;   2140 sec
[2021-04-23 07:22:15,327 INFO] Step 11150/50000; acc:  65.60; ppl:  3.41; xent: 1.23; lr: 0.00010; 11189/4411 tok/s;   2149 sec
[2021-04-23 07:22:24,650 INFO] Step 11200/50000; acc:  66.07; ppl:  3.39; xent: 1.22; lr: 0.00010; 10863/4321 tok/s;   2159 sec
[2021-04-23 07:22:34,104 INFO] Step 11250/50000; acc:  65.98; ppl:  3.39; xent: 1.22; lr: 0.00010; 10690/4250 tok/s;   2168 sec
[2021-04-23 07:22:44,248 INFO] Step 11300/50000; acc:  65.64; ppl:  3.43; xent: 1.23; lr: 0.00010; 10358/3916 tok/s;   2178 sec
[2021-04-23 07:22:53,878 INFO] Step 11350/50000; acc:  65.95; ppl:  3.37; xent: 1.21; lr: 0.00010; 10491/4173 tok/s;   2188 sec
[2021-04-23 07:23:03,365 INFO] Step 11400/50000; acc:  66.00; ppl:  3.36; xent: 1.21; lr: 0.00010; 10688/4207 tok/s;   2197 sec
[2021-04-23 07:23:12,837 INFO] Step 11450/50000; acc:  65.69; ppl:  3.42; xent: 1.23; lr: 0.00010; 10659/4292 tok/s;   2207 sec
[2021-04-23 07:23:22,625 INFO] Step 11500/50000; acc:  66.09; ppl:  3.39; xent: 1.22; lr: 0.00010; 10526/4051 tok/s;   2217 sec
[2021-04-23 07:23:32,163 INFO] Step 11550/50000; acc:  66.24; ppl:  3.35; xent: 1.21; lr: 0.00010; 10688/4246 tok/s;   2226 sec
[2021-04-23 07:23:40,985 INFO] Step 11600/50000; acc:  66.23; ppl:  3.31; xent: 1.20; lr: 0.00010; 11455/4490 tok/s;   2235 sec
[2021-04-23 07:23:50,243 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:23:50,624 INFO] Step 11650/50000; acc:  66.38; ppl:  3.34; xent: 1.21; lr: 0.00010; 10542/4176 tok/s;   2245 sec
[2021-04-23 07:24:00,568 INFO] Step 11700/50000; acc:  66.38; ppl:  3.33; xent: 1.20; lr: 0.00010; 10114/4110 tok/s;   2255 sec
[2021-04-23 07:24:10,324 INFO] Step 11750/50000; acc:  65.76; ppl:  3.43; xent: 1.23; lr: 0.00010; 10746/4081 tok/s;   2264 sec
[2021-04-23 07:24:20,530 INFO] Step 11800/50000; acc:  66.50; ppl:  3.32; xent: 1.20; lr: 0.00010; 9592/4025 tok/s;   2275 sec
[2021-04-23 07:24:30,136 INFO] Step 11850/50000; acc:  65.68; ppl:  3.40; xent: 1.23; lr: 0.00010; 10798/4221 tok/s;   2284 sec
[2021-04-23 07:24:39,497 INFO] Step 11900/50000; acc:  66.27; ppl:  3.34; xent: 1.21; lr: 0.00010; 10695/4249 tok/s;   2294 sec
[2021-04-23 07:24:48,929 INFO] Step 11950/50000; acc:  66.08; ppl:  3.36; xent: 1.21; lr: 0.00010; 10741/4234 tok/s;   2303 sec
[2021-04-23 07:24:58,571 INFO] Step 12000/50000; acc:  66.17; ppl:  3.35; xent: 1.21; lr: 0.00010; 10716/4149 tok/s;   2313 sec
[2021-04-23 07:25:08,737 INFO] Step 12050/50000; acc:  66.01; ppl:  3.35; xent: 1.21; lr: 0.00010; 9939/4020 tok/s;   2323 sec
[2021-04-23 07:25:18,120 INFO] Step 12100/50000; acc:  66.36; ppl:  3.31; xent: 1.20; lr: 0.00010; 10967/4231 tok/s;   2332 sec
[2021-04-23 07:25:27,653 INFO] Step 12150/50000; acc:  66.20; ppl:  3.35; xent: 1.21; lr: 0.00010; 10684/4228 tok/s;   2342 sec
[2021-04-23 07:25:37,265 INFO] Step 12200/50000; acc:  66.32; ppl:  3.34; xent: 1.21; lr: 0.00010; 10512/4141 tok/s;   2351 sec
[2021-04-23 07:25:47,072 INFO] Step 12250/50000; acc:  66.69; ppl:  3.29; xent: 1.19; lr: 0.00010; 10367/4094 tok/s;   2361 sec
[2021-04-23 07:25:55,975 INFO] Step 12300/50000; acc:  66.29; ppl:  3.33; xent: 1.20; lr: 0.00010; 11533/4476 tok/s;   2370 sec
[2021-04-23 07:26:05,112 INFO] Step 12350/50000; acc:  66.44; ppl:  3.28; xent: 1.19; lr: 0.00010; 10990/4430 tok/s;   2379 sec
[2021-04-23 07:26:11,749 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:26:14,763 INFO] Step 12400/50000; acc:  66.35; ppl:  3.32; xent: 1.20; lr: 0.00010; 10592/4188 tok/s;   2389 sec
[2021-04-23 07:26:24,597 INFO] Step 12450/50000; acc:  66.73; ppl:  3.32; xent: 1.20; lr: 0.00010; 10342/4107 tok/s;   2399 sec
[2021-04-23 07:26:34,396 INFO] Step 12500/50000; acc:  66.27; ppl:  3.31; xent: 1.20; lr: 0.00010; 10257/4062 tok/s;   2408 sec
[2021-04-23 07:26:44,424 INFO] Step 12550/50000; acc:  65.98; ppl:  3.38; xent: 1.22; lr: 0.00010; 10363/4129 tok/s;   2418 sec
[2021-04-23 07:26:53,693 INFO] Step 12600/50000; acc:  66.79; ppl:  3.25; xent: 1.18; lr: 0.00010; 10637/4352 tok/s;   2428 sec
[2021-04-23 07:27:03,442 INFO] Step 12650/50000; acc:  66.10; ppl:  3.37; xent: 1.21; lr: 0.00010; 10618/4100 tok/s;   2438 sec
[2021-04-23 07:27:12,750 INFO] Step 12700/50000; acc:  66.57; ppl:  3.30; xent: 1.19; lr: 0.00010; 10835/4339 tok/s;   2447 sec
[2021-04-23 07:27:22,597 INFO] Step 12750/50000; acc:  66.53; ppl:  3.30; xent: 1.19; lr: 0.00010; 10431/4014 tok/s;   2457 sec
[2021-04-23 07:27:32,606 INFO] Step 12800/50000; acc:  66.24; ppl:  3.31; xent: 1.20; lr: 0.00010; 10243/4062 tok/s;   2467 sec
[2021-04-23 07:27:41,870 INFO] Step 12850/50000; acc:  66.80; ppl:  3.26; xent: 1.18; lr: 0.00010; 10790/4257 tok/s;   2476 sec
[2021-04-23 07:27:51,757 INFO] Step 12900/50000; acc:  66.22; ppl:  3.32; xent: 1.20; lr: 0.00010; 10475/4130 tok/s;   2486 sec
[2021-04-23 07:28:01,093 INFO] Step 12950/50000; acc:  66.98; ppl:  3.28; xent: 1.19; lr: 0.00010; 10859/4253 tok/s;   2495 sec
[2021-04-23 07:28:10,663 INFO] Step 13000/50000; acc:  66.24; ppl:  3.31; xent: 1.20; lr: 0.00010; 10615/4181 tok/s;   2505 sec
[2021-04-23 07:28:19,697 INFO] Step 13050/50000; acc:  66.69; ppl:  3.26; xent: 1.18; lr: 0.00010; 11155/4444 tok/s;   2514 sec
[2021-04-23 07:28:29,185 INFO] Step 13100/50000; acc:  66.95; ppl:  3.25; xent: 1.18; lr: 0.00010; 10782/4261 tok/s;   2523 sec
[2021-04-23 07:28:33,025 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:28:38,723 INFO] Step 13150/50000; acc:  66.73; ppl:  3.28; xent: 1.19; lr: 0.00010; 10628/4259 tok/s;   2533 sec
[2021-04-23 07:28:48,622 INFO] Step 13200/50000; acc:  66.70; ppl:  3.28; xent: 1.19; lr: 0.00010; 10348/3991 tok/s;   2543 sec
[2021-04-23 07:28:58,858 INFO] Step 13250/50000; acc:  66.58; ppl:  3.28; xent: 1.19; lr: 0.00010; 9879/4035 tok/s;   2553 sec
[2021-04-23 07:29:08,214 INFO] Step 13300/50000; acc:  66.89; ppl:  3.25; xent: 1.18; lr: 0.00010; 10701/4283 tok/s;   2562 sec
[2021-04-23 07:29:17,988 INFO] Step 13350/50000; acc:  66.60; ppl:  3.31; xent: 1.20; lr: 0.00010; 10666/4195 tok/s;   2572 sec
[2021-04-23 07:29:27,133 INFO] Step 13400/50000; acc:  67.14; ppl:  3.23; xent: 1.17; lr: 0.00010; 10741/4315 tok/s;   2581 sec
[2021-04-23 07:29:36,554 INFO] Step 13450/50000; acc:  66.41; ppl:  3.31; xent: 1.20; lr: 0.00010; 11177/4221 tok/s;   2591 sec
[2021-04-23 07:29:46,409 INFO] Step 13500/50000; acc:  66.66; ppl:  3.23; xent: 1.17; lr: 0.00010; 10223/4079 tok/s;   2600 sec
[2021-04-23 07:29:56,305 INFO] Step 13550/50000; acc:  66.67; ppl:  3.28; xent: 1.19; lr: 0.00010; 10291/4057 tok/s;   2610 sec
[2021-04-23 07:30:05,512 INFO] Step 13600/50000; acc:  67.01; ppl:  3.24; xent: 1.18; lr: 0.00010; 11112/4419 tok/s;   2620 sec
[2021-04-23 07:30:14,781 INFO] Step 13650/50000; acc:  67.02; ppl:  3.25; xent: 1.18; lr: 0.00010; 10800/4317 tok/s;   2629 sec
[2021-04-23 07:30:24,744 INFO] Step 13700/50000; acc:  66.72; ppl:  3.28; xent: 1.19; lr: 0.00010; 10410/4051 tok/s;   2639 sec
[2021-04-23 07:30:33,613 INFO] Step 13750/50000; acc:  67.02; ppl:  3.23; xent: 1.17; lr: 0.00010; 11431/4439 tok/s;   2648 sec
[2021-04-23 07:30:42,679 INFO] Step 13800/50000; acc:  67.23; ppl:  3.21; xent: 1.17; lr: 0.00010; 11118/4428 tok/s;   2657 sec
[2021-04-23 07:30:52,290 INFO] Step 13850/50000; acc:  67.42; ppl:  3.20; xent: 1.16; lr: 0.00010; 10520/4246 tok/s;   2666 sec
[2021-04-23 07:30:53,478 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:31:01,947 INFO] Step 13900/50000; acc:  66.91; ppl:  3.26; xent: 1.18; lr: 0.00010; 10678/4174 tok/s;   2676 sec
[2021-04-23 07:31:11,626 INFO] Step 13950/50000; acc:  66.83; ppl:  3.25; xent: 1.18; lr: 0.00010; 10415/4135 tok/s;   2686 sec
[2021-04-23 07:31:21,741 INFO] Step 14000/50000; acc:  67.16; ppl:  3.23; xent: 1.17; lr: 0.00010; 10020/4071 tok/s;   2696 sec
[2021-04-23 07:31:31,018 INFO] Step 14050/50000; acc:  66.95; ppl:  3.24; xent: 1.17; lr: 0.00010; 10946/4351 tok/s;   2705 sec
[2021-04-23 07:31:40,199 INFO] Step 14100/50000; acc:  67.11; ppl:  3.23; xent: 1.17; lr: 0.00010; 10893/4341 tok/s;   2714 sec
[2021-04-23 07:31:49,958 INFO] Step 14150/50000; acc:  66.87; ppl:  3.25; xent: 1.18; lr: 0.00010; 10718/4141 tok/s;   2724 sec
[2021-04-23 07:31:59,472 INFO] Step 14200/50000; acc:  67.07; ppl:  3.20; xent: 1.16; lr: 0.00010; 10582/4104 tok/s;   2734 sec
[2021-04-23 07:32:09,512 INFO] Step 14250/50000; acc:  66.78; ppl:  3.25; xent: 1.18; lr: 0.00010; 10335/4054 tok/s;   2744 sec
[2021-04-23 07:32:19,193 INFO] Step 14300/50000; acc:  67.10; ppl:  3.21; xent: 1.17; lr: 0.00010; 10359/4190 tok/s;   2753 sec
[2021-04-23 07:32:28,671 INFO] Step 14350/50000; acc:  67.05; ppl:  3.23; xent: 1.17; lr: 0.00010; 10768/4260 tok/s;   2763 sec
[2021-04-23 07:32:38,130 INFO] Step 14400/50000; acc:  67.04; ppl:  3.26; xent: 1.18; lr: 0.00010; 10746/4241 tok/s;   2772 sec
[2021-04-23 07:32:47,666 INFO] Step 14450/50000; acc:  67.65; ppl:  3.16; xent: 1.15; lr: 0.00010; 10607/4156 tok/s;   2782 sec
[2021-04-23 07:32:56,456 INFO] Step 14500/50000; acc:  67.20; ppl:  3.20; xent: 1.16; lr: 0.00010; 11717/4504 tok/s;   2791 sec
[2021-04-23 07:33:05,690 INFO] Step 14550/50000; acc:  67.25; ppl:  3.18; xent: 1.16; lr: 0.00010; 10914/4351 tok/s;   2800 sec
[2021-04-23 07:33:07,710 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:33:15,663 INFO] Step 14600/50000; acc:  67.21; ppl:  3.18; xent: 1.16; lr: 0.00010; 10200/4111 tok/s;   2810 sec
[2021-04-23 07:33:25,253 INFO] Step 14650/50000; acc:  67.46; ppl:  3.19; xent: 1.16; lr: 0.00010; 10560/4138 tok/s;   2819 sec
[2021-04-23 07:33:35,481 INFO] Step 14700/50000; acc:  66.97; ppl:  3.24; xent: 1.18; lr: 0.00010; 9982/3985 tok/s;   2830 sec
[2021-04-23 07:33:45,113 INFO] Step 14750/50000; acc:  67.02; ppl:  3.21; xent: 1.17; lr: 0.00010; 10470/4286 tok/s;   2839 sec
[2021-04-23 07:33:54,184 INFO] Step 14800/50000; acc:  67.19; ppl:  3.20; xent: 1.16; lr: 0.00010; 11219/4405 tok/s;   2848 sec
[2021-04-23 07:34:03,423 INFO] Step 14850/50000; acc:  67.22; ppl:  3.18; xent: 1.16; lr: 0.00010; 10901/4317 tok/s;   2857 sec
[2021-04-23 07:34:13,104 INFO] Step 14900/50000; acc:  67.40; ppl:  3.20; xent: 1.16; lr: 0.00010; 10500/4127 tok/s;   2867 sec
[2021-04-23 07:34:23,301 INFO] Step 14950/50000; acc:  66.63; ppl:  3.26; xent: 1.18; lr: 0.00010; 10386/3961 tok/s;   2877 sec
[2021-04-23 07:34:32,814 INFO] Step 15000/50000; acc:  67.70; ppl:  3.12; xent: 1.14; lr: 0.00010; 10378/4191 tok/s;   2887 sec
[2021-04-23 07:34:32,816 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-23 07:34:40,718 INFO] Validation perplexity: 3.18579
[2021-04-23 07:34:40,718 INFO] Validation accuracy: 67.6295
[2021-04-23 07:34:40,720 INFO] Saving checkpoint ../models/default_params/loose_ops/model_step_15000.pt
[2021-04-23 07:34:50,887 INFO] Step 15050/50000; acc:  67.21; ppl:  3.20; xent: 1.16; lr: 0.00010; 5753/2226 tok/s;   2905 sec
[2021-04-23 07:35:00,288 INFO] Step 15100/50000; acc:  67.27; ppl:  3.19; xent: 1.16; lr: 0.00010; 10651/4301 tok/s;   2914 sec
[2021-04-23 07:35:10,048 INFO] Step 15150/50000; acc:  67.60; ppl:  3.17; xent: 1.15; lr: 0.00010; 10450/4052 tok/s;   2924 sec
[2021-04-23 07:35:19,437 INFO] Step 15200/50000; acc:  67.39; ppl:  3.17; xent: 1.15; lr: 0.00010; 10947/4325 tok/s;   2933 sec
[2021-04-23 07:35:28,249 INFO] Step 15250/50000; acc:  67.66; ppl:  3.13; xent: 1.14; lr: 0.00010; 11275/4514 tok/s;   2942 sec
[2021-04-23 07:35:37,312 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:35:38,089 INFO] Step 15300/50000; acc:  67.57; ppl:  3.18; xent: 1.16; lr: 0.00010; 10516/4100 tok/s;   2952 sec
[2021-04-23 07:35:47,868 INFO] Step 15350/50000; acc:  67.59; ppl:  3.14; xent: 1.14; lr: 0.00010; 10350/4122 tok/s;   2962 sec
[2021-04-23 07:35:57,500 INFO] Step 15400/50000; acc:  67.37; ppl:  3.20; xent: 1.16; lr: 0.00010; 10554/4137 tok/s;   2972 sec
[2021-04-23 07:36:07,643 INFO] Step 15450/50000; acc:  67.62; ppl:  3.15; xent: 1.15; lr: 0.00010; 9861/4092 tok/s;   2982 sec
[2021-04-23 07:36:17,153 INFO] Step 15500/50000; acc:  67.10; ppl:  3.19; xent: 1.16; lr: 0.00010; 10792/4236 tok/s;   2991 sec
[2021-04-23 07:36:26,591 INFO] Step 15550/50000; acc:  67.61; ppl:  3.17; xent: 1.15; lr: 0.00010; 10710/4259 tok/s;   3001 sec
[2021-04-23 07:36:36,030 INFO] Step 15600/50000; acc:  67.51; ppl:  3.18; xent: 1.16; lr: 0.00010; 10777/4183 tok/s;   3010 sec
[2021-04-23 07:36:45,711 INFO] Step 15650/50000; acc:  67.55; ppl:  3.15; xent: 1.15; lr: 0.00010; 10607/4142 tok/s;   3020 sec
[2021-04-23 07:36:55,870 INFO] Step 15700/50000; acc:  67.50; ppl:  3.16; xent: 1.15; lr: 0.00010; 9993/4029 tok/s;   3030 sec
[2021-04-23 07:37:05,196 INFO] Step 15750/50000; acc:  67.57; ppl:  3.15; xent: 1.15; lr: 0.00010; 11132/4254 tok/s;   3039 sec
[2021-04-23 07:37:14,584 INFO] Step 15800/50000; acc:  67.71; ppl:  3.12; xent: 1.14; lr: 0.00010; 10597/4309 tok/s;   3049 sec
[2021-04-23 07:37:24,322 INFO] Step 15850/50000; acc:  67.35; ppl:  3.19; xent: 1.16; lr: 0.00010; 10630/4113 tok/s;   3058 sec
[2021-04-23 07:37:34,047 INFO] Step 15900/50000; acc:  68.13; ppl:  3.10; xent: 1.13; lr: 0.00010; 10358/4110 tok/s;   3068 sec
[2021-04-23 07:37:42,787 INFO] Step 15950/50000; acc:  67.79; ppl:  3.14; xent: 1.15; lr: 0.00010; 11644/4538 tok/s;   3077 sec
[2021-04-23 07:37:52,073 INFO] Step 16000/50000; acc:  67.70; ppl:  3.12; xent: 1.14; lr: 0.00010; 10905/4356 tok/s;   3086 sec
[2021-04-23 07:37:58,428 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:38:01,740 INFO] Step 16050/50000; acc:  67.79; ppl:  3.11; xent: 1.14; lr: 0.00010; 10413/4193 tok/s;   3096 sec
[2021-04-23 07:38:11,758 INFO] Step 16100/50000; acc:  67.65; ppl:  3.16; xent: 1.15; lr: 0.00010; 10356/4052 tok/s;   3106 sec
[2021-04-23 07:38:21,592 INFO] Step 16150/50000; acc:  67.58; ppl:  3.15; xent: 1.15; lr: 0.00010; 10261/4074 tok/s;   3116 sec
[2021-04-23 07:38:31,292 INFO] Step 16200/50000; acc:  67.75; ppl:  3.13; xent: 1.14; lr: 0.00010; 10381/4212 tok/s;   3125 sec
[2021-04-23 07:38:40,635 INFO] Step 16250/50000; acc:  67.77; ppl:  3.11; xent: 1.13; lr: 0.00010; 10787/4328 tok/s;   3135 sec
[2021-04-23 07:38:50,298 INFO] Step 16300/50000; acc:  67.76; ppl:  3.15; xent: 1.15; lr: 0.00010; 10589/4141 tok/s;   3144 sec
[2021-04-23 07:38:59,654 INFO] Step 16350/50000; acc:  67.59; ppl:  3.15; xent: 1.15; lr: 0.00010; 10904/4309 tok/s;   3154 sec
[2021-04-23 07:39:09,709 INFO] Step 16400/50000; acc:  67.69; ppl:  3.13; xent: 1.14; lr: 0.00010; 10248/3947 tok/s;   3164 sec
[2021-04-23 07:39:19,727 INFO] Step 16450/50000; acc:  67.86; ppl:  3.11; xent: 1.14; lr: 0.00010; 10168/4031 tok/s;   3174 sec
[2021-04-23 07:39:29,109 INFO] Step 16500/50000; acc:  68.07; ppl:  3.10; xent: 1.13; lr: 0.00010; 10699/4228 tok/s;   3183 sec
[2021-04-23 07:39:38,955 INFO] Step 16550/50000; acc:  67.39; ppl:  3.15; xent: 1.15; lr: 0.00010; 10624/4137 tok/s;   3193 sec
[2021-04-23 07:39:48,022 INFO] Step 16600/50000; acc:  68.46; ppl:  3.08; xent: 1.12; lr: 0.00010; 10912/4377 tok/s;   3202 sec
[2021-04-23 07:39:57,570 INFO] Step 16650/50000; acc:  67.32; ppl:  3.17; xent: 1.15; lr: 0.00010; 10919/4176 tok/s;   3212 sec
[2021-04-23 07:40:06,583 INFO] Step 16700/50000; acc:  68.14; ppl:  3.07; xent: 1.12; lr: 0.00010; 11078/4478 tok/s;   3221 sec
[2021-04-23 07:40:16,078 INFO] Step 16750/50000; acc:  68.35; ppl:  3.06; xent: 1.12; lr: 0.00010; 10663/4253 tok/s;   3230 sec
[2021-04-23 07:40:19,547 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:40:25,630 INFO] Step 16800/50000; acc:  67.85; ppl:  3.12; xent: 1.14; lr: 0.00010; 10723/4259 tok/s;   3240 sec
[2021-04-23 07:40:35,441 INFO] Step 16850/50000; acc:  68.22; ppl:  3.08; xent: 1.13; lr: 0.00010; 10241/4041 tok/s;   3250 sec
[2021-04-23 07:40:45,691 INFO] Step 16900/50000; acc:  67.99; ppl:  3.13; xent: 1.14; lr: 0.00010; 10053/4003 tok/s;   3260 sec
[2021-04-23 07:40:55,062 INFO] Step 16950/50000; acc:  67.80; ppl:  3.09; xent: 1.13; lr: 0.00010; 10743/4295 tok/s;   3269 sec
[2021-04-23 07:41:04,629 INFO] Step 17000/50000; acc:  67.78; ppl:  3.12; xent: 1.14; lr: 0.00010; 10558/4245 tok/s;   3279 sec
[2021-04-23 07:41:13,905 INFO] Step 17050/50000; acc:  68.21; ppl:  3.08; xent: 1.12; lr: 0.00010; 10844/4276 tok/s;   3288 sec
[2021-04-23 07:41:23,371 INFO] Step 17100/50000; acc:  67.87; ppl:  3.13; xent: 1.14; lr: 0.00010; 11016/4223 tok/s;   3297 sec
[2021-04-23 07:41:33,155 INFO] Step 17150/50000; acc:  67.72; ppl:  3.09; xent: 1.13; lr: 0.00010; 10394/4094 tok/s;   3307 sec
[2021-04-23 07:41:43,024 INFO] Step 17200/50000; acc:  68.01; ppl:  3.11; xent: 1.13; lr: 0.00010; 10333/4059 tok/s;   3317 sec
[2021-04-23 07:41:52,431 INFO] Step 17250/50000; acc:  68.08; ppl:  3.07; xent: 1.12; lr: 0.00010; 10827/4312 tok/s;   3326 sec
[2021-04-23 07:42:01,917 INFO] Step 17300/50000; acc:  67.94; ppl:  3.10; xent: 1.13; lr: 0.00010; 10588/4261 tok/s;   3336 sec
[2021-04-23 07:42:11,740 INFO] Step 17350/50000; acc:  68.00; ppl:  3.11; xent: 1.14; lr: 0.00010; 10676/4081 tok/s;   3346 sec
[2021-04-23 07:42:20,459 INFO] Step 17400/50000; acc:  68.53; ppl:  3.02; xent: 1.10; lr: 0.00010; 11335/4527 tok/s;   3355 sec
[2021-04-23 07:42:29,804 INFO] Step 17450/50000; acc:  67.71; ppl:  3.10; xent: 1.13; lr: 0.00010; 11082/4311 tok/s;   3364 sec
[2021-04-23 07:42:39,402 INFO] Step 17500/50000; acc:  68.43; ppl:  3.03; xent: 1.11; lr: 0.00010; 10440/4221 tok/s;   3373 sec
[2021-04-23 07:42:40,286 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:42:49,111 INFO] Step 17550/50000; acc:  68.50; ppl:  3.06; xent: 1.12; lr: 0.00010; 10522/4147 tok/s;   3383 sec
[2021-04-23 07:42:58,928 INFO] Step 17600/50000; acc:  67.72; ppl:  3.11; xent: 1.14; lr: 0.00010; 10346/4100 tok/s;   3393 sec
[2021-04-23 07:43:08,919 INFO] Step 17650/50000; acc:  68.23; ppl:  3.06; xent: 1.12; lr: 0.00010; 9990/4125 tok/s;   3403 sec
[2021-04-23 07:43:18,155 INFO] Step 17700/50000; acc:  68.04; ppl:  3.10; xent: 1.13; lr: 0.00010; 11182/4364 tok/s;   3412 sec
[2021-04-23 07:43:27,528 INFO] Step 17750/50000; acc:  68.26; ppl:  3.06; xent: 1.12; lr: 0.00010; 10732/4252 tok/s;   3422 sec
[2021-04-23 07:43:37,038 INFO] Step 17800/50000; acc:  68.22; ppl:  3.06; xent: 1.12; lr: 0.00010; 10663/4236 tok/s;   3431 sec
[2021-04-23 07:43:46,604 INFO] Step 17850/50000; acc:  68.21; ppl:  3.07; xent: 1.12; lr: 0.00010; 10766/4117 tok/s;   3441 sec
[2021-04-23 07:43:56,511 INFO] Step 17900/50000; acc:  67.99; ppl:  3.08; xent: 1.12; lr: 0.00010; 10362/4113 tok/s;   3451 sec
[2021-04-23 07:44:05,930 INFO] Step 17950/50000; acc:  68.08; ppl:  3.05; xent: 1.11; lr: 0.00010; 10741/4265 tok/s;   3460 sec
[2021-04-23 07:44:15,546 INFO] Step 18000/50000; acc:  68.17; ppl:  3.06; xent: 1.12; lr: 0.00010; 10634/4211 tok/s;   3470 sec
[2021-04-23 07:44:24,910 INFO] Step 18050/50000; acc:  68.00; ppl:  3.09; xent: 1.13; lr: 0.00010; 10809/4270 tok/s;   3479 sec
[2021-04-23 07:44:34,496 INFO] Step 18100/50000; acc:  68.60; ppl:  3.01; xent: 1.10; lr: 0.00010; 10598/4170 tok/s;   3489 sec
[2021-04-23 07:44:43,527 INFO] Step 18150/50000; acc:  68.30; ppl:  3.05; xent: 1.12; lr: 0.00010; 11530/4378 tok/s;   3498 sec
[2021-04-23 07:44:52,714 INFO] Step 18200/50000; acc:  68.80; ppl:  3.01; xent: 1.10; lr: 0.00010; 10702/4381 tok/s;   3507 sec
[2021-04-23 07:44:54,385 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:45:02,589 INFO] Step 18250/50000; acc:  68.21; ppl:  3.07; xent: 1.12; lr: 0.00010; 10552/4138 tok/s;   3517 sec
[2021-04-23 07:45:12,143 INFO] Step 18300/50000; acc:  68.65; ppl:  3.04; xent: 1.11; lr: 0.00010; 10518/4172 tok/s;   3526 sec
[2021-04-23 07:45:22,149 INFO] Step 18350/50000; acc:  67.95; ppl:  3.07; xent: 1.12; lr: 0.00010; 10082/4038 tok/s;   3536 sec
[2021-04-23 07:45:31,823 INFO] Step 18400/50000; acc:  68.19; ppl:  3.07; xent: 1.12; lr: 0.00010; 10531/4277 tok/s;   3546 sec
[2021-04-23 07:45:41,066 INFO] Step 18450/50000; acc:  68.42; ppl:  3.03; xent: 1.11; lr: 0.00010; 10818/4318 tok/s;   3555 sec
[2021-04-23 07:45:50,312 INFO] Step 18500/50000; acc:  68.52; ppl:  3.05; xent: 1.11; lr: 0.00010; 11118/4313 tok/s;   3564 sec
[2021-04-23 07:45:59,884 INFO] Step 18550/50000; acc:  68.43; ppl:  3.04; xent: 1.11; lr: 0.00010; 10677/4161 tok/s;   3574 sec
[2021-04-23 07:46:10,092 INFO] Step 18600/50000; acc:  68.14; ppl:  3.07; xent: 1.12; lr: 0.00010; 10027/3943 tok/s;   3584 sec
[2021-04-23 07:46:19,719 INFO] Step 18650/50000; acc:  68.74; ppl:  3.00; xent: 1.10; lr: 0.00010; 10504/4180 tok/s;   3594 sec
[2021-04-23 07:46:29,193 INFO] Step 18700/50000; acc:  68.47; ppl:  3.03; xent: 1.11; lr: 0.00010; 10847/4222 tok/s;   3603 sec
[2021-04-23 07:46:38,923 INFO] Step 18750/50000; acc:  68.11; ppl:  3.07; xent: 1.12; lr: 0.00010; 10415/4182 tok/s;   3613 sec
[2021-04-23 07:46:48,635 INFO] Step 18800/50000; acc:  68.80; ppl:  3.01; xent: 1.10; lr: 0.00010; 10529/4070 tok/s;   3623 sec
[2021-04-23 07:46:57,879 INFO] Step 18850/50000; acc:  68.61; ppl:  3.00; xent: 1.10; lr: 0.00010; 11063/4352 tok/s;   3632 sec
[2021-04-23 07:47:06,753 INFO] Step 18900/50000; acc:  68.66; ppl:  2.99; xent: 1.10; lr: 0.00010; 11236/4494 tok/s;   3641 sec
[2021-04-23 07:47:15,416 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:47:16,767 INFO] Step 18950/50000; acc:  68.46; ppl:  3.06; xent: 1.12; lr: 0.00010; 10436/4086 tok/s;   3651 sec
[2021-04-23 07:47:26,308 INFO] Step 19000/50000; acc:  69.10; ppl:  2.97; xent: 1.09; lr: 0.00010; 10348/4196 tok/s;   3660 sec
[2021-04-23 07:47:36,325 INFO] Step 19050/50000; acc:  67.99; ppl:  3.10; xent: 1.13; lr: 0.00010; 10401/3995 tok/s;   3670 sec
[2021-04-23 07:47:46,127 INFO] Step 19100/50000; acc:  68.97; ppl:  2.97; xent: 1.09; lr: 0.00010; 10136/4205 tok/s;   3680 sec
[2021-04-23 07:47:55,580 INFO] Step 19150/50000; acc:  68.39; ppl:  3.04; xent: 1.11; lr: 0.00010; 10742/4267 tok/s;   3690 sec
[2021-04-23 07:48:05,082 INFO] Step 19200/50000; acc:  68.67; ppl:  3.03; xent: 1.11; lr: 0.00010; 10719/4235 tok/s;   3699 sec
[2021-04-23 07:48:14,401 INFO] Step 19250/50000; acc:  68.83; ppl:  3.00; xent: 1.10; lr: 0.00010; 10763/4244 tok/s;   3708 sec
[2021-04-23 07:48:24,100 INFO] Step 19300/50000; acc:  68.35; ppl:  3.04; xent: 1.11; lr: 0.00010; 10769/4135 tok/s;   3718 sec
[2021-04-23 07:48:34,527 INFO] Step 19350/50000; acc:  68.44; ppl:  3.01; xent: 1.10; lr: 0.00010; 9792/3932 tok/s;   3729 sec
[2021-04-23 07:48:43,671 INFO] Step 19400/50000; acc:  69.14; ppl:  2.97; xent: 1.09; lr: 0.00010; 10995/4288 tok/s;   3738 sec
[2021-04-23 07:48:53,136 INFO] Step 19450/50000; acc:  68.64; ppl:  3.01; xent: 1.10; lr: 0.00010; 10747/4294 tok/s;   3747 sec
[2021-04-23 07:49:02,739 INFO] Step 19500/50000; acc:  68.37; ppl:  3.03; xent: 1.11; lr: 0.00010; 10671/4138 tok/s;   3757 sec
[2021-04-23 07:49:12,473 INFO] Step 19550/50000; acc:  69.14; ppl:  2.97; xent: 1.09; lr: 0.00010; 10445/4156 tok/s;   3767 sec
[2021-04-23 07:49:21,344 INFO] Step 19600/50000; acc:  68.80; ppl:  3.01; xent: 1.10; lr: 0.00010; 11494/4478 tok/s;   3775 sec
[2021-04-23 07:49:30,883 INFO] Step 19650/50000; acc:  69.09; ppl:  2.97; xent: 1.09; lr: 0.00010; 10563/4226 tok/s;   3785 sec
[2021-04-23 07:49:36,765 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:49:40,533 INFO] Step 19700/50000; acc:  69.02; ppl:  2.99; xent: 1.09; lr: 0.00010; 10470/4204 tok/s;   3795 sec
[2021-04-23 07:49:50,696 INFO] Step 19750/50000; acc:  68.46; ppl:  3.03; xent: 1.11; lr: 0.00010; 10337/3996 tok/s;   3805 sec
[2021-04-23 07:50:00,247 INFO] Step 19800/50000; acc:  68.92; ppl:  2.97; xent: 1.09; lr: 0.00010; 10290/4182 tok/s;   3814 sec
[2021-04-23 07:50:10,250 INFO] Step 19850/50000; acc:  68.44; ppl:  3.04; xent: 1.11; lr: 0.00010; 10316/4105 tok/s;   3824 sec
[2021-04-23 07:50:19,459 INFO] Step 19900/50000; acc:  68.97; ppl:  2.95; xent: 1.08; lr: 0.00010; 10871/4377 tok/s;   3834 sec
[2021-04-23 07:50:28,987 INFO] Step 19950/50000; acc:  68.67; ppl:  3.01; xent: 1.10; lr: 0.00010; 10628/4181 tok/s;   3843 sec
[2021-04-23 07:50:38,437 INFO] Step 20000/50000; acc:  68.79; ppl:  3.01; xent: 1.10; lr: 0.00010; 10872/4266 tok/s;   3852 sec
[2021-04-23 07:50:38,440 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-23 07:50:46,355 INFO] Validation perplexity: 3.07764
[2021-04-23 07:50:46,355 INFO] Validation accuracy: 68.4101
[2021-04-23 07:50:46,357 INFO] Saving checkpoint ../models/default_params/loose_ops/model_step_20000.pt
[2021-04-23 07:50:56,747 INFO] Step 20050/50000; acc:  68.67; ppl:  2.98; xent: 1.09; lr: 0.00010; 5546/2169 tok/s;   3871 sec
[2021-04-23 07:51:06,892 INFO] Step 20100/50000; acc:  68.63; ppl:  3.00; xent: 1.10; lr: 0.00010; 10210/3974 tok/s;   3881 sec
[2021-04-23 07:51:16,457 INFO] Step 20150/50000; acc:  69.21; ppl:  2.96; xent: 1.08; lr: 0.00010; 10568/4240 tok/s;   3891 sec
[2021-04-23 07:51:25,988 INFO] Step 20200/50000; acc:  68.83; ppl:  2.98; xent: 1.09; lr: 0.00010; 10639/4159 tok/s;   3900 sec
[2021-04-23 07:51:35,452 INFO] Step 20250/50000; acc:  68.92; ppl:  2.98; xent: 1.09; lr: 0.00010; 10689/4274 tok/s;   3910 sec
[2021-04-23 07:51:44,939 INFO] Step 20300/50000; acc:  68.71; ppl:  2.99; xent: 1.09; lr: 0.00010; 10870/4138 tok/s;   3919 sec
[2021-04-23 07:51:53,988 INFO] Step 20350/50000; acc:  69.10; ppl:  2.95; xent: 1.08; lr: 0.00010; 11121/4491 tok/s;   3928 sec
[2021-04-23 07:52:03,549 INFO] Step 20400/50000; acc:  69.40; ppl:  2.94; xent: 1.08; lr: 0.00010; 10635/4227 tok/s;   3938 sec
[2021-04-23 07:52:06,624 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:52:13,148 INFO] Step 20450/50000; acc:  68.86; ppl:  2.99; xent: 1.09; lr: 0.00010; 10600/4246 tok/s;   3947 sec
[2021-04-23 07:52:22,769 INFO] Step 20500/50000; acc:  68.92; ppl:  2.96; xent: 1.09; lr: 0.00010; 10486/4094 tok/s;   3957 sec
[2021-04-23 07:52:33,173 INFO] Step 20550/50000; acc:  68.59; ppl:  3.03; xent: 1.11; lr: 0.00010; 10010/3987 tok/s;   3967 sec
[2021-04-23 07:52:42,305 INFO] Step 20600/50000; acc:  69.50; ppl:  2.91; xent: 1.07; lr: 0.00010; 10769/4385 tok/s;   3976 sec
[2021-04-23 07:52:52,087 INFO] Step 20650/50000; acc:  68.56; ppl:  3.01; xent: 1.10; lr: 0.00010; 10589/4129 tok/s;   3986 sec
[2021-04-23 07:53:01,327 INFO] Step 20700/50000; acc:  69.10; ppl:  2.94; xent: 1.08; lr: 0.00010; 10797/4307 tok/s;   3995 sec
[2021-04-23 07:53:10,768 INFO] Step 20750/50000; acc:  69.09; ppl:  2.96; xent: 1.09; lr: 0.00010; 10945/4234 tok/s;   4005 sec
[2021-04-23 07:53:20,556 INFO] Step 20800/50000; acc:  68.76; ppl:  2.97; xent: 1.09; lr: 0.00010; 10442/4119 tok/s;   4015 sec
[2021-04-23 07:53:30,027 INFO] Step 20850/50000; acc:  69.13; ppl:  2.97; xent: 1.09; lr: 0.00010; 10622/4207 tok/s;   4024 sec
[2021-04-23 07:53:39,624 INFO] Step 20900/50000; acc:  68.99; ppl:  2.96; xent: 1.09; lr: 0.00010; 10801/4228 tok/s;   4034 sec
[2021-04-23 07:53:49,109 INFO] Step 20950/50000; acc:  69.05; ppl:  2.97; xent: 1.09; lr: 0.00010; 10645/4251 tok/s;   4043 sec
[2021-04-23 07:53:58,585 INFO] Step 21000/50000; acc:  69.33; ppl:  2.93; xent: 1.07; lr: 0.00010; 10731/4166 tok/s;   4053 sec
[2021-04-23 07:54:07,626 INFO] Step 21050/50000; acc:  69.33; ppl:  2.93; xent: 1.08; lr: 0.00010; 11177/4440 tok/s;   4062 sec
[2021-04-23 07:54:16,985 INFO] Step 21100/50000; acc:  68.91; ppl:  2.96; xent: 1.08; lr: 0.00010; 10942/4294 tok/s;   4071 sec
[2021-04-23 07:54:20,812 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:54:26,550 INFO] Step 21150/50000; acc:  69.28; ppl:  2.93; xent: 1.08; lr: 0.00010; 10600/4258 tok/s;   4081 sec
[2021-04-23 07:54:36,556 INFO] Step 21200/50000; acc:  69.42; ppl:  2.95; xent: 1.08; lr: 0.00010; 10220/4015 tok/s;   4091 sec
[2021-04-23 07:54:46,037 INFO] Step 21250/50000; acc:  69.11; ppl:  2.97; xent: 1.09; lr: 0.00010; 10651/4226 tok/s;   4100 sec
[2021-04-23 07:54:55,991 INFO] Step 21300/50000; acc:  69.20; ppl:  2.94; xent: 1.08; lr: 0.00010; 10068/4119 tok/s;   4110 sec
[2021-04-23 07:55:05,474 INFO] Step 21350/50000; acc:  68.71; ppl:  2.99; xent: 1.09; lr: 0.00010; 11006/4298 tok/s;   4120 sec
[2021-04-23 07:55:14,390 INFO] Step 21400/50000; acc:  69.66; ppl:  2.89; xent: 1.06; lr: 0.00010; 11012/4458 tok/s;   4128 sec
[2021-04-23 07:55:24,299 INFO] Step 21450/50000; acc:  68.79; ppl:  2.98; xent: 1.09; lr: 0.00010; 10498/4064 tok/s;   4138 sec
[2021-04-23 07:55:33,928 INFO] Step 21500/50000; acc:  69.03; ppl:  2.94; xent: 1.08; lr: 0.00010; 10623/4102 tok/s;   4148 sec
[2021-04-23 07:55:43,606 INFO] Step 21550/50000; acc:  69.46; ppl:  2.92; xent: 1.07; lr: 0.00010; 10488/4179 tok/s;   4158 sec
[2021-04-23 07:55:53,320 INFO] Step 21600/50000; acc:  69.26; ppl:  2.93; xent: 1.08; lr: 0.00010; 10499/4129 tok/s;   4167 sec
[2021-04-23 07:56:02,797 INFO] Step 21650/50000; acc:  69.25; ppl:  2.93; xent: 1.07; lr: 0.00010; 10611/4261 tok/s;   4177 sec
[2021-04-23 07:56:12,365 INFO] Step 21700/50000; acc:  68.95; ppl:  2.97; xent: 1.09; lr: 0.00010; 10809/4201 tok/s;   4186 sec
[2021-04-23 07:56:21,980 INFO] Step 21750/50000; acc:  69.42; ppl:  2.90; xent: 1.06; lr: 0.00010; 10588/4190 tok/s;   4196 sec
[2021-04-23 07:56:30,731 INFO] Step 21800/50000; acc:  69.75; ppl:  2.88; xent: 1.06; lr: 0.00010; 11530/4466 tok/s;   4205 sec
[2021-04-23 07:56:40,072 INFO] Step 21850/50000; acc:  69.70; ppl:  2.90; xent: 1.06; lr: 0.00010; 10765/4327 tok/s;   4214 sec
[2021-04-23 07:56:41,408 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:56:49,958 INFO] Step 21900/50000; acc:  69.51; ppl:  2.93; xent: 1.07; lr: 0.00010; 10416/4156 tok/s;   4224 sec
[2021-04-23 07:56:59,510 INFO] Step 21950/50000; acc:  69.24; ppl:  2.92; xent: 1.07; lr: 0.00010; 10635/4167 tok/s;   4234 sec
[2021-04-23 07:57:09,651 INFO] Step 22000/50000; acc:  69.25; ppl:  2.93; xent: 1.07; lr: 0.00010; 9966/4002 tok/s;   4244 sec
[2021-04-23 07:57:19,384 INFO] Step 22050/50000; acc:  69.19; ppl:  2.96; xent: 1.08; lr: 0.00010; 10427/4192 tok/s;   4253 sec
[2021-04-23 07:57:28,629 INFO] Step 22100/50000; acc:  69.34; ppl:  2.89; xent: 1.06; lr: 0.00010; 10858/4336 tok/s;   4263 sec
[2021-04-23 07:57:37,894 INFO] Step 22150/50000; acc:  69.30; ppl:  2.96; xent: 1.08; lr: 0.00010; 11201/4304 tok/s;   4272 sec
[2021-04-23 07:57:47,566 INFO] Step 22200/50000; acc:  69.53; ppl:  2.90; xent: 1.06; lr: 0.00010; 10349/4129 tok/s;   4282 sec
[2021-04-23 07:57:58,068 INFO] Step 22250/50000; acc:  68.83; ppl:  2.96; xent: 1.09; lr: 0.00010; 9971/3874 tok/s;   4292 sec
[2021-04-23 07:58:07,528 INFO] Step 22300/50000; acc:  69.78; ppl:  2.86; xent: 1.05; lr: 0.00010; 10612/4217 tok/s;   4302 sec
[2021-04-23 07:58:16,970 INFO] Step 22350/50000; acc:  69.40; ppl:  2.90; xent: 1.06; lr: 0.00010; 10773/4251 tok/s;   4311 sec
[2021-04-23 07:58:26,490 INFO] Step 22400/50000; acc:  69.29; ppl:  2.94; xent: 1.08; lr: 0.00010; 10706/4227 tok/s;   4321 sec
[2021-04-23 07:58:36,067 INFO] Step 22450/50000; acc:  69.81; ppl:  2.87; xent: 1.06; lr: 0.00010; 10525/4131 tok/s;   4330 sec
[2021-04-23 07:58:45,281 INFO] Step 22500/50000; acc:  69.37; ppl:  2.91; xent: 1.07; lr: 0.00010; 11293/4398 tok/s;   4339 sec
[2021-04-23 07:58:54,368 INFO] Step 22550/50000; acc:  69.72; ppl:  2.87; xent: 1.06; lr: 0.00010; 11040/4401 tok/s;   4348 sec
[2021-04-23 07:59:02,468 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 07:59:04,068 INFO] Step 22600/50000; acc:  69.48; ppl:  2.90; xent: 1.06; lr: 0.00010; 10440/4166 tok/s;   4358 sec
[2021-04-23 07:59:13,741 INFO] Step 22650/50000; acc:  69.92; ppl:  2.87; xent: 1.05; lr: 0.00010; 10447/4165 tok/s;   4368 sec
[2021-04-23 07:59:23,721 INFO] Step 22700/50000; acc:  68.92; ppl:  2.96; xent: 1.08; lr: 0.00010; 10312/4011 tok/s;   4378 sec
[2021-04-23 07:59:33,432 INFO] Step 22750/50000; acc:  69.77; ppl:  2.88; xent: 1.06; lr: 0.00010; 10332/4244 tok/s;   4387 sec
[2021-04-23 07:59:42,977 INFO] Step 22800/50000; acc:  69.47; ppl:  2.90; xent: 1.07; lr: 0.00010; 10673/4233 tok/s;   4397 sec
[2021-04-23 07:59:52,404 INFO] Step 22850/50000; acc:  69.58; ppl:  2.90; xent: 1.07; lr: 0.00010; 10742/4247 tok/s;   4406 sec
[2021-04-23 08:00:01,813 INFO] Step 22900/50000; acc:  69.62; ppl:  2.88; xent: 1.06; lr: 0.00010; 10706/4236 tok/s;   4416 sec
[2021-04-23 08:00:11,620 INFO] Step 22950/50000; acc:  69.42; ppl:  2.92; xent: 1.07; lr: 0.00010; 10770/4090 tok/s;   4426 sec
[2021-04-23 08:00:21,725 INFO] Step 23000/50000; acc:  69.78; ppl:  2.86; xent: 1.05; lr: 0.00010; 9848/4012 tok/s;   4436 sec
[2021-04-23 08:00:31,035 INFO] Step 23050/50000; acc:  69.56; ppl:  2.89; xent: 1.06; lr: 0.00010; 11104/4223 tok/s;   4445 sec
[2021-04-23 08:00:40,593 INFO] Step 23100/50000; acc:  69.80; ppl:  2.87; xent: 1.05; lr: 0.00010; 10541/4280 tok/s;   4455 sec
[2021-04-23 08:00:50,224 INFO] Step 23150/50000; acc:  69.84; ppl:  2.88; xent: 1.06; lr: 0.00010; 10535/4139 tok/s;   4464 sec
[2021-04-23 08:00:59,921 INFO] Step 23200/50000; acc:  69.86; ppl:  2.87; xent: 1.06; lr: 0.00010; 10579/4153 tok/s;   4474 sec
[2021-04-23 08:01:08,730 INFO] Step 23250/50000; acc:  69.73; ppl:  2.87; xent: 1.05; lr: 0.00010; 11369/4500 tok/s;   4483 sec
[2021-04-23 08:01:18,435 INFO] Step 23300/50000; acc:  69.78; ppl:  2.87; xent: 1.06; lr: 0.00010; 10589/4188 tok/s;   4492 sec
[2021-04-23 08:01:23,711 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:01:27,817 INFO] Step 23350/50000; acc:  69.67; ppl:  2.87; xent: 1.06; lr: 0.00010; 10820/4303 tok/s;   4502 sec
[2021-04-23 08:01:37,930 INFO] Step 23400/50000; acc:  69.77; ppl:  2.87; xent: 1.05; lr: 0.00010; 10070/3981 tok/s;   4512 sec
[2021-04-23 08:01:47,808 INFO] Step 23450/50000; acc:  69.62; ppl:  2.89; xent: 1.06; lr: 0.00010; 10174/4080 tok/s;   4522 sec
[2021-04-23 08:01:57,400 INFO] Step 23500/50000; acc:  69.80; ppl:  2.88; xent: 1.06; lr: 0.00010; 10650/4276 tok/s;   4531 sec
[2021-04-23 08:02:06,810 INFO] Step 23550/50000; acc:  69.84; ppl:  2.86; xent: 1.05; lr: 0.00010; 10742/4304 tok/s;   4541 sec
[2021-04-23 08:02:16,171 INFO] Step 23600/50000; acc:  69.79; ppl:  2.88; xent: 1.06; lr: 0.00010; 10829/4235 tok/s;   4550 sec
[2021-04-23 08:02:25,510 INFO] Step 23650/50000; acc:  69.67; ppl:  2.88; xent: 1.06; lr: 0.00010; 10981/4290 tok/s;   4560 sec
[2021-04-23 08:02:35,389 INFO] Step 23700/50000; acc:  69.63; ppl:  2.87; xent: 1.05; lr: 0.00010; 10295/4036 tok/s;   4569 sec
[2021-04-23 08:02:45,426 INFO] Step 23750/50000; acc:  69.40; ppl:  2.89; xent: 1.06; lr: 0.00010; 10423/4031 tok/s;   4579 sec
[2021-04-23 08:02:54,869 INFO] Step 23800/50000; acc:  70.01; ppl:  2.82; xent: 1.04; lr: 0.00010; 10466/4268 tok/s;   4589 sec
[2021-04-23 08:03:04,441 INFO] Step 23850/50000; acc:  69.52; ppl:  2.90; xent: 1.06; lr: 0.00010; 10852/4155 tok/s;   4599 sec
[2021-04-23 08:03:13,986 INFO] Step 23900/50000; acc:  69.92; ppl:  2.84; xent: 1.04; lr: 0.00010; 10498/4253 tok/s;   4608 sec
[2021-04-23 08:03:23,215 INFO] Step 23950/50000; acc:  69.95; ppl:  2.84; xent: 1.04; lr: 0.00010; 11073/4262 tok/s;   4617 sec
[2021-04-23 08:03:32,339 INFO] Step 24000/50000; acc:  69.99; ppl:  2.84; xent: 1.05; lr: 0.00010; 11112/4444 tok/s;   4626 sec
[2021-04-23 08:03:41,754 INFO] Step 24050/50000; acc:  70.25; ppl:  2.82; xent: 1.04; lr: 0.00010; 10642/4274 tok/s;   4636 sec
[2021-04-23 08:03:44,573 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:03:51,501 INFO] Step 24100/50000; acc:  69.59; ppl:  2.88; xent: 1.06; lr: 0.00010; 10639/4214 tok/s;   4646 sec
[2021-04-23 08:04:01,162 INFO] Step 24150/50000; acc:  69.77; ppl:  2.87; xent: 1.06; lr: 0.00010; 10482/4079 tok/s;   4655 sec
[2021-04-23 08:04:11,224 INFO] Step 24200/50000; acc:  69.90; ppl:  2.85; xent: 1.05; lr: 0.00010; 10029/4086 tok/s;   4665 sec
[2021-04-23 08:04:20,496 INFO] Step 24250/50000; acc:  70.23; ppl:  2.83; xent: 1.04; lr: 0.00010; 10845/4330 tok/s;   4675 sec
[2021-04-23 08:04:30,110 INFO] Step 24300/50000; acc:  69.86; ppl:  2.87; xent: 1.06; lr: 0.00010; 10661/4187 tok/s;   4684 sec
[2021-04-23 08:04:39,347 INFO] Step 24350/50000; acc:  69.89; ppl:  2.84; xent: 1.05; lr: 0.00010; 10911/4340 tok/s;   4693 sec
[2021-04-23 08:04:48,897 INFO] Step 24400/50000; acc:  69.82; ppl:  2.86; xent: 1.05; lr: 0.00010; 10861/4169 tok/s;   4703 sec
[2021-04-23 08:04:58,911 INFO] Step 24450/50000; acc:  69.87; ppl:  2.85; xent: 1.05; lr: 0.00010; 10148/4021 tok/s;   4713 sec
[2021-04-23 08:05:08,548 INFO] Step 24500/50000; acc:  70.10; ppl:  2.84; xent: 1.04; lr: 0.00010; 10473/4139 tok/s;   4723 sec
[2021-04-23 08:05:18,083 INFO] Step 24550/50000; acc:  69.79; ppl:  2.87; xent: 1.05; lr: 0.00010; 10985/4282 tok/s;   4732 sec
[2021-04-23 08:05:27,432 INFO] Step 24600/50000; acc:  70.26; ppl:  2.82; xent: 1.04; lr: 0.00010; 10537/4297 tok/s;   4741 sec
[2021-04-23 08:05:37,273 INFO] Step 24650/50000; acc:  69.85; ppl:  2.86; xent: 1.05; lr: 0.00010; 10596/4024 tok/s;   4751 sec
[2021-04-23 08:05:46,081 INFO] Step 24700/50000; acc:  70.30; ppl:  2.81; xent: 1.03; lr: 0.00010; 11383/4521 tok/s;   4760 sec
[2021-04-23 08:05:55,481 INFO] Step 24750/50000; acc:  70.08; ppl:  2.83; xent: 1.04; lr: 0.00010; 10766/4260 tok/s;   4770 sec
[2021-04-23 08:05:58,884 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:06:05,339 INFO] Step 24800/50000; acc:  69.99; ppl:  2.84; xent: 1.04; lr: 0.00010; 10393/4180 tok/s;   4779 sec
[2021-04-23 08:06:15,053 INFO] Step 24850/50000; acc:  70.54; ppl:  2.79; xent: 1.03; lr: 0.00010; 10344/4094 tok/s;   4789 sec
[2021-04-23 08:06:24,730 INFO] Step 24900/50000; acc:  69.61; ppl:  2.89; xent: 1.06; lr: 0.00010; 10640/4198 tok/s;   4799 sec
[2021-04-23 08:06:34,659 INFO] Step 24950/50000; acc:  70.18; ppl:  2.82; xent: 1.04; lr: 0.00010; 10138/4091 tok/s;   4809 sec
[2021-04-23 08:06:44,104 INFO] Step 25000/50000; acc:  69.95; ppl:  2.85; xent: 1.05; lr: 0.00010; 10722/4323 tok/s;   4818 sec
[2021-04-23 08:06:44,105 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-23 08:06:52,023 INFO] Validation perplexity: 3.01747
[2021-04-23 08:06:52,023 INFO] Validation accuracy: 68.9489
[2021-04-23 08:06:52,025 INFO] Saving checkpoint ../models/default_params/loose_ops/model_step_25000.pt
[2021-04-23 08:07:01,836 INFO] Step 25050/50000; acc:  70.48; ppl:  2.80; xent: 1.03; lr: 0.00010; 5657/2228 tok/s;   4836 sec
[2021-04-23 08:07:11,585 INFO] Step 25100/50000; acc:  70.02; ppl:  2.84; xent: 1.04; lr: 0.00010; 10565/4156 tok/s;   4846 sec
[2021-04-23 08:07:21,313 INFO] Step 25150/50000; acc:  69.82; ppl:  2.85; xent: 1.05; lr: 0.00010; 10605/4058 tok/s;   4855 sec
[2021-04-23 08:07:30,981 INFO] Step 25200/50000; acc:  70.13; ppl:  2.81; xent: 1.03; lr: 0.00010; 10539/4194 tok/s;   4865 sec
[2021-04-23 08:07:40,643 INFO] Step 25250/50000; acc:  70.34; ppl:  2.80; xent: 1.03; lr: 0.00010; 10513/4146 tok/s;   4875 sec
[2021-04-23 08:07:49,962 INFO] Step 25300/50000; acc:  70.04; ppl:  2.85; xent: 1.05; lr: 0.00010; 10807/4293 tok/s;   4884 sec
[2021-04-23 08:07:59,717 INFO] Step 25350/50000; acc:  69.97; ppl:  2.85; xent: 1.05; lr: 0.00010; 10724/4164 tok/s;   4894 sec
[2021-04-23 08:08:09,217 INFO] Step 25400/50000; acc:  70.46; ppl:  2.77; xent: 1.02; lr: 0.00010; 10465/4217 tok/s;   4903 sec
[2021-04-23 08:08:18,179 INFO] Step 25450/50000; acc:  70.07; ppl:  2.82; xent: 1.04; lr: 0.00010; 11536/4389 tok/s;   4912 sec
[2021-04-23 08:08:27,548 INFO] Step 25500/50000; acc:  70.86; ppl:  2.77; xent: 1.02; lr: 0.00010; 10668/4309 tok/s;   4922 sec
[2021-04-23 08:08:28,414 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:08:37,203 INFO] Step 25550/50000; acc:  70.38; ppl:  2.79; xent: 1.03; lr: 0.00010; 10539/4221 tok/s;   4931 sec
[2021-04-23 08:08:46,898 INFO] Step 25600/50000; acc:  70.01; ppl:  2.84; xent: 1.04; lr: 0.00010; 10570/4131 tok/s;   4941 sec
[2021-04-23 08:08:56,946 INFO] Step 25650/50000; acc:  70.15; ppl:  2.80; xent: 1.03; lr: 0.00010; 9881/4051 tok/s;   4951 sec
[2021-04-23 08:09:06,688 INFO] Step 25700/50000; acc:  69.71; ppl:  2.86; xent: 1.05; lr: 0.00010; 10611/4157 tok/s;   4961 sec
[2021-04-23 08:09:16,195 INFO] Step 25750/50000; acc:  70.24; ppl:  2.80; xent: 1.03; lr: 0.00010; 10626/4235 tok/s;   4970 sec
[2021-04-23 08:09:25,381 INFO] Step 25800/50000; acc:  70.46; ppl:  2.80; xent: 1.03; lr: 0.00010; 10940/4312 tok/s;   4979 sec
[2021-04-23 08:09:34,985 INFO] Step 25850/50000; acc:  70.39; ppl:  2.81; xent: 1.03; lr: 0.00010; 10663/4193 tok/s;   4989 sec
[2021-04-23 08:09:45,160 INFO] Step 25900/50000; acc:  69.85; ppl:  2.82; xent: 1.04; lr: 0.00010; 10181/3967 tok/s;   4999 sec
[2021-04-23 08:09:54,560 INFO] Step 25950/50000; acc:  70.53; ppl:  2.77; xent: 1.02; lr: 0.00010; 10766/4262 tok/s;   5009 sec
[2021-04-23 08:10:04,106 INFO] Step 26000/50000; acc:  70.24; ppl:  2.81; xent: 1.03; lr: 0.00010; 10703/4214 tok/s;   5018 sec
[2021-04-23 08:10:13,733 INFO] Step 26050/50000; acc:  70.26; ppl:  2.82; xent: 1.04; lr: 0.00010; 10534/4171 tok/s;   5028 sec
[2021-04-23 08:10:23,344 INFO] Step 26100/50000; acc:  70.68; ppl:  2.77; xent: 1.02; lr: 0.00010; 10523/4109 tok/s;   5037 sec
[2021-04-23 08:10:32,525 INFO] Step 26150/50000; acc:  70.09; ppl:  2.82; xent: 1.04; lr: 0.00010; 11426/4431 tok/s;   5047 sec
[2021-04-23 08:10:41,344 INFO] Step 26200/50000; acc:  70.89; ppl:  2.74; xent: 1.01; lr: 0.00010; 11103/4511 tok/s;   5055 sec
[2021-04-23 08:10:49,070 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:10:51,135 INFO] Step 26250/50000; acc:  70.27; ppl:  2.81; xent: 1.03; lr: 0.00010; 10624/4151 tok/s;   5065 sec
[2021-04-23 08:11:00,696 INFO] Step 26300/50000; acc:  70.75; ppl:  2.76; xent: 1.02; lr: 0.00010; 10483/4203 tok/s;   5075 sec
[2021-04-23 08:11:10,804 INFO] Step 26350/50000; acc:  70.10; ppl:  2.82; xent: 1.04; lr: 0.00010; 10065/3957 tok/s;   5085 sec
[2021-04-23 08:11:20,581 INFO] Step 26400/50000; acc:  70.33; ppl:  2.80; xent: 1.03; lr: 0.00010; 10353/4237 tok/s;   5095 sec
[2021-04-23 08:11:29,914 INFO] Step 26450/50000; acc:  70.83; ppl:  2.76; xent: 1.01; lr: 0.00010; 10758/4311 tok/s;   5104 sec
[2021-04-23 08:11:39,380 INFO] Step 26500/50000; acc:  70.17; ppl:  2.83; xent: 1.04; lr: 0.00010; 10885/4235 tok/s;   5113 sec
[2021-04-23 08:11:48,922 INFO] Step 26550/50000; acc:  70.28; ppl:  2.81; xent: 1.03; lr: 0.00010; 10625/4221 tok/s;   5123 sec
[2021-04-23 08:11:58,631 INFO] Step 26600/50000; acc:  70.57; ppl:  2.77; xent: 1.02; lr: 0.00010; 10564/4053 tok/s;   5133 sec
[2021-04-23 08:12:08,874 INFO] Step 26650/50000; acc:  70.47; ppl:  2.78; xent: 1.02; lr: 0.00010; 9913/3984 tok/s;   5143 sec
[2021-04-23 08:12:18,087 INFO] Step 26700/50000; acc:  70.60; ppl:  2.77; xent: 1.02; lr: 0.00010; 11112/4297 tok/s;   5152 sec
[2021-04-23 08:12:27,706 INFO] Step 26750/50000; acc:  70.29; ppl:  2.78; xent: 1.02; lr: 0.00010; 10567/4250 tok/s;   5162 sec
[2021-04-23 08:12:37,401 INFO] Step 26800/50000; acc:  70.47; ppl:  2.78; xent: 1.02; lr: 0.00010; 10498/4087 tok/s;   5171 sec
[2021-04-23 08:12:47,039 INFO] Step 26850/50000; acc:  70.61; ppl:  2.76; xent: 1.02; lr: 0.00010; 10573/4176 tok/s;   5181 sec
[2021-04-23 08:12:55,891 INFO] Step 26900/50000; acc:  70.43; ppl:  2.77; xent: 1.02; lr: 0.00010; 11359/4476 tok/s;   5190 sec
[2021-04-23 08:13:05,520 INFO] Step 26950/50000; acc:  70.45; ppl:  2.79; xent: 1.03; lr: 0.00010; 10794/4234 tok/s;   5200 sec
[2021-04-23 08:13:10,458 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:13:14,919 INFO] Step 27000/50000; acc:  70.65; ppl:  2.76; xent: 1.01; lr: 0.00010; 10535/4277 tok/s;   5209 sec
[2021-04-23 08:13:25,227 INFO] Step 27050/50000; acc:  70.51; ppl:  2.79; xent: 1.02; lr: 0.00010; 10154/3920 tok/s;   5219 sec
[2021-04-23 08:13:35,135 INFO] Step 27100/50000; acc:  70.67; ppl:  2.76; xent: 1.01; lr: 0.00010; 10034/4103 tok/s;   5229 sec
[2021-04-23 08:13:44,692 INFO] Step 27150/50000; acc:  70.60; ppl:  2.77; xent: 1.02; lr: 0.00010; 10584/4256 tok/s;   5239 sec
[2021-04-23 08:13:54,195 INFO] Step 27200/50000; acc:  70.49; ppl:  2.78; xent: 1.02; lr: 0.00010; 10710/4252 tok/s;   5248 sec
[2021-04-23 08:14:03,333 INFO] Step 27250/50000; acc:  70.88; ppl:  2.74; xent: 1.01; lr: 0.00010; 10911/4316 tok/s;   5257 sec
[2021-04-23 08:14:12,769 INFO] Step 27300/50000; acc:  70.36; ppl:  2.78; xent: 1.02; lr: 0.00010; 11093/4273 tok/s;   5267 sec
[2021-04-23 08:14:22,772 INFO] Step 27350/50000; acc:  70.43; ppl:  2.76; xent: 1.02; lr: 0.00010; 10216/3994 tok/s;   5277 sec
[2021-04-23 08:14:32,649 INFO] Step 27400/50000; acc:  70.62; ppl:  2.76; xent: 1.02; lr: 0.00010; 10270/4070 tok/s;   5287 sec
[2021-04-23 08:14:41,947 INFO] Step 27450/50000; acc:  70.76; ppl:  2.75; xent: 1.01; lr: 0.00010; 10863/4331 tok/s;   5296 sec
[2021-04-23 08:14:51,598 INFO] Step 27500/50000; acc:  70.57; ppl:  2.77; xent: 1.02; lr: 0.00010; 10639/4163 tok/s;   5306 sec
[2021-04-23 08:15:01,099 INFO] Step 27550/50000; acc:  70.73; ppl:  2.77; xent: 1.02; lr: 0.00010; 10667/4255 tok/s;   5315 sec
[2021-04-23 08:15:10,326 INFO] Step 27600/50000; acc:  70.87; ppl:  2.74; xent: 1.01; lr: 0.00010; 11094/4285 tok/s;   5324 sec
[2021-04-23 08:15:19,487 INFO] Step 27650/50000; acc:  70.97; ppl:  2.74; xent: 1.01; lr: 0.00010; 11001/4381 tok/s;   5334 sec
[2021-04-23 08:15:28,950 INFO] Step 27700/50000; acc:  70.88; ppl:  2.73; xent: 1.00; lr: 0.00010; 10649/4275 tok/s;   5343 sec
[2021-04-23 08:15:31,387 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:15:38,683 INFO] Step 27750/50000; acc:  70.55; ppl:  2.78; xent: 1.02; lr: 0.00010; 10759/4200 tok/s;   5353 sec
[2021-04-23 08:15:48,267 INFO] Step 27800/50000; acc:  70.97; ppl:  2.73; xent: 1.00; lr: 0.00010; 10301/4104 tok/s;   5362 sec
[2021-04-23 08:15:58,316 INFO] Step 27850/50000; acc:  70.51; ppl:  2.79; xent: 1.03; lr: 0.00010; 10288/4098 tok/s;   5372 sec
[2021-04-23 08:16:07,612 INFO] Step 27900/50000; acc:  70.87; ppl:  2.73; xent: 1.00; lr: 0.00010; 10752/4372 tok/s;   5382 sec
[2021-04-23 08:16:17,028 INFO] Step 27950/50000; acc:  70.84; ppl:  2.74; xent: 1.01; lr: 0.00010; 10761/4210 tok/s;   5391 sec
[2021-04-23 08:16:26,425 INFO] Step 28000/50000; acc:  70.57; ppl:  2.75; xent: 1.01; lr: 0.00010; 10817/4314 tok/s;   5400 sec
[2021-04-23 08:16:35,765 INFO] Step 28050/50000; acc:  70.94; ppl:  2.72; xent: 1.00; lr: 0.00010; 10940/4221 tok/s;   5410 sec
[2021-04-23 08:16:45,820 INFO] Step 28100/50000; acc:  70.50; ppl:  2.78; xent: 1.02; lr: 0.00010; 10298/4053 tok/s;   5420 sec
[2021-04-23 08:16:55,524 INFO] Step 28150/50000; acc:  70.87; ppl:  2.75; xent: 1.01; lr: 0.00010; 10452/4127 tok/s;   5430 sec
[2021-04-23 08:17:04,904 INFO] Step 28200/50000; acc:  70.70; ppl:  2.73; xent: 1.00; lr: 0.00010; 10825/4315 tok/s;   5439 sec
[2021-04-23 08:17:14,342 INFO] Step 28250/50000; acc:  70.67; ppl:  2.75; xent: 1.01; lr: 0.00010; 10665/4212 tok/s;   5448 sec
[2021-04-23 08:17:24,155 INFO] Step 28300/50000; acc:  70.93; ppl:  2.73; xent: 1.00; lr: 0.00010; 10530/4082 tok/s;   5458 sec
[2021-04-23 08:17:32,916 INFO] Step 28350/50000; acc:  71.12; ppl:  2.72; xent: 1.00; lr: 0.00010; 11548/4538 tok/s;   5467 sec
[2021-04-23 08:17:42,446 INFO] Step 28400/50000; acc:  70.94; ppl:  2.73; xent: 1.01; lr: 0.00010; 10630/4203 tok/s;   5477 sec
[2021-04-23 08:17:45,417 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:17:52,207 INFO] Step 28450/50000; acc:  70.84; ppl:  2.73; xent: 1.01; lr: 0.00010; 10467/4200 tok/s;   5486 sec
[2021-04-23 08:18:01,696 INFO] Step 28500/50000; acc:  71.30; ppl:  2.68; xent: 0.99; lr: 0.00010; 10613/4206 tok/s;   5496 sec
[2021-04-23 08:18:11,628 INFO] Step 28550/50000; acc:  70.12; ppl:  2.81; xent: 1.03; lr: 0.00010; 10458/4098 tok/s;   5506 sec
[2021-04-23 08:18:21,444 INFO] Step 28600/50000; acc:  71.33; ppl:  2.69; xent: 0.99; lr: 0.00010; 10022/4171 tok/s;   5516 sec
[2021-04-23 08:18:30,722 INFO] Step 28650/50000; acc:  70.42; ppl:  2.78; xent: 1.02; lr: 0.00010; 11203/4353 tok/s;   5525 sec
[2021-04-23 08:18:39,866 INFO] Step 28700/50000; acc:  71.36; ppl:  2.70; xent: 0.99; lr: 0.00010; 10875/4333 tok/s;   5534 sec
[2021-04-23 08:18:49,487 INFO] Step 28750/50000; acc:  70.76; ppl:  2.74; xent: 1.01; lr: 0.00010; 10626/4184 tok/s;   5544 sec
[2021-04-23 08:18:59,339 INFO] Step 28800/50000; acc:  70.43; ppl:  2.76; xent: 1.01; lr: 0.00010; 10532/4051 tok/s;   5553 sec
[2021-04-23 08:19:08,785 INFO] Step 28850/50000; acc:  71.38; ppl:  2.69; xent: 0.99; lr: 0.00010; 10612/4253 tok/s;   5563 sec
[2021-04-23 08:19:18,607 INFO] Step 28900/50000; acc:  70.78; ppl:  2.74; xent: 1.01; lr: 0.00010; 10536/4109 tok/s;   5573 sec
[2021-04-23 08:19:28,007 INFO] Step 28950/50000; acc:  70.81; ppl:  2.74; xent: 1.01; lr: 0.00010; 10766/4275 tok/s;   5582 sec
[2021-04-23 08:19:37,546 INFO] Step 29000/50000; acc:  71.17; ppl:  2.71; xent: 1.00; lr: 0.00010; 10634/4192 tok/s;   5592 sec
[2021-04-23 08:19:47,135 INFO] Step 29050/50000; acc:  71.36; ppl:  2.70; xent: 0.99; lr: 0.00010; 10610/4214 tok/s;   5601 sec
[2021-04-23 08:19:56,027 INFO] Step 29100/50000; acc:  71.06; ppl:  2.70; xent: 0.99; lr: 0.00010; 11468/4428 tok/s;   5610 sec
[2021-04-23 08:20:05,648 INFO] Step 29150/50000; acc:  71.17; ppl:  2.70; xent: 0.99; lr: 0.00010; 10518/4217 tok/s;   5620 sec
[2021-04-23 08:20:06,041 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:20:15,193 INFO] Step 29200/50000; acc:  71.17; ppl:  2.71; xent: 1.00; lr: 0.00010; 10680/4230 tok/s;   5629 sec
[2021-04-23 08:20:25,279 INFO] Step 29250/50000; acc:  70.89; ppl:  2.74; xent: 1.01; lr: 0.00010; 10111/3978 tok/s;   5639 sec
[2021-04-23 08:20:35,176 INFO] Step 29300/50000; acc:  70.94; ppl:  2.71; xent: 1.00; lr: 0.00010; 10069/4111 tok/s;   5649 sec
[2021-04-23 08:20:44,869 INFO] Step 29350/50000; acc:  70.67; ppl:  2.76; xent: 1.01; lr: 0.00010; 10780/4190 tok/s;   5659 sec
[2021-04-23 08:20:54,387 INFO] Step 29400/50000; acc:  71.35; ppl:  2.67; xent: 0.98; lr: 0.00010; 10355/4220 tok/s;   5668 sec
[2021-04-23 08:21:03,903 INFO] Step 29450/50000; acc:  70.91; ppl:  2.72; xent: 1.00; lr: 0.00010; 10846/4174 tok/s;   5678 sec
[2021-04-23 08:21:13,376 INFO] Step 29500/50000; acc:  71.14; ppl:  2.72; xent: 1.00; lr: 0.00010; 10725/4230 tok/s;   5687 sec
[2021-04-23 08:21:23,481 INFO] Step 29550/50000; acc:  70.93; ppl:  2.71; xent: 1.00; lr: 0.00010; 10143/4034 tok/s;   5698 sec
[2021-04-23 08:21:32,759 INFO] Step 29600/50000; acc:  71.40; ppl:  2.67; xent: 0.98; lr: 0.00010; 10951/4293 tok/s;   5707 sec
[2021-04-23 08:21:42,317 INFO] Step 29650/50000; acc:  71.27; ppl:  2.70; xent: 0.99; lr: 0.00010; 10548/4189 tok/s;   5716 sec
[2021-04-23 08:21:51,833 INFO] Step 29700/50000; acc:  70.90; ppl:  2.73; xent: 1.01; lr: 0.00010; 10861/4233 tok/s;   5726 sec
[2021-04-23 08:22:01,590 INFO] Step 29750/50000; acc:  71.34; ppl:  2.69; xent: 0.99; lr: 0.00010; 10423/4081 tok/s;   5736 sec
[2021-04-23 08:22:10,600 INFO] Step 29800/50000; acc:  71.25; ppl:  2.69; xent: 0.99; lr: 0.00010; 11258/4472 tok/s;   5745 sec
[2021-04-23 08:22:19,484 INFO] Step 29850/50000; acc:  71.73; ppl:  2.66; xent: 0.98; lr: 0.00010; 11275/4490 tok/s;   5754 sec
[2021-04-23 08:22:26,984 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:22:29,292 INFO] Step 29900/50000; acc:  71.15; ppl:  2.70; xent: 0.99; lr: 0.00010; 10512/4149 tok/s;   5763 sec
[2021-04-23 08:22:38,933 INFO] Step 29950/50000; acc:  71.37; ppl:  2.68; xent: 0.98; lr: 0.00010; 10503/4185 tok/s;   5773 sec
[2021-04-23 08:22:48,893 INFO] Step 30000/50000; acc:  70.90; ppl:  2.72; xent: 1.00; lr: 0.00010; 10227/4027 tok/s;   5783 sec
[2021-04-23 08:22:48,895 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-23 08:22:56,835 INFO] Validation perplexity: 3.00383
[2021-04-23 08:22:56,835 INFO] Validation accuracy: 69.196
[2021-04-23 08:22:56,837 INFO] Saving checkpoint ../models/default_params/loose_ops/model_step_30000.pt
[2021-04-23 08:23:07,208 INFO] Step 30050/50000; acc:  71.19; ppl:  2.70; xent: 1.00; lr: 0.00010; 5507/2234 tok/s;   5801 sec
[2021-04-23 08:23:16,611 INFO] Step 30100/50000; acc:  71.66; ppl:  2.67; xent: 0.98; lr: 0.00010; 10695/4287 tok/s;   5811 sec
[2021-04-23 08:23:26,380 INFO] Step 30150/50000; acc:  71.12; ppl:  2.73; xent: 1.00; lr: 0.00010; 10654/4132 tok/s;   5820 sec
[2021-04-23 08:23:35,587 INFO] Step 30200/50000; acc:  71.36; ppl:  2.68; xent: 0.98; lr: 0.00010; 10777/4335 tok/s;   5830 sec
[2021-04-23 08:23:45,592 INFO] Step 30250/50000; acc:  70.93; ppl:  2.72; xent: 1.00; lr: 0.00010; 10509/3967 tok/s;   5840 sec
[2021-04-23 08:23:55,665 INFO] Step 30300/50000; acc:  71.39; ppl:  2.66; xent: 0.98; lr: 0.00010; 9993/4041 tok/s;   5850 sec
[2021-04-23 08:24:04,834 INFO] Step 30350/50000; acc:  71.62; ppl:  2.67; xent: 0.98; lr: 0.00010; 11048/4307 tok/s;   5859 sec
[2021-04-23 08:24:14,395 INFO] Step 30400/50000; acc:  71.40; ppl:  2.68; xent: 0.99; lr: 0.00010; 10696/4291 tok/s;   5868 sec
[2021-04-23 08:24:23,893 INFO] Step 30450/50000; acc:  71.59; ppl:  2.66; xent: 0.98; lr: 0.00010; 10557/4157 tok/s;   5878 sec
[2021-04-23 08:24:33,591 INFO] Step 30500/50000; acc:  71.26; ppl:  2.70; xent: 0.99; lr: 0.00010; 10715/4149 tok/s;   5888 sec
[2021-04-23 08:24:42,679 INFO] Step 30550/50000; acc:  71.27; ppl:  2.69; xent: 0.99; lr: 0.00010; 11121/4408 tok/s;   5897 sec
[2021-04-23 08:24:51,912 INFO] Step 30600/50000; acc:  71.80; ppl:  2.64; xent: 0.97; lr: 0.00010; 10902/4354 tok/s;   5906 sec
[2021-04-23 08:24:56,694 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:25:01,642 INFO] Step 30650/50000; acc:  71.30; ppl:  2.69; xent: 0.99; lr: 0.00010; 10407/4197 tok/s;   5916 sec
[2021-04-23 08:25:11,717 INFO] Step 30700/50000; acc:  71.18; ppl:  2.68; xent: 0.99; lr: 0.00010; 10261/3957 tok/s;   5926 sec
[2021-04-23 08:25:21,763 INFO] Step 30750/50000; acc:  71.26; ppl:  2.68; xent: 0.98; lr: 0.00010; 10021/4096 tok/s;   5936 sec
[2021-04-23 08:25:31,296 INFO] Step 30800/50000; acc:  71.29; ppl:  2.68; xent: 0.99; lr: 0.00010; 10624/4232 tok/s;   5945 sec
[2021-04-23 08:25:40,774 INFO] Step 30850/50000; acc:  71.51; ppl:  2.66; xent: 0.98; lr: 0.00010; 10698/4257 tok/s;   5955 sec
[2021-04-23 08:25:49,826 INFO] Step 30900/50000; acc:  71.74; ppl:  2.66; xent: 0.98; lr: 0.00010; 11048/4353 tok/s;   5964 sec
[2021-04-23 08:25:59,287 INFO] Step 30950/50000; acc:  71.23; ppl:  2.70; xent: 0.99; lr: 0.00010; 11185/4255 tok/s;   5973 sec
[2021-04-23 08:26:09,264 INFO] Step 31000/50000; acc:  71.69; ppl:  2.65; xent: 0.97; lr: 0.00010; 9992/4005 tok/s;   5983 sec
[2021-04-23 08:26:19,246 INFO] Step 31050/50000; acc:  71.17; ppl:  2.69; xent: 0.99; lr: 0.00010; 10402/4021 tok/s;   5993 sec
[2021-04-23 08:26:28,437 INFO] Step 31100/50000; acc:  71.77; ppl:  2.64; xent: 0.97; lr: 0.00010; 10919/4420 tok/s;   6002 sec
[2021-04-23 08:26:37,892 INFO] Step 31150/50000; acc:  71.63; ppl:  2.67; xent: 0.98; lr: 0.00010; 10751/4247 tok/s;   6012 sec
[2021-04-23 08:26:47,602 INFO] Step 31200/50000; acc:  71.49; ppl:  2.66; xent: 0.98; lr: 0.00010; 10509/4131 tok/s;   6022 sec
[2021-04-23 08:26:56,689 INFO] Step 31250/50000; acc:  71.63; ppl:  2.64; xent: 0.97; lr: 0.00010; 11094/4358 tok/s;   6031 sec
[2021-04-23 08:27:05,922 INFO] Step 31300/50000; acc:  71.50; ppl:  2.66; xent: 0.98; lr: 0.00010; 11131/4365 tok/s;   6040 sec
[2021-04-23 08:27:15,365 INFO] Step 31350/50000; acc:  71.64; ppl:  2.64; xent: 0.97; lr: 0.00010; 10730/4278 tok/s;   6049 sec
[2021-04-23 08:27:17,376 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:27:24,971 INFO] Step 31400/50000; acc:  71.57; ppl:  2.66; xent: 0.98; lr: 0.00010; 10577/4230 tok/s;   6059 sec
[2021-04-23 08:27:34,703 INFO] Step 31450/50000; acc:  71.30; ppl:  2.67; xent: 0.98; lr: 0.00010; 10351/4110 tok/s;   6069 sec
[2021-04-23 08:27:44,671 INFO] Step 31500/50000; acc:  71.49; ppl:  2.67; xent: 0.98; lr: 0.00010; 10265/4088 tok/s;   6079 sec
[2021-04-23 08:27:54,046 INFO] Step 31550/50000; acc:  71.40; ppl:  2.65; xent: 0.98; lr: 0.00010; 10771/4340 tok/s;   6088 sec
[2021-04-23 08:28:03,550 INFO] Step 31600/50000; acc:  71.66; ppl:  2.65; xent: 0.97; lr: 0.00010; 10678/4215 tok/s;   6098 sec
[2021-04-23 08:28:12,940 INFO] Step 31650/50000; acc:  71.63; ppl:  2.65; xent: 0.98; lr: 0.00010; 10791/4308 tok/s;   6107 sec
[2021-04-23 08:28:22,310 INFO] Step 31700/50000; acc:  71.79; ppl:  2.64; xent: 0.97; lr: 0.00010; 10942/4166 tok/s;   6116 sec
[2021-04-23 08:28:32,256 INFO] Step 31750/50000; acc:  71.22; ppl:  2.69; xent: 0.99; lr: 0.00010; 10500/4090 tok/s;   6126 sec
[2021-04-23 08:28:41,951 INFO] Step 31800/50000; acc:  71.96; ppl:  2.63; xent: 0.97; lr: 0.00010; 10223/4153 tok/s;   6136 sec
[2021-04-23 08:28:51,439 INFO] Step 31850/50000; acc:  71.13; ppl:  2.68; xent: 0.98; lr: 0.00010; 10969/4253 tok/s;   6145 sec
[2021-04-23 08:29:00,894 INFO] Step 31900/50000; acc:  71.71; ppl:  2.65; xent: 0.97; lr: 0.00010; 10559/4214 tok/s;   6155 sec
[2021-04-23 08:29:10,528 INFO] Step 31950/50000; acc:  71.84; ppl:  2.63; xent: 0.97; lr: 0.00010; 10634/4157 tok/s;   6165 sec
[2021-04-23 08:29:19,273 INFO] Step 32000/50000; acc:  71.89; ppl:  2.62; xent: 0.96; lr: 0.00010; 11640/4533 tok/s;   6173 sec
[2021-04-23 08:29:28,595 INFO] Step 32050/50000; acc:  71.82; ppl:  2.62; xent: 0.96; lr: 0.00010; 10712/4318 tok/s;   6183 sec
[2021-04-23 08:29:31,255 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:29:38,368 INFO] Step 32100/50000; acc:  71.57; ppl:  2.65; xent: 0.97; lr: 0.00010; 10635/4177 tok/s;   6192 sec
[2021-04-23 08:29:47,998 INFO] Step 32150/50000; acc:  72.08; ppl:  2.63; xent: 0.97; lr: 0.00010; 10527/4151 tok/s;   6202 sec
[2021-04-23 08:29:57,713 INFO] Step 32200/50000; acc:  71.45; ppl:  2.67; xent: 0.98; lr: 0.00010; 10351/4138 tok/s;   6212 sec
[2021-04-23 08:30:07,684 INFO] Step 32250/50000; acc:  71.80; ppl:  2.63; xent: 0.97; lr: 0.00010; 10092/4136 tok/s;   6222 sec
[2021-04-23 08:30:16,884 INFO] Step 32300/50000; acc:  71.52; ppl:  2.65; xent: 0.97; lr: 0.00010; 11164/4375 tok/s;   6231 sec
[2021-04-23 08:30:26,086 INFO] Step 32350/50000; acc:  71.95; ppl:  2.63; xent: 0.97; lr: 0.00010; 10917/4367 tok/s;   6240 sec
[2021-04-23 08:30:35,648 INFO] Step 32400/50000; acc:  71.66; ppl:  2.64; xent: 0.97; lr: 0.00010; 10734/4200 tok/s;   6250 sec
[2021-04-23 08:30:45,576 INFO] Step 32450/50000; acc:  71.47; ppl:  2.66; xent: 0.98; lr: 0.00010; 10395/4008 tok/s;   6260 sec
[2021-04-23 08:30:55,003 INFO] Step 32500/50000; acc:  71.93; ppl:  2.61; xent: 0.96; lr: 0.00010; 10657/4229 tok/s;   6269 sec
[2021-04-23 08:31:04,808 INFO] Step 32550/50000; acc:  71.59; ppl:  2.65; xent: 0.97; lr: 0.00010; 10669/4122 tok/s;   6279 sec
[2021-04-23 08:31:14,229 INFO] Step 32600/50000; acc:  71.92; ppl:  2.62; xent: 0.96; lr: 0.00010; 10486/4292 tok/s;   6288 sec
[2021-04-23 08:31:23,990 INFO] Step 32650/50000; acc:  71.80; ppl:  2.64; xent: 0.97; lr: 0.00010; 10668/4081 tok/s;   6298 sec
[2021-04-23 08:31:33,328 INFO] Step 32700/50000; acc:  72.03; ppl:  2.60; xent: 0.95; lr: 0.00010; 10790/4316 tok/s;   6307 sec
[2021-04-23 08:31:42,242 INFO] Step 32750/50000; acc:  72.23; ppl:  2.59; xent: 0.95; lr: 0.00010; 11330/4441 tok/s;   6316 sec
[2021-04-23 08:31:51,914 INFO] Step 32800/50000; acc:  71.89; ppl:  2.63; xent: 0.97; lr: 0.00010; 10552/4178 tok/s;   6326 sec
[2021-04-23 08:31:51,924 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:32:01,761 INFO] Step 32850/50000; acc:  72.37; ppl:  2.57; xent: 0.94; lr: 0.00010; 10183/4130 tok/s;   6336 sec
[2021-04-23 08:32:11,554 INFO] Step 32900/50000; acc:  71.42; ppl:  2.67; xent: 0.98; lr: 0.00010; 10601/4072 tok/s;   6346 sec
[2021-04-23 08:32:21,568 INFO] Step 32950/50000; acc:  72.02; ppl:  2.61; xent: 0.96; lr: 0.00010; 10013/4112 tok/s;   6356 sec
[2021-04-23 08:32:31,045 INFO] Step 33000/50000; acc:  71.76; ppl:  2.64; xent: 0.97; lr: 0.00010; 10670/4239 tok/s;   6365 sec
[2021-04-23 08:32:40,666 INFO] Step 33050/50000; acc:  72.11; ppl:  2.59; xent: 0.95; lr: 0.00010; 10485/4188 tok/s;   6375 sec
[2021-04-23 08:32:49,941 INFO] Step 33100/50000; acc:  71.82; ppl:  2.64; xent: 0.97; lr: 0.00010; 11037/4281 tok/s;   6384 sec
[2021-04-23 08:32:59,433 INFO] Step 33150/50000; acc:  71.80; ppl:  2.63; xent: 0.97; lr: 0.00010; 10805/4213 tok/s;   6393 sec
[2021-04-23 08:33:09,786 INFO] Step 33200/50000; acc:  71.59; ppl:  2.63; xent: 0.97; lr: 0.00010; 9912/3939 tok/s;   6404 sec
[2021-04-23 08:33:19,023 INFO] Step 33250/50000; acc:  72.25; ppl:  2.58; xent: 0.95; lr: 0.00010; 10935/4313 tok/s;   6413 sec
[2021-04-23 08:33:28,622 INFO] Step 33300/50000; acc:  71.88; ppl:  2.62; xent: 0.96; lr: 0.00010; 10550/4138 tok/s;   6423 sec
[2021-04-23 08:33:38,456 INFO] Step 33350/50000; acc:  71.80; ppl:  2.64; xent: 0.97; lr: 0.00010; 10615/4150 tok/s;   6433 sec
[2021-04-23 08:33:48,006 INFO] Step 33400/50000; acc:  72.47; ppl:  2.56; xent: 0.94; lr: 0.00010; 10404/4139 tok/s;   6442 sec
[2021-04-23 08:33:56,984 INFO] Step 33450/50000; acc:  71.74; ppl:  2.64; xent: 0.97; lr: 0.00010; 11578/4477 tok/s;   6451 sec
[2021-04-23 08:34:06,000 INFO] Step 33500/50000; acc:  72.33; ppl:  2.57; xent: 0.95; lr: 0.00010; 11011/4459 tok/s;   6460 sec
[2021-04-23 08:34:13,068 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:34:15,753 INFO] Step 33550/50000; acc:  72.11; ppl:  2.60; xent: 0.96; lr: 0.00010; 10466/4148 tok/s;   6470 sec
[2021-04-23 08:34:25,431 INFO] Step 33600/50000; acc:  72.13; ppl:  2.59; xent: 0.95; lr: 0.00010; 10554/4172 tok/s;   6479 sec
[2021-04-23 08:34:35,292 INFO] Step 33650/50000; acc:  71.96; ppl:  2.60; xent: 0.96; lr: 0.00010; 10157/4052 tok/s;   6489 sec
[2021-04-23 08:34:45,241 INFO] Step 33700/50000; acc:  71.65; ppl:  2.64; xent: 0.97; lr: 0.00010; 10343/4128 tok/s;   6499 sec
[2021-04-23 08:34:54,694 INFO] Step 33750/50000; acc:  72.32; ppl:  2.57; xent: 0.95; lr: 0.00010; 10687/4302 tok/s;   6509 sec
[2021-04-23 08:35:04,152 INFO] Step 33800/50000; acc:  71.82; ppl:  2.62; xent: 0.96; lr: 0.00010; 10662/4207 tok/s;   6518 sec
[2021-04-23 08:35:13,594 INFO] Step 33850/50000; acc:  72.08; ppl:  2.60; xent: 0.95; lr: 0.00010; 10761/4268 tok/s;   6528 sec
[2021-04-23 08:35:23,406 INFO] Step 33900/50000; acc:  71.73; ppl:  2.62; xent: 0.96; lr: 0.00010; 10583/4028 tok/s;   6537 sec
[2021-04-23 08:35:33,575 INFO] Step 33950/50000; acc:  72.07; ppl:  2.59; xent: 0.95; lr: 0.00010; 10007/4022 tok/s;   6548 sec
[2021-04-23 08:35:42,793 INFO] Step 34000/50000; acc:  72.32; ppl:  2.59; xent: 0.95; lr: 0.00010; 11012/4272 tok/s;   6557 sec
[2021-04-23 08:35:52,542 INFO] Step 34050/50000; acc:  72.09; ppl:  2.58; xent: 0.95; lr: 0.00010; 10432/4188 tok/s;   6567 sec
[2021-04-23 08:36:01,949 INFO] Step 34100/50000; acc:  72.62; ppl:  2.56; xent: 0.94; lr: 0.00010; 10712/4182 tok/s;   6576 sec
[2021-04-23 08:36:11,817 INFO] Step 34150/50000; acc:  71.60; ppl:  2.64; xent: 0.97; lr: 0.00010; 10628/4130 tok/s;   6586 sec
[2021-04-23 08:36:20,613 INFO] Step 34200/50000; acc:  72.58; ppl:  2.55; xent: 0.94; lr: 0.00010; 11208/4507 tok/s;   6595 sec
[2021-04-23 08:36:30,198 INFO] Step 34250/50000; acc:  72.43; ppl:  2.58; xent: 0.95; lr: 0.00010; 10781/4244 tok/s;   6604 sec
[2021-04-23 08:36:34,368 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:36:39,699 INFO] Step 34300/50000; acc:  72.12; ppl:  2.58; xent: 0.95; lr: 0.00010; 10572/4263 tok/s;   6614 sec
[2021-04-23 08:36:49,665 INFO] Step 34350/50000; acc:  71.97; ppl:  2.59; xent: 0.95; lr: 0.00010; 10256/3994 tok/s;   6624 sec
[2021-04-23 08:36:59,710 INFO] Step 34400/50000; acc:  71.96; ppl:  2.61; xent: 0.96; lr: 0.00010; 10108/4092 tok/s;   6634 sec
[2021-04-23 08:37:09,160 INFO] Step 34450/50000; acc:  72.42; ppl:  2.57; xent: 0.94; lr: 0.00010; 10556/4256 tok/s;   6643 sec
[2021-04-23 08:37:18,843 INFO] Step 34500/50000; acc:  72.06; ppl:  2.58; xent: 0.95; lr: 0.00010; 10659/4213 tok/s;   6653 sec
[2021-04-23 08:37:28,080 INFO] Step 34550/50000; acc:  72.27; ppl:  2.59; xent: 0.95; lr: 0.00010; 10897/4270 tok/s;   6662 sec
[2021-04-23 08:37:37,255 INFO] Step 34600/50000; acc:  72.44; ppl:  2.58; xent: 0.95; lr: 0.00010; 11187/4327 tok/s;   6671 sec
[2021-04-23 08:37:47,534 INFO] Step 34650/50000; acc:  72.43; ppl:  2.56; xent: 0.94; lr: 0.00010; 9891/3917 tok/s;   6682 sec
[2021-04-23 08:37:57,314 INFO] Step 34700/50000; acc:  71.87; ppl:  2.60; xent: 0.96; lr: 0.00010; 10524/4101 tok/s;   6691 sec
[2021-04-23 08:38:06,576 INFO] Step 34750/50000; acc:  72.34; ppl:  2.57; xent: 0.94; lr: 0.00010; 10952/4391 tok/s;   6701 sec
[2021-04-23 08:38:15,952 INFO] Step 34800/50000; acc:  72.26; ppl:  2.58; xent: 0.95; lr: 0.00010; 10842/4290 tok/s;   6710 sec
[2021-04-23 08:38:25,803 INFO] Step 34850/50000; acc:  72.27; ppl:  2.58; xent: 0.95; lr: 0.00010; 10323/4066 tok/s;   6720 sec
[2021-04-23 08:38:34,887 INFO] Step 34900/50000; acc:  72.44; ppl:  2.55; xent: 0.94; lr: 0.00010; 11118/4344 tok/s;   6729 sec
[2021-04-23 08:38:44,119 INFO] Step 34950/50000; acc:  72.03; ppl:  2.59; xent: 0.95; lr: 0.00010; 11268/4390 tok/s;   6738 sec
[2021-04-23 08:38:53,552 INFO] Step 35000/50000; acc:  72.77; ppl:  2.52; xent: 0.92; lr: 0.00010; 10481/4280 tok/s;   6748 sec
[2021-04-23 08:38:53,556 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-23 08:39:01,473 INFO] Validation perplexity: 3.01464
[2021-04-23 08:39:01,473 INFO] Validation accuracy: 69.4281
[2021-04-23 08:39:01,475 INFO] Saving checkpoint ../models/default_params/loose_ops/model_step_35000.pt
[2021-04-23 08:39:03,703 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:39:12,000 INFO] Step 35050/50000; acc:  71.89; ppl:  2.61; xent: 0.96; lr: 0.00010; 5648/2197 tok/s;   6766 sec
[2021-04-23 08:39:21,498 INFO] Step 35100/50000; acc:  72.35; ppl:  2.56; xent: 0.94; lr: 0.00010; 10513/4190 tok/s;   6776 sec
[2021-04-23 08:39:31,560 INFO] Step 35150/50000; acc:  72.14; ppl:  2.59; xent: 0.95; lr: 0.00010; 10052/4107 tok/s;   6786 sec
[2021-04-23 08:39:40,832 INFO] Step 35200/50000; acc:  72.45; ppl:  2.56; xent: 0.94; lr: 0.00010; 10996/4344 tok/s;   6795 sec
[2021-04-23 08:39:50,161 INFO] Step 35250/50000; acc:  72.48; ppl:  2.54; xent: 0.93; lr: 0.00010; 10691/4299 tok/s;   6804 sec
[2021-04-23 08:39:59,840 INFO] Step 35300/50000; acc:  72.41; ppl:  2.59; xent: 0.95; lr: 0.00010; 10688/4177 tok/s;   6814 sec
[2021-04-23 08:40:09,436 INFO] Step 35350/50000; acc:  72.16; ppl:  2.56; xent: 0.94; lr: 0.00010; 10740/4061 tok/s;   6823 sec
[2021-04-23 08:40:19,218 INFO] Step 35400/50000; acc:  72.40; ppl:  2.56; xent: 0.94; lr: 0.00010; 10350/4149 tok/s;   6833 sec
[2021-04-23 08:40:28,955 INFO] Step 35450/50000; acc:  72.47; ppl:  2.56; xent: 0.94; lr: 0.00010; 10389/4171 tok/s;   6843 sec
[2021-04-23 08:40:38,529 INFO] Step 35500/50000; acc:  72.02; ppl:  2.58; xent: 0.95; lr: 0.00010; 10769/4195 tok/s;   6853 sec
[2021-04-23 08:40:47,882 INFO] Step 35550/50000; acc:  72.54; ppl:  2.56; xent: 0.94; lr: 0.00010; 10782/4282 tok/s;   6862 sec
[2021-04-23 08:40:57,557 INFO] Step 35600/50000; acc:  72.50; ppl:  2.54; xent: 0.93; lr: 0.00010; 10621/4135 tok/s;   6872 sec
[2021-04-23 08:41:06,363 INFO] Step 35650/50000; acc:  72.84; ppl:  2.53; xent: 0.93; lr: 0.00010; 11479/4490 tok/s;   6880 sec
[2021-04-23 08:41:15,486 INFO] Step 35700/50000; acc:  72.78; ppl:  2.53; xent: 0.93; lr: 0.00010; 10984/4383 tok/s;   6890 sec
[2021-04-23 08:41:17,930 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:41:25,618 INFO] Step 35750/50000; acc:  72.18; ppl:  2.58; xent: 0.95; lr: 0.00010; 10362/4077 tok/s;   6900 sec
[2021-04-23 08:41:35,138 INFO] Step 35800/50000; acc:  73.10; ppl:  2.51; xent: 0.92; lr: 0.00010; 10398/4157 tok/s;   6909 sec
[2021-04-23 08:41:45,281 INFO] Step 35850/50000; acc:  71.95; ppl:  2.60; xent: 0.96; lr: 0.00010; 10176/4012 tok/s;   6919 sec
[2021-04-23 08:41:54,838 INFO] Step 35900/50000; acc:  72.47; ppl:  2.56; xent: 0.94; lr: 0.00010; 10448/4284 tok/s;   6929 sec
[2021-04-23 08:42:03,910 INFO] Step 35950/50000; acc:  72.76; ppl:  2.52; xent: 0.93; lr: 0.00010; 11195/4421 tok/s;   6938 sec
[2021-04-23 08:42:13,260 INFO] Step 36000/50000; acc:  72.41; ppl:  2.55; xent: 0.94; lr: 0.00010; 10820/4303 tok/s;   6947 sec
[2021-04-23 08:42:22,781 INFO] Step 36050/50000; acc:  72.41; ppl:  2.56; xent: 0.94; lr: 0.00010; 10636/4202 tok/s;   6957 sec
[2021-04-23 08:42:32,931 INFO] Step 36100/50000; acc:  71.90; ppl:  2.59; xent: 0.95; lr: 0.00010; 10333/3937 tok/s;   6967 sec
[2021-04-23 08:42:42,600 INFO] Step 36150/50000; acc:  72.68; ppl:  2.53; xent: 0.93; lr: 0.00010; 10456/4161 tok/s;   6977 sec
[2021-04-23 08:42:52,048 INFO] Step 36200/50000; acc:  73.00; ppl:  2.52; xent: 0.92; lr: 0.00010; 10733/4214 tok/s;   6986 sec
[2021-04-23 08:43:01,500 INFO] Step 36250/50000; acc:  72.39; ppl:  2.56; xent: 0.94; lr: 0.00010; 10692/4309 tok/s;   6996 sec
[2021-04-23 08:43:11,188 INFO] Step 36300/50000; acc:  72.71; ppl:  2.55; xent: 0.93; lr: 0.00010; 10631/4082 tok/s;   7005 sec
[2021-04-23 08:43:20,623 INFO] Step 36350/50000; acc:  72.67; ppl:  2.53; xent: 0.93; lr: 0.00010; 10808/4299 tok/s;   7015 sec
[2021-04-23 08:43:29,440 INFO] Step 36400/50000; acc:  72.72; ppl:  2.52; xent: 0.93; lr: 0.00010; 11451/4499 tok/s;   7023 sec
[2021-04-23 08:43:38,780 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:43:39,135 INFO] Step 36450/50000; acc:  72.80; ppl:  2.53; xent: 0.93; lr: 0.00010; 10470/4162 tok/s;   7033 sec
[2021-04-23 08:43:48,986 INFO] Step 36500/50000; acc:  73.06; ppl:  2.51; xent: 0.92; lr: 0.00010; 10224/4120 tok/s;   7043 sec
[2021-04-23 08:43:58,768 INFO] Step 36550/50000; acc:  72.27; ppl:  2.58; xent: 0.95; lr: 0.00010; 10728/4080 tok/s;   7053 sec
[2021-04-23 08:44:08,771 INFO] Step 36600/50000; acc:  72.97; ppl:  2.51; xent: 0.92; lr: 0.00010; 9783/4124 tok/s;   7063 sec
[2021-04-23 08:44:18,359 INFO] Step 36650/50000; acc:  72.46; ppl:  2.56; xent: 0.94; lr: 0.00010; 10819/4208 tok/s;   7072 sec
[2021-04-23 08:44:27,774 INFO] Step 36700/50000; acc:  72.89; ppl:  2.51; xent: 0.92; lr: 0.00010; 10620/4242 tok/s;   7082 sec
[2021-04-23 08:44:37,146 INFO] Step 36750/50000; acc:  72.76; ppl:  2.54; xent: 0.93; lr: 0.00010; 10823/4245 tok/s;   7091 sec
[2021-04-23 08:44:46,800 INFO] Step 36800/50000; acc:  72.43; ppl:  2.53; xent: 0.93; lr: 0.00010; 10694/4149 tok/s;   7101 sec
[2021-04-23 08:44:57,007 INFO] Step 36850/50000; acc:  72.63; ppl:  2.53; xent: 0.93; lr: 0.00010; 9913/3999 tok/s;   7111 sec
[2021-04-23 08:45:06,364 INFO] Step 36900/50000; acc:  72.90; ppl:  2.51; xent: 0.92; lr: 0.00010; 10972/4257 tok/s;   7120 sec
[2021-04-23 08:45:15,901 INFO] Step 36950/50000; acc:  72.84; ppl:  2.54; xent: 0.93; lr: 0.00010; 10685/4226 tok/s;   7130 sec
[2021-04-23 08:45:25,471 INFO] Step 37000/50000; acc:  73.16; ppl:  2.50; xent: 0.92; lr: 0.00010; 10552/4172 tok/s;   7140 sec
[2021-04-23 08:45:35,191 INFO] Step 37050/50000; acc:  72.93; ppl:  2.51; xent: 0.92; lr: 0.00010; 10445/4119 tok/s;   7149 sec
[2021-04-23 08:45:44,038 INFO] Step 37100/50000; acc:  72.81; ppl:  2.53; xent: 0.93; lr: 0.00010; 11634/4469 tok/s;   7158 sec
[2021-04-23 08:45:53,193 INFO] Step 37150/50000; acc:  72.91; ppl:  2.51; xent: 0.92; lr: 0.00010; 10973/4449 tok/s;   7167 sec
[2021-04-23 08:46:00,004 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:46:02,982 INFO] Step 37200/50000; acc:  72.76; ppl:  2.52; xent: 0.92; lr: 0.00010; 10449/4138 tok/s;   7177 sec
[2021-04-23 08:46:12,890 INFO] Step 37250/50000; acc:  73.06; ppl:  2.52; xent: 0.92; lr: 0.00010; 10273/4063 tok/s;   7187 sec
[2021-04-23 08:46:22,687 INFO] Step 37300/50000; acc:  72.50; ppl:  2.52; xent: 0.92; lr: 0.00010; 10241/4093 tok/s;   7197 sec
[2021-04-23 08:46:32,616 INFO] Step 37350/50000; acc:  72.58; ppl:  2.55; xent: 0.94; lr: 0.00010; 10464/4154 tok/s;   7207 sec
[2021-04-23 08:46:41,866 INFO] Step 37400/50000; acc:  73.47; ppl:  2.46; xent: 0.90; lr: 0.00010; 10666/4354 tok/s;   7216 sec
[2021-04-23 08:46:51,570 INFO] Step 37450/50000; acc:  72.32; ppl:  2.56; xent: 0.94; lr: 0.00010; 10654/4114 tok/s;   7226 sec
[2021-04-23 08:47:00,869 INFO] Step 37500/50000; acc:  73.08; ppl:  2.51; xent: 0.92; lr: 0.00010; 10852/4351 tok/s;   7235 sec
[2021-04-23 08:47:10,726 INFO] Step 37550/50000; acc:  72.86; ppl:  2.51; xent: 0.92; lr: 0.00010; 10439/4019 tok/s;   7245 sec
[2021-04-23 08:47:20,714 INFO] Step 37600/50000; acc:  72.51; ppl:  2.53; xent: 0.93; lr: 0.00010; 10252/4056 tok/s;   7255 sec
[2021-04-23 08:47:29,955 INFO] Step 37650/50000; acc:  73.26; ppl:  2.48; xent: 0.91; lr: 0.00010; 10817/4256 tok/s;   7264 sec
[2021-04-23 08:47:39,810 INFO] Step 37700/50000; acc:  72.80; ppl:  2.53; xent: 0.93; lr: 0.00010; 10508/4161 tok/s;   7274 sec
[2021-04-23 08:47:49,157 INFO] Step 37750/50000; acc:  73.11; ppl:  2.50; xent: 0.91; lr: 0.00010; 10846/4245 tok/s;   7283 sec
[2021-04-23 08:47:58,606 INFO] Step 37800/50000; acc:  72.94; ppl:  2.50; xent: 0.92; lr: 0.00010; 10755/4212 tok/s;   7293 sec
[2021-04-23 08:48:07,731 INFO] Step 37850/50000; acc:  73.03; ppl:  2.50; xent: 0.92; lr: 0.00010; 11037/4424 tok/s;   7302 sec
[2021-04-23 08:48:17,241 INFO] Step 37900/50000; acc:  73.17; ppl:  2.49; xent: 0.91; lr: 0.00010; 10748/4254 tok/s;   7311 sec
[2021-04-23 08:48:21,065 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:48:26,649 INFO] Step 37950/50000; acc:  72.70; ppl:  2.51; xent: 0.92; lr: 0.00010; 10811/4319 tok/s;   7321 sec
[2021-04-23 08:48:36,635 INFO] Step 38000/50000; acc:  73.01; ppl:  2.51; xent: 0.92; lr: 0.00010; 10218/3977 tok/s;   7331 sec
[2021-04-23 08:48:46,790 INFO] Step 38050/50000; acc:  72.99; ppl:  2.52; xent: 0.92; lr: 0.00010; 9965/4032 tok/s;   7341 sec
[2021-04-23 08:48:56,143 INFO] Step 38100/50000; acc:  73.19; ppl:  2.47; xent: 0.91; lr: 0.00010; 10700/4308 tok/s;   7350 sec
[2021-04-23 08:49:05,730 INFO] Step 38150/50000; acc:  72.69; ppl:  2.53; xent: 0.93; lr: 0.00010; 10879/4264 tok/s;   7360 sec
[2021-04-23 08:49:14,961 INFO] Step 38200/50000; acc:  73.43; ppl:  2.46; xent: 0.90; lr: 0.00010; 10646/4276 tok/s;   7369 sec
[2021-04-23 08:49:24,324 INFO] Step 38250/50000; acc:  72.49; ppl:  2.55; xent: 0.93; lr: 0.00010; 11250/4241 tok/s;   7378 sec
[2021-04-23 08:49:34,073 INFO] Step 38300/50000; acc:  72.92; ppl:  2.47; xent: 0.91; lr: 0.00010; 10341/4127 tok/s;   7388 sec
[2021-04-23 08:49:44,075 INFO] Step 38350/50000; acc:  73.01; ppl:  2.51; xent: 0.92; lr: 0.00010; 10175/4019 tok/s;   7398 sec
[2021-04-23 08:49:53,292 INFO] Step 38400/50000; acc:  72.96; ppl:  2.49; xent: 0.91; lr: 0.00010; 11104/4395 tok/s;   7407 sec
[2021-04-23 08:50:02,705 INFO] Step 38450/50000; acc:  73.49; ppl:  2.48; xent: 0.91; lr: 0.00010; 10629/4271 tok/s;   7417 sec
[2021-04-23 08:50:12,502 INFO] Step 38500/50000; acc:  73.17; ppl:  2.49; xent: 0.91; lr: 0.00010; 10586/4098 tok/s;   7427 sec
[2021-04-23 08:50:21,566 INFO] Step 38550/50000; acc:  73.02; ppl:  2.49; xent: 0.91; lr: 0.00010; 11184/4364 tok/s;   7436 sec
[2021-04-23 08:50:30,706 INFO] Step 38600/50000; acc:  73.23; ppl:  2.47; xent: 0.91; lr: 0.00010; 11037/4382 tok/s;   7445 sec
[2021-04-23 08:50:40,399 INFO] Step 38650/50000; acc:  73.18; ppl:  2.47; xent: 0.91; lr: 0.00010; 10424/4209 tok/s;   7454 sec
[2021-04-23 08:50:41,623 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:50:50,143 INFO] Step 38700/50000; acc:  72.73; ppl:  2.50; xent: 0.92; lr: 0.00010; 10592/4129 tok/s;   7464 sec
[2021-04-23 08:50:59,776 INFO] Step 38750/50000; acc:  73.11; ppl:  2.49; xent: 0.91; lr: 0.00010; 10466/4174 tok/s;   7474 sec
[2021-04-23 08:51:09,968 INFO] Step 38800/50000; acc:  72.94; ppl:  2.50; xent: 0.92; lr: 0.00010; 9944/4049 tok/s;   7484 sec
[2021-04-23 08:51:19,176 INFO] Step 38850/50000; acc:  73.12; ppl:  2.49; xent: 0.91; lr: 0.00010; 11008/4366 tok/s;   7493 sec
[2021-04-23 08:51:28,466 INFO] Step 38900/50000; acc:  73.22; ppl:  2.47; xent: 0.90; lr: 0.00010; 10777/4298 tok/s;   7503 sec
[2021-04-23 08:51:38,050 INFO] Step 38950/50000; acc:  72.99; ppl:  2.51; xent: 0.92; lr: 0.00010; 10918/4209 tok/s;   7512 sec
[2021-04-23 08:51:47,452 INFO] Step 39000/50000; acc:  73.32; ppl:  2.46; xent: 0.90; lr: 0.00010; 10702/4170 tok/s;   7522 sec
[2021-04-23 08:51:57,519 INFO] Step 39050/50000; acc:  72.91; ppl:  2.49; xent: 0.91; lr: 0.00010; 10310/4040 tok/s;   7532 sec
[2021-04-23 08:52:06,870 INFO] Step 39100/50000; acc:  73.40; ppl:  2.47; xent: 0.90; lr: 0.00010; 10720/4320 tok/s;   7541 sec
[2021-04-23 08:52:16,436 INFO] Step 39150/50000; acc:  73.41; ppl:  2.47; xent: 0.90; lr: 0.00010; 10664/4231 tok/s;   7550 sec
[2021-04-23 08:52:25,967 INFO] Step 39200/50000; acc:  73.28; ppl:  2.49; xent: 0.91; lr: 0.00010; 10668/4214 tok/s;   7560 sec
[2021-04-23 08:52:35,482 INFO] Step 39250/50000; acc:  73.52; ppl:  2.44; xent: 0.89; lr: 0.00010; 10635/4178 tok/s;   7570 sec
[2021-04-23 08:52:44,436 INFO] Step 39300/50000; acc:  73.23; ppl:  2.48; xent: 0.91; lr: 0.00010; 11520/4410 tok/s;   7578 sec
[2021-04-23 08:52:53,758 INFO] Step 39350/50000; acc:  73.25; ppl:  2.47; xent: 0.90; lr: 0.00010; 10801/4319 tok/s;   7588 sec
[2021-04-23 08:52:55,769 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:53:03,566 INFO] Step 39400/50000; acc:  73.39; ppl:  2.46; xent: 0.90; lr: 0.00010; 10364/4167 tok/s;   7598 sec
[2021-04-23 08:53:13,196 INFO] Step 39450/50000; acc:  73.52; ppl:  2.45; xent: 0.90; lr: 0.00010; 10511/4130 tok/s;   7607 sec
[2021-04-23 08:53:23,278 INFO] Step 39500/50000; acc:  72.74; ppl:  2.51; xent: 0.92; lr: 0.00010; 10120/4021 tok/s;   7617 sec
[2021-04-23 08:53:32,876 INFO] Step 39550/50000; acc:  73.20; ppl:  2.48; xent: 0.91; lr: 0.00010; 10528/4312 tok/s;   7627 sec
[2021-04-23 08:53:42,136 INFO] Step 39600/50000; acc:  73.38; ppl:  2.45; xent: 0.90; lr: 0.00010; 10979/4322 tok/s;   7636 sec
[2021-04-23 08:53:51,282 INFO] Step 39650/50000; acc:  73.41; ppl:  2.47; xent: 0.91; lr: 0.00010; 11013/4357 tok/s;   7645 sec
[2021-04-23 08:54:00,909 INFO] Step 39700/50000; acc:  73.11; ppl:  2.47; xent: 0.91; lr: 0.00010; 10562/4131 tok/s;   7655 sec
[2021-04-23 08:54:11,206 INFO] Step 39750/50000; acc:  72.81; ppl:  2.51; xent: 0.92; lr: 0.00010; 10264/3928 tok/s;   7665 sec
[2021-04-23 08:54:20,798 INFO] Step 39800/50000; acc:  73.86; ppl:  2.43; xent: 0.89; lr: 0.00010; 10315/4182 tok/s;   7675 sec
[2021-04-23 08:54:30,281 INFO] Step 39850/50000; acc:  73.39; ppl:  2.48; xent: 0.91; lr: 0.00010; 10941/4219 tok/s;   7684 sec
[2021-04-23 08:54:39,729 INFO] Step 39900/50000; acc:  73.38; ppl:  2.47; xent: 0.90; lr: 0.00010; 10622/4260 tok/s;   7694 sec
[2021-04-23 08:54:49,508 INFO] Step 39950/50000; acc:  73.67; ppl:  2.43; xent: 0.89; lr: 0.00010; 10432/4079 tok/s;   7704 sec
[2021-04-23 08:54:58,830 INFO] Step 40000/50000; acc:  73.45; ppl:  2.45; xent: 0.90; lr: 0.00010; 11031/4327 tok/s;   7713 sec
[2021-04-23 08:54:58,834 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-23 08:55:06,735 INFO] Validation perplexity: 3.04471
[2021-04-23 08:55:06,735 INFO] Validation accuracy: 69.3141
[2021-04-23 08:55:06,737 INFO] Saving checkpoint ../models/default_params/loose_ops/model_step_40000.pt
[2021-04-23 08:55:16,206 INFO] Step 40050/50000; acc:  73.48; ppl:  2.44; xent: 0.89; lr: 0.00010; 5712/2295 tok/s;   7730 sec
[2021-04-23 08:55:25,293 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:55:26,036 INFO] Step 40100/50000; acc:  73.17; ppl:  2.47; xent: 0.91; lr: 0.00010; 10533/4107 tok/s;   7740 sec
[2021-04-23 08:55:35,786 INFO] Step 40150/50000; acc:  73.78; ppl:  2.43; xent: 0.89; lr: 0.00010; 10377/4137 tok/s;   7750 sec
[2021-04-23 08:55:45,519 INFO] Step 40200/50000; acc:  73.42; ppl:  2.48; xent: 0.91; lr: 0.00010; 10435/4083 tok/s;   7760 sec
[2021-04-23 08:55:55,578 INFO] Step 40250/50000; acc:  73.46; ppl:  2.45; xent: 0.90; lr: 0.00010; 9960/4146 tok/s;   7770 sec
[2021-04-23 08:56:05,046 INFO] Step 40300/50000; acc:  73.52; ppl:  2.46; xent: 0.90; lr: 0.00010; 10828/4252 tok/s;   7779 sec
[2021-04-23 08:56:14,437 INFO] Step 40350/50000; acc:  73.37; ppl:  2.46; xent: 0.90; lr: 0.00010; 10764/4279 tok/s;   7788 sec
[2021-04-23 08:56:23,815 INFO] Step 40400/50000; acc:  73.28; ppl:  2.46; xent: 0.90; lr: 0.00010; 10865/4222 tok/s;   7798 sec
[2021-04-23 08:56:33,450 INFO] Step 40450/50000; acc:  73.33; ppl:  2.45; xent: 0.90; lr: 0.00010; 10642/4139 tok/s;   7808 sec
[2021-04-23 08:56:43,661 INFO] Step 40500/50000; acc:  73.43; ppl:  2.45; xent: 0.90; lr: 0.00010; 9941/4027 tok/s;   7818 sec
[2021-04-23 08:56:53,070 INFO] Step 40550/50000; acc:  73.52; ppl:  2.45; xent: 0.90; lr: 0.00010; 11038/4198 tok/s;   7827 sec
[2021-04-23 08:57:02,459 INFO] Step 40600/50000; acc:  73.88; ppl:  2.43; xent: 0.89; lr: 0.00010; 10591/4316 tok/s;   7837 sec
[2021-04-23 08:57:12,021 INFO] Step 40650/50000; acc:  73.47; ppl:  2.47; xent: 0.90; lr: 0.00010; 10835/4153 tok/s;   7846 sec
[2021-04-23 08:57:21,722 INFO] Step 40700/50000; acc:  73.72; ppl:  2.41; xent: 0.88; lr: 0.00010; 10372/4156 tok/s;   7856 sec
[2021-04-23 08:57:30,659 INFO] Step 40750/50000; acc:  73.38; ppl:  2.45; xent: 0.90; lr: 0.00010; 11394/4451 tok/s;   7865 sec
[2021-04-23 08:57:40,098 INFO] Step 40800/50000; acc:  73.40; ppl:  2.44; xent: 0.89; lr: 0.00010; 10719/4278 tok/s;   7874 sec
[2021-04-23 08:57:46,391 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 08:57:49,684 INFO] Step 40850/50000; acc:  73.71; ppl:  2.43; xent: 0.89; lr: 0.00010; 10506/4233 tok/s;   7884 sec
[2021-04-23 08:57:59,743 INFO] Step 40900/50000; acc:  73.50; ppl:  2.47; xent: 0.90; lr: 0.00010; 10337/4028 tok/s;   7894 sec
[2021-04-23 08:58:09,537 INFO] Step 40950/50000; acc:  73.36; ppl:  2.45; xent: 0.90; lr: 0.00010; 10282/4094 tok/s;   7904 sec
[2021-04-23 08:58:19,248 INFO] Step 41000/50000; acc:  73.70; ppl:  2.44; xent: 0.89; lr: 0.00010; 10361/4198 tok/s;   7913 sec
[2021-04-23 08:58:28,606 INFO] Step 41050/50000; acc:  73.91; ppl:  2.41; xent: 0.88; lr: 0.00010; 10784/4331 tok/s;   7923 sec
[2021-04-23 08:58:38,281 INFO] Step 41100/50000; acc:  73.49; ppl:  2.47; xent: 0.90; lr: 0.00010; 10577/4121 tok/s;   7932 sec
[2021-04-23 08:58:47,602 INFO] Step 41150/50000; acc:  73.51; ppl:  2.44; xent: 0.89; lr: 0.00010; 10932/4323 tok/s;   7942 sec
[2021-04-23 08:58:57,446 INFO] Step 41200/50000; acc:  73.44; ppl:  2.44; xent: 0.89; lr: 0.00010; 10487/4038 tok/s;   7952 sec
[2021-04-23 08:59:07,500 INFO] Step 41250/50000; acc:  73.64; ppl:  2.42; xent: 0.89; lr: 0.00010; 10115/4014 tok/s;   7962 sec
[2021-04-23 08:59:16,893 INFO] Step 41300/50000; acc:  74.07; ppl:  2.41; xent: 0.88; lr: 0.00010; 10695/4234 tok/s;   7971 sec
[2021-04-23 08:59:26,706 INFO] Step 41350/50000; acc:  73.38; ppl:  2.46; xent: 0.90; lr: 0.00010; 10659/4140 tok/s;   7981 sec
[2021-04-23 08:59:35,857 INFO] Step 41400/50000; acc:  74.13; ppl:  2.39; xent: 0.87; lr: 0.00010; 10815/4365 tok/s;   7990 sec
[2021-04-23 08:59:45,497 INFO] Step 41450/50000; acc:  73.32; ppl:  2.46; xent: 0.90; lr: 0.00010; 10811/4105 tok/s;   8000 sec
[2021-04-23 08:59:54,581 INFO] Step 41500/50000; acc:  73.81; ppl:  2.41; xent: 0.88; lr: 0.00010; 10976/4466 tok/s;   8009 sec
[2021-04-23 09:00:04,076 INFO] Step 41550/50000; acc:  73.84; ppl:  2.41; xent: 0.88; lr: 0.00010; 10674/4239 tok/s;   8018 sec
[2021-04-23 09:00:07,548 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:00:13,589 INFO] Step 41600/50000; acc:  73.63; ppl:  2.44; xent: 0.89; lr: 0.00010; 10746/4276 tok/s;   8028 sec
[2021-04-23 09:00:23,268 INFO] Step 41650/50000; acc:  73.94; ppl:  2.41; xent: 0.88; lr: 0.00010; 10400/4085 tok/s;   8037 sec
[2021-04-23 09:00:33,492 INFO] Step 41700/50000; acc:  73.29; ppl:  2.45; xent: 0.90; lr: 0.00010; 10079/4034 tok/s;   8048 sec
[2021-04-23 09:00:42,815 INFO] Step 41750/50000; acc:  73.80; ppl:  2.41; xent: 0.88; lr: 0.00010; 10798/4326 tok/s;   8057 sec
[2021-04-23 09:00:52,395 INFO] Step 41800/50000; acc:  73.68; ppl:  2.42; xent: 0.88; lr: 0.00010; 10552/4215 tok/s;   8066 sec
[2021-04-23 09:01:01,584 INFO] Step 41850/50000; acc:  73.82; ppl:  2.41; xent: 0.88; lr: 0.00010; 10938/4321 tok/s;   8076 sec
[2021-04-23 09:01:11,120 INFO] Step 41900/50000; acc:  73.79; ppl:  2.44; xent: 0.89; lr: 0.00010; 10942/4203 tok/s;   8085 sec
[2021-04-23 09:01:20,715 INFO] Step 41950/50000; acc:  73.73; ppl:  2.42; xent: 0.88; lr: 0.00010; 10581/4171 tok/s;   8095 sec
[2021-04-23 09:01:30,384 INFO] Step 42000/50000; acc:  73.45; ppl:  2.45; xent: 0.90; lr: 0.00010; 10569/4158 tok/s;   8104 sec
[2021-04-23 09:01:39,830 INFO] Step 42050/50000; acc:  73.99; ppl:  2.40; xent: 0.88; lr: 0.00010; 10770/4273 tok/s;   8114 sec
[2021-04-23 09:01:49,215 INFO] Step 42100/50000; acc:  74.21; ppl:  2.40; xent: 0.88; lr: 0.00010; 10700/4297 tok/s;   8123 sec
[2021-04-23 09:01:58,944 INFO] Step 42150/50000; acc:  73.66; ppl:  2.43; xent: 0.89; lr: 0.00010; 10786/4106 tok/s;   8133 sec
[2021-04-23 09:02:07,827 INFO] Step 42200/50000; acc:  74.07; ppl:  2.39; xent: 0.87; lr: 0.00010; 11129/4476 tok/s;   8142 sec
[2021-04-23 09:02:17,234 INFO] Step 42250/50000; acc:  73.70; ppl:  2.43; xent: 0.89; lr: 0.00010; 10998/4295 tok/s;   8151 sec
[2021-04-23 09:02:21,403 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:02:26,866 INFO] Step 42300/50000; acc:  74.12; ppl:  2.39; xent: 0.87; lr: 0.00010; 10418/4178 tok/s;   8161 sec
[2021-04-23 09:02:36,577 INFO] Step 42350/50000; acc:  73.99; ppl:  2.39; xent: 0.87; lr: 0.00010; 10506/4161 tok/s;   8171 sec
[2021-04-23 09:02:46,217 INFO] Step 42400/50000; acc:  73.67; ppl:  2.44; xent: 0.89; lr: 0.00010; 10533/4184 tok/s;   8180 sec
[2021-04-23 09:02:56,186 INFO] Step 42450/50000; acc:  74.09; ppl:  2.40; xent: 0.88; lr: 0.00010; 10013/4100 tok/s;   8190 sec
[2021-04-23 09:03:05,424 INFO] Step 42500/50000; acc:  73.72; ppl:  2.42; xent: 0.88; lr: 0.00010; 11186/4384 tok/s;   8199 sec
[2021-04-23 09:03:14,668 INFO] Step 42550/50000; acc:  74.10; ppl:  2.39; xent: 0.87; lr: 0.00010; 10886/4313 tok/s;   8209 sec
[2021-04-23 09:03:24,333 INFO] Step 42600/50000; acc:  73.85; ppl:  2.41; xent: 0.88; lr: 0.00010; 10483/4161 tok/s;   8218 sec
[2021-04-23 09:03:34,023 INFO] Step 42650/50000; acc:  73.74; ppl:  2.41; xent: 0.88; lr: 0.00010; 10637/4072 tok/s;   8228 sec
[2021-04-23 09:03:43,739 INFO] Step 42700/50000; acc:  73.82; ppl:  2.41; xent: 0.88; lr: 0.00010; 10563/4174 tok/s;   8238 sec
[2021-04-23 09:03:53,363 INFO] Step 42750/50000; acc:  74.03; ppl:  2.40; xent: 0.88; lr: 0.00010; 10519/4183 tok/s;   8247 sec
[2021-04-23 09:04:03,010 INFO] Step 42800/50000; acc:  73.92; ppl:  2.41; xent: 0.88; lr: 0.00010; 10591/4206 tok/s;   8257 sec
[2021-04-23 09:04:12,461 INFO] Step 42850/50000; acc:  74.02; ppl:  2.40; xent: 0.88; lr: 0.00010; 10730/4227 tok/s;   8267 sec
[2021-04-23 09:04:22,102 INFO] Step 42900/50000; acc:  74.02; ppl:  2.38; xent: 0.87; lr: 0.00010; 10512/4157 tok/s;   8276 sec
[2021-04-23 09:04:31,014 INFO] Step 42950/50000; acc:  73.99; ppl:  2.41; xent: 0.88; lr: 0.00010; 11686/4428 tok/s;   8285 sec
[2021-04-23 09:04:40,220 INFO] Step 43000/50000; acc:  74.28; ppl:  2.37; xent: 0.86; lr: 0.00010; 10683/4363 tok/s;   8294 sec
[2021-04-23 09:04:41,956 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:04:50,149 INFO] Step 43050/50000; acc:  73.96; ppl:  2.40; xent: 0.88; lr: 0.00010; 10483/4127 tok/s;   8304 sec
[2021-04-23 09:04:59,739 INFO] Step 43100/50000; acc:  74.18; ppl:  2.39; xent: 0.87; lr: 0.00010; 10477/4154 tok/s;   8314 sec
[2021-04-23 09:05:09,918 INFO] Step 43150/50000; acc:  73.82; ppl:  2.40; xent: 0.87; lr: 0.00010; 9919/3986 tok/s;   8324 sec
[2021-04-23 09:05:19,544 INFO] Step 43200/50000; acc:  73.98; ppl:  2.41; xent: 0.88; lr: 0.00010; 10581/4265 tok/s;   8334 sec
[2021-04-23 09:05:28,810 INFO] Step 43250/50000; acc:  74.41; ppl:  2.36; xent: 0.86; lr: 0.00010; 10798/4331 tok/s;   8343 sec
[2021-04-23 09:05:38,065 INFO] Step 43300/50000; acc:  73.98; ppl:  2.41; xent: 0.88; lr: 0.00010; 11096/4301 tok/s;   8352 sec
[2021-04-23 09:05:47,650 INFO] Step 43350/50000; acc:  73.96; ppl:  2.38; xent: 0.87; lr: 0.00010; 10692/4167 tok/s;   8362 sec
[2021-04-23 09:05:57,918 INFO] Step 43400/50000; acc:  73.90; ppl:  2.40; xent: 0.87; lr: 0.00010; 9948/3920 tok/s;   8372 sec
[2021-04-23 09:06:07,527 INFO] Step 43450/50000; acc:  74.37; ppl:  2.37; xent: 0.86; lr: 0.00010; 10537/4178 tok/s;   8382 sec
[2021-04-23 09:06:17,001 INFO] Step 43500/50000; acc:  74.21; ppl:  2.38; xent: 0.87; lr: 0.00010; 10844/4238 tok/s;   8391 sec
[2021-04-23 09:06:26,594 INFO] Step 43550/50000; acc:  74.35; ppl:  2.39; xent: 0.87; lr: 0.00010; 10545/4216 tok/s;   8401 sec
[2021-04-23 09:06:36,222 INFO] Step 43600/50000; acc:  74.24; ppl:  2.37; xent: 0.86; lr: 0.00010; 10636/4112 tok/s;   8410 sec
[2021-04-23 09:06:45,356 INFO] Step 43650/50000; acc:  74.18; ppl:  2.38; xent: 0.87; lr: 0.00010; 11184/4411 tok/s;   8419 sec
[2021-04-23 09:06:54,417 INFO] Step 43700/50000; acc:  74.37; ppl:  2.36; xent: 0.86; lr: 0.00010; 11006/4406 tok/s;   8428 sec
[2021-04-23 09:07:02,961 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:07:04,289 INFO] Step 43750/50000; acc:  73.76; ppl:  2.41; xent: 0.88; lr: 0.00010; 10591/4121 tok/s;   8438 sec
[2021-04-23 09:07:13,866 INFO] Step 43800/50000; acc:  74.82; ppl:  2.34; xent: 0.85; lr: 0.00010; 10322/4190 tok/s;   8448 sec
[2021-04-23 09:07:23,929 INFO] Step 43850/50000; acc:  73.56; ppl:  2.43; xent: 0.89; lr: 0.00010; 10337/3975 tok/s;   8458 sec
[2021-04-23 09:07:33,545 INFO] Step 43900/50000; acc:  74.53; ppl:  2.36; xent: 0.86; lr: 0.00010; 10328/4292 tok/s;   8468 sec
[2021-04-23 09:07:43,045 INFO] Step 43950/50000; acc:  74.07; ppl:  2.39; xent: 0.87; lr: 0.00010; 10694/4259 tok/s;   8477 sec
[2021-04-23 09:07:52,440 INFO] Step 44000/50000; acc:  74.22; ppl:  2.38; xent: 0.87; lr: 0.00010; 10837/4252 tok/s;   8486 sec
[2021-04-23 09:08:01,839 INFO] Step 44050/50000; acc:  74.33; ppl:  2.37; xent: 0.86; lr: 0.00010; 10678/4240 tok/s;   8496 sec
[2021-04-23 09:08:11,485 INFO] Step 44100/50000; acc:  73.90; ppl:  2.40; xent: 0.87; lr: 0.00010; 10836/4136 tok/s;   8506 sec
[2021-04-23 09:08:21,839 INFO] Step 44150/50000; acc:  73.92; ppl:  2.38; xent: 0.87; lr: 0.00010; 9841/3960 tok/s;   8516 sec
[2021-04-23 09:08:30,985 INFO] Step 44200/50000; acc:  74.62; ppl:  2.35; xent: 0.85; lr: 0.00010; 11019/4273 tok/s;   8525 sec
[2021-04-23 09:08:40,595 INFO] Step 44250/50000; acc:  74.53; ppl:  2.37; xent: 0.86; lr: 0.00010; 10577/4253 tok/s;   8535 sec
[2021-04-23 09:08:50,394 INFO] Step 44300/50000; acc:  74.21; ppl:  2.38; xent: 0.87; lr: 0.00010; 10459/4078 tok/s;   8544 sec
[2021-04-23 09:08:59,967 INFO] Step 44350/50000; acc:  74.49; ppl:  2.35; xent: 0.86; lr: 0.00010; 10633/4196 tok/s;   8554 sec
[2021-04-23 09:09:08,804 INFO] Step 44400/50000; acc:  74.12; ppl:  2.38; xent: 0.87; lr: 0.00010; 11522/4488 tok/s;   8563 sec
[2021-04-23 09:09:18,399 INFO] Step 44450/50000; acc:  74.30; ppl:  2.37; xent: 0.86; lr: 0.00010; 10504/4221 tok/s;   8572 sec
[2021-04-23 09:09:24,100 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:09:27,875 INFO] Step 44500/50000; acc:  74.49; ppl:  2.36; xent: 0.86; lr: 0.00010; 10657/4273 tok/s;   8582 sec
[2021-04-23 09:09:38,117 INFO] Step 44550/50000; acc:  73.83; ppl:  2.39; xent: 0.87; lr: 0.00010; 10273/3961 tok/s;   8592 sec
[2021-04-23 09:09:47,860 INFO] Step 44600/50000; acc:  74.43; ppl:  2.35; xent: 0.86; lr: 0.00010; 10074/4110 tok/s;   8602 sec
[2021-04-23 09:09:57,612 INFO] Step 44650/50000; acc:  74.14; ppl:  2.39; xent: 0.87; lr: 0.00010; 10585/4206 tok/s;   8612 sec
[2021-04-23 09:10:06,917 INFO] Step 44700/50000; acc:  74.59; ppl:  2.33; xent: 0.85; lr: 0.00010; 10759/4341 tok/s;   8621 sec
[2021-04-23 09:10:16,246 INFO] Step 44750/50000; acc:  74.43; ppl:  2.37; xent: 0.86; lr: 0.00010; 10836/4254 tok/s;   8630 sec
[2021-04-23 09:10:25,684 INFO] Step 44800/50000; acc:  74.30; ppl:  2.38; xent: 0.87; lr: 0.00010; 10914/4275 tok/s;   8640 sec
[2021-04-23 09:10:35,370 INFO] Step 44850/50000; acc:  74.47; ppl:  2.35; xent: 0.86; lr: 0.00010; 10476/4100 tok/s;   8649 sec
[2021-04-23 09:10:45,475 INFO] Step 44900/50000; acc:  74.30; ppl:  2.38; xent: 0.87; lr: 0.00010; 10248/3995 tok/s;   8660 sec
[2021-04-23 09:10:55,033 INFO] Step 44950/50000; acc:  74.89; ppl:  2.33; xent: 0.85; lr: 0.00010; 10580/4230 tok/s;   8669 sec
[2021-04-23 09:11:04,517 INFO] Step 45000/50000; acc:  74.56; ppl:  2.35; xent: 0.86; lr: 0.00010; 10686/4180 tok/s;   8679 sec
[2021-04-23 09:11:04,521 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-23 09:11:12,424 INFO] Validation perplexity: 3.10618
[2021-04-23 09:11:12,424 INFO] Validation accuracy: 69.3385
[2021-04-23 09:11:12,426 INFO] Saving checkpoint ../models/default_params/loose_ops/model_step_45000.pt
[2021-04-23 09:11:22,439 INFO] Step 45050/50000; acc:  74.48; ppl:  2.35; xent: 0.85; lr: 0.00010; 5633/2262 tok/s;   8696 sec
[2021-04-23 09:11:31,842 INFO] Step 45100/50000; acc:  74.27; ppl:  2.36; xent: 0.86; lr: 0.00010; 10989/4175 tok/s;   8706 sec
[2021-04-23 09:11:40,999 INFO] Step 45150/50000; acc:  74.61; ppl:  2.33; xent: 0.85; lr: 0.00010; 10996/4453 tok/s;   8715 sec
[2021-04-23 09:11:50,495 INFO] Step 45200/50000; acc:  74.42; ppl:  2.34; xent: 0.85; lr: 0.00010; 10710/4236 tok/s;   8725 sec
[2021-04-23 09:11:53,612 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:12:00,052 INFO] Step 45250/50000; acc:  74.46; ppl:  2.36; xent: 0.86; lr: 0.00010; 10647/4263 tok/s;   8734 sec
[2021-04-23 09:12:09,795 INFO] Step 45300/50000; acc:  74.64; ppl:  2.33; xent: 0.85; lr: 0.00010; 10345/4040 tok/s;   8744 sec
[2021-04-23 09:12:20,005 INFO] Step 45350/50000; acc:  73.92; ppl:  2.41; xent: 0.88; lr: 0.00010; 10193/4074 tok/s;   8754 sec
[2021-04-23 09:12:29,134 INFO] Step 45400/50000; acc:  75.11; ppl:  2.29; xent: 0.83; lr: 0.00010; 10768/4385 tok/s;   8763 sec
[2021-04-23 09:12:38,859 INFO] Step 45450/50000; acc:  74.18; ppl:  2.38; xent: 0.87; lr: 0.00010; 10666/4146 tok/s;   8773 sec
[2021-04-23 09:12:48,039 INFO] Step 45500/50000; acc:  74.60; ppl:  2.34; xent: 0.85; lr: 0.00010; 10857/4342 tok/s;   8782 sec
[2021-04-23 09:12:57,503 INFO] Step 45550/50000; acc:  74.39; ppl:  2.35; xent: 0.85; lr: 0.00010; 10935/4212 tok/s;   8792 sec
[2021-04-23 09:13:07,419 INFO] Step 45600/50000; acc:  74.53; ppl:  2.35; xent: 0.85; lr: 0.00010; 10306/4080 tok/s;   8801 sec
[2021-04-23 09:13:17,091 INFO] Step 45650/50000; acc:  74.81; ppl:  2.34; xent: 0.85; lr: 0.00010; 10395/4125 tok/s;   8811 sec
[2021-04-23 09:13:26,541 INFO] Step 45700/50000; acc:  74.57; ppl:  2.35; xent: 0.85; lr: 0.00010; 10974/4296 tok/s;   8821 sec
[2021-04-23 09:13:36,050 INFO] Step 45750/50000; acc:  74.66; ppl:  2.34; xent: 0.85; lr: 0.00010; 10611/4225 tok/s;   8830 sec
[2021-04-23 09:13:45,700 INFO] Step 45800/50000; acc:  74.77; ppl:  2.33; xent: 0.85; lr: 0.00010; 10532/4105 tok/s;   8840 sec
[2021-04-23 09:13:54,605 INFO] Step 45850/50000; acc:  74.76; ppl:  2.34; xent: 0.85; lr: 0.00010; 11357/4485 tok/s;   8849 sec
[2021-04-23 09:14:03,962 INFO] Step 45900/50000; acc:  74.42; ppl:  2.34; xent: 0.85; lr: 0.00010; 10925/4291 tok/s;   8858 sec
[2021-04-23 09:14:07,748 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:14:13,654 INFO] Step 45950/50000; acc:  74.77; ppl:  2.32; xent: 0.84; lr: 0.00010; 10483/4218 tok/s;   8868 sec
[2021-04-23 09:14:23,672 INFO] Step 46000/50000; acc:  74.70; ppl:  2.33; xent: 0.84; lr: 0.00010; 10191/4007 tok/s;   8878 sec
[2021-04-23 09:14:33,337 INFO] Step 46050/50000; acc:  74.33; ppl:  2.35; xent: 0.86; lr: 0.00010; 10468/4151 tok/s;   8887 sec
[2021-04-23 09:14:43,238 INFO] Step 46100/50000; acc:  74.80; ppl:  2.32; xent: 0.84; lr: 0.00010; 10103/4125 tok/s;   8897 sec
[2021-04-23 09:14:52,782 INFO] Step 46150/50000; acc:  74.16; ppl:  2.37; xent: 0.86; lr: 0.00010; 10955/4298 tok/s;   8907 sec
[2021-04-23 09:15:02,029 INFO] Step 46200/50000; acc:  74.88; ppl:  2.29; xent: 0.83; lr: 0.00010; 10605/4283 tok/s;   8916 sec
[2021-04-23 09:15:11,772 INFO] Step 46250/50000; acc:  74.57; ppl:  2.36; xent: 0.86; lr: 0.00010; 10679/4131 tok/s;   8926 sec
[2021-04-23 09:15:21,378 INFO] Step 46300/50000; acc:  74.64; ppl:  2.33; xent: 0.85; lr: 0.00010; 10627/4103 tok/s;   8935 sec
[2021-04-23 09:15:30,884 INFO] Step 46350/50000; acc:  74.86; ppl:  2.32; xent: 0.84; lr: 0.00010; 10703/4262 tok/s;   8945 sec
[2021-04-23 09:15:40,589 INFO] Step 46400/50000; acc:  74.63; ppl:  2.34; xent: 0.85; lr: 0.00010; 10519/4148 tok/s;   8955 sec
[2021-04-23 09:15:50,098 INFO] Step 46450/50000; acc:  74.87; ppl:  2.32; xent: 0.84; lr: 0.00010; 10555/4242 tok/s;   8964 sec
[2021-04-23 09:15:59,745 INFO] Step 46500/50000; acc:  74.64; ppl:  2.34; xent: 0.85; lr: 0.00010; 10729/4158 tok/s;   8974 sec
[2021-04-23 09:16:09,369 INFO] Step 46550/50000; acc:  74.73; ppl:  2.32; xent: 0.84; lr: 0.00010; 10577/4199 tok/s;   8983 sec
[2021-04-23 09:16:18,164 INFO] Step 46600/50000; acc:  75.15; ppl:  2.30; xent: 0.83; lr: 0.00010; 11465/4427 tok/s;   8992 sec
[2021-04-23 09:16:27,595 INFO] Step 46650/50000; acc:  75.03; ppl:  2.32; xent: 0.84; lr: 0.00010; 10685/4291 tok/s;   9002 sec
[2021-04-23 09:16:28,934 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:16:37,452 INFO] Step 46700/50000; acc:  74.89; ppl:  2.33; xent: 0.84; lr: 0.00010; 10435/4142 tok/s;   9012 sec
[2021-04-23 09:16:47,010 INFO] Step 46750/50000; acc:  74.62; ppl:  2.33; xent: 0.85; lr: 0.00010; 10634/4180 tok/s;   9021 sec
[2021-04-23 09:16:57,330 INFO] Step 46800/50000; acc:  74.81; ppl:  2.33; xent: 0.85; lr: 0.00010; 9778/3964 tok/s;   9031 sec
[2021-04-23 09:17:06,977 INFO] Step 46850/50000; acc:  74.70; ppl:  2.33; xent: 0.85; lr: 0.00010; 10521/4194 tok/s;   9041 sec
[2021-04-23 09:17:16,410 INFO] Step 46900/50000; acc:  75.29; ppl:  2.29; xent: 0.83; lr: 0.00010; 10649/4256 tok/s;   9050 sec
[2021-04-23 09:17:25,691 INFO] Step 46950/50000; acc:  74.51; ppl:  2.34; xent: 0.85; lr: 0.00010; 11179/4295 tok/s;   9060 sec
[2021-04-23 09:17:35,299 INFO] Step 47000/50000; acc:  75.07; ppl:  2.31; xent: 0.84; lr: 0.00010; 10423/4168 tok/s;   9069 sec
[2021-04-23 09:17:45,519 INFO] Step 47050/50000; acc:  74.33; ppl:  2.35; xent: 0.85; lr: 0.00010; 10235/3968 tok/s;   9080 sec
[2021-04-23 09:17:54,852 INFO] Step 47100/50000; acc:  75.39; ppl:  2.27; xent: 0.82; lr: 0.00010; 10742/4287 tok/s;   9089 sec
[2021-04-23 09:18:04,450 INFO] Step 47150/50000; acc:  75.13; ppl:  2.31; xent: 0.84; lr: 0.00010; 10622/4187 tok/s;   9099 sec
[2021-04-23 09:18:14,077 INFO] Step 47200/50000; acc:  74.71; ppl:  2.33; xent: 0.84; lr: 0.00010; 10582/4179 tok/s;   9108 sec
[2021-04-23 09:18:23,563 INFO] Step 47250/50000; acc:  75.35; ppl:  2.28; xent: 0.82; lr: 0.00010; 10625/4160 tok/s;   9118 sec
[2021-04-23 09:18:32,753 INFO] Step 47300/50000; acc:  74.74; ppl:  2.34; xent: 0.85; lr: 0.00010; 11303/4432 tok/s;   9127 sec
[2021-04-23 09:18:41,809 INFO] Step 47350/50000; acc:  75.33; ppl:  2.28; xent: 0.83; lr: 0.00010; 11080/4388 tok/s;   9136 sec
[2021-04-23 09:18:49,774 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:18:51,358 INFO] Step 47400/50000; acc:  74.98; ppl:  2.29; xent: 0.83; lr: 0.00010; 10619/4228 tok/s;   9145 sec
[2021-04-23 09:19:01,131 INFO] Step 47450/50000; acc:  74.92; ppl:  2.31; xent: 0.84; lr: 0.00010; 10348/4134 tok/s;   9155 sec
[2021-04-23 09:19:11,182 INFO] Step 47500/50000; acc:  74.41; ppl:  2.35; xent: 0.85; lr: 0.00010; 10226/3982 tok/s;   9165 sec
[2021-04-23 09:19:20,954 INFO] Step 47550/50000; acc:  75.24; ppl:  2.30; xent: 0.83; lr: 0.00010; 10280/4200 tok/s;   9175 sec
[2021-04-23 09:19:30,272 INFO] Step 47600/50000; acc:  75.06; ppl:  2.30; xent: 0.83; lr: 0.00010; 10933/4352 tok/s;   9184 sec
[2021-04-23 09:19:39,760 INFO] Step 47650/50000; acc:  74.91; ppl:  2.30; xent: 0.83; lr: 0.00010; 10667/4213 tok/s;   9194 sec
[2021-04-23 09:19:49,274 INFO] Step 47700/50000; acc:  75.03; ppl:  2.30; xent: 0.83; lr: 0.00010; 10592/4218 tok/s;   9203 sec
[2021-04-23 09:19:58,998 INFO] Step 47750/50000; acc:  74.84; ppl:  2.33; xent: 0.85; lr: 0.00010; 10887/4092 tok/s;   9213 sec
[2021-04-23 09:20:09,069 INFO] Step 47800/50000; acc:  75.05; ppl:  2.29; xent: 0.83; lr: 0.00010; 9852/4039 tok/s;   9223 sec
[2021-04-23 09:20:18,303 INFO] Step 47850/50000; acc:  75.03; ppl:  2.31; xent: 0.84; lr: 0.00010; 11213/4281 tok/s;   9232 sec
[2021-04-23 09:20:27,786 INFO] Step 47900/50000; acc:  75.20; ppl:  2.29; xent: 0.83; lr: 0.00010; 10607/4295 tok/s;   9242 sec
[2021-04-23 09:20:37,515 INFO] Step 47950/50000; acc:  75.28; ppl:  2.29; xent: 0.83; lr: 0.00010; 10437/4087 tok/s;   9252 sec
[2021-04-23 09:20:47,140 INFO] Step 48000/50000; acc:  75.00; ppl:  2.30; xent: 0.83; lr: 0.00010; 10642/4196 tok/s;   9261 sec
[2021-04-23 09:20:55,945 INFO] Step 48050/50000; acc:  75.02; ppl:  2.29; xent: 0.83; lr: 0.00010; 11386/4509 tok/s;   9270 sec
[2021-04-23 09:21:05,669 INFO] Step 48100/50000; acc:  74.94; ppl:  2.31; xent: 0.84; lr: 0.00010; 10574/4159 tok/s;   9280 sec
[2021-04-23 09:21:10,999 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:21:15,122 INFO] Step 48150/50000; acc:  75.09; ppl:  2.30; xent: 0.83; lr: 0.00010; 10730/4275 tok/s;   9289 sec
[2021-04-23 09:21:25,156 INFO] Step 48200/50000; acc:  75.10; ppl:  2.30; xent: 0.83; lr: 0.00010; 10172/4011 tok/s;   9299 sec
[2021-04-23 09:21:35,182 INFO] Step 48250/50000; acc:  74.70; ppl:  2.30; xent: 0.83; lr: 0.00010; 10005/4048 tok/s;   9309 sec
[2021-04-23 09:21:44,926 INFO] Step 48300/50000; acc:  74.97; ppl:  2.30; xent: 0.83; lr: 0.00010; 10477/4194 tok/s;   9319 sec
[2021-04-23 09:21:54,342 INFO] Step 48350/50000; acc:  75.40; ppl:  2.27; xent: 0.82; lr: 0.00010; 10740/4288 tok/s;   9328 sec
[2021-04-23 09:22:03,592 INFO] Step 48400/50000; acc:  75.05; ppl:  2.30; xent: 0.83; lr: 0.00010; 10948/4283 tok/s;   9338 sec
[2021-04-23 09:22:12,900 INFO] Step 48450/50000; acc:  75.07; ppl:  2.30; xent: 0.83; lr: 0.00010; 11024/4301 tok/s;   9347 sec
[2021-04-23 09:22:22,721 INFO] Step 48500/50000; acc:  75.06; ppl:  2.28; xent: 0.83; lr: 0.00010; 10360/4062 tok/s;   9357 sec
[2021-04-23 09:22:32,794 INFO] Step 48550/50000; acc:  74.82; ppl:  2.32; xent: 0.84; lr: 0.00010; 10396/4023 tok/s;   9367 sec
[2021-04-23 09:22:42,164 INFO] Step 48600/50000; acc:  75.80; ppl:  2.25; xent: 0.81; lr: 0.00010; 10535/4307 tok/s;   9376 sec
[2021-04-23 09:22:51,770 INFO] Step 48650/50000; acc:  75.09; ppl:  2.30; xent: 0.83; lr: 0.00010; 10812/4143 tok/s;   9386 sec
[2021-04-23 09:23:01,244 INFO] Step 48700/50000; acc:  75.34; ppl:  2.27; xent: 0.82; lr: 0.00010; 10579/4268 tok/s;   9395 sec
[2021-04-23 09:23:10,472 INFO] Step 48750/50000; acc:  75.08; ppl:  2.27; xent: 0.82; lr: 0.00010; 11070/4288 tok/s;   9405 sec
[2021-04-23 09:23:19,697 INFO] Step 48800/50000; acc:  75.25; ppl:  2.27; xent: 0.82; lr: 0.00010; 10979/4360 tok/s;   9414 sec
[2021-04-23 09:23:29,215 INFO] Step 48850/50000; acc:  75.25; ppl:  2.27; xent: 0.82; lr: 0.00010; 10550/4264 tok/s;   9423 sec
[2021-04-23 09:23:31,978 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:23:38,740 INFO] Step 48900/50000; acc:  74.82; ppl:  2.31; xent: 0.84; lr: 0.00010; 10887/4287 tok/s;   9433 sec
[2021-04-23 09:23:48,450 INFO] Step 48950/50000; acc:  75.26; ppl:  2.28; xent: 0.82; lr: 0.00010; 10420/4051 tok/s;   9443 sec
[2021-04-23 09:23:58,291 INFO] Step 49000/50000; acc:  74.95; ppl:  2.30; xent: 0.83; lr: 0.00010; 10244/4177 tok/s;   9452 sec
[2021-04-23 09:24:07,616 INFO] Step 49050/50000; acc:  75.65; ppl:  2.25; xent: 0.81; lr: 0.00010; 10799/4335 tok/s;   9462 sec
[2021-04-23 09:24:17,205 INFO] Step 49100/50000; acc:  75.05; ppl:  2.28; xent: 0.82; lr: 0.00010; 10680/4165 tok/s;   9471 sec
[2021-04-23 09:24:26,436 INFO] Step 49150/50000; acc:  75.10; ppl:  2.29; xent: 0.83; lr: 0.00010; 10927/4364 tok/s;   9480 sec
[2021-04-23 09:24:35,977 INFO] Step 49200/50000; acc:  75.29; ppl:  2.27; xent: 0.82; lr: 0.00010; 10867/4161 tok/s;   9490 sec
[2021-04-23 09:24:45,864 INFO] Step 49250/50000; acc:  75.18; ppl:  2.28; xent: 0.83; lr: 0.00010; 10280/4091 tok/s;   9500 sec
[2021-04-23 09:24:55,504 INFO] Step 49300/50000; acc:  75.23; ppl:  2.29; xent: 0.83; lr: 0.00010; 10473/4137 tok/s;   9510 sec
[2021-04-23 09:25:04,985 INFO] Step 49350/50000; acc:  74.89; ppl:  2.30; xent: 0.83; lr: 0.00010; 11052/4314 tok/s;   9519 sec
[2021-04-23 09:25:14,268 INFO] Step 49400/50000; acc:  75.90; ppl:  2.23; xent: 0.80; lr: 0.00010; 10600/4280 tok/s;   9528 sec
[2021-04-23 09:25:24,161 INFO] Step 49450/50000; acc:  75.10; ppl:  2.30; xent: 0.83; lr: 0.00010; 10548/4046 tok/s;   9538 sec
[2021-04-23 09:25:32,817 INFO] Step 49500/50000; acc:  75.63; ppl:  2.25; xent: 0.81; lr: 0.00010; 11587/4578 tok/s;   9547 sec
[2021-04-23 09:25:42,379 INFO] Step 49550/50000; acc:  75.42; ppl:  2.27; xent: 0.82; lr: 0.00010; 10565/4199 tok/s;   9556 sec
[2021-04-23 09:25:45,725 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-23 09:25:52,001 INFO] Step 49600/50000; acc:  75.13; ppl:  2.28; xent: 0.82; lr: 0.00010; 10669/4284 tok/s;   9566 sec
[2021-04-23 09:26:01,687 INFO] Step 49650/50000; acc:  75.75; ppl:  2.24; xent: 0.81; lr: 0.00010; 10364/4108 tok/s;   9576 sec
[2021-04-23 09:26:11,160 INFO] Step 49700/50000; acc:  75.03; ppl:  2.30; xent: 0.83; lr: 0.00010; 10848/4274 tok/s;   9585 sec
[2021-04-23 09:26:21,130 INFO] Step 49750/50000; acc:  75.43; ppl:  2.26; xent: 0.82; lr: 0.00010; 10111/4119 tok/s;   9595 sec
[2021-04-23 09:26:30,483 INFO] Step 49800/50000; acc:  75.42; ppl:  2.26; xent: 0.82; lr: 0.00010; 10832/4313 tok/s;   9605 sec
[2021-04-23 09:26:39,523 INFO] Step 49850/50000; acc:  75.47; ppl:  2.24; xent: 0.81; lr: 0.00010; 11097/4372 tok/s;   9614 sec
[2021-04-23 09:26:49,357 INFO] Step 49900/50000; acc:  75.06; ppl:  2.28; xent: 0.82; lr: 0.00010; 10488/4122 tok/s;   9623 sec
[2021-04-23 09:26:59,031 INFO] Step 49950/50000; acc:  74.79; ppl:  2.29; xent: 0.83; lr: 0.00010; 10655/4103 tok/s;   9633 sec
[2021-04-23 09:27:08,690 INFO] Step 50000/50000; acc:  75.43; ppl:  2.25; xent: 0.81; lr: 0.00005; 10542/4171 tok/s;   9643 sec
[2021-04-23 09:27:08,695 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-23 09:27:16,610 INFO] Validation perplexity: 3.18068
[2021-04-23 09:27:16,610 INFO] Validation accuracy: 69.2652
[2021-04-23 09:27:16,612 INFO] Saving checkpoint ../models/default_params/loose_ops/model_step_50000.pt

Parameter group 1

These models are trained with the GRU RNN architecture instead of the default LSTM architecture. All other parameters are the same as the default parameter group.

GROUP1_PARAMS = {
    "rnnType": "GRU"
}

Model paths

Variable name Value
MODEL_GROUP1_CONTROL "../models/group1_params/control"
MODEL_GROUP1_BASIC "../models/group1_params/basic_ops"
MODEL_GROUP1_STRICT "../models/group1_params/strict_ops"
MODEL_GROUP1_LOOSE "../models/group1_params/loose_ops"

Models

Control:

modelGroup1Control = HephaestusModel(MODEL_GROUP1_CONTROL)
modelGroup1Control.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_METHODS_TRAIN_FIXED,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_METHODS_VALID_FIXED,
    **GROUP1_PARAMS
)
[2021-04-24 22:32:39,517 INFO] Counter vocab from -1 samples.
[2021-04-24 22:32:39,517 INFO] n_sample=-1: Build vocab on full datasets.
[2021-04-24 22:32:39,526 INFO] corpus_1's transforms: TransformPipe()
[2021-04-24 22:32:39,527 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:32:40,127 INFO] Counters src:429
[2021-04-24 22:32:40,127 INFO] Counters tgt:423
[2021-04-24 22:32:40,127 WARNING] path ../models/group1_params/control/save_data.vocab.src exists, may overwrite...
[2021-04-24 22:32:40,128 WARNING] path ../models/group1_params/control/save_data.vocab.tgt exists, may overwrite...
[2021-04-24 22:32:40,765 INFO] Parsed 2 corpora from -data.
[2021-04-24 22:32:40,766 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-04-24 22:32:40,766 INFO] Loading vocab from text file...
[2021-04-24 22:32:40,766 INFO] Loading src vocabulary from ../models/group1_params/control/save_data.vocab.src
[2021-04-24 22:32:40,768 INFO] Loaded src vocab has 429 tokens.
[2021-04-24 22:32:40,768 INFO] Loading tgt vocabulary from ../models/group1_params/control/save_data.vocab.tgt
[2021-04-24 22:32:40,769 INFO] Loaded tgt vocab has 423 tokens.
[2021-04-24 22:32:40,770 INFO] Building fields with vocab in counters...
[2021-04-24 22:32:40,770 INFO]  * tgt vocab size: 427.
[2021-04-24 22:32:40,771 INFO]  * src vocab size: 431.
[2021-04-24 22:32:40,771 INFO]  * src vocab size = 431
[2021-04-24 22:32:40,771 INFO]  * tgt vocab size = 427
[2021-04-24 22:32:40,772 INFO] Building model...
[2021-04-24 22:32:41,923 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): GRU(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(427, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedGRU(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): GRUCell(768, 256)
        (1): GRUCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=427, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-04-24 22:32:41,923 INFO] encoder: 1206784
[2021-04-24 22:32:41,923 INFO] decoder: 1773995
[2021-04-24 22:32:41,923 INFO] * number of parameters: 2980779
[2021-04-24 22:32:41,925 INFO] Starting training on GPU: [0]
[2021-04-24 22:32:41,925 INFO] Start training loop and validate every 5000 steps...
[2021-04-24 22:32:41,925 INFO] corpus_1's transforms: TransformPipe()
[2021-04-24 22:32:41,925 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:32:51,382 INFO] Step 50/50000; acc:  12.74; ppl: 97.07; xent: 4.58; lr: 0.00010; 10920/10296 tok/s;      9 sec
[2021-04-24 22:33:00,398 INFO] Step 100/50000; acc:  19.10; ppl: 32.57; xent: 3.48; lr: 0.00010; 11174/10536 tok/s;     18 sec
[2021-04-24 22:33:09,683 INFO] Step 150/50000; acc:  27.56; ppl: 24.33; xent: 3.19; lr: 0.00010; 11046/10444 tok/s;     28 sec
[2021-04-24 22:33:18,805 INFO] Step 200/50000; acc:  38.75; ppl: 17.54; xent: 2.86; lr: 0.00010; 10986/10430 tok/s;     37 sec
[2021-04-24 22:33:27,883 INFO] Step 250/50000; acc:  45.94; ppl: 12.92; xent: 2.56; lr: 0.00010; 11008/10446 tok/s;     46 sec
[2021-04-24 22:33:37,273 INFO] Step 300/50000; acc:  49.51; ppl: 10.45; xent: 2.35; lr: 0.00010; 11054/10371 tok/s;     55 sec
[2021-04-24 22:33:46,561 INFO] Step 350/50000; acc:  52.65; ppl:  8.49; xent: 2.14; lr: 0.00010; 10919/10359 tok/s;     65 sec
[2021-04-24 22:33:55,987 INFO] Step 400/50000; acc:  54.48; ppl:  7.44; xent: 2.01; lr: 0.00010; 11002/10347 tok/s;     74 sec
[2021-04-24 22:34:05,243 INFO] Step 450/50000; acc:  56.95; ppl:  6.60; xent: 1.89; lr: 0.00010; 10907/10278 tok/s;     83 sec
[2021-04-24 22:34:14,595 INFO] Step 500/50000; acc:  58.40; ppl:  6.05; xent: 1.80; lr: 0.00010; 11042/10347 tok/s;     93 sec
[2021-04-24 22:34:23,794 INFO] Step 550/50000; acc:  60.21; ppl:  5.55; xent: 1.71; lr: 0.00010; 10958/10465 tok/s;    102 sec
[2021-04-24 22:34:33,027 INFO] Step 600/50000; acc:  62.26; ppl:  5.07; xent: 1.62; lr: 0.00010; 11089/10377 tok/s;    111 sec
[2021-04-24 22:34:42,202 INFO] Step 650/50000; acc:  63.77; ppl:  4.72; xent: 1.55; lr: 0.00010; 10936/10327 tok/s;    120 sec
[2021-04-24 22:34:51,318 INFO] Step 700/50000; acc:  65.52; ppl:  4.34; xent: 1.47; lr: 0.00010; 10997/10340 tok/s;    129 sec
[2021-04-24 22:34:52,077 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:35:00,707 INFO] Step 750/50000; acc:  66.33; ppl:  4.17; xent: 1.43; lr: 0.00010; 10938/10291 tok/s;    139 sec
[2021-04-24 22:35:09,794 INFO] Step 800/50000; acc:  67.87; ppl:  3.93; xent: 1.37; lr: 0.00010; 11145/10495 tok/s;    148 sec
[2021-04-24 22:35:19,020 INFO] Step 850/50000; acc:  68.81; ppl:  3.77; xent: 1.33; lr: 0.00010; 11054/10502 tok/s;    157 sec
[2021-04-24 22:35:28,256 INFO] Step 900/50000; acc:  70.07; ppl:  3.53; xent: 1.26; lr: 0.00010; 10973/10365 tok/s;    166 sec
[2021-04-24 22:35:37,664 INFO] Step 950/50000; acc:  71.48; ppl:  3.33; xent: 1.20; lr: 0.00010; 10927/10338 tok/s;    176 sec
[2021-04-24 22:35:46,697 INFO] Step 1000/50000; acc:  72.67; ppl:  3.20; xent: 1.16; lr: 0.00010; 11048/10382 tok/s;    185 sec
[2021-04-24 22:35:55,963 INFO] Step 1050/50000; acc:  73.64; ppl:  3.07; xent: 1.12; lr: 0.00010; 10965/10422 tok/s;    194 sec
[2021-04-24 22:36:05,375 INFO] Step 1100/50000; acc:  74.50; ppl:  2.96; xent: 1.09; lr: 0.00010; 11093/10442 tok/s;    203 sec
[2021-04-24 22:36:14,619 INFO] Step 1150/50000; acc:  75.66; ppl:  2.82; xent: 1.04; lr: 0.00010; 10766/10182 tok/s;    213 sec
[2021-04-24 22:36:24,014 INFO] Step 1200/50000; acc:  75.97; ppl:  2.80; xent: 1.03; lr: 0.00010; 11075/10353 tok/s;    222 sec
[2021-04-24 22:36:33,181 INFO] Step 1250/50000; acc:  76.88; ppl:  2.69; xent: 0.99; lr: 0.00010; 11001/10476 tok/s;    231 sec
[2021-04-24 22:36:42,554 INFO] Step 1300/50000; acc:  77.57; ppl:  2.63; xent: 0.97; lr: 0.00010; 11048/10382 tok/s;    241 sec
[2021-04-24 22:36:51,717 INFO] Step 1350/50000; acc:  78.28; ppl:  2.53; xent: 0.93; lr: 0.00010; 11063/10407 tok/s;    250 sec
[2021-04-24 22:37:00,909 INFO] Step 1400/50000; acc:  79.20; ppl:  2.45; xent: 0.90; lr: 0.00010; 10964/10255 tok/s;    259 sec
[2021-04-24 22:37:08,202 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:37:10,091 INFO] Step 1450/50000; acc:  79.48; ppl:  2.43; xent: 0.89; lr: 0.00010; 11007/10414 tok/s;    268 sec
[2021-04-24 22:37:19,349 INFO] Step 1500/50000; acc:  80.10; ppl:  2.37; xent: 0.86; lr: 0.00010; 10849/10274 tok/s;    277 sec
[2021-04-24 22:37:28,599 INFO] Step 1550/50000; acc:  80.06; ppl:  2.36; xent: 0.86; lr: 0.00010; 11102/10442 tok/s;    287 sec
[2021-04-24 22:37:37,666 INFO] Step 1600/50000; acc:  80.73; ppl:  2.30; xent: 0.83; lr: 0.00010; 11028/10541 tok/s;    296 sec
[2021-04-24 22:37:47,107 INFO] Step 1650/50000; acc:  81.55; ppl:  2.24; xent: 0.80; lr: 0.00010; 10900/10225 tok/s;    305 sec
[2021-04-24 22:37:56,277 INFO] Step 1700/50000; acc:  81.92; ppl:  2.20; xent: 0.79; lr: 0.00010; 11016/10435 tok/s;    314 sec
[2021-04-24 22:38:05,692 INFO] Step 1750/50000; acc:  82.23; ppl:  2.17; xent: 0.77; lr: 0.00010; 10972/10364 tok/s;    324 sec
[2021-04-24 22:38:14,906 INFO] Step 1800/50000; acc:  83.06; ppl:  2.10; xent: 0.74; lr: 0.00010; 11004/10412 tok/s;    333 sec
[2021-04-24 22:38:24,197 INFO] Step 1850/50000; acc:  82.86; ppl:  2.10; xent: 0.74; lr: 0.00010; 10887/10278 tok/s;    342 sec
[2021-04-24 22:38:33,677 INFO] Step 1900/50000; acc:  83.37; ppl:  2.08; xent: 0.73; lr: 0.00010; 10863/10169 tok/s;    352 sec
[2021-04-24 22:38:42,917 INFO] Step 1950/50000; acc:  83.65; ppl:  2.06; xent: 0.72; lr: 0.00010; 10822/10247 tok/s;    361 sec
[2021-04-24 22:38:52,238 INFO] Step 2000/50000; acc:  83.87; ppl:  2.02; xent: 0.70; lr: 0.00010; 11137/10513 tok/s;    370 sec
[2021-04-24 22:39:01,512 INFO] Step 2050/50000; acc:  84.37; ppl:  1.98; xent: 0.68; lr: 0.00010; 10947/10264 tok/s;    380 sec
[2021-04-24 22:39:10,807 INFO] Step 2100/50000; acc:  84.80; ppl:  1.97; xent: 0.68; lr: 0.00010; 11047/10368 tok/s;    389 sec
[2021-04-24 22:39:19,934 INFO] Step 2150/50000; acc:  85.09; ppl:  1.93; xent: 0.66; lr: 0.00010; 10986/10366 tok/s;    398 sec
[2021-04-24 22:39:24,590 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:39:29,173 INFO] Step 2200/50000; acc:  85.32; ppl:  1.93; xent: 0.66; lr: 0.00010; 11048/10359 tok/s;    407 sec
[2021-04-24 22:39:38,397 INFO] Step 2250/50000; acc:  85.51; ppl:  1.92; xent: 0.65; lr: 0.00010; 11000/10459 tok/s;    416 sec
[2021-04-24 22:39:47,345 INFO] Step 2300/50000; acc:  85.48; ppl:  1.91; xent: 0.65; lr: 0.00010; 11174/10539 tok/s;    425 sec
[2021-04-24 22:39:56,691 INFO] Step 2350/50000; acc:  85.59; ppl:  1.89; xent: 0.64; lr: 0.00010; 10905/10364 tok/s;    435 sec
[2021-04-24 22:40:05,878 INFO] Step 2400/50000; acc:  86.35; ppl:  1.83; xent: 0.61; lr: 0.00010; 10963/10379 tok/s;    444 sec
[2021-04-24 22:40:15,157 INFO] Step 2450/50000; acc:  86.39; ppl:  1.84; xent: 0.61; lr: 0.00010; 11036/10351 tok/s;    453 sec
[2021-04-24 22:40:24,534 INFO] Step 2500/50000; acc:  86.45; ppl:  1.83; xent: 0.60; lr: 0.00010; 10917/10377 tok/s;    463 sec
[2021-04-24 22:40:33,936 INFO] Step 2550/50000; acc:  87.06; ppl:  1.79; xent: 0.58; lr: 0.00010; 11079/10374 tok/s;    472 sec
[2021-04-24 22:40:43,224 INFO] Step 2600/50000; acc:  86.84; ppl:  1.80; xent: 0.59; lr: 0.00010; 10811/10276 tok/s;    481 sec
[2021-04-24 22:40:52,481 INFO] Step 2650/50000; acc:  86.99; ppl:  1.79; xent: 0.58; lr: 0.00010; 10855/10199 tok/s;    491 sec
[2021-04-24 22:41:01,877 INFO] Step 2700/50000; acc:  87.19; ppl:  1.78; xent: 0.57; lr: 0.00010; 11020/10376 tok/s;    500 sec
[2021-04-24 22:41:10,907 INFO] Step 2750/50000; acc:  87.26; ppl:  1.77; xent: 0.57; lr: 0.00010; 11010/10435 tok/s;    509 sec
[2021-04-24 22:41:20,343 INFO] Step 2800/50000; acc:  87.58; ppl:  1.74; xent: 0.55; lr: 0.00010; 11064/10363 tok/s;    518 sec
[2021-04-24 22:41:29,466 INFO] Step 2850/50000; acc:  87.63; ppl:  1.75; xent: 0.56; lr: 0.00010; 11002/10312 tok/s;    528 sec
[2021-04-24 22:41:38,768 INFO] Step 2900/50000; acc:  88.02; ppl:  1.71; xent: 0.54; lr: 0.00010; 11068/10450 tok/s;    537 sec
[2021-04-24 22:41:40,787 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:41:48,037 INFO] Step 2950/50000; acc:  87.90; ppl:  1.73; xent: 0.55; lr: 0.00010; 10910/10213 tok/s;    546 sec
[2021-04-24 22:41:57,169 INFO] Step 3000/50000; acc:  88.07; ppl:  1.71; xent: 0.54; lr: 0.00010; 11130/10589 tok/s;    555 sec
[2021-04-24 22:42:06,199 INFO] Step 3050/50000; acc:  88.02; ppl:  1.72; xent: 0.54; lr: 0.00010; 11131/10514 tok/s;    564 sec
[2021-04-24 22:42:15,449 INFO] Step 3100/50000; acc:  88.17; ppl:  1.70; xent: 0.53; lr: 0.00010; 10830/10295 tok/s;    574 sec
[2021-04-24 22:42:24,705 INFO] Step 3150/50000; acc:  88.66; ppl:  1.67; xent: 0.51; lr: 0.00010; 11038/10407 tok/s;    583 sec
[2021-04-24 22:42:33,834 INFO] Step 3200/50000; acc:  88.70; ppl:  1.67; xent: 0.51; lr: 0.00010; 11009/10453 tok/s;    592 sec
[2021-04-24 22:42:43,355 INFO] Step 3250/50000; acc:  89.07; ppl:  1.64; xent: 0.49; lr: 0.00010; 11010/10330 tok/s;    601 sec
[2021-04-24 22:42:52,543 INFO] Step 3300/50000; acc:  88.62; ppl:  1.67; xent: 0.51; lr: 0.00010; 11031/10401 tok/s;    611 sec
[2021-04-24 22:43:01,972 INFO] Step 3350/50000; acc:  88.91; ppl:  1.65; xent: 0.50; lr: 0.00010; 10960/10339 tok/s;    620 sec
[2021-04-24 22:43:11,215 INFO] Step 3400/50000; acc:  88.83; ppl:  1.66; xent: 0.50; lr: 0.00010; 10882/10256 tok/s;    629 sec
[2021-04-24 22:43:20,367 INFO] Step 3450/50000; acc:  88.89; ppl:  1.65; xent: 0.50; lr: 0.00010; 10942/10397 tok/s;    638 sec
[2021-04-24 22:43:29,683 INFO] Step 3500/50000; acc:  89.18; ppl:  1.62; xent: 0.48; lr: 0.00010; 11163/10511 tok/s;    648 sec
[2021-04-24 22:43:38,807 INFO] Step 3550/50000; acc:  89.41; ppl:  1.61; xent: 0.48; lr: 0.00010; 10895/10232 tok/s;    657 sec
[2021-04-24 22:43:48,200 INFO] Step 3600/50000; acc:  89.31; ppl:  1.62; xent: 0.48; lr: 0.00010; 11020/10348 tok/s;    666 sec
[2021-04-24 22:43:51,032 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:43:57,394 INFO] Step 3650/50000; acc:  89.33; ppl:  1.62; xent: 0.48; lr: 0.00010; 11027/10367 tok/s;    675 sec
[2021-04-24 22:44:06,733 INFO] Step 3700/50000; acc:  89.73; ppl:  1.61; xent: 0.47; lr: 0.00010; 11061/10398 tok/s;    685 sec
[2021-04-24 22:44:15,785 INFO] Step 3750/50000; acc:  89.40; ppl:  1.62; xent: 0.48; lr: 0.00010; 11084/10487 tok/s;    694 sec
[2021-04-24 22:44:25,030 INFO] Step 3800/50000; acc:  89.38; ppl:  1.61; xent: 0.48; lr: 0.00010; 10936/10384 tok/s;    703 sec
[2021-04-24 22:44:34,172 INFO] Step 3850/50000; acc:  89.83; ppl:  1.59; xent: 0.46; lr: 0.00010; 11032/10469 tok/s;    712 sec
[2021-04-24 22:44:43,284 INFO] Step 3900/50000; acc:  89.75; ppl:  1.58; xent: 0.46; lr: 0.00010; 10968/10351 tok/s;    721 sec
[2021-04-24 22:44:52,589 INFO] Step 3950/50000; acc:  90.04; ppl:  1.57; xent: 0.45; lr: 0.00010; 11052/10418 tok/s;    731 sec
[2021-04-24 22:45:01,966 INFO] Step 4000/50000; acc:  90.21; ppl:  1.56; xent: 0.45; lr: 0.00010; 10957/10366 tok/s;    740 sec
[2021-04-24 22:45:11,355 INFO] Step 4050/50000; acc:  89.88; ppl:  1.57; xent: 0.45; lr: 0.00010; 10954/10324 tok/s;    749 sec
[2021-04-24 22:45:20,589 INFO] Step 4100/50000; acc:  89.95; ppl:  1.57; xent: 0.45; lr: 0.00010; 10982/10329 tok/s;    759 sec
[2021-04-24 22:45:29,959 INFO] Step 4150/50000; acc:  90.05; ppl:  1.57; xent: 0.45; lr: 0.00010; 11001/10349 tok/s;    768 sec
[2021-04-24 22:45:39,106 INFO] Step 4200/50000; acc:  90.13; ppl:  1.56; xent: 0.45; lr: 0.00010; 10982/10445 tok/s;    777 sec
[2021-04-24 22:45:48,300 INFO] Step 4250/50000; acc:  90.21; ppl:  1.55; xent: 0.44; lr: 0.00010; 10991/10301 tok/s;    786 sec
[2021-04-24 22:45:57,640 INFO] Step 4300/50000; acc:  90.48; ppl:  1.55; xent: 0.44; lr: 0.00010; 11020/10358 tok/s;    796 sec
[2021-04-24 22:46:06,667 INFO] Step 4350/50000; acc:  90.20; ppl:  1.55; xent: 0.44; lr: 0.00010; 10996/10385 tok/s;    805 sec
[2021-04-24 22:46:07,096 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:46:16,167 INFO] Step 4400/50000; acc:  90.36; ppl:  1.54; xent: 0.43; lr: 0.00010; 10952/10258 tok/s;    814 sec
[2021-04-24 22:46:25,321 INFO] Step 4450/50000; acc:  90.39; ppl:  1.55; xent: 0.44; lr: 0.00010; 11082/10454 tok/s;    823 sec
[2021-04-24 22:46:34,529 INFO] Step 4500/50000; acc:  90.35; ppl:  1.54; xent: 0.44; lr: 0.00010; 11074/10471 tok/s;    833 sec
[2021-04-24 22:46:43,730 INFO] Step 4550/50000; acc:  90.41; ppl:  1.54; xent: 0.43; lr: 0.00010; 10947/10379 tok/s;    842 sec
[2021-04-24 22:46:53,027 INFO] Step 4600/50000; acc:  90.54; ppl:  1.53; xent: 0.43; lr: 0.00010; 10922/10390 tok/s;    851 sec
[2021-04-24 22:47:02,155 INFO] Step 4650/50000; acc:  90.63; ppl:  1.53; xent: 0.42; lr: 0.00010; 10984/10308 tok/s;    860 sec
[2021-04-24 22:47:11,511 INFO] Step 4700/50000; acc:  90.94; ppl:  1.51; xent: 0.42; lr: 0.00010; 10898/10364 tok/s;    870 sec
[2021-04-24 22:47:20,888 INFO] Step 4750/50000; acc:  90.74; ppl:  1.51; xent: 0.41; lr: 0.00010; 11013/10402 tok/s;    879 sec
[2021-04-24 22:47:30,240 INFO] Step 4800/50000; acc:  90.70; ppl:  1.52; xent: 0.42; lr: 0.00010; 10777/10136 tok/s;    888 sec
[2021-04-24 22:47:39,636 INFO] Step 4850/50000; acc:  90.83; ppl:  1.51; xent: 0.41; lr: 0.00010; 10985/10295 tok/s;    898 sec
[2021-04-24 22:47:48,810 INFO] Step 4900/50000; acc:  90.80; ppl:  1.52; xent: 0.42; lr: 0.00010; 11026/10495 tok/s;    907 sec
[2021-04-24 22:47:58,161 INFO] Step 4950/50000; acc:  90.85; ppl:  1.51; xent: 0.41; lr: 0.00010; 11087/10373 tok/s;    916 sec
[2021-04-24 22:48:07,308 INFO] Step 5000/50000; acc:  90.95; ppl:  1.50; xent: 0.41; lr: 0.00010; 11001/10409 tok/s;    925 sec
[2021-04-24 22:48:07,309 INFO] valid's transforms: TransformPipe()
[2021-04-24 22:48:07,311 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-24 22:48:16,108 INFO] Validation perplexity: 1.47109
[2021-04-24 22:48:16,108 INFO] Validation accuracy: 91.6229
[2021-04-24 22:48:16,110 INFO] Saving checkpoint ../models/group1_params/control/model_step_5000.pt
[2021-04-24 22:48:25,662 INFO] Step 5050/50000; acc:  91.03; ppl:  1.50; xent: 0.41; lr: 0.00010; 5424/5073 tok/s;    944 sec
[2021-04-24 22:48:32,686 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:48:34,986 INFO] Step 5100/50000; acc:  91.00; ppl:  1.50; xent: 0.41; lr: 0.00010; 11134/10469 tok/s;    953 sec
[2021-04-24 22:48:44,190 INFO] Step 5150/50000; acc:  90.98; ppl:  1.50; xent: 0.41; lr: 0.00010; 10788/10272 tok/s;    962 sec
[2021-04-24 22:48:53,561 INFO] Step 5200/50000; acc:  91.06; ppl:  1.50; xent: 0.41; lr: 0.00010; 11098/10414 tok/s;    972 sec
[2021-04-24 22:49:02,663 INFO] Step 5250/50000; acc:  90.99; ppl:  1.50; xent: 0.41; lr: 0.00010; 11008/10476 tok/s;    981 sec
[2021-04-24 22:49:12,156 INFO] Step 5300/50000; acc:  91.13; ppl:  1.49; xent: 0.40; lr: 0.00010; 10862/10222 tok/s;    990 sec
[2021-04-24 22:49:21,253 INFO] Step 5350/50000; acc:  91.21; ppl:  1.49; xent: 0.40; lr: 0.00010; 11041/10454 tok/s;    999 sec
[2021-04-24 22:49:30,607 INFO] Step 5400/50000; acc:  91.22; ppl:  1.49; xent: 0.40; lr: 0.00010; 10910/10345 tok/s;   1009 sec
[2021-04-24 22:49:39,881 INFO] Step 5450/50000; acc:  91.65; ppl:  1.46; xent: 0.38; lr: 0.00010; 11005/10373 tok/s;   1018 sec
[2021-04-24 22:49:49,177 INFO] Step 5500/50000; acc:  91.21; ppl:  1.48; xent: 0.39; lr: 0.00010; 10876/10307 tok/s;   1027 sec
[2021-04-24 22:49:58,582 INFO] Step 5550/50000; acc:  91.23; ppl:  1.48; xent: 0.40; lr: 0.00010; 10850/10175 tok/s;   1037 sec
[2021-04-24 22:50:07,937 INFO] Step 5600/50000; acc:  91.38; ppl:  1.47; xent: 0.39; lr: 0.00010; 10835/10241 tok/s;   1046 sec
[2021-04-24 22:50:17,229 INFO] Step 5650/50000; acc:  91.31; ppl:  1.47; xent: 0.39; lr: 0.00010; 11060/10458 tok/s;   1055 sec
[2021-04-24 22:50:26,456 INFO] Step 5700/50000; acc:  91.46; ppl:  1.46; xent: 0.38; lr: 0.00010; 11019/10309 tok/s;   1065 sec
[2021-04-24 22:50:35,752 INFO] Step 5750/50000; acc:  91.42; ppl:  1.47; xent: 0.39; lr: 0.00010; 11061/10371 tok/s;   1074 sec
[2021-04-24 22:50:44,864 INFO] Step 5800/50000; acc:  91.50; ppl:  1.46; xent: 0.38; lr: 0.00010; 10977/10372 tok/s;   1083 sec
[2021-04-24 22:50:49,225 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:50:54,095 INFO] Step 5850/50000; acc:  91.16; ppl:  1.48; xent: 0.39; lr: 0.00010; 10896/10255 tok/s;   1092 sec
[2021-04-24 22:51:03,581 INFO] Step 5900/50000; acc:  91.75; ppl:  1.46; xent: 0.38; lr: 0.00010; 10998/10360 tok/s;   1102 sec
[2021-04-24 22:51:12,424 INFO] Step 5950/50000; acc:  91.26; ppl:  1.47; xent: 0.39; lr: 0.00010; 11149/10607 tok/s;   1110 sec
[2021-04-24 22:51:21,899 INFO] Step 6000/50000; acc:  91.55; ppl:  1.46; xent: 0.38; lr: 0.00010; 10918/10300 tok/s;   1120 sec
[2021-04-24 22:51:31,181 INFO] Step 6050/50000; acc:  91.56; ppl:  1.46; xent: 0.38; lr: 0.00010; 10862/10309 tok/s;   1129 sec
[2021-04-24 22:51:40,381 INFO] Step 6100/50000; acc:  91.67; ppl:  1.46; xent: 0.38; lr: 0.00010; 11126/10430 tok/s;   1138 sec
[2021-04-24 22:51:49,743 INFO] Step 6150/50000; acc:  91.74; ppl:  1.45; xent: 0.37; lr: 0.00010; 10900/10387 tok/s;   1148 sec
[2021-04-24 22:51:59,025 INFO] Step 6200/50000; acc:  91.82; ppl:  1.44; xent: 0.37; lr: 0.00010; 11074/10401 tok/s;   1157 sec
[2021-04-24 22:52:08,298 INFO] Step 6250/50000; acc:  91.65; ppl:  1.45; xent: 0.37; lr: 0.00010; 10887/10253 tok/s;   1166 sec
[2021-04-24 22:52:17,593 INFO] Step 6300/50000; acc:  91.57; ppl:  1.46; xent: 0.38; lr: 0.00010; 10829/10244 tok/s;   1176 sec
[2021-04-24 22:52:26,876 INFO] Step 6350/50000; acc:  91.81; ppl:  1.45; xent: 0.37; lr: 0.00010; 11029/10384 tok/s;   1185 sec
[2021-04-24 22:52:36,001 INFO] Step 6400/50000; acc:  91.67; ppl:  1.45; xent: 0.37; lr: 0.00010; 11070/10484 tok/s;   1194 sec
[2021-04-24 22:52:45,419 INFO] Step 6450/50000; acc:  91.88; ppl:  1.43; xent: 0.36; lr: 0.00010; 10967/10281 tok/s;   1203 sec
[2021-04-24 22:52:54,564 INFO] Step 6500/50000; acc:  91.66; ppl:  1.46; xent: 0.38; lr: 0.00010; 11000/10289 tok/s;   1213 sec
[2021-04-24 22:53:03,906 INFO] Step 6550/50000; acc:  91.83; ppl:  1.44; xent: 0.36; lr: 0.00010; 11066/10431 tok/s;   1222 sec
[2021-04-24 22:53:05,557 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:53:13,114 INFO] Step 6600/50000; acc:  91.83; ppl:  1.45; xent: 0.37; lr: 0.00010; 10918/10262 tok/s;   1231 sec
[2021-04-24 22:53:22,189 INFO] Step 6650/50000; acc:  91.81; ppl:  1.44; xent: 0.37; lr: 0.00010; 11036/10491 tok/s;   1240 sec
[2021-04-24 22:53:31,448 INFO] Step 6700/50000; acc:  91.65; ppl:  1.45; xent: 0.37; lr: 0.00010; 11134/10507 tok/s;   1250 sec
[2021-04-24 22:53:40,597 INFO] Step 6750/50000; acc:  91.75; ppl:  1.44; xent: 0.36; lr: 0.00010; 10852/10325 tok/s;   1259 sec
[2021-04-24 22:53:49,971 INFO] Step 6800/50000; acc:  91.93; ppl:  1.43; xent: 0.36; lr: 0.00010; 11038/10386 tok/s;   1268 sec
[2021-04-24 22:53:59,130 INFO] Step 6850/50000; acc:  92.11; ppl:  1.43; xent: 0.36; lr: 0.00010; 10993/10427 tok/s;   1277 sec
[2021-04-24 22:54:08,664 INFO] Step 6900/50000; acc:  92.35; ppl:  1.41; xent: 0.34; lr: 0.00010; 11002/10338 tok/s;   1287 sec
[2021-04-24 22:54:17,815 INFO] Step 6950/50000; acc:  91.76; ppl:  1.44; xent: 0.36; lr: 0.00010; 11030/10383 tok/s;   1296 sec
[2021-04-24 22:54:27,193 INFO] Step 7000/50000; acc:  91.98; ppl:  1.43; xent: 0.36; lr: 0.00010; 10865/10284 tok/s;   1305 sec
[2021-04-24 22:54:36,476 INFO] Step 7050/50000; acc:  92.03; ppl:  1.43; xent: 0.36; lr: 0.00010; 10904/10256 tok/s;   1315 sec
[2021-04-24 22:54:45,634 INFO] Step 7100/50000; acc:  91.79; ppl:  1.44; xent: 0.36; lr: 0.00010; 10935/10406 tok/s;   1324 sec
[2021-04-24 22:54:54,951 INFO] Step 7150/50000; acc:  92.16; ppl:  1.41; xent: 0.34; lr: 0.00010; 11070/10386 tok/s;   1333 sec
[2021-04-24 22:55:04,151 INFO] Step 7200/50000; acc:  92.16; ppl:  1.42; xent: 0.35; lr: 0.00010; 10947/10290 tok/s;   1342 sec
[2021-04-24 22:55:13,459 INFO] Step 7250/50000; acc:  92.14; ppl:  1.42; xent: 0.35; lr: 0.00010; 10996/10377 tok/s;   1352 sec
[2021-04-24 22:55:15,948 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:55:22,650 INFO] Step 7300/50000; acc:  91.96; ppl:  1.43; xent: 0.36; lr: 0.00010; 11096/10373 tok/s;   1361 sec
[2021-04-24 22:55:32,048 INFO] Step 7350/50000; acc:  92.26; ppl:  1.41; xent: 0.35; lr: 0.00010; 10984/10368 tok/s;   1370 sec
[2021-04-24 22:55:41,047 INFO] Step 7400/50000; acc:  91.90; ppl:  1.44; xent: 0.36; lr: 0.00010; 11076/10501 tok/s;   1379 sec
[2021-04-24 22:55:50,245 INFO] Step 7450/50000; acc:  91.89; ppl:  1.43; xent: 0.36; lr: 0.00010; 10863/10348 tok/s;   1388 sec
[2021-04-24 22:55:59,537 INFO] Step 7500/50000; acc:  92.29; ppl:  1.41; xent: 0.34; lr: 0.00010; 11145/10462 tok/s;   1398 sec
[2021-04-24 22:56:08,589 INFO] Step 7550/50000; acc:  92.01; ppl:  1.42; xent: 0.35; lr: 0.00010; 10906/10375 tok/s;   1407 sec
[2021-04-24 22:56:18,017 INFO] Step 7600/50000; acc:  92.35; ppl:  1.41; xent: 0.34; lr: 0.00010; 11088/10395 tok/s;   1416 sec
[2021-04-24 22:56:27,368 INFO] Step 7650/50000; acc:  92.45; ppl:  1.40; xent: 0.34; lr: 0.00010; 10978/10389 tok/s;   1425 sec
[2021-04-24 22:56:36,727 INFO] Step 7700/50000; acc:  92.18; ppl:  1.41; xent: 0.34; lr: 0.00010; 10993/10359 tok/s;   1435 sec
[2021-04-24 22:56:45,976 INFO] Step 7750/50000; acc:  92.11; ppl:  1.42; xent: 0.35; lr: 0.00010; 10920/10256 tok/s;   1444 sec
[2021-04-24 22:56:55,247 INFO] Step 7800/50000; acc:  92.22; ppl:  1.41; xent: 0.35; lr: 0.00010; 10967/10376 tok/s;   1453 sec
[2021-04-24 22:57:04,393 INFO] Step 7850/50000; acc:  92.34; ppl:  1.41; xent: 0.34; lr: 0.00010; 11056/10457 tok/s;   1462 sec
[2021-04-24 22:57:13,629 INFO] Step 7900/50000; acc:  92.22; ppl:  1.41; xent: 0.34; lr: 0.00010; 10971/10305 tok/s;   1472 sec
[2021-04-24 22:57:22,868 INFO] Step 7950/50000; acc:  92.38; ppl:  1.41; xent: 0.34; lr: 0.00010; 10987/10351 tok/s;   1481 sec
[2021-04-24 22:57:32,088 INFO] Step 8000/50000; acc:  92.36; ppl:  1.40; xent: 0.34; lr: 0.00010; 10957/10317 tok/s;   1490 sec
[2021-04-24 22:57:32,098 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:57:41,534 INFO] Step 8050/50000; acc:  92.30; ppl:  1.40; xent: 0.34; lr: 0.00010; 10894/10229 tok/s;   1500 sec
[2021-04-24 22:57:50,686 INFO] Step 8100/50000; acc:  92.36; ppl:  1.41; xent: 0.34; lr: 0.00010; 11117/10486 tok/s;   1509 sec
[2021-04-24 22:57:59,929 INFO] Step 8150/50000; acc:  92.24; ppl:  1.41; xent: 0.35; lr: 0.00010; 11035/10449 tok/s;   1518 sec
[2021-04-24 22:58:09,109 INFO] Step 8200/50000; acc:  92.28; ppl:  1.40; xent: 0.34; lr: 0.00010; 10936/10345 tok/s;   1527 sec
[2021-04-24 22:58:18,284 INFO] Step 8250/50000; acc:  92.23; ppl:  1.41; xent: 0.34; lr: 0.00010; 10924/10409 tok/s;   1536 sec
[2021-04-24 22:58:27,647 INFO] Step 8300/50000; acc:  92.45; ppl:  1.40; xent: 0.33; lr: 0.00010; 11002/10304 tok/s;   1546 sec
[2021-04-24 22:58:36,930 INFO] Step 8350/50000; acc:  92.53; ppl:  1.40; xent: 0.33; lr: 0.00010; 10847/10349 tok/s;   1555 sec
[2021-04-24 22:58:46,425 INFO] Step 8400/50000; acc:  92.49; ppl:  1.39; xent: 0.33; lr: 0.00010; 11030/10363 tok/s;   1565 sec
[2021-04-24 22:58:55,788 INFO] Step 8450/50000; acc:  92.45; ppl:  1.39; xent: 0.33; lr: 0.00010; 10745/10109 tok/s;   1574 sec
[2021-04-24 22:59:05,196 INFO] Step 8500/50000; acc:  92.40; ppl:  1.40; xent: 0.34; lr: 0.00010; 11008/10291 tok/s;   1583 sec
[2021-04-24 22:59:14,369 INFO] Step 8550/50000; acc:  92.52; ppl:  1.40; xent: 0.33; lr: 0.00010; 10977/10492 tok/s;   1592 sec
[2021-04-24 22:59:23,640 INFO] Step 8600/50000; acc:  92.37; ppl:  1.39; xent: 0.33; lr: 0.00010; 11040/10342 tok/s;   1602 sec
[2021-04-24 22:59:32,795 INFO] Step 8650/50000; acc:  92.50; ppl:  1.39; xent: 0.33; lr: 0.00010; 11018/10423 tok/s;   1611 sec
[2021-04-24 22:59:41,899 INFO] Step 8700/50000; acc:  92.56; ppl:  1.39; xent: 0.33; lr: 0.00010; 10963/10297 tok/s;   1620 sec
[2021-04-24 22:59:48,549 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 22:59:51,209 INFO] Step 8750/50000; acc:  92.44; ppl:  1.39; xent: 0.33; lr: 0.00010; 11051/10385 tok/s;   1629 sec
[2021-04-24 23:00:00,413 INFO] Step 8800/50000; acc:  92.52; ppl:  1.40; xent: 0.33; lr: 0.00010; 10956/10343 tok/s;   1638 sec
[2021-04-24 23:00:09,704 INFO] Step 8850/50000; acc:  92.50; ppl:  1.39; xent: 0.33; lr: 0.00010; 11065/10450 tok/s;   1648 sec
[2021-04-24 23:00:18,879 INFO] Step 8900/50000; acc:  92.42; ppl:  1.40; xent: 0.34; lr: 0.00010; 10973/10434 tok/s;   1657 sec
[2021-04-24 23:00:28,271 INFO] Step 8950/50000; acc:  92.52; ppl:  1.39; xent: 0.33; lr: 0.00010; 10970/10321 tok/s;   1666 sec
[2021-04-24 23:00:37,370 INFO] Step 9000/50000; acc:  92.34; ppl:  1.40; xent: 0.33; lr: 0.00010; 10986/10400 tok/s;   1675 sec
[2021-04-24 23:00:46,625 INFO] Step 9050/50000; acc:  92.51; ppl:  1.39; xent: 0.33; lr: 0.00010; 10903/10355 tok/s;   1685 sec
[2021-04-24 23:00:56,080 INFO] Step 9100/50000; acc:  92.85; ppl:  1.37; xent: 0.31; lr: 0.00010; 11078/10441 tok/s;   1694 sec
[2021-04-24 23:01:05,315 INFO] Step 9150/50000; acc:  92.47; ppl:  1.39; xent: 0.33; lr: 0.00010; 10816/10229 tok/s;   1703 sec
[2021-04-24 23:01:14,760 INFO] Step 9200/50000; acc:  92.64; ppl:  1.38; xent: 0.32; lr: 0.00010; 10959/10254 tok/s;   1713 sec
[2021-04-24 23:01:24,090 INFO] Step 9250/50000; acc:  92.66; ppl:  1.38; xent: 0.32; lr: 0.00010; 10855/10250 tok/s;   1722 sec
[2021-04-24 23:01:33,352 INFO] Step 9300/50000; acc:  92.55; ppl:  1.39; xent: 0.33; lr: 0.00010; 11119/10519 tok/s;   1731 sec
[2021-04-24 23:01:42,522 INFO] Step 9350/50000; acc:  92.70; ppl:  1.37; xent: 0.32; lr: 0.00010; 11048/10337 tok/s;   1741 sec
[2021-04-24 23:01:51,755 INFO] Step 9400/50000; acc:  92.62; ppl:  1.39; xent: 0.33; lr: 0.00010; 10987/10331 tok/s;   1750 sec
[2021-04-24 23:02:00,875 INFO] Step 9450/50000; acc:  92.61; ppl:  1.38; xent: 0.32; lr: 0.00010; 11020/10409 tok/s;   1759 sec
[2021-04-24 23:02:04,897 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:02:10,109 INFO] Step 9500/50000; acc:  92.42; ppl:  1.39; xent: 0.33; lr: 0.00010; 10905/10284 tok/s;   1768 sec
[2021-04-24 23:02:19,422 INFO] Step 9550/50000; acc:  92.82; ppl:  1.37; xent: 0.32; lr: 0.00010; 11076/10417 tok/s;   1777 sec
[2021-04-24 23:02:28,458 INFO] Step 9600/50000; acc:  92.42; ppl:  1.39; xent: 0.33; lr: 0.00010; 11102/10557 tok/s;   1787 sec
[2021-04-24 23:02:37,874 INFO] Step 9650/50000; acc:  92.55; ppl:  1.38; xent: 0.32; lr: 0.00010; 10869/10233 tok/s;   1796 sec
[2021-04-24 23:02:47,121 INFO] Step 9700/50000; acc:  92.70; ppl:  1.38; xent: 0.32; lr: 0.00010; 10932/10398 tok/s;   1805 sec
[2021-04-24 23:02:56,400 INFO] Step 9750/50000; acc:  92.71; ppl:  1.38; xent: 0.32; lr: 0.00010; 11047/10369 tok/s;   1814 sec
[2021-04-24 23:03:05,737 INFO] Step 9800/50000; acc:  92.83; ppl:  1.37; xent: 0.31; lr: 0.00010; 10889/10368 tok/s;   1824 sec
[2021-04-24 23:03:14,886 INFO] Step 9850/50000; acc:  92.73; ppl:  1.37; xent: 0.32; lr: 0.00010; 11071/10404 tok/s;   1833 sec
[2021-04-24 23:03:24,376 INFO] Step 9900/50000; acc:  92.69; ppl:  1.37; xent: 0.32; lr: 0.00010; 10905/10235 tok/s;   1842 sec
[2021-04-24 23:03:33,574 INFO] Step 9950/50000; acc:  92.66; ppl:  1.38; xent: 0.32; lr: 0.00010; 10830/10284 tok/s;   1852 sec
[2021-04-24 23:03:42,942 INFO] Step 10000/50000; acc:  92.78; ppl:  1.37; xent: 0.32; lr: 0.00010; 11088/10397 tok/s;   1861 sec
[2021-04-24 23:03:42,945 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-24 23:03:51,724 INFO] Validation perplexity: 1.36485
[2021-04-24 23:03:51,724 INFO] Validation accuracy: 92.9276
[2021-04-24 23:03:51,726 INFO] Saving checkpoint ../models/group1_params/control/model_step_10000.pt
[2021-04-24 23:04:01,315 INFO] Step 10050/50000; acc:  92.63; ppl:  1.38; xent: 0.32; lr: 0.00010; 5502/5210 tok/s;   1879 sec
[2021-04-24 23:04:10,708 INFO] Step 10100/50000; acc:  92.80; ppl:  1.36; xent: 0.31; lr: 0.00010; 11002/10313 tok/s;   1889 sec
[2021-04-24 23:04:19,764 INFO] Step 10150/50000; acc:  92.79; ppl:  1.37; xent: 0.32; lr: 0.00010; 11066/10406 tok/s;   1898 sec
[2021-04-24 23:04:28,987 INFO] Step 10200/50000; acc:  92.61; ppl:  1.38; xent: 0.32; lr: 0.00010; 11060/10424 tok/s;   1907 sec
[2021-04-24 23:04:30,283 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:04:38,243 INFO] Step 10250/50000; acc:  92.76; ppl:  1.37; xent: 0.32; lr: 0.00010; 10932/10297 tok/s;   1916 sec
[2021-04-24 23:04:47,254 INFO] Step 10300/50000; acc:  92.79; ppl:  1.37; xent: 0.32; lr: 0.00010; 11116/10508 tok/s;   1925 sec
[2021-04-24 23:04:56,438 INFO] Step 10350/50000; acc:  92.59; ppl:  1.38; xent: 0.32; lr: 0.00010; 11113/10562 tok/s;   1935 sec
[2021-04-24 23:05:05,590 INFO] Step 10400/50000; acc:  92.71; ppl:  1.37; xent: 0.31; lr: 0.00010; 10997/10399 tok/s;   1944 sec
[2021-04-24 23:05:14,955 INFO] Step 10450/50000; acc:  92.79; ppl:  1.37; xent: 0.31; lr: 0.00010; 10940/10294 tok/s;   1953 sec
[2021-04-24 23:05:24,122 INFO] Step 10500/50000; acc:  92.86; ppl:  1.37; xent: 0.31; lr: 0.00010; 11033/10449 tok/s;   1962 sec
[2021-04-24 23:05:33,666 INFO] Step 10550/50000; acc:  93.09; ppl:  1.35; xent: 0.30; lr: 0.00010; 11009/10366 tok/s;   1972 sec
[2021-04-24 23:05:42,813 INFO] Step 10600/50000; acc:  92.79; ppl:  1.37; xent: 0.31; lr: 0.00010; 10961/10354 tok/s;   1981 sec
[2021-04-24 23:05:52,079 INFO] Step 10650/50000; acc:  92.86; ppl:  1.37; xent: 0.31; lr: 0.00010; 10864/10305 tok/s;   1990 sec
[2021-04-24 23:06:01,521 INFO] Step 10700/50000; acc:  92.91; ppl:  1.36; xent: 0.31; lr: 0.00010; 10993/10270 tok/s;   2000 sec
[2021-04-24 23:06:10,569 INFO] Step 10750/50000; acc:  92.64; ppl:  1.38; xent: 0.32; lr: 0.00010; 10936/10456 tok/s;   2009 sec
[2021-04-24 23:06:19,963 INFO] Step 10800/50000; acc:  92.94; ppl:  1.35; xent: 0.30; lr: 0.00010; 11149/10413 tok/s;   2018 sec
[2021-04-24 23:06:29,232 INFO] Step 10850/50000; acc:  92.99; ppl:  1.36; xent: 0.31; lr: 0.00010; 10875/10211 tok/s;   2027 sec
[2021-04-24 23:06:38,503 INFO] Step 10900/50000; acc:  92.89; ppl:  1.36; xent: 0.31; lr: 0.00010; 11058/10381 tok/s;   2037 sec
[2021-04-24 23:06:40,563 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:06:47,647 INFO] Step 10950/50000; acc:  92.83; ppl:  1.37; xent: 0.31; lr: 0.00010; 11087/10430 tok/s;   2046 sec
[2021-04-24 23:06:56,933 INFO] Step 11000/50000; acc:  92.92; ppl:  1.36; xent: 0.31; lr: 0.00010; 10979/10396 tok/s;   2055 sec
[2021-04-24 23:07:05,952 INFO] Step 11050/50000; acc:  92.74; ppl:  1.37; xent: 0.32; lr: 0.00010; 11098/10516 tok/s;   2064 sec
[2021-04-24 23:07:15,127 INFO] Step 11100/50000; acc:  92.70; ppl:  1.37; xent: 0.31; lr: 0.00010; 10915/10368 tok/s;   2073 sec
[2021-04-24 23:07:24,429 INFO] Step 11150/50000; acc:  92.92; ppl:  1.36; xent: 0.31; lr: 0.00010; 11012/10400 tok/s;   2083 sec
[2021-04-24 23:07:33,518 INFO] Step 11200/50000; acc:  92.84; ppl:  1.37; xent: 0.31; lr: 0.00010; 11024/10410 tok/s;   2092 sec
[2021-04-24 23:07:42,878 INFO] Step 11250/50000; acc:  93.11; ppl:  1.35; xent: 0.30; lr: 0.00010; 11076/10445 tok/s;   2101 sec
[2021-04-24 23:07:52,159 INFO] Step 11300/50000; acc:  93.15; ppl:  1.35; xent: 0.30; lr: 0.00010; 11086/10444 tok/s;   2110 sec
[2021-04-24 23:08:01,544 INFO] Step 11350/50000; acc:  92.99; ppl:  1.35; xent: 0.30; lr: 0.00010; 10960/10347 tok/s;   2120 sec
[2021-04-24 23:08:10,762 INFO] Step 11400/50000; acc:  92.83; ppl:  1.36; xent: 0.31; lr: 0.00010; 10914/10257 tok/s;   2129 sec
[2021-04-24 23:08:19,897 INFO] Step 11450/50000; acc:  92.92; ppl:  1.36; xent: 0.31; lr: 0.00010; 10985/10432 tok/s;   2138 sec
[2021-04-24 23:08:29,234 INFO] Step 11500/50000; acc:  93.01; ppl:  1.35; xent: 0.30; lr: 0.00010; 11123/10451 tok/s;   2147 sec
[2021-04-24 23:08:38,382 INFO] Step 11550/50000; acc:  92.89; ppl:  1.35; xent: 0.30; lr: 0.00010; 10937/10305 tok/s;   2156 sec
[2021-04-24 23:08:47,676 INFO] Step 11600/50000; acc:  93.12; ppl:  1.35; xent: 0.30; lr: 0.00010; 11088/10402 tok/s;   2166 sec
[2021-04-24 23:08:56,456 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:08:56,922 INFO] Step 11650/50000; acc:  92.91; ppl:  1.35; xent: 0.30; lr: 0.00010; 10941/10312 tok/s;   2175 sec
[2021-04-24 23:09:06,327 INFO] Step 11700/50000; acc:  92.92; ppl:  1.36; xent: 0.31; lr: 0.00010; 10936/10244 tok/s;   2184 sec
[2021-04-24 23:09:15,473 INFO] Step 11750/50000; acc:  93.04; ppl:  1.35; xent: 0.30; lr: 0.00010; 11075/10494 tok/s;   2194 sec
[2021-04-24 23:09:24,617 INFO] Step 11800/50000; acc:  92.83; ppl:  1.36; xent: 0.31; lr: 0.00010; 11007/10442 tok/s;   2203 sec
[2021-04-24 23:09:33,823 INFO] Step 11850/50000; acc:  92.84; ppl:  1.36; xent: 0.31; lr: 0.00010; 10955/10337 tok/s;   2212 sec
[2021-04-24 23:09:43,031 INFO] Step 11900/50000; acc:  92.90; ppl:  1.35; xent: 0.30; lr: 0.00010; 10912/10408 tok/s;   2221 sec
[2021-04-24 23:09:52,324 INFO] Step 11950/50000; acc:  93.06; ppl:  1.35; xent: 0.30; lr: 0.00010; 10991/10295 tok/s;   2230 sec
[2021-04-24 23:10:01,672 INFO] Step 12000/50000; acc:  93.16; ppl:  1.34; xent: 0.30; lr: 0.00010; 10933/10403 tok/s;   2240 sec
[2021-04-24 23:10:11,108 INFO] Step 12050/50000; acc:  93.02; ppl:  1.35; xent: 0.30; lr: 0.00010; 10980/10328 tok/s;   2249 sec
[2021-04-24 23:10:20,411 INFO] Step 12100/50000; acc:  92.95; ppl:  1.35; xent: 0.30; lr: 0.00010; 10835/10179 tok/s;   2258 sec
[2021-04-24 23:10:29,840 INFO] Step 12150/50000; acc:  93.08; ppl:  1.35; xent: 0.30; lr: 0.00010; 11001/10307 tok/s;   2268 sec
[2021-04-24 23:10:39,039 INFO] Step 12200/50000; acc:  93.00; ppl:  1.35; xent: 0.30; lr: 0.00010; 10892/10416 tok/s;   2277 sec
[2021-04-24 23:10:48,199 INFO] Step 12250/50000; acc:  93.12; ppl:  1.34; xent: 0.30; lr: 0.00010; 11030/10353 tok/s;   2286 sec
[2021-04-24 23:10:57,562 INFO] Step 12300/50000; acc:  93.17; ppl:  1.35; xent: 0.30; lr: 0.00010; 11041/10407 tok/s;   2296 sec
[2021-04-24 23:11:06,581 INFO] Step 12350/50000; acc:  93.10; ppl:  1.34; xent: 0.29; lr: 0.00010; 10940/10308 tok/s;   2305 sec
[2021-04-24 23:11:12,876 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:11:15,964 INFO] Step 12400/50000; acc:  93.11; ppl:  1.35; xent: 0.30; lr: 0.00010; 11122/10357 tok/s;   2314 sec
[2021-04-24 23:11:25,207 INFO] Step 12450/50000; acc:  93.08; ppl:  1.35; xent: 0.30; lr: 0.00010; 10940/10363 tok/s;   2323 sec
[2021-04-24 23:11:34,444 INFO] Step 12500/50000; acc:  93.04; ppl:  1.35; xent: 0.30; lr: 0.00010; 11127/10485 tok/s;   2333 sec
[2021-04-24 23:11:43,539 INFO] Step 12550/50000; acc:  93.01; ppl:  1.35; xent: 0.30; lr: 0.00010; 11030/10522 tok/s;   2342 sec
[2021-04-24 23:11:52,882 INFO] Step 12600/50000; acc:  92.94; ppl:  1.35; xent: 0.30; lr: 0.00010; 10863/10273 tok/s;   2351 sec
[2021-04-24 23:12:02,004 INFO] Step 12650/50000; acc:  92.98; ppl:  1.35; xent: 0.30; lr: 0.00010; 11026/10425 tok/s;   2360 sec
[2021-04-24 23:12:11,300 INFO] Step 12700/50000; acc:  93.06; ppl:  1.35; xent: 0.30; lr: 0.00010; 10893/10318 tok/s;   2369 sec
[2021-04-24 23:12:20,646 INFO] Step 12750/50000; acc:  93.37; ppl:  1.33; xent: 0.29; lr: 0.00010; 11072/10433 tok/s;   2379 sec
[2021-04-24 23:12:29,996 INFO] Step 12800/50000; acc:  93.02; ppl:  1.35; xent: 0.30; lr: 0.00010; 10841/10258 tok/s;   2388 sec
[2021-04-24 23:12:39,412 INFO] Step 12850/50000; acc:  93.15; ppl:  1.34; xent: 0.29; lr: 0.00010; 10888/10192 tok/s;   2397 sec
[2021-04-24 23:12:48,707 INFO] Step 12900/50000; acc:  93.21; ppl:  1.34; xent: 0.29; lr: 0.00010; 10912/10309 tok/s;   2407 sec
[2021-04-24 23:12:58,005 INFO] Step 12950/50000; acc:  93.09; ppl:  1.35; xent: 0.30; lr: 0.00010; 11116/10490 tok/s;   2416 sec
[2021-04-24 23:13:07,126 INFO] Step 13000/50000; acc:  93.28; ppl:  1.33; xent: 0.28; lr: 0.00010; 11034/10382 tok/s;   2425 sec
[2021-04-24 23:13:16,191 INFO] Step 13050/50000; acc:  93.07; ppl:  1.35; xent: 0.30; lr: 0.00010; 11043/10344 tok/s;   2434 sec
[2021-04-24 23:13:25,521 INFO] Step 13100/50000; acc:  93.13; ppl:  1.34; xent: 0.29; lr: 0.00010; 11064/10424 tok/s;   2444 sec
[2021-04-24 23:13:29,124 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:13:34,678 INFO] Step 13150/50000; acc:  93.07; ppl:  1.35; xent: 0.30; lr: 0.00010; 10861/10292 tok/s;   2453 sec
[2021-04-24 23:13:44,068 INFO] Step 13200/50000; acc:  93.33; ppl:  1.34; xent: 0.29; lr: 0.00010; 11137/10419 tok/s;   2462 sec
[2021-04-24 23:13:53,158 INFO] Step 13250/50000; acc:  92.98; ppl:  1.35; xent: 0.30; lr: 0.00010; 11052/10516 tok/s;   2471 sec
[2021-04-24 23:14:02,575 INFO] Step 13300/50000; acc:  93.08; ppl:  1.34; xent: 0.29; lr: 0.00010; 10881/10228 tok/s;   2481 sec
[2021-04-24 23:14:11,766 INFO] Step 13350/50000; acc:  93.14; ppl:  1.34; xent: 0.29; lr: 0.00010; 10950/10436 tok/s;   2490 sec
[2021-04-24 23:14:21,000 INFO] Step 13400/50000; acc:  93.12; ppl:  1.34; xent: 0.29; lr: 0.00010; 10955/10360 tok/s;   2499 sec
[2021-04-24 23:14:30,385 INFO] Step 13450/50000; acc:  93.34; ppl:  1.33; xent: 0.29; lr: 0.00010; 10897/10323 tok/s;   2508 sec
[2021-04-24 23:14:39,576 INFO] Step 13500/50000; acc:  93.12; ppl:  1.34; xent: 0.29; lr: 0.00010; 11013/10325 tok/s;   2518 sec
[2021-04-24 23:14:49,010 INFO] Step 13550/50000; acc:  93.25; ppl:  1.34; xent: 0.29; lr: 0.00010; 10871/10274 tok/s;   2527 sec
[2021-04-24 23:14:58,257 INFO] Step 13600/50000; acc:  93.20; ppl:  1.34; xent: 0.29; lr: 0.00010; 10944/10322 tok/s;   2536 sec
[2021-04-24 23:15:07,541 INFO] Step 13650/50000; acc:  93.24; ppl:  1.34; xent: 0.29; lr: 0.00010; 11065/10415 tok/s;   2546 sec
[2021-04-24 23:15:16,755 INFO] Step 13700/50000; acc:  93.29; ppl:  1.34; xent: 0.29; lr: 0.00010; 11017/10420 tok/s;   2555 sec
[2021-04-24 23:15:26,150 INFO] Step 13750/50000; acc:  93.30; ppl:  1.33; xent: 0.28; lr: 0.00010; 10986/10296 tok/s;   2564 sec
[2021-04-24 23:15:35,206 INFO] Step 13800/50000; acc:  93.20; ppl:  1.34; xent: 0.29; lr: 0.00010; 11035/10378 tok/s;   2573 sec
[2021-04-24 23:15:44,314 INFO] Step 13850/50000; acc:  93.17; ppl:  1.34; xent: 0.29; lr: 0.00010; 11041/10423 tok/s;   2582 sec
[2021-04-24 23:15:45,424 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:15:53,809 INFO] Step 13900/50000; acc:  93.38; ppl:  1.33; xent: 0.29; lr: 0.00010; 10943/10271 tok/s;   2592 sec
[2021-04-24 23:16:02,737 INFO] Step 13950/50000; acc:  93.20; ppl:  1.34; xent: 0.29; lr: 0.00010; 11082/10493 tok/s;   2601 sec
[2021-04-24 23:16:12,065 INFO] Step 14000/50000; acc:  93.13; ppl:  1.34; xent: 0.29; lr: 0.00010; 11087/10484 tok/s;   2610 sec
[2021-04-24 23:16:21,231 INFO] Step 14050/50000; acc:  93.29; ppl:  1.33; xent: 0.29; lr: 0.00010; 11019/10435 tok/s;   2619 sec
[2021-04-24 23:16:30,495 INFO] Step 14100/50000; acc:  93.24; ppl:  1.33; xent: 0.29; lr: 0.00010; 11051/10445 tok/s;   2629 sec
[2021-04-24 23:16:39,658 INFO] Step 14150/50000; acc:  93.18; ppl:  1.33; xent: 0.29; lr: 0.00010; 11008/10398 tok/s;   2638 sec
[2021-04-24 23:16:49,154 INFO] Step 14200/50000; acc:  93.54; ppl:  1.32; xent: 0.28; lr: 0.00010; 10921/10295 tok/s;   2647 sec
[2021-04-24 23:16:58,359 INFO] Step 14250/50000; acc:  93.21; ppl:  1.33; xent: 0.29; lr: 0.00010; 10950/10342 tok/s;   2656 sec
[2021-04-24 23:17:07,618 INFO] Step 14300/50000; acc:  93.20; ppl:  1.34; xent: 0.29; lr: 0.00010; 10872/10323 tok/s;   2666 sec
[2021-04-24 23:17:16,935 INFO] Step 14350/50000; acc:  93.34; ppl:  1.33; xent: 0.29; lr: 0.00010; 11033/10306 tok/s;   2675 sec
[2021-04-24 23:17:26,136 INFO] Step 14400/50000; acc:  93.15; ppl:  1.34; xent: 0.29; lr: 0.00010; 10927/10448 tok/s;   2684 sec
[2021-04-24 23:17:35,485 INFO] Step 14450/50000; acc:  93.34; ppl:  1.32; xent: 0.28; lr: 0.00010; 11091/10362 tok/s;   2694 sec
[2021-04-24 23:17:44,695 INFO] Step 14500/50000; acc:  93.28; ppl:  1.33; xent: 0.29; lr: 0.00010; 10955/10296 tok/s;   2703 sec
[2021-04-24 23:17:54,055 INFO] Step 14550/50000; acc:  93.37; ppl:  1.32; xent: 0.28; lr: 0.00010; 10969/10276 tok/s;   2712 sec
[2021-04-24 23:17:55,723 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:18:03,169 INFO] Step 14600/50000; acc:  93.27; ppl:  1.34; xent: 0.29; lr: 0.00010; 11064/10409 tok/s;   2721 sec
[2021-04-24 23:18:12,330 INFO] Step 14650/50000; acc:  93.33; ppl:  1.33; xent: 0.29; lr: 0.00010; 10990/10420 tok/s;   2730 sec
[2021-04-24 23:18:21,623 INFO] Step 14700/50000; acc:  93.31; ppl:  1.33; xent: 0.29; lr: 0.00010; 11061/10435 tok/s;   2740 sec
[2021-04-24 23:18:30,682 INFO] Step 14750/50000; acc:  93.07; ppl:  1.34; xent: 0.29; lr: 0.00010; 10941/10426 tok/s;   2749 sec
[2021-04-24 23:18:40,104 INFO] Step 14800/50000; acc:  93.35; ppl:  1.32; xent: 0.28; lr: 0.00010; 11015/10348 tok/s;   2758 sec
[2021-04-24 23:18:49,228 INFO] Step 14850/50000; acc:  93.26; ppl:  1.33; xent: 0.29; lr: 0.00010; 10984/10414 tok/s;   2767 sec
[2021-04-24 23:18:58,593 INFO] Step 14900/50000; acc:  93.58; ppl:  1.32; xent: 0.28; lr: 0.00010; 11103/10473 tok/s;   2777 sec
[2021-04-24 23:19:07,896 INFO] Step 14950/50000; acc:  93.52; ppl:  1.32; xent: 0.28; lr: 0.00010; 10990/10371 tok/s;   2786 sec
[2021-04-24 23:19:17,223 INFO] Step 15000/50000; acc:  93.29; ppl:  1.33; xent: 0.28; lr: 0.00010; 10887/10258 tok/s;   2795 sec
[2021-04-24 23:19:17,226 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-24 23:19:25,998 INFO] Validation perplexity: 1.34298
[2021-04-24 23:19:25,998 INFO] Validation accuracy: 93.2873
[2021-04-24 23:19:26,000 INFO] Saving checkpoint ../models/group1_params/control/model_step_15000.pt
[2021-04-24 23:19:35,660 INFO] Step 15050/50000; acc:  93.23; ppl:  1.33; xent: 0.29; lr: 0.00010; 5485/5161 tok/s;   2814 sec
[2021-04-24 23:19:44,847 INFO] Step 15100/50000; acc:  93.37; ppl:  1.33; xent: 0.28; lr: 0.00010; 10943/10391 tok/s;   2823 sec
[2021-04-24 23:19:54,124 INFO] Step 15150/50000; acc:  93.40; ppl:  1.32; xent: 0.28; lr: 0.00010; 11076/10452 tok/s;   2832 sec
[2021-04-24 23:20:03,333 INFO] Step 15200/50000; acc:  93.44; ppl:  1.32; xent: 0.28; lr: 0.00010; 11038/10370 tok/s;   2841 sec
[2021-04-24 23:20:12,606 INFO] Step 15250/50000; acc:  93.47; ppl:  1.32; xent: 0.28; lr: 0.00010; 10983/10302 tok/s;   2851 sec
[2021-04-24 23:20:21,029 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:20:21,835 INFO] Step 15300/50000; acc:  93.25; ppl:  1.33; xent: 0.28; lr: 0.00010; 10989/10350 tok/s;   2860 sec
[2021-04-24 23:20:31,272 INFO] Step 15350/50000; acc:  93.40; ppl:  1.32; xent: 0.28; lr: 0.00010; 10918/10267 tok/s;   2869 sec
[2021-04-24 23:20:40,393 INFO] Step 15400/50000; acc:  93.40; ppl:  1.33; xent: 0.28; lr: 0.00010; 11061/10442 tok/s;   2878 sec
[2021-04-24 23:20:49,429 INFO] Step 15450/50000; acc:  93.15; ppl:  1.34; xent: 0.29; lr: 0.00010; 10994/10455 tok/s;   2888 sec
[2021-04-24 23:20:58,833 INFO] Step 15500/50000; acc:  93.35; ppl:  1.32; xent: 0.28; lr: 0.00010; 11005/10337 tok/s;   2897 sec
[2021-04-24 23:21:07,970 INFO] Step 15550/50000; acc:  93.31; ppl:  1.33; xent: 0.28; lr: 0.00010; 10858/10366 tok/s;   2906 sec
[2021-04-24 23:21:17,347 INFO] Step 15600/50000; acc:  93.42; ppl:  1.32; xent: 0.28; lr: 0.00010; 11063/10400 tok/s;   2915 sec
[2021-04-24 23:21:26,685 INFO] Step 15650/50000; acc:  93.62; ppl:  1.32; xent: 0.27; lr: 0.00010; 10945/10398 tok/s;   2925 sec
[2021-04-24 23:21:36,091 INFO] Step 15700/50000; acc:  93.40; ppl:  1.32; xent: 0.28; lr: 0.00010; 11041/10351 tok/s;   2934 sec
[2021-04-24 23:21:45,385 INFO] Step 15750/50000; acc:  93.33; ppl:  1.32; xent: 0.28; lr: 0.00010; 10781/10129 tok/s;   2943 sec
[2021-04-24 23:21:54,702 INFO] Step 15800/50000; acc:  93.42; ppl:  1.32; xent: 0.28; lr: 0.00010; 10991/10355 tok/s;   2953 sec
[2021-04-24 23:22:03,875 INFO] Step 15850/50000; acc:  93.43; ppl:  1.32; xent: 0.28; lr: 0.00010; 10964/10445 tok/s;   2962 sec
[2021-04-24 23:22:13,099 INFO] Step 15900/50000; acc:  93.40; ppl:  1.31; xent: 0.27; lr: 0.00010; 10961/10278 tok/s;   2971 sec
[2021-04-24 23:22:22,414 INFO] Step 15950/50000; acc:  93.45; ppl:  1.32; xent: 0.28; lr: 0.00010; 10992/10394 tok/s;   2980 sec
[2021-04-24 23:22:31,558 INFO] Step 16000/50000; acc:  93.52; ppl:  1.31; xent: 0.27; lr: 0.00010; 10964/10277 tok/s;   2990 sec
[2021-04-24 23:22:37,460 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:22:40,913 INFO] Step 16050/50000; acc:  93.37; ppl:  1.32; xent: 0.28; lr: 0.00010; 11047/10340 tok/s;   2999 sec
[2021-04-24 23:22:50,196 INFO] Step 16100/50000; acc:  93.42; ppl:  1.32; xent: 0.28; lr: 0.00010; 10937/10362 tok/s;   3008 sec
[2021-04-24 23:22:59,449 INFO] Step 16150/50000; acc:  93.48; ppl:  1.32; xent: 0.28; lr: 0.00010; 11101/10457 tok/s;   3018 sec
[2021-04-24 23:23:08,567 INFO] Step 16200/50000; acc:  93.30; ppl:  1.33; xent: 0.28; lr: 0.00010; 10953/10443 tok/s;   3027 sec
[2021-04-24 23:23:17,826 INFO] Step 16250/50000; acc:  93.35; ppl:  1.32; xent: 0.28; lr: 0.00010; 10817/10257 tok/s;   3036 sec
[2021-04-24 23:23:27,139 INFO] Step 16300/50000; acc:  93.42; ppl:  1.32; xent: 0.28; lr: 0.00010; 11079/10403 tok/s;   3045 sec
[2021-04-24 23:23:36,367 INFO] Step 16350/50000; acc:  93.37; ppl:  1.32; xent: 0.28; lr: 0.00010; 10850/10331 tok/s;   3054 sec
[2021-04-24 23:23:45,847 INFO] Step 16400/50000; acc:  93.76; ppl:  1.31; xent: 0.27; lr: 0.00010; 11091/10355 tok/s;   3064 sec
[2021-04-24 23:23:55,240 INFO] Step 16450/50000; acc:  93.38; ppl:  1.32; xent: 0.28; lr: 0.00010; 10777/10246 tok/s;   3073 sec
[2021-04-24 23:24:04,610 INFO] Step 16500/50000; acc:  93.53; ppl:  1.32; xent: 0.27; lr: 0.00010; 10969/10257 tok/s;   3083 sec
[2021-04-24 23:24:13,825 INFO] Step 16550/50000; acc:  93.51; ppl:  1.31; xent: 0.27; lr: 0.00010; 10953/10376 tok/s;   3092 sec
[2021-04-24 23:24:22,970 INFO] Step 16600/50000; acc:  93.38; ppl:  1.32; xent: 0.28; lr: 0.00010; 11153/10536 tok/s;   3101 sec
[2021-04-24 23:24:32,121 INFO] Step 16650/50000; acc:  93.60; ppl:  1.31; xent: 0.27; lr: 0.00010; 11052/10407 tok/s;   3110 sec
[2021-04-24 23:24:41,209 INFO] Step 16700/50000; acc:  93.47; ppl:  1.32; xent: 0.28; lr: 0.00010; 11031/10331 tok/s;   3119 sec
[2021-04-24 23:24:50,503 INFO] Step 16750/50000; acc:  93.42; ppl:  1.32; xent: 0.27; lr: 0.00010; 10986/10399 tok/s;   3129 sec
[2021-04-24 23:24:53,775 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:24:59,780 INFO] Step 16800/50000; acc:  93.45; ppl:  1.32; xent: 0.27; lr: 0.00010; 10913/10266 tok/s;   3138 sec
[2021-04-24 23:25:09,110 INFO] Step 16850/50000; acc:  93.58; ppl:  1.32; xent: 0.27; lr: 0.00010; 11054/10411 tok/s;   3147 sec
[2021-04-24 23:25:18,221 INFO] Step 16900/50000; acc:  93.44; ppl:  1.32; xent: 0.28; lr: 0.00010; 11068/10493 tok/s;   3156 sec
[2021-04-24 23:25:27,648 INFO] Step 16950/50000; acc:  93.32; ppl:  1.32; xent: 0.28; lr: 0.00010; 10879/10236 tok/s;   3166 sec
[2021-04-24 23:25:36,723 INFO] Step 17000/50000; acc:  93.37; ppl:  1.32; xent: 0.27; lr: 0.00010; 11033/10530 tok/s;   3175 sec
[2021-04-24 23:25:45,813 INFO] Step 17050/50000; acc:  93.43; ppl:  1.32; xent: 0.28; lr: 0.00010; 11002/10385 tok/s;   3184 sec
[2021-04-24 23:25:55,379 INFO] Step 17100/50000; acc:  93.79; ppl:  1.30; xent: 0.26; lr: 0.00010; 10974/10333 tok/s;   3193 sec
[2021-04-24 23:26:04,435 INFO] Step 17150/50000; acc:  93.44; ppl:  1.32; xent: 0.27; lr: 0.00010; 11035/10416 tok/s;   3203 sec
[2021-04-24 23:26:13,982 INFO] Step 17200/50000; acc:  93.52; ppl:  1.31; xent: 0.27; lr: 0.00010; 10898/10264 tok/s;   3212 sec
[2021-04-24 23:26:23,276 INFO] Step 17250/50000; acc:  93.59; ppl:  1.31; xent: 0.27; lr: 0.00010; 10899/10250 tok/s;   3221 sec
[2021-04-24 23:26:32,518 INFO] Step 17300/50000; acc:  93.50; ppl:  1.31; xent: 0.27; lr: 0.00010; 11123/10482 tok/s;   3231 sec
[2021-04-24 23:26:41,630 INFO] Step 17350/50000; acc:  93.53; ppl:  1.31; xent: 0.27; lr: 0.00010; 11098/10507 tok/s;   3240 sec
[2021-04-24 23:26:50,964 INFO] Step 17400/50000; acc:  93.50; ppl:  1.31; xent: 0.27; lr: 0.00010; 10902/10222 tok/s;   3249 sec
[2021-04-24 23:27:00,061 INFO] Step 17450/50000; acc:  93.50; ppl:  1.31; xent: 0.27; lr: 0.00010; 11055/10458 tok/s;   3258 sec
[2021-04-24 23:27:09,185 INFO] Step 17500/50000; acc:  93.42; ppl:  1.31; xent: 0.27; lr: 0.00010; 11037/10367 tok/s;   3267 sec
[2021-04-24 23:27:09,903 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:27:18,612 INFO] Step 17550/50000; acc:  93.64; ppl:  1.31; xent: 0.27; lr: 0.00010; 10919/10243 tok/s;   3277 sec
[2021-04-24 23:27:27,643 INFO] Step 17600/50000; acc:  93.56; ppl:  1.32; xent: 0.27; lr: 0.00010; 11119/10524 tok/s;   3286 sec
[2021-04-24 23:27:36,872 INFO] Step 17650/50000; acc:  93.46; ppl:  1.32; xent: 0.27; lr: 0.00010; 11095/10513 tok/s;   3295 sec
[2021-04-24 23:27:46,114 INFO] Step 17700/50000; acc:  93.63; ppl:  1.31; xent: 0.27; lr: 0.00010; 10942/10389 tok/s;   3304 sec
[2021-04-24 23:27:55,400 INFO] Step 17750/50000; acc:  93.54; ppl:  1.31; xent: 0.27; lr: 0.00010; 11042/10423 tok/s;   3313 sec
[2021-04-24 23:28:04,550 INFO] Step 17800/50000; acc:  93.51; ppl:  1.31; xent: 0.27; lr: 0.00010; 10987/10374 tok/s;   3323 sec
[2021-04-24 23:28:13,877 INFO] Step 17850/50000; acc:  93.77; ppl:  1.30; xent: 0.26; lr: 0.00010; 10971/10362 tok/s;   3332 sec
[2021-04-24 23:28:23,300 INFO] Step 17900/50000; acc:  93.64; ppl:  1.30; xent: 0.27; lr: 0.00010; 10973/10314 tok/s;   3341 sec
[2021-04-24 23:28:32,370 INFO] Step 17950/50000; acc:  93.52; ppl:  1.31; xent: 0.27; lr: 0.00010; 10955/10420 tok/s;   3350 sec
[2021-04-24 23:28:41,855 INFO] Step 18000/50000; acc:  93.75; ppl:  1.30; xent: 0.27; lr: 0.00010; 10999/10251 tok/s;   3360 sec
[2021-04-24 23:28:51,081 INFO] Step 18050/50000; acc:  93.51; ppl:  1.32; xent: 0.27; lr: 0.00010; 10915/10443 tok/s;   3369 sec
[2021-04-24 23:29:00,407 INFO] Step 18100/50000; acc:  93.74; ppl:  1.30; xent: 0.26; lr: 0.00010; 11139/10384 tok/s;   3378 sec
[2021-04-24 23:29:09,611 INFO] Step 18150/50000; acc:  93.58; ppl:  1.31; xent: 0.27; lr: 0.00010; 10914/10297 tok/s;   3388 sec
[2021-04-24 23:29:18,890 INFO] Step 18200/50000; acc:  93.54; ppl:  1.31; xent: 0.27; lr: 0.00010; 10915/10253 tok/s;   3397 sec
[2021-04-24 23:29:20,209 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:29:28,048 INFO] Step 18250/50000; acc:  93.60; ppl:  1.31; xent: 0.27; lr: 0.00010; 11056/10393 tok/s;   3406 sec
[2021-04-24 23:29:37,172 INFO] Step 18300/50000; acc:  93.66; ppl:  1.31; xent: 0.27; lr: 0.00010; 11059/10451 tok/s;   3415 sec
[2021-04-24 23:29:46,402 INFO] Step 18350/50000; acc:  93.57; ppl:  1.31; xent: 0.27; lr: 0.00010; 11003/10444 tok/s;   3424 sec
[2021-04-24 23:29:55,601 INFO] Step 18400/50000; acc:  93.48; ppl:  1.31; xent: 0.27; lr: 0.00010; 10960/10369 tok/s;   3434 sec
[2021-04-24 23:30:04,960 INFO] Step 18450/50000; acc:  93.58; ppl:  1.31; xent: 0.27; lr: 0.00010; 10959/10340 tok/s;   3443 sec
[2021-04-24 23:30:14,070 INFO] Step 18500/50000; acc:  93.60; ppl:  1.31; xent: 0.27; lr: 0.00010; 11046/10450 tok/s;   3452 sec
[2021-04-24 23:30:23,507 INFO] Step 18550/50000; acc:  93.84; ppl:  1.30; xent: 0.26; lr: 0.00010; 11036/10411 tok/s;   3462 sec
[2021-04-24 23:30:32,753 INFO] Step 18600/50000; acc:  93.72; ppl:  1.30; xent: 0.26; lr: 0.00010; 10979/10366 tok/s;   3471 sec
[2021-04-24 23:30:42,006 INFO] Step 18650/50000; acc:  93.49; ppl:  1.31; xent: 0.27; lr: 0.00010; 10851/10269 tok/s;   3480 sec
[2021-04-24 23:30:51,370 INFO] Step 18700/50000; acc:  93.62; ppl:  1.31; xent: 0.27; lr: 0.00010; 11062/10341 tok/s;   3489 sec
[2021-04-24 23:31:00,458 INFO] Step 18750/50000; acc:  93.70; ppl:  1.30; xent: 0.27; lr: 0.00010; 10946/10427 tok/s;   3499 sec
[2021-04-24 23:31:09,852 INFO] Step 18800/50000; acc:  93.71; ppl:  1.30; xent: 0.26; lr: 0.00010; 11107/10443 tok/s;   3508 sec
[2021-04-24 23:31:19,093 INFO] Step 18850/50000; acc:  93.77; ppl:  1.30; xent: 0.26; lr: 0.00010; 11008/10342 tok/s;   3517 sec
[2021-04-24 23:31:28,326 INFO] Step 18900/50000; acc:  93.75; ppl:  1.30; xent: 0.26; lr: 0.00010; 11044/10352 tok/s;   3526 sec
[2021-04-24 23:31:36,321 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:31:37,525 INFO] Step 18950/50000; acc:  93.55; ppl:  1.31; xent: 0.27; lr: 0.00010; 10973/10349 tok/s;   3536 sec
[2021-04-24 23:31:46,863 INFO] Step 19000/50000; acc:  93.65; ppl:  1.30; xent: 0.26; lr: 0.00010; 10875/10256 tok/s;   3545 sec
[2021-04-24 23:31:56,021 INFO] Step 19050/50000; acc:  93.66; ppl:  1.31; xent: 0.27; lr: 0.00010; 11066/10431 tok/s;   3554 sec
[2021-04-24 23:32:05,092 INFO] Step 19100/50000; acc:  93.40; ppl:  1.31; xent: 0.27; lr: 0.00010; 10989/10490 tok/s;   3563 sec
[2021-04-24 23:32:14,413 INFO] Step 19150/50000; acc:  93.70; ppl:  1.30; xent: 0.26; lr: 0.00010; 10979/10300 tok/s;   3572 sec
[2021-04-24 23:32:23,598 INFO] Step 19200/50000; acc:  93.52; ppl:  1.31; xent: 0.27; lr: 0.00010; 10967/10442 tok/s;   3582 sec
[2021-04-24 23:32:32,982 INFO] Step 19250/50000; acc:  93.70; ppl:  1.30; xent: 0.26; lr: 0.00010; 10958/10353 tok/s;   3591 sec
[2021-04-24 23:32:42,274 INFO] Step 19300/50000; acc:  93.90; ppl:  1.29; xent: 0.26; lr: 0.00010; 11012/10411 tok/s;   3600 sec
[2021-04-24 23:32:51,765 INFO] Step 19350/50000; acc:  93.68; ppl:  1.30; xent: 0.26; lr: 0.00010; 10962/10304 tok/s;   3610 sec
[2021-04-24 23:33:00,999 INFO] Step 19400/50000; acc:  93.61; ppl:  1.31; xent: 0.27; lr: 0.00010; 10794/10135 tok/s;   3619 sec
[2021-04-24 23:33:10,321 INFO] Step 19450/50000; acc:  93.74; ppl:  1.30; xent: 0.26; lr: 0.00010; 10841/10241 tok/s;   3628 sec
[2021-04-24 23:33:19,616 INFO] Step 19500/50000; acc:  93.77; ppl:  1.30; xent: 0.26; lr: 0.00010; 11117/10506 tok/s;   3638 sec
[2021-04-24 23:33:28,722 INFO] Step 19550/50000; acc:  93.69; ppl:  1.29; xent: 0.26; lr: 0.00010; 10958/10321 tok/s;   3647 sec
[2021-04-24 23:33:38,111 INFO] Step 19600/50000; acc:  93.75; ppl:  1.30; xent: 0.26; lr: 0.00010; 11068/10404 tok/s;   3656 sec
[2021-04-24 23:33:47,281 INFO] Step 19650/50000; acc:  93.67; ppl:  1.30; xent: 0.26; lr: 0.00010; 10943/10301 tok/s;   3665 sec
[2021-04-24 23:33:52,702 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:33:56,586 INFO] Step 19700/50000; acc:  93.66; ppl:  1.30; xent: 0.26; lr: 0.00010; 11111/10397 tok/s;   3675 sec
[2021-04-24 23:34:05,798 INFO] Step 19750/50000; acc:  93.68; ppl:  1.30; xent: 0.26; lr: 0.00010; 11007/10434 tok/s;   3684 sec
[2021-04-24 23:34:14,878 INFO] Step 19800/50000; acc:  93.74; ppl:  1.30; xent: 0.26; lr: 0.00010; 11134/10485 tok/s;   3693 sec
[2021-04-24 23:34:24,048 INFO] Step 19850/50000; acc:  93.63; ppl:  1.30; xent: 0.26; lr: 0.00010; 10940/10450 tok/s;   3702 sec
[2021-04-24 23:34:33,298 INFO] Step 19900/50000; acc:  93.53; ppl:  1.30; xent: 0.26; lr: 0.00010; 10857/10264 tok/s;   3711 sec
[2021-04-24 23:34:42,505 INFO] Step 19950/50000; acc:  93.72; ppl:  1.30; xent: 0.26; lr: 0.00010; 11090/10450 tok/s;   3721 sec
[2021-04-24 23:34:51,823 INFO] Step 20000/50000; acc:  93.69; ppl:  1.30; xent: 0.26; lr: 0.00010; 10904/10355 tok/s;   3730 sec
[2021-04-24 23:34:51,824 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-24 23:35:00,619 INFO] Validation perplexity: 1.32637
[2021-04-24 23:35:00,619 INFO] Validation accuracy: 93.4719
[2021-04-24 23:35:00,622 INFO] Saving checkpoint ../models/group1_params/control/model_step_20000.pt
[2021-04-24 23:35:10,493 INFO] Step 20050/50000; acc:  93.92; ppl:  1.29; xent: 0.25; lr: 0.00010; 5581/5226 tok/s;   3749 sec
[2021-04-24 23:35:19,871 INFO] Step 20100/50000; acc:  93.68; ppl:  1.30; xent: 0.26; lr: 0.00010; 10813/10232 tok/s;   3758 sec
[2021-04-24 23:35:29,333 INFO] Step 20150/50000; acc:  93.80; ppl:  1.29; xent: 0.26; lr: 0.00010; 10887/10216 tok/s;   3767 sec
[2021-04-24 23:35:38,531 INFO] Step 20200/50000; acc:  93.74; ppl:  1.30; xent: 0.26; lr: 0.00010; 10920/10337 tok/s;   3777 sec
[2021-04-24 23:35:47,590 INFO] Step 20250/50000; acc:  93.70; ppl:  1.30; xent: 0.26; lr: 0.00010; 11109/10503 tok/s;   3786 sec
[2021-04-24 23:35:56,951 INFO] Step 20300/50000; acc:  93.89; ppl:  1.29; xent: 0.25; lr: 0.00010; 11088/10406 tok/s;   3795 sec
[2021-04-24 23:36:05,955 INFO] Step 20350/50000; acc:  93.66; ppl:  1.30; xent: 0.26; lr: 0.00010; 10981/10308 tok/s;   3804 sec
[2021-04-24 23:36:15,327 INFO] Step 20400/50000; acc:  93.74; ppl:  1.29; xent: 0.26; lr: 0.00010; 11073/10462 tok/s;   3813 sec
[2021-04-24 23:36:18,176 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:36:24,630 INFO] Step 20450/50000; acc:  93.68; ppl:  1.30; xent: 0.26; lr: 0.00010; 10872/10226 tok/s;   3823 sec
[2021-04-24 23:36:33,891 INFO] Step 20500/50000; acc:  93.87; ppl:  1.29; xent: 0.26; lr: 0.00010; 11153/10518 tok/s;   3832 sec
[2021-04-24 23:36:42,973 INFO] Step 20550/50000; acc:  93.70; ppl:  1.30; xent: 0.26; lr: 0.00010; 11063/10462 tok/s;   3841 sec
[2021-04-24 23:36:52,309 INFO] Step 20600/50000; acc:  93.61; ppl:  1.30; xent: 0.26; lr: 0.00010; 10844/10246 tok/s;   3850 sec
[2021-04-24 23:37:01,435 INFO] Step 20650/50000; acc:  93.77; ppl:  1.29; xent: 0.26; lr: 0.00010; 11026/10505 tok/s;   3860 sec
[2021-04-24 23:37:10,586 INFO] Step 20700/50000; acc:  93.65; ppl:  1.30; xent: 0.26; lr: 0.00010; 10945/10331 tok/s;   3869 sec
[2021-04-24 23:37:20,052 INFO] Step 20750/50000; acc:  94.03; ppl:  1.28; xent: 0.25; lr: 0.00010; 10994/10339 tok/s;   3878 sec
[2021-04-24 23:37:29,230 INFO] Step 20800/50000; acc:  93.76; ppl:  1.29; xent: 0.26; lr: 0.00010; 11015/10420 tok/s;   3887 sec
[2021-04-24 23:37:38,702 INFO] Step 20850/50000; acc:  93.79; ppl:  1.29; xent: 0.26; lr: 0.00010; 10896/10237 tok/s;   3897 sec
[2021-04-24 23:37:47,988 INFO] Step 20900/50000; acc:  93.90; ppl:  1.29; xent: 0.26; lr: 0.00010; 10931/10288 tok/s;   3906 sec
[2021-04-24 23:37:57,262 INFO] Step 20950/50000; acc:  93.80; ppl:  1.29; xent: 0.26; lr: 0.00010; 11090/10490 tok/s;   3915 sec
[2021-04-24 23:38:06,357 INFO] Step 21000/50000; acc:  93.80; ppl:  1.29; xent: 0.25; lr: 0.00010; 11083/10474 tok/s;   3924 sec
[2021-04-24 23:38:15,615 INFO] Step 21050/50000; acc:  93.81; ppl:  1.29; xent: 0.25; lr: 0.00010; 10850/10197 tok/s;   3934 sec
[2021-04-24 23:38:24,944 INFO] Step 21100/50000; acc:  93.79; ppl:  1.29; xent: 0.26; lr: 0.00010; 11052/10404 tok/s;   3943 sec
[2021-04-24 23:38:28,491 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:38:33,944 INFO] Step 21150/50000; acc:  93.61; ppl:  1.30; xent: 0.26; lr: 0.00010; 11072/10422 tok/s;   3952 sec
[2021-04-24 23:38:43,496 INFO] Step 21200/50000; acc:  93.87; ppl:  1.29; xent: 0.25; lr: 0.00010; 10920/10292 tok/s;   3962 sec
[2021-04-24 23:38:52,505 INFO] Step 21250/50000; acc:  93.79; ppl:  1.30; xent: 0.26; lr: 0.00010; 11147/10490 tok/s;   3971 sec
[2021-04-24 23:39:01,758 INFO] Step 21300/50000; acc:  93.74; ppl:  1.30; xent: 0.26; lr: 0.00010; 11080/10477 tok/s;   3980 sec
[2021-04-24 23:39:10,932 INFO] Step 21350/50000; acc:  93.71; ppl:  1.29; xent: 0.26; lr: 0.00010; 10983/10433 tok/s;   3989 sec
[2021-04-24 23:39:20,109 INFO] Step 21400/50000; acc:  93.69; ppl:  1.29; xent: 0.26; lr: 0.00010; 11026/10436 tok/s;   3998 sec
[2021-04-24 23:39:29,302 INFO] Step 21450/50000; acc:  93.71; ppl:  1.30; xent: 0.26; lr: 0.00010; 10990/10357 tok/s;   4007 sec
[2021-04-24 23:39:38,668 INFO] Step 21500/50000; acc:  94.00; ppl:  1.28; xent: 0.25; lr: 0.00010; 10961/10368 tok/s;   4017 sec
[2021-04-24 23:39:48,005 INFO] Step 21550/50000; acc:  93.75; ppl:  1.29; xent: 0.25; lr: 0.00010; 10948/10334 tok/s;   4026 sec
[2021-04-24 23:39:57,225 INFO] Step 21600/50000; acc:  93.72; ppl:  1.30; xent: 0.26; lr: 0.00010; 10943/10324 tok/s;   4035 sec
[2021-04-24 23:40:06,603 INFO] Step 21650/50000; acc:  93.92; ppl:  1.29; xent: 0.25; lr: 0.00010; 11002/10306 tok/s;   4045 sec
[2021-04-24 23:40:15,849 INFO] Step 21700/50000; acc:  93.75; ppl:  1.29; xent: 0.26; lr: 0.00010; 10948/10441 tok/s;   4054 sec
[2021-04-24 23:40:25,156 INFO] Step 21750/50000; acc:  93.92; ppl:  1.28; xent: 0.25; lr: 0.00010; 11152/10383 tok/s;   4063 sec
[2021-04-24 23:40:34,317 INFO] Step 21800/50000; acc:  93.75; ppl:  1.29; xent: 0.25; lr: 0.00010; 10903/10330 tok/s;   4072 sec
[2021-04-24 23:40:43,411 INFO] Step 21850/50000; acc:  93.79; ppl:  1.29; xent: 0.25; lr: 0.00010; 10998/10328 tok/s;   4081 sec
[2021-04-24 23:40:44,549 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:40:52,854 INFO] Step 21900/50000; acc:  93.86; ppl:  1.29; xent: 0.26; lr: 0.00010; 10988/10314 tok/s;   4091 sec
[2021-04-24 23:41:01,908 INFO] Step 21950/50000; acc:  93.95; ppl:  1.29; xent: 0.25; lr: 0.00010; 11019/10412 tok/s;   4100 sec
[2021-04-24 23:41:11,204 INFO] Step 22000/50000; acc:  93.88; ppl:  1.29; xent: 0.25; lr: 0.00010; 11089/10510 tok/s;   4109 sec
[2021-04-24 23:41:20,441 INFO] Step 22050/50000; acc:  93.67; ppl:  1.30; xent: 0.26; lr: 0.00010; 10928/10331 tok/s;   4119 sec
[2021-04-24 23:41:29,814 INFO] Step 22100/50000; acc:  93.90; ppl:  1.29; xent: 0.25; lr: 0.00010; 10962/10354 tok/s;   4128 sec
[2021-04-24 23:41:38,907 INFO] Step 22150/50000; acc:  93.80; ppl:  1.29; xent: 0.26; lr: 0.00010; 11009/10375 tok/s;   4137 sec
[2021-04-24 23:41:48,276 INFO] Step 22200/50000; acc:  94.09; ppl:  1.28; xent: 0.25; lr: 0.00010; 11003/10436 tok/s;   4146 sec
[2021-04-24 23:41:57,560 INFO] Step 22250/50000; acc:  93.90; ppl:  1.28; xent: 0.25; lr: 0.00010; 10960/10361 tok/s;   4156 sec
[2021-04-24 23:42:06,902 INFO] Step 22300/50000; acc:  93.76; ppl:  1.29; xent: 0.26; lr: 0.00010; 10781/10177 tok/s;   4165 sec
[2021-04-24 23:42:16,221 INFO] Step 22350/50000; acc:  93.86; ppl:  1.29; xent: 0.25; lr: 0.00010; 11000/10297 tok/s;   4174 sec
[2021-04-24 23:42:25,362 INFO] Step 22400/50000; acc:  94.00; ppl:  1.28; xent: 0.25; lr: 0.00010; 11027/10498 tok/s;   4183 sec
[2021-04-24 23:42:34,760 INFO] Step 22450/50000; acc:  93.87; ppl:  1.29; xent: 0.25; lr: 0.00010; 11003/10389 tok/s;   4193 sec
[2021-04-24 23:42:43,962 INFO] Step 22500/50000; acc:  93.92; ppl:  1.28; xent: 0.25; lr: 0.00010; 11074/10357 tok/s;   4202 sec
[2021-04-24 23:42:53,255 INFO] Step 22550/50000; acc:  94.04; ppl:  1.28; xent: 0.25; lr: 0.00010; 10991/10274 tok/s;   4211 sec
[2021-04-24 23:43:00,875 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:43:02,424 INFO] Step 22600/50000; acc:  93.67; ppl:  1.29; xent: 0.25; lr: 0.00010; 10959/10365 tok/s;   4220 sec
[2021-04-24 23:43:11,662 INFO] Step 22650/50000; acc:  93.79; ppl:  1.29; xent: 0.25; lr: 0.00010; 10863/10285 tok/s;   4230 sec
[2021-04-24 23:43:21,028 INFO] Step 22700/50000; acc:  93.98; ppl:  1.29; xent: 0.25; lr: 0.00010; 11076/10416 tok/s;   4239 sec
[2021-04-24 23:43:29,981 INFO] Step 22750/50000; acc:  93.68; ppl:  1.29; xent: 0.25; lr: 0.00010; 11003/10524 tok/s;   4248 sec
[2021-04-24 23:43:39,476 INFO] Step 22800/50000; acc:  93.90; ppl:  1.28; xent: 0.25; lr: 0.00010; 10944/10239 tok/s;   4258 sec
[2021-04-24 23:43:48,686 INFO] Step 22850/50000; acc:  93.87; ppl:  1.29; xent: 0.25; lr: 0.00010; 10940/10405 tok/s;   4267 sec
[2021-04-24 23:43:58,047 INFO] Step 22900/50000; acc:  93.83; ppl:  1.29; xent: 0.25; lr: 0.00010; 11009/10377 tok/s;   4276 sec
[2021-04-24 23:44:07,313 INFO] Step 22950/50000; acc:  94.13; ppl:  1.28; xent: 0.24; lr: 0.00010; 10998/10422 tok/s;   4285 sec
[2021-04-24 23:44:16,729 INFO] Step 23000/50000; acc:  93.83; ppl:  1.29; xent: 0.25; lr: 0.00010; 10885/10260 tok/s;   4295 sec
[2021-04-24 23:44:26,021 INFO] Step 23050/50000; acc:  93.88; ppl:  1.29; xent: 0.25; lr: 0.00010; 10804/10151 tok/s;   4304 sec
[2021-04-24 23:44:35,340 INFO] Step 23100/50000; acc:  93.93; ppl:  1.28; xent: 0.24; lr: 0.00010; 10857/10243 tok/s;   4313 sec
[2021-04-24 23:44:44,565 INFO] Step 23150/50000; acc:  93.98; ppl:  1.28; xent: 0.25; lr: 0.00010; 11086/10502 tok/s;   4323 sec
[2021-04-24 23:44:53,806 INFO] Step 23200/50000; acc:  93.95; ppl:  1.28; xent: 0.25; lr: 0.00010; 10974/10297 tok/s;   4332 sec
[2021-04-24 23:45:03,143 INFO] Step 23250/50000; acc:  93.91; ppl:  1.28; xent: 0.25; lr: 0.00010; 10994/10358 tok/s;   4341 sec
[2021-04-24 23:45:12,292 INFO] Step 23300/50000; acc:  93.97; ppl:  1.28; xent: 0.25; lr: 0.00010; 11007/10335 tok/s;   4350 sec
[2021-04-24 23:45:17,371 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:45:21,612 INFO] Step 23350/50000; acc:  93.91; ppl:  1.28; xent: 0.25; lr: 0.00010; 11099/10392 tok/s;   4360 sec
[2021-04-24 23:45:30,821 INFO] Step 23400/50000; acc:  93.98; ppl:  1.28; xent: 0.25; lr: 0.00010; 10968/10411 tok/s;   4369 sec
[2021-04-24 23:45:39,740 INFO] Step 23450/50000; acc:  93.91; ppl:  1.29; xent: 0.25; lr: 0.00010; 11191/10556 tok/s;   4378 sec
[2021-04-24 23:45:49,143 INFO] Step 23500/50000; acc:  93.86; ppl:  1.28; xent: 0.25; lr: 0.00010; 10947/10413 tok/s;   4387 sec
[2021-04-24 23:45:58,303 INFO] Step 23550/50000; acc:  93.84; ppl:  1.28; xent: 0.25; lr: 0.00010; 10835/10279 tok/s;   4396 sec
[2021-04-24 23:46:07,601 INFO] Step 23600/50000; acc:  94.01; ppl:  1.28; xent: 0.25; lr: 0.00010; 11128/10429 tok/s;   4406 sec
[2021-04-24 23:46:16,968 INFO] Step 23650/50000; acc:  93.96; ppl:  1.28; xent: 0.25; lr: 0.00010; 10883/10349 tok/s;   4415 sec
[2021-04-24 23:46:26,317 INFO] Step 23700/50000; acc:  94.17; ppl:  1.27; xent: 0.24; lr: 0.00010; 11139/10447 tok/s;   4424 sec
[2021-04-24 23:46:35,649 INFO] Step 23750/50000; acc:  93.85; ppl:  1.28; xent: 0.25; lr: 0.00010; 10819/10224 tok/s;   4434 sec
[2021-04-24 23:46:45,031 INFO] Step 23800/50000; acc:  94.00; ppl:  1.28; xent: 0.24; lr: 0.00010; 10843/10187 tok/s;   4443 sec
[2021-04-24 23:46:54,265 INFO] Step 23850/50000; acc:  94.03; ppl:  1.28; xent: 0.24; lr: 0.00010; 10926/10339 tok/s;   4452 sec
[2021-04-24 23:47:03,375 INFO] Step 23900/50000; acc:  93.86; ppl:  1.28; xent: 0.25; lr: 0.00010; 11047/10457 tok/s;   4461 sec
[2021-04-24 23:47:12,713 INFO] Step 23950/50000; acc:  94.00; ppl:  1.27; xent: 0.24; lr: 0.00010; 11014/10358 tok/s;   4471 sec
[2021-04-24 23:47:21,801 INFO] Step 24000/50000; acc:  93.94; ppl:  1.28; xent: 0.25; lr: 0.00010; 11040/10309 tok/s;   4480 sec
[2021-04-24 23:47:31,141 INFO] Step 24050/50000; acc:  93.97; ppl:  1.28; xent: 0.24; lr: 0.00010; 11010/10419 tok/s;   4489 sec
[2021-04-24 23:47:33,613 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:47:40,431 INFO] Step 24100/50000; acc:  93.97; ppl:  1.28; xent: 0.25; lr: 0.00010; 10930/10219 tok/s;   4499 sec
[2021-04-24 23:47:49,673 INFO] Step 24150/50000; acc:  94.10; ppl:  1.28; xent: 0.24; lr: 0.00010; 11158/10588 tok/s;   4508 sec
[2021-04-24 23:47:58,701 INFO] Step 24200/50000; acc:  93.83; ppl:  1.29; xent: 0.25; lr: 0.00010; 11076/10490 tok/s;   4517 sec
[2021-04-24 23:48:07,926 INFO] Step 24250/50000; acc:  93.81; ppl:  1.28; xent: 0.25; lr: 0.00010; 10837/10274 tok/s;   4526 sec
[2021-04-24 23:48:17,278 INFO] Step 24300/50000; acc:  94.05; ppl:  1.28; xent: 0.24; lr: 0.00010; 11046/10392 tok/s;   4535 sec
[2021-04-24 23:48:26,341 INFO] Step 24350/50000; acc:  93.86; ppl:  1.28; xent: 0.25; lr: 0.00010; 10917/10395 tok/s;   4544 sec
[2021-04-24 23:48:35,923 INFO] Step 24400/50000; acc:  94.27; ppl:  1.27; xent: 0.24; lr: 0.00010; 11033/10348 tok/s;   4554 sec
[2021-04-24 23:48:45,130 INFO] Step 24450/50000; acc:  93.88; ppl:  1.28; xent: 0.25; lr: 0.00010; 10982/10383 tok/s;   4563 sec
[2021-04-24 23:48:54,551 INFO] Step 24500/50000; acc:  94.07; ppl:  1.28; xent: 0.24; lr: 0.00010; 10965/10316 tok/s;   4573 sec
[2021-04-24 23:49:03,818 INFO] Step 24550/50000; acc:  94.00; ppl:  1.27; xent: 0.24; lr: 0.00010; 10904/10277 tok/s;   4582 sec
[2021-04-24 23:49:13,077 INFO] Step 24600/50000; acc:  94.02; ppl:  1.28; xent: 0.25; lr: 0.00010; 10958/10398 tok/s;   4591 sec
[2021-04-24 23:49:22,182 INFO] Step 24650/50000; acc:  94.05; ppl:  1.27; xent: 0.24; lr: 0.00010; 11129/10509 tok/s;   4600 sec
[2021-04-24 23:49:31,426 INFO] Step 24700/50000; acc:  94.06; ppl:  1.27; xent: 0.24; lr: 0.00010; 10887/10192 tok/s;   4610 sec
[2021-04-24 23:49:40,692 INFO] Step 24750/50000; acc:  94.03; ppl:  1.27; xent: 0.24; lr: 0.00010; 11001/10400 tok/s;   4619 sec
[2021-04-24 23:49:43,958 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:49:49,846 INFO] Step 24800/50000; acc:  93.90; ppl:  1.28; xent: 0.25; lr: 0.00010; 11075/10366 tok/s;   4628 sec
[2021-04-24 23:49:59,222 INFO] Step 24850/50000; acc:  94.07; ppl:  1.28; xent: 0.25; lr: 0.00010; 10993/10366 tok/s;   4637 sec
[2021-04-24 23:50:08,352 INFO] Step 24900/50000; acc:  94.02; ppl:  1.28; xent: 0.24; lr: 0.00010; 11049/10447 tok/s;   4646 sec
[2021-04-24 23:50:17,635 INFO] Step 24950/50000; acc:  93.93; ppl:  1.28; xent: 0.25; lr: 0.00010; 11036/10447 tok/s;   4656 sec
[2021-04-24 23:50:26,765 INFO] Step 25000/50000; acc:  93.88; ppl:  1.28; xent: 0.25; lr: 0.00010; 10994/10443 tok/s;   4665 sec
[2021-04-24 23:50:26,766 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-24 23:50:35,568 INFO] Validation perplexity: 1.31985
[2021-04-24 23:50:35,568 INFO] Validation accuracy: 93.5379
[2021-04-24 23:50:35,570 INFO] Saving checkpoint ../models/group1_params/control/model_step_25000.pt
[2021-04-24 23:50:45,165 INFO] Step 25050/50000; acc:  93.97; ppl:  1.28; xent: 0.24; lr: 0.00010; 5423/5138 tok/s;   4683 sec
[2021-04-24 23:50:54,560 INFO] Step 25100/50000; acc:  94.12; ppl:  1.28; xent: 0.24; lr: 0.00010; 11046/10363 tok/s;   4693 sec
[2021-04-24 23:51:03,816 INFO] Step 25150/50000; acc:  94.11; ppl:  1.27; xent: 0.24; lr: 0.00010; 10943/10360 tok/s;   4702 sec
[2021-04-24 23:51:13,264 INFO] Step 25200/50000; acc:  94.06; ppl:  1.27; xent: 0.24; lr: 0.00010; 11000/10342 tok/s;   4711 sec
[2021-04-24 23:51:22,521 INFO] Step 25250/50000; acc:  94.01; ppl:  1.28; xent: 0.25; lr: 0.00010; 10920/10308 tok/s;   4721 sec
[2021-04-24 23:51:31,886 INFO] Step 25300/50000; acc:  94.06; ppl:  1.27; xent: 0.24; lr: 0.00010; 11006/10338 tok/s;   4730 sec
[2021-04-24 23:51:41,092 INFO] Step 25350/50000; acc:  94.05; ppl:  1.27; xent: 0.24; lr: 0.00010; 10955/10421 tok/s;   4739 sec
[2021-04-24 23:51:50,342 INFO] Step 25400/50000; acc:  94.07; ppl:  1.27; xent: 0.24; lr: 0.00010; 11074/10354 tok/s;   4748 sec
[2021-04-24 23:51:59,506 INFO] Step 25450/50000; acc:  94.03; ppl:  1.27; xent: 0.24; lr: 0.00010; 10941/10349 tok/s;   4758 sec
[2021-04-24 23:52:08,664 INFO] Step 25500/50000; acc:  94.03; ppl:  1.27; xent: 0.24; lr: 0.00010; 10975/10320 tok/s;   4767 sec
[2021-04-24 23:52:09,396 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:52:18,013 INFO] Step 25550/50000; acc:  93.99; ppl:  1.28; xent: 0.24; lr: 0.00010; 10962/10326 tok/s;   4776 sec
[2021-04-24 23:52:27,119 INFO] Step 25600/50000; acc:  94.14; ppl:  1.27; xent: 0.24; lr: 0.00010; 11132/10462 tok/s;   4785 sec
[2021-04-24 23:52:36,350 INFO] Step 25650/50000; acc:  94.08; ppl:  1.27; xent: 0.24; lr: 0.00010; 11036/10467 tok/s;   4794 sec
[2021-04-24 23:52:45,621 INFO] Step 25700/50000; acc:  93.91; ppl:  1.28; xent: 0.25; lr: 0.00010; 10918/10331 tok/s;   4804 sec
[2021-04-24 23:52:55,019 INFO] Step 25750/50000; acc:  94.03; ppl:  1.27; xent: 0.24; lr: 0.00010; 10952/10362 tok/s;   4813 sec
[2021-04-24 23:53:04,132 INFO] Step 25800/50000; acc:  93.95; ppl:  1.28; xent: 0.25; lr: 0.00010; 10940/10275 tok/s;   4822 sec
[2021-04-24 23:53:13,474 INFO] Step 25850/50000; acc:  94.20; ppl:  1.27; xent: 0.24; lr: 0.00010; 10888/10365 tok/s;   4832 sec
[2021-04-24 23:53:22,989 INFO] Step 25900/50000; acc:  94.20; ppl:  1.26; xent: 0.23; lr: 0.00010; 10964/10342 tok/s;   4841 sec
[2021-04-24 23:53:32,255 INFO] Step 25950/50000; acc:  93.94; ppl:  1.27; xent: 0.24; lr: 0.00010; 10725/10131 tok/s;   4850 sec
[2021-04-24 23:53:41,690 INFO] Step 26000/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 11050/10315 tok/s;   4860 sec
[2021-04-24 23:53:50,906 INFO] Step 26050/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 10947/10439 tok/s;   4869 sec
[2021-04-24 23:54:00,255 INFO] Step 26100/50000; acc:  94.10; ppl:  1.27; xent: 0.24; lr: 0.00010; 11073/10393 tok/s;   4878 sec
[2021-04-24 23:54:09,411 INFO] Step 26150/50000; acc:  94.20; ppl:  1.26; xent: 0.23; lr: 0.00010; 11048/10430 tok/s;   4887 sec
[2021-04-24 23:54:18,592 INFO] Step 26200/50000; acc:  94.10; ppl:  1.27; xent: 0.24; lr: 0.00010; 10981/10260 tok/s;   4897 sec
[2021-04-24 23:54:25,881 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:54:27,792 INFO] Step 26250/50000; acc:  94.02; ppl:  1.27; xent: 0.24; lr: 0.00010; 10994/10387 tok/s;   4906 sec
[2021-04-24 23:54:37,052 INFO] Step 26300/50000; acc:  94.11; ppl:  1.27; xent: 0.24; lr: 0.00010; 10858/10252 tok/s;   4915 sec
[2021-04-24 23:54:46,309 INFO] Step 26350/50000; acc:  94.14; ppl:  1.27; xent: 0.24; lr: 0.00010; 11074/10452 tok/s;   4924 sec
[2021-04-24 23:54:55,364 INFO] Step 26400/50000; acc:  93.94; ppl:  1.27; xent: 0.24; lr: 0.00010; 11057/10547 tok/s;   4933 sec
[2021-04-24 23:55:04,893 INFO] Step 26450/50000; acc:  94.06; ppl:  1.27; xent: 0.24; lr: 0.00010; 10806/10162 tok/s;   4943 sec
[2021-04-24 23:55:13,977 INFO] Step 26500/50000; acc:  94.07; ppl:  1.27; xent: 0.24; lr: 0.00010; 11111/10499 tok/s;   4952 sec
[2021-04-24 23:55:23,435 INFO] Step 26550/50000; acc:  94.07; ppl:  1.27; xent: 0.24; lr: 0.00010; 10921/10343 tok/s;   4962 sec
[2021-04-24 23:55:32,675 INFO] Step 26600/50000; acc:  94.46; ppl:  1.25; xent: 0.23; lr: 0.00010; 11004/10363 tok/s;   4971 sec
[2021-04-24 23:55:41,937 INFO] Step 26650/50000; acc:  94.00; ppl:  1.27; xent: 0.24; lr: 0.00010; 10895/10317 tok/s;   4980 sec
[2021-04-24 23:55:51,418 INFO] Step 26700/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 10876/10191 tok/s;   4989 sec
[2021-04-24 23:56:00,640 INFO] Step 26750/50000; acc:  94.06; ppl:  1.27; xent: 0.24; lr: 0.00010; 10828/10247 tok/s;   4999 sec
[2021-04-24 23:56:09,990 INFO] Step 26800/50000; acc:  94.21; ppl:  1.26; xent: 0.24; lr: 0.00010; 11109/10510 tok/s;   5008 sec
[2021-04-24 23:56:19,216 INFO] Step 26850/50000; acc:  94.10; ppl:  1.27; xent: 0.24; lr: 0.00010; 10985/10286 tok/s;   5017 sec
[2021-04-24 23:56:28,482 INFO] Step 26900/50000; acc:  94.14; ppl:  1.27; xent: 0.24; lr: 0.00010; 11092/10395 tok/s;   5027 sec
[2021-04-24 23:56:37,644 INFO] Step 26950/50000; acc:  94.14; ppl:  1.27; xent: 0.24; lr: 0.00010; 10954/10354 tok/s;   5036 sec
[2021-04-24 23:56:42,299 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:56:46,936 INFO] Step 27000/50000; acc:  94.06; ppl:  1.27; xent: 0.24; lr: 0.00010; 10972/10297 tok/s;   5045 sec
[2021-04-24 23:56:56,205 INFO] Step 27050/50000; acc:  94.21; ppl:  1.27; xent: 0.24; lr: 0.00010; 10977/10380 tok/s;   5054 sec
[2021-04-24 23:57:05,161 INFO] Step 27100/50000; acc:  94.03; ppl:  1.27; xent: 0.24; lr: 0.00010; 11143/10553 tok/s;   5063 sec
[2021-04-24 23:57:14,536 INFO] Step 27150/50000; acc:  94.05; ppl:  1.27; xent: 0.24; lr: 0.00010; 10865/10315 tok/s;   5073 sec
[2021-04-24 23:57:23,769 INFO] Step 27200/50000; acc:  94.07; ppl:  1.27; xent: 0.24; lr: 0.00010; 10906/10335 tok/s;   5082 sec
[2021-04-24 23:57:32,984 INFO] Step 27250/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 11103/10405 tok/s;   5091 sec
[2021-04-24 23:57:42,364 INFO] Step 27300/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 10924/10390 tok/s;   5100 sec
[2021-04-24 23:57:51,712 INFO] Step 27350/50000; acc:  94.33; ppl:  1.26; xent: 0.23; lr: 0.00010; 11147/10441 tok/s;   5110 sec
[2021-04-24 23:58:01,011 INFO] Step 27400/50000; acc:  94.05; ppl:  1.27; xent: 0.24; lr: 0.00010; 10813/10246 tok/s;   5119 sec
[2021-04-24 23:58:10,235 INFO] Step 27450/50000; acc:  94.08; ppl:  1.27; xent: 0.24; lr: 0.00010; 10877/10249 tok/s;   5128 sec
[2021-04-24 23:58:19,593 INFO] Step 27500/50000; acc:  94.30; ppl:  1.26; xent: 0.23; lr: 0.00010; 11057/10403 tok/s;   5138 sec
[2021-04-24 23:58:28,625 INFO] Step 27550/50000; acc:  94.07; ppl:  1.27; xent: 0.24; lr: 0.00010; 11015/10440 tok/s;   5147 sec
[2021-04-24 23:58:38,091 INFO] Step 27600/50000; acc:  94.32; ppl:  1.26; xent: 0.23; lr: 0.00010; 11026/10352 tok/s;   5156 sec
[2021-04-24 23:58:47,212 INFO] Step 27650/50000; acc:  94.08; ppl:  1.27; xent: 0.24; lr: 0.00010; 10998/10279 tok/s;   5165 sec
[2021-04-24 23:58:56,476 INFO] Step 27700/50000; acc:  94.21; ppl:  1.26; xent: 0.23; lr: 0.00010; 11136/10516 tok/s;   5175 sec
[2021-04-24 23:58:58,489 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-24 23:59:05,711 INFO] Step 27750/50000; acc:  94.17; ppl:  1.27; xent: 0.24; lr: 0.00010; 10945/10277 tok/s;   5184 sec
[2021-04-24 23:59:14,810 INFO] Step 27800/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 11160/10588 tok/s;   5193 sec
[2021-04-24 23:59:23,863 INFO] Step 27850/50000; acc:  94.02; ppl:  1.27; xent: 0.24; lr: 0.00010; 11097/10489 tok/s;   5202 sec
[2021-04-24 23:59:33,074 INFO] Step 27900/50000; acc:  94.07; ppl:  1.26; xent: 0.23; lr: 0.00010; 10895/10362 tok/s;   5211 sec
[2021-04-24 23:59:42,376 INFO] Step 27950/50000; acc:  94.23; ppl:  1.26; xent: 0.23; lr: 0.00010; 10973/10340 tok/s;   5220 sec
[2021-04-24 23:59:51,507 INFO] Step 28000/50000; acc:  94.17; ppl:  1.27; xent: 0.24; lr: 0.00010; 11013/10443 tok/s;   5230 sec
[2021-04-25 00:00:01,052 INFO] Step 28050/50000; acc:  94.39; ppl:  1.25; xent: 0.23; lr: 0.00010; 10974/10312 tok/s;   5239 sec
[2021-04-25 00:00:10,277 INFO] Step 28100/50000; acc:  94.22; ppl:  1.26; xent: 0.23; lr: 0.00010; 10994/10337 tok/s;   5248 sec
[2021-04-25 00:00:19,708 INFO] Step 28150/50000; acc:  94.28; ppl:  1.26; xent: 0.23; lr: 0.00010; 10956/10352 tok/s;   5258 sec
[2021-04-25 00:00:29,022 INFO] Step 28200/50000; acc:  94.13; ppl:  1.26; xent: 0.23; lr: 0.00010; 10807/10180 tok/s;   5267 sec
[2021-04-25 00:00:38,143 INFO] Step 28250/50000; acc:  94.15; ppl:  1.27; xent: 0.24; lr: 0.00010; 10967/10420 tok/s;   5276 sec
[2021-04-25 00:00:47,509 INFO] Step 28300/50000; acc:  94.32; ppl:  1.25; xent: 0.23; lr: 0.00010; 11117/10430 tok/s;   5286 sec
[2021-04-25 00:00:56,640 INFO] Step 28350/50000; acc:  94.19; ppl:  1.26; xent: 0.23; lr: 0.00010; 10882/10259 tok/s;   5295 sec
[2021-04-25 00:01:05,972 INFO] Step 28400/50000; acc:  94.20; ppl:  1.26; xent: 0.23; lr: 0.00010; 11071/10407 tok/s;   5304 sec
[2021-04-25 00:01:08,819 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:01:15,167 INFO] Step 28450/50000; acc:  94.06; ppl:  1.27; xent: 0.24; lr: 0.00010; 11053/10369 tok/s;   5313 sec
[2021-04-25 00:01:24,553 INFO] Step 28500/50000; acc:  94.41; ppl:  1.26; xent: 0.23; lr: 0.00010; 10985/10362 tok/s;   5323 sec
[2021-04-25 00:01:33,592 INFO] Step 28550/50000; acc:  94.14; ppl:  1.27; xent: 0.24; lr: 0.00010; 11092/10473 tok/s;   5332 sec
[2021-04-25 00:01:42,851 INFO] Step 28600/50000; acc:  94.03; ppl:  1.27; xent: 0.24; lr: 0.00010; 10927/10412 tok/s;   5341 sec
[2021-04-25 00:01:51,956 INFO] Step 28650/50000; acc:  94.21; ppl:  1.26; xent: 0.23; lr: 0.00010; 11091/10486 tok/s;   5350 sec
[2021-04-25 00:02:01,093 INFO] Step 28700/50000; acc:  94.14; ppl:  1.26; xent: 0.23; lr: 0.00010; 10932/10313 tok/s;   5359 sec
[2021-04-25 00:02:10,395 INFO] Step 28750/50000; acc:  94.30; ppl:  1.26; xent: 0.23; lr: 0.00010; 11070/10433 tok/s;   5368 sec
[2021-04-25 00:02:19,715 INFO] Step 28800/50000; acc:  94.37; ppl:  1.25; xent: 0.23; lr: 0.00010; 11014/10437 tok/s;   5378 sec
[2021-04-25 00:02:29,090 INFO] Step 28850/50000; acc:  94.21; ppl:  1.26; xent: 0.23; lr: 0.00010; 10960/10328 tok/s;   5387 sec
[2021-04-25 00:02:38,357 INFO] Step 28900/50000; acc:  94.25; ppl:  1.26; xent: 0.23; lr: 0.00010; 10948/10284 tok/s;   5396 sec
[2021-04-25 00:02:47,692 INFO] Step 28950/50000; acc:  94.36; ppl:  1.26; xent: 0.23; lr: 0.00010; 11040/10403 tok/s;   5406 sec
[2021-04-25 00:02:56,821 INFO] Step 29000/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 11015/10441 tok/s;   5415 sec
[2021-04-25 00:03:06,017 INFO] Step 29050/50000; acc:  94.21; ppl:  1.26; xent: 0.23; lr: 0.00010; 10997/10317 tok/s;   5424 sec
[2021-04-25 00:03:15,332 INFO] Step 29100/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 11022/10365 tok/s;   5433 sec
[2021-04-25 00:03:24,421 INFO] Step 29150/50000; acc:  94.12; ppl:  1.26; xent: 0.23; lr: 0.00010; 10943/10343 tok/s;   5442 sec
[2021-04-25 00:03:24,852 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:03:33,950 INFO] Step 29200/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 10913/10222 tok/s;   5452 sec
[2021-04-25 00:03:43,107 INFO] Step 29250/50000; acc:  94.23; ppl:  1.26; xent: 0.23; lr: 0.00010; 11077/10454 tok/s;   5461 sec
[2021-04-25 00:03:52,306 INFO] Step 29300/50000; acc:  94.22; ppl:  1.26; xent: 0.23; lr: 0.00010; 11082/10494 tok/s;   5470 sec
[2021-04-25 00:04:01,500 INFO] Step 29350/50000; acc:  94.14; ppl:  1.26; xent: 0.23; lr: 0.00010; 10971/10363 tok/s;   5480 sec
[2021-04-25 00:04:10,801 INFO] Step 29400/50000; acc:  94.12; ppl:  1.26; xent: 0.23; lr: 0.00010; 10913/10393 tok/s;   5489 sec
[2021-04-25 00:04:19,916 INFO] Step 29450/50000; acc:  94.21; ppl:  1.26; xent: 0.23; lr: 0.00010; 10999/10331 tok/s;   5498 sec
[2021-04-25 00:04:29,254 INFO] Step 29500/50000; acc:  94.43; ppl:  1.25; xent: 0.23; lr: 0.00010; 10923/10379 tok/s;   5507 sec
[2021-04-25 00:04:38,652 INFO] Step 29550/50000; acc:  94.25; ppl:  1.26; xent: 0.23; lr: 0.00010; 10980/10378 tok/s;   5517 sec
[2021-04-25 00:04:47,947 INFO] Step 29600/50000; acc:  94.22; ppl:  1.26; xent: 0.23; lr: 0.00010; 10816/10172 tok/s;   5526 sec
[2021-04-25 00:04:57,351 INFO] Step 29650/50000; acc:  94.29; ppl:  1.25; xent: 0.23; lr: 0.00010; 10998/10278 tok/s;   5535 sec
[2021-04-25 00:05:06,552 INFO] Step 29700/50000; acc:  94.38; ppl:  1.25; xent: 0.23; lr: 0.00010; 10995/10510 tok/s;   5545 sec
[2021-04-25 00:05:15,932 INFO] Step 29750/50000; acc:  94.27; ppl:  1.25; xent: 0.23; lr: 0.00010; 11059/10320 tok/s;   5554 sec
[2021-04-25 00:05:25,080 INFO] Step 29800/50000; acc:  94.23; ppl:  1.26; xent: 0.23; lr: 0.00010; 10972/10416 tok/s;   5563 sec
[2021-04-25 00:05:34,151 INFO] Step 29850/50000; acc:  94.23; ppl:  1.26; xent: 0.23; lr: 0.00010; 10984/10267 tok/s;   5572 sec
[2021-04-25 00:05:41,192 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:05:43,515 INFO] Step 29900/50000; acc:  94.36; ppl:  1.26; xent: 0.23; lr: 0.00010; 11100/10455 tok/s;   5582 sec
[2021-04-25 00:05:52,642 INFO] Step 29950/50000; acc:  94.25; ppl:  1.26; xent: 0.23; lr: 0.00010; 10883/10315 tok/s;   5591 sec
[2021-04-25 00:06:01,987 INFO] Step 30000/50000; acc:  94.34; ppl:  1.26; xent: 0.23; lr: 0.00010; 11122/10446 tok/s;   5600 sec
[2021-04-25 00:06:01,989 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-25 00:06:10,773 INFO] Validation perplexity: 1.31758
[2021-04-25 00:06:10,773 INFO] Validation accuracy: 93.6066
[2021-04-25 00:06:10,775 INFO] Saving checkpoint ../models/group1_params/control/model_step_30000.pt
[2021-04-25 00:06:20,376 INFO] Step 30050/50000; acc:  94.17; ppl:  1.26; xent: 0.23; lr: 0.00010; 5454/5194 tok/s;   5618 sec
[2021-04-25 00:06:29,802 INFO] Step 30100/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 10919/10270 tok/s;   5628 sec
[2021-04-25 00:06:38,905 INFO] Step 30150/50000; acc:  94.27; ppl:  1.26; xent: 0.23; lr: 0.00010; 11037/10449 tok/s;   5637 sec
[2021-04-25 00:06:48,178 INFO] Step 30200/50000; acc:  94.22; ppl:  1.26; xent: 0.23; lr: 0.00010; 11018/10446 tok/s;   5646 sec
[2021-04-25 00:06:57,530 INFO] Step 30250/50000; acc:  94.50; ppl:  1.25; xent: 0.22; lr: 0.00010; 10918/10332 tok/s;   5656 sec
[2021-04-25 00:07:06,819 INFO] Step 30300/50000; acc:  94.17; ppl:  1.26; xent: 0.23; lr: 0.00010; 10882/10272 tok/s;   5665 sec
[2021-04-25 00:07:16,194 INFO] Step 30350/50000; acc:  94.31; ppl:  1.26; xent: 0.23; lr: 0.00010; 10878/10202 tok/s;   5674 sec
[2021-04-25 00:07:25,467 INFO] Step 30400/50000; acc:  94.34; ppl:  1.25; xent: 0.22; lr: 0.00010; 10914/10310 tok/s;   5684 sec
[2021-04-25 00:07:34,760 INFO] Step 30450/50000; acc:  94.44; ppl:  1.25; xent: 0.23; lr: 0.00010; 11074/10462 tok/s;   5693 sec
[2021-04-25 00:07:43,965 INFO] Step 30500/50000; acc:  94.42; ppl:  1.24; xent: 0.22; lr: 0.00010; 11054/10354 tok/s;   5702 sec
[2021-04-25 00:07:53,274 INFO] Step 30550/50000; acc:  94.31; ppl:  1.26; xent: 0.23; lr: 0.00010; 11048/10366 tok/s;   5711 sec
[2021-04-25 00:08:02,390 INFO] Step 30600/50000; acc:  94.30; ppl:  1.25; xent: 0.23; lr: 0.00010; 10964/10353 tok/s;   5720 sec
[2021-04-25 00:08:06,743 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:08:11,549 INFO] Step 30650/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 10979/10350 tok/s;   5730 sec
[2021-04-25 00:08:20,988 INFO] Step 30700/50000; acc:  94.48; ppl:  1.25; xent: 0.22; lr: 0.00010; 11045/10351 tok/s;   5739 sec
[2021-04-25 00:08:29,880 INFO] Step 30750/50000; acc:  94.16; ppl:  1.26; xent: 0.23; lr: 0.00010; 11108/10615 tok/s;   5748 sec
[2021-04-25 00:08:39,364 INFO] Step 30800/50000; acc:  94.31; ppl:  1.25; xent: 0.23; lr: 0.00010; 10896/10257 tok/s;   5757 sec
[2021-04-25 00:08:48,611 INFO] Step 30850/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 10911/10360 tok/s;   5767 sec
[2021-04-25 00:08:57,848 INFO] Step 30900/50000; acc:  94.32; ppl:  1.26; xent: 0.23; lr: 0.00010; 11079/10400 tok/s;   5776 sec
[2021-04-25 00:09:07,228 INFO] Step 30950/50000; acc:  94.41; ppl:  1.25; xent: 0.22; lr: 0.00010; 10880/10368 tok/s;   5785 sec
[2021-04-25 00:09:16,492 INFO] Step 31000/50000; acc:  94.45; ppl:  1.25; xent: 0.22; lr: 0.00010; 11089/10397 tok/s;   5795 sec
[2021-04-25 00:09:25,734 INFO] Step 31050/50000; acc:  94.24; ppl:  1.26; xent: 0.23; lr: 0.00010; 10912/10303 tok/s;   5804 sec
[2021-04-25 00:09:34,985 INFO] Step 31100/50000; acc:  94.32; ppl:  1.25; xent: 0.23; lr: 0.00010; 10889/10279 tok/s;   5813 sec
[2021-04-25 00:09:44,275 INFO] Step 31150/50000; acc:  94.42; ppl:  1.25; xent: 0.22; lr: 0.00010; 11026/10391 tok/s;   5822 sec
[2021-04-25 00:09:53,372 INFO] Step 31200/50000; acc:  94.37; ppl:  1.25; xent: 0.22; lr: 0.00010; 11091/10505 tok/s;   5831 sec
[2021-04-25 00:10:02,779 INFO] Step 31250/50000; acc:  94.37; ppl:  1.25; xent: 0.22; lr: 0.00010; 10982/10307 tok/s;   5841 sec
[2021-04-25 00:10:11,862 INFO] Step 31300/50000; acc:  94.32; ppl:  1.25; xent: 0.23; lr: 0.00010; 11089/10371 tok/s;   5850 sec
[2021-04-25 00:10:21,169 INFO] Step 31350/50000; acc:  94.38; ppl:  1.25; xent: 0.22; lr: 0.00010; 11100/10454 tok/s;   5859 sec
[2021-04-25 00:10:22,814 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:10:30,421 INFO] Step 31400/50000; acc:  94.33; ppl:  1.26; xent: 0.23; lr: 0.00010; 10881/10252 tok/s;   5868 sec
[2021-04-25 00:10:39,434 INFO] Step 31450/50000; acc:  94.40; ppl:  1.25; xent: 0.23; lr: 0.00010; 11102/10536 tok/s;   5878 sec
[2021-04-25 00:10:48,643 INFO] Step 31500/50000; acc:  94.33; ppl:  1.26; xent: 0.23; lr: 0.00010; 11195/10551 tok/s;   5887 sec
[2021-04-25 00:10:57,752 INFO] Step 31550/50000; acc:  94.22; ppl:  1.25; xent: 0.23; lr: 0.00010; 10888/10368 tok/s;   5896 sec
[2021-04-25 00:11:07,138 INFO] Step 31600/50000; acc:  94.36; ppl:  1.25; xent: 0.22; lr: 0.00010; 11028/10370 tok/s;   5905 sec
[2021-04-25 00:11:16,325 INFO] Step 31650/50000; acc:  94.38; ppl:  1.25; xent: 0.23; lr: 0.00010; 10973/10418 tok/s;   5914 sec
[2021-04-25 00:11:25,876 INFO] Step 31700/50000; acc:  94.68; ppl:  1.24; xent: 0.21; lr: 0.00010; 10981/10317 tok/s;   5924 sec
[2021-04-25 00:11:35,025 INFO] Step 31750/50000; acc:  94.30; ppl:  1.25; xent: 0.23; lr: 0.00010; 11014/10368 tok/s;   5933 sec
[2021-04-25 00:11:44,446 INFO] Step 31800/50000; acc:  94.39; ppl:  1.25; xent: 0.23; lr: 0.00010; 10829/10270 tok/s;   5943 sec
[2021-04-25 00:11:53,704 INFO] Step 31850/50000; acc:  94.43; ppl:  1.25; xent: 0.22; lr: 0.00010; 10927/10254 tok/s;   5952 sec
[2021-04-25 00:12:02,805 INFO] Step 31900/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 11010/10500 tok/s;   5961 sec
[2021-04-25 00:12:12,107 INFO] Step 31950/50000; acc:  94.47; ppl:  1.24; xent: 0.22; lr: 0.00010; 11095/10402 tok/s;   5970 sec
[2021-04-25 00:12:21,312 INFO] Step 32000/50000; acc:  94.50; ppl:  1.25; xent: 0.22; lr: 0.00010; 10939/10257 tok/s;   5979 sec
[2021-04-25 00:12:30,595 INFO] Step 32050/50000; acc:  94.31; ppl:  1.25; xent: 0.22; lr: 0.00010; 11027/10385 tok/s;   5989 sec
[2021-04-25 00:12:33,109 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:12:39,804 INFO] Step 32100/50000; acc:  94.37; ppl:  1.25; xent: 0.23; lr: 0.00010; 11062/10370 tok/s;   5998 sec
[2021-04-25 00:12:49,152 INFO] Step 32150/50000; acc:  94.55; ppl:  1.24; xent: 0.22; lr: 0.00010; 11048/10460 tok/s;   6007 sec
[2021-04-25 00:12:58,097 INFO] Step 32200/50000; acc:  94.27; ppl:  1.26; xent: 0.23; lr: 0.00010; 11138/10523 tok/s;   6016 sec
[2021-04-25 00:13:07,229 INFO] Step 32250/50000; acc:  94.26; ppl:  1.26; xent: 0.23; lr: 0.00010; 10955/10422 tok/s;   6025 sec
[2021-04-25 00:13:16,613 INFO] Step 32300/50000; acc:  94.49; ppl:  1.24; xent: 0.22; lr: 0.00010; 11028/10381 tok/s;   6035 sec
[2021-04-25 00:13:25,655 INFO] Step 32350/50000; acc:  94.30; ppl:  1.25; xent: 0.23; lr: 0.00010; 10913/10358 tok/s;   6044 sec
[2021-04-25 00:13:35,048 INFO] Step 32400/50000; acc:  94.54; ppl:  1.25; xent: 0.22; lr: 0.00010; 11144/10484 tok/s;   6053 sec
[2021-04-25 00:13:44,358 INFO] Step 32450/50000; acc:  94.51; ppl:  1.24; xent: 0.22; lr: 0.00010; 11021/10405 tok/s;   6062 sec
[2021-04-25 00:13:53,686 INFO] Step 32500/50000; acc:  94.40; ppl:  1.25; xent: 0.22; lr: 0.00010; 11017/10366 tok/s;   6072 sec
[2021-04-25 00:14:02,941 INFO] Step 32550/50000; acc:  94.32; ppl:  1.25; xent: 0.22; lr: 0.00010; 10917/10266 tok/s;   6081 sec
[2021-04-25 00:14:12,178 INFO] Step 32600/50000; acc:  94.55; ppl:  1.24; xent: 0.22; lr: 0.00010; 11015/10435 tok/s;   6090 sec
[2021-04-25 00:14:21,334 INFO] Step 32650/50000; acc:  94.45; ppl:  1.25; xent: 0.22; lr: 0.00010; 11044/10439 tok/s;   6099 sec
[2021-04-25 00:14:30,589 INFO] Step 32700/50000; acc:  94.33; ppl:  1.24; xent: 0.22; lr: 0.00010; 10935/10257 tok/s;   6109 sec
[2021-04-25 00:14:39,789 INFO] Step 32750/50000; acc:  94.51; ppl:  1.25; xent: 0.22; lr: 0.00010; 11047/10415 tok/s;   6118 sec
[2021-04-25 00:14:48,993 INFO] Step 32800/50000; acc:  94.45; ppl:  1.25; xent: 0.22; lr: 0.00010; 10978/10336 tok/s;   6127 sec
[2021-04-25 00:14:49,002 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:14:58,410 INFO] Step 32850/50000; acc:  94.45; ppl:  1.25; xent: 0.22; lr: 0.00010; 10917/10236 tok/s;   6136 sec
[2021-04-25 00:15:07,615 INFO] Step 32900/50000; acc:  94.51; ppl:  1.25; xent: 0.22; lr: 0.00010; 11055/10460 tok/s;   6146 sec
[2021-04-25 00:15:16,867 INFO] Step 32950/50000; acc:  94.32; ppl:  1.25; xent: 0.22; lr: 0.00010; 11032/10444 tok/s;   6155 sec
[2021-04-25 00:15:26,006 INFO] Step 33000/50000; acc:  94.34; ppl:  1.25; xent: 0.22; lr: 0.00010; 10970/10356 tok/s;   6164 sec
[2021-04-25 00:15:35,189 INFO] Step 33050/50000; acc:  94.32; ppl:  1.25; xent: 0.22; lr: 0.00010; 10920/10415 tok/s;   6173 sec
[2021-04-25 00:15:44,534 INFO] Step 33100/50000; acc:  94.50; ppl:  1.24; xent: 0.22; lr: 0.00010; 11043/10349 tok/s;   6183 sec
[2021-04-25 00:15:53,800 INFO] Step 33150/50000; acc:  94.54; ppl:  1.24; xent: 0.22; lr: 0.00010; 10870/10346 tok/s;   6192 sec
[2021-04-25 00:16:03,255 INFO] Step 33200/50000; acc:  94.57; ppl:  1.24; xent: 0.21; lr: 0.00010; 11068/10382 tok/s;   6201 sec
[2021-04-25 00:16:12,588 INFO] Step 33250/50000; acc:  94.33; ppl:  1.25; xent: 0.22; lr: 0.00010; 10767/10154 tok/s;   6211 sec
[2021-04-25 00:16:21,981 INFO] Step 33300/50000; acc:  94.54; ppl:  1.24; xent: 0.22; lr: 0.00010; 11031/10330 tok/s;   6220 sec
[2021-04-25 00:16:31,203 INFO] Step 33350/50000; acc:  94.47; ppl:  1.24; xent: 0.22; lr: 0.00010; 10918/10421 tok/s;   6229 sec
[2021-04-25 00:16:40,493 INFO] Step 33400/50000; acc:  94.48; ppl:  1.24; xent: 0.22; lr: 0.00010; 11025/10333 tok/s;   6239 sec
[2021-04-25 00:16:49,652 INFO] Step 33450/50000; acc:  94.38; ppl:  1.24; xent: 0.22; lr: 0.00010; 11004/10396 tok/s;   6248 sec
[2021-04-25 00:16:58,764 INFO] Step 33500/50000; acc:  94.47; ppl:  1.24; xent: 0.22; lr: 0.00010; 10950/10297 tok/s;   6257 sec
[2021-04-25 00:17:05,403 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:17:08,072 INFO] Step 33550/50000; acc:  94.46; ppl:  1.25; xent: 0.22; lr: 0.00010; 11050/10380 tok/s;   6266 sec
[2021-04-25 00:17:17,261 INFO] Step 33600/50000; acc:  94.41; ppl:  1.24; xent: 0.22; lr: 0.00010; 10990/10371 tok/s;   6275 sec
[2021-04-25 00:17:26,535 INFO] Step 33650/50000; acc:  94.54; ppl:  1.25; xent: 0.22; lr: 0.00010; 11074/10451 tok/s;   6285 sec
[2021-04-25 00:17:35,685 INFO] Step 33700/50000; acc:  94.34; ppl:  1.25; xent: 0.22; lr: 0.00010; 11014/10503 tok/s;   6294 sec
[2021-04-25 00:17:45,052 INFO] Step 33750/50000; acc:  94.48; ppl:  1.24; xent: 0.22; lr: 0.00010; 10983/10334 tok/s;   6303 sec
[2021-04-25 00:17:54,168 INFO] Step 33800/50000; acc:  94.37; ppl:  1.25; xent: 0.22; lr: 0.00010; 10976/10378 tok/s;   6312 sec
[2021-04-25 00:18:03,393 INFO] Step 33850/50000; acc:  94.45; ppl:  1.25; xent: 0.22; lr: 0.00010; 10944/10402 tok/s;   6321 sec
[2021-04-25 00:18:12,839 INFO] Step 33900/50000; acc:  94.71; ppl:  1.23; xent: 0.21; lr: 0.00010; 11076/10399 tok/s;   6331 sec
[2021-04-25 00:18:22,078 INFO] Step 33950/50000; acc:  94.33; ppl:  1.25; xent: 0.22; lr: 0.00010; 10815/10263 tok/s;   6340 sec
[2021-04-25 00:18:31,527 INFO] Step 34000/50000; acc:  94.57; ppl:  1.24; xent: 0.22; lr: 0.00010; 10958/10234 tok/s;   6350 sec
[2021-04-25 00:18:40,817 INFO] Step 34050/50000; acc:  94.59; ppl:  1.24; xent: 0.21; lr: 0.00010; 10890/10309 tok/s;   6359 sec
[2021-04-25 00:18:50,098 INFO] Step 34100/50000; acc:  94.53; ppl:  1.24; xent: 0.22; lr: 0.00010; 11118/10491 tok/s;   6368 sec
[2021-04-25 00:18:59,211 INFO] Step 34150/50000; acc:  94.55; ppl:  1.23; xent: 0.21; lr: 0.00010; 11099/10409 tok/s;   6377 sec
[2021-04-25 00:19:08,390 INFO] Step 34200/50000; acc:  94.43; ppl:  1.24; xent: 0.22; lr: 0.00010; 11053/10366 tok/s;   6386 sec
[2021-04-25 00:19:17,530 INFO] Step 34250/50000; acc:  94.41; ppl:  1.24; xent: 0.22; lr: 0.00010; 10994/10392 tok/s;   6396 sec
[2021-04-25 00:19:21,549 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:19:26,781 INFO] Step 34300/50000; acc:  94.42; ppl:  1.25; xent: 0.22; lr: 0.00010; 10892/10297 tok/s;   6405 sec
[2021-04-25 00:19:36,022 INFO] Step 34350/50000; acc:  94.74; ppl:  1.24; xent: 0.21; lr: 0.00010; 11153/10478 tok/s;   6414 sec
[2021-04-25 00:19:45,061 INFO] Step 34400/50000; acc:  94.41; ppl:  1.25; xent: 0.22; lr: 0.00010; 11103/10549 tok/s;   6423 sec
[2021-04-25 00:19:54,507 INFO] Step 34450/50000; acc:  94.42; ppl:  1.24; xent: 0.22; lr: 0.00010; 10835/10214 tok/s;   6433 sec
[2021-04-25 00:20:03,686 INFO] Step 34500/50000; acc:  94.51; ppl:  1.24; xent: 0.22; lr: 0.00010; 11012/10458 tok/s;   6442 sec
[2021-04-25 00:20:13,007 INFO] Step 34550/50000; acc:  94.51; ppl:  1.24; xent: 0.22; lr: 0.00010; 11003/10344 tok/s;   6451 sec
[2021-04-25 00:20:22,371 INFO] Step 34600/50000; acc:  94.59; ppl:  1.24; xent: 0.21; lr: 0.00010; 10854/10337 tok/s;   6460 sec
[2021-04-25 00:20:31,538 INFO] Step 34650/50000; acc:  94.48; ppl:  1.24; xent: 0.22; lr: 0.00010; 11037/10344 tok/s;   6470 sec
[2021-04-25 00:20:41,045 INFO] Step 34700/50000; acc:  94.49; ppl:  1.24; xent: 0.22; lr: 0.00010; 10901/10268 tok/s;   6479 sec
[2021-04-25 00:20:50,198 INFO] Step 34750/50000; acc:  94.39; ppl:  1.24; xent: 0.22; lr: 0.00010; 10888/10302 tok/s;   6488 sec
[2021-04-25 00:20:59,577 INFO] Step 34800/50000; acc:  94.69; ppl:  1.24; xent: 0.21; lr: 0.00010; 11063/10382 tok/s;   6498 sec
[2021-04-25 00:21:08,773 INFO] Step 34850/50000; acc:  94.53; ppl:  1.24; xent: 0.22; lr: 0.00010; 10996/10425 tok/s;   6507 sec
[2021-04-25 00:21:18,142 INFO] Step 34900/50000; acc:  94.58; ppl:  1.23; xent: 0.21; lr: 0.00010; 11020/10319 tok/s;   6516 sec
[2021-04-25 00:21:27,232 INFO] Step 34950/50000; acc:  94.49; ppl:  1.24; xent: 0.22; lr: 0.00010; 11046/10376 tok/s;   6525 sec
[2021-04-25 00:21:36,469 INFO] Step 35000/50000; acc:  94.40; ppl:  1.24; xent: 0.22; lr: 0.00010; 11032/10412 tok/s;   6535 sec
[2021-04-25 00:21:36,471 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-25 00:21:45,267 INFO] Validation perplexity: 1.31621
[2021-04-25 00:21:45,267 INFO] Validation accuracy: 93.485
[2021-04-25 00:21:45,269 INFO] Saving checkpoint ../models/group1_params/control/model_step_35000.pt
[2021-04-25 00:21:47,046 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:21:55,028 INFO] Step 35050/50000; acc:  94.58; ppl:  1.24; xent: 0.22; lr: 0.00010; 5452/5142 tok/s;   6553 sec
[2021-04-25 00:22:04,003 INFO] Step 35100/50000; acc:  94.58; ppl:  1.24; xent: 0.21; lr: 0.00010; 11165/10522 tok/s;   6562 sec
[2021-04-25 00:22:13,254 INFO] Step 35150/50000; acc:  94.41; ppl:  1.25; xent: 0.22; lr: 0.00010; 11013/10473 tok/s;   6571 sec
[2021-04-25 00:22:22,362 INFO] Step 35200/50000; acc:  94.56; ppl:  1.24; xent: 0.21; lr: 0.00010; 11074/10485 tok/s;   6580 sec
[2021-04-25 00:22:31,693 INFO] Step 35250/50000; acc:  94.52; ppl:  1.24; xent: 0.22; lr: 0.00010; 10967/10343 tok/s;   6590 sec
[2021-04-25 00:22:40,869 INFO] Step 35300/50000; acc:  94.48; ppl:  1.24; xent: 0.22; lr: 0.00010; 11032/10432 tok/s;   6599 sec
[2021-04-25 00:22:50,470 INFO] Step 35350/50000; acc:  94.73; ppl:  1.23; xent: 0.21; lr: 0.00010; 10944/10289 tok/s;   6609 sec
[2021-04-25 00:22:59,680 INFO] Step 35400/50000; acc:  94.47; ppl:  1.24; xent: 0.21; lr: 0.00010; 10895/10292 tok/s;   6618 sec
[2021-04-25 00:23:08,920 INFO] Step 35450/50000; acc:  94.51; ppl:  1.24; xent: 0.22; lr: 0.00010; 10876/10310 tok/s;   6627 sec
[2021-04-25 00:23:18,369 INFO] Step 35500/50000; acc:  94.63; ppl:  1.23; xent: 0.21; lr: 0.00010; 10995/10295 tok/s;   6636 sec
[2021-04-25 00:23:27,444 INFO] Step 35550/50000; acc:  94.42; ppl:  1.24; xent: 0.22; lr: 0.00010; 10911/10439 tok/s;   6646 sec
[2021-04-25 00:23:36,844 INFO] Step 35600/50000; acc:  94.67; ppl:  1.23; xent: 0.21; lr: 0.00010; 11145/10412 tok/s;   6655 sec
[2021-04-25 00:23:46,087 INFO] Step 35650/50000; acc:  94.61; ppl:  1.23; xent: 0.21; lr: 0.00010; 10885/10201 tok/s;   6664 sec
[2021-04-25 00:23:55,364 INFO] Step 35700/50000; acc:  94.56; ppl:  1.24; xent: 0.21; lr: 0.00010; 11050/10383 tok/s;   6673 sec
[2021-04-25 00:23:57,425 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:24:04,535 INFO] Step 35750/50000; acc:  94.52; ppl:  1.24; xent: 0.21; lr: 0.00010; 11055/10402 tok/s;   6683 sec
[2021-04-25 00:24:13,841 INFO] Step 35800/50000; acc:  94.64; ppl:  1.24; xent: 0.21; lr: 0.00010; 10953/10358 tok/s;   6692 sec
[2021-04-25 00:24:22,922 INFO] Step 35850/50000; acc:  94.47; ppl:  1.24; xent: 0.22; lr: 0.00010; 11034/10444 tok/s;   6701 sec
[2021-04-25 00:24:32,029 INFO] Step 35900/50000; acc:  94.40; ppl:  1.24; xent: 0.22; lr: 0.00010; 11009/10479 tok/s;   6710 sec
[2021-04-25 00:24:41,390 INFO] Step 35950/50000; acc:  94.57; ppl:  1.24; xent: 0.21; lr: 0.00010; 10928/10293 tok/s;   6719 sec
[2021-04-25 00:24:50,468 INFO] Step 36000/50000; acc:  94.57; ppl:  1.24; xent: 0.21; lr: 0.00010; 11025/10430 tok/s;   6729 sec
[2021-04-25 00:24:59,864 INFO] Step 36050/50000; acc:  94.72; ppl:  1.23; xent: 0.21; lr: 0.00010; 11056/10433 tok/s;   6738 sec
[2021-04-25 00:25:09,181 INFO] Step 36100/50000; acc:  94.65; ppl:  1.23; xent: 0.21; lr: 0.00010; 11022/10401 tok/s;   6747 sec
[2021-04-25 00:25:18,566 INFO] Step 36150/50000; acc:  94.53; ppl:  1.24; xent: 0.21; lr: 0.00010; 10969/10316 tok/s;   6757 sec
[2021-04-25 00:25:27,742 INFO] Step 36200/50000; acc:  94.52; ppl:  1.24; xent: 0.22; lr: 0.00010; 10962/10319 tok/s;   6766 sec
[2021-04-25 00:25:36,879 INFO] Step 36250/50000; acc:  94.70; ppl:  1.23; xent: 0.21; lr: 0.00010; 10993/10429 tok/s;   6775 sec
[2021-04-25 00:25:46,233 INFO] Step 36300/50000; acc:  94.67; ppl:  1.23; xent: 0.21; lr: 0.00010; 11098/10441 tok/s;   6784 sec
[2021-04-25 00:25:55,388 INFO] Step 36350/50000; acc:  94.60; ppl:  1.23; xent: 0.21; lr: 0.00010; 10934/10324 tok/s;   6793 sec
[2021-04-25 00:26:04,717 INFO] Step 36400/50000; acc:  94.66; ppl:  1.24; xent: 0.21; lr: 0.00010; 11036/10325 tok/s;   6803 sec
[2021-04-25 00:26:13,471 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:26:13,955 INFO] Step 36450/50000; acc:  94.51; ppl:  1.24; xent: 0.21; lr: 0.00010; 10938/10321 tok/s;   6812 sec
[2021-04-25 00:26:23,341 INFO] Step 36500/50000; acc:  94.68; ppl:  1.24; xent: 0.21; lr: 0.00010; 10973/10291 tok/s;   6821 sec
[2021-04-25 00:26:32,511 INFO] Step 36550/50000; acc:  94.63; ppl:  1.24; xent: 0.21; lr: 0.00010; 11055/10442 tok/s;   6831 sec
[2021-04-25 00:26:41,627 INFO] Step 36600/50000; acc:  94.46; ppl:  1.24; xent: 0.21; lr: 0.00010; 11040/10497 tok/s;   6840 sec
[2021-04-25 00:26:50,852 INFO] Step 36650/50000; acc:  94.53; ppl:  1.24; xent: 0.22; lr: 0.00010; 10932/10298 tok/s;   6849 sec
[2021-04-25 00:27:00,081 INFO] Step 36700/50000; acc:  94.53; ppl:  1.24; xent: 0.21; lr: 0.00010; 10873/10380 tok/s;   6858 sec
[2021-04-25 00:27:09,354 INFO] Step 36750/50000; acc:  94.69; ppl:  1.24; xent: 0.21; lr: 0.00010; 11026/10358 tok/s;   6867 sec
[2021-04-25 00:27:18,656 INFO] Step 36800/50000; acc:  94.73; ppl:  1.23; xent: 0.21; lr: 0.00010; 10981/10422 tok/s;   6877 sec
[2021-04-25 00:27:28,122 INFO] Step 36850/50000; acc:  94.65; ppl:  1.23; xent: 0.21; lr: 0.00010; 10960/10311 tok/s;   6886 sec
[2021-04-25 00:27:37,430 INFO] Step 36900/50000; acc:  94.56; ppl:  1.24; xent: 0.21; lr: 0.00010; 10805/10153 tok/s;   6896 sec
[2021-04-25 00:27:46,840 INFO] Step 36950/50000; acc:  94.68; ppl:  1.23; xent: 0.21; lr: 0.00010; 11027/10330 tok/s;   6905 sec
[2021-04-25 00:27:56,021 INFO] Step 37000/50000; acc:  94.65; ppl:  1.23; xent: 0.21; lr: 0.00010; 10908/10434 tok/s;   6914 sec
[2021-04-25 00:28:05,220 INFO] Step 37050/50000; acc:  94.66; ppl:  1.23; xent: 0.21; lr: 0.00010; 10965/10287 tok/s;   6923 sec
[2021-04-25 00:28:14,600 INFO] Step 37100/50000; acc:  94.71; ppl:  1.23; xent: 0.21; lr: 0.00010; 11046/10403 tok/s;   6933 sec
[2021-04-25 00:28:23,642 INFO] Step 37150/50000; acc:  94.64; ppl:  1.23; xent: 0.21; lr: 0.00010; 10917/10280 tok/s;   6942 sec
[2021-04-25 00:28:29,959 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:28:33,034 INFO] Step 37200/50000; acc:  94.62; ppl:  1.23; xent: 0.21; lr: 0.00010; 11119/10377 tok/s;   6951 sec
[2021-04-25 00:28:42,282 INFO] Step 37250/50000; acc:  94.65; ppl:  1.24; xent: 0.21; lr: 0.00010; 10940/10356 tok/s;   6960 sec
[2021-04-25 00:28:51,483 INFO] Step 37300/50000; acc:  94.69; ppl:  1.23; xent: 0.21; lr: 0.00010; 11153/10510 tok/s;   6970 sec
[2021-04-25 00:29:00,595 INFO] Step 37350/50000; acc:  94.62; ppl:  1.24; xent: 0.21; lr: 0.00010; 11007/10502 tok/s;   6979 sec
[2021-04-25 00:29:09,933 INFO] Step 37400/50000; acc:  94.56; ppl:  1.23; xent: 0.21; lr: 0.00010; 10875/10277 tok/s;   6988 sec
[2021-04-25 00:29:19,054 INFO] Step 37450/50000; acc:  94.60; ppl:  1.24; xent: 0.21; lr: 0.00010; 11015/10413 tok/s;   6997 sec
[2021-04-25 00:29:28,353 INFO] Step 37500/50000; acc:  94.58; ppl:  1.23; xent: 0.21; lr: 0.00010; 10893/10329 tok/s;   7006 sec
[2021-04-25 00:29:37,749 INFO] Step 37550/50000; acc:  94.86; ppl:  1.22; xent: 0.20; lr: 0.00010; 11037/10355 tok/s;   7016 sec
[2021-04-25 00:29:47,076 INFO] Step 37600/50000; acc:  94.55; ppl:  1.24; xent: 0.21; lr: 0.00010; 10853/10306 tok/s;   7025 sec
[2021-04-25 00:29:56,457 INFO] Step 37650/50000; acc:  94.56; ppl:  1.23; xent: 0.21; lr: 0.00010; 10925/10233 tok/s;   7035 sec
[2021-04-25 00:30:05,711 INFO] Step 37700/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 10964/10355 tok/s;   7044 sec
[2021-04-25 00:30:14,946 INFO] Step 37750/50000; acc:  94.59; ppl:  1.23; xent: 0.21; lr: 0.00010; 11192/10550 tok/s;   7053 sec
[2021-04-25 00:30:24,059 INFO] Step 37800/50000; acc:  94.77; ppl:  1.22; xent: 0.20; lr: 0.00010; 11043/10405 tok/s;   7062 sec
[2021-04-25 00:30:33,078 INFO] Step 37850/50000; acc:  94.55; ppl:  1.24; xent: 0.21; lr: 0.00010; 11096/10383 tok/s;   7071 sec
[2021-04-25 00:30:42,437 INFO] Step 37900/50000; acc:  94.57; ppl:  1.23; xent: 0.21; lr: 0.00010; 11017/10396 tok/s;   7081 sec
[2021-04-25 00:30:46,042 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:30:51,640 INFO] Step 37950/50000; acc:  94.62; ppl:  1.23; xent: 0.21; lr: 0.00010; 10845/10255 tok/s;   7090 sec
[2021-04-25 00:31:01,029 INFO] Step 38000/50000; acc:  94.80; ppl:  1.23; xent: 0.21; lr: 0.00010; 11096/10428 tok/s;   7099 sec
[2021-04-25 00:31:10,106 INFO] Step 38050/50000; acc:  94.59; ppl:  1.23; xent: 0.21; lr: 0.00010; 11075/10503 tok/s;   7108 sec
[2021-04-25 00:31:19,525 INFO] Step 38100/50000; acc:  94.60; ppl:  1.23; xent: 0.21; lr: 0.00010; 10876/10234 tok/s;   7118 sec
[2021-04-25 00:31:28,643 INFO] Step 38150/50000; acc:  94.64; ppl:  1.23; xent: 0.21; lr: 0.00010; 11040/10514 tok/s;   7127 sec
[2021-04-25 00:31:37,852 INFO] Step 38200/50000; acc:  94.53; ppl:  1.24; xent: 0.21; lr: 0.00010; 10992/10383 tok/s;   7136 sec
[2021-04-25 00:31:47,183 INFO] Step 38250/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 10962/10370 tok/s;   7145 sec
[2021-04-25 00:31:56,301 INFO] Step 38300/50000; acc:  94.68; ppl:  1.23; xent: 0.21; lr: 0.00010; 11110/10440 tok/s;   7154 sec
[2021-04-25 00:32:05,743 INFO] Step 38350/50000; acc:  94.60; ppl:  1.23; xent: 0.21; lr: 0.00010; 10853/10278 tok/s;   7164 sec
[2021-04-25 00:32:14,990 INFO] Step 38400/50000; acc:  94.65; ppl:  1.23; xent: 0.21; lr: 0.00010; 10950/10308 tok/s;   7173 sec
[2021-04-25 00:32:24,266 INFO] Step 38450/50000; acc:  94.85; ppl:  1.23; xent: 0.20; lr: 0.00010; 11067/10417 tok/s;   7182 sec
[2021-04-25 00:32:33,429 INFO] Step 38500/50000; acc:  94.75; ppl:  1.23; xent: 0.20; lr: 0.00010; 11080/10471 tok/s;   7192 sec
[2021-04-25 00:32:42,822 INFO] Step 38550/50000; acc:  94.79; ppl:  1.22; xent: 0.20; lr: 0.00010; 10988/10279 tok/s;   7201 sec
[2021-04-25 00:32:51,919 INFO] Step 38600/50000; acc:  94.61; ppl:  1.23; xent: 0.21; lr: 0.00010; 10990/10380 tok/s;   7210 sec
[2021-04-25 00:33:00,996 INFO] Step 38650/50000; acc:  94.60; ppl:  1.23; xent: 0.21; lr: 0.00010; 11073/10437 tok/s;   7219 sec
[2021-04-25 00:33:02,104 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:33:10,517 INFO] Step 38700/50000; acc:  94.85; ppl:  1.23; xent: 0.20; lr: 0.00010; 10924/10239 tok/s;   7229 sec
[2021-04-25 00:33:19,428 INFO] Step 38750/50000; acc:  94.63; ppl:  1.23; xent: 0.21; lr: 0.00010; 11105/10520 tok/s;   7238 sec
[2021-04-25 00:33:28,711 INFO] Step 38800/50000; acc:  94.62; ppl:  1.23; xent: 0.21; lr: 0.00010; 11140/10537 tok/s;   7247 sec
[2021-04-25 00:33:37,919 INFO] Step 38850/50000; acc:  94.58; ppl:  1.23; xent: 0.21; lr: 0.00010; 10949/10386 tok/s;   7256 sec
[2021-04-25 00:33:47,197 INFO] Step 38900/50000; acc:  94.76; ppl:  1.23; xent: 0.21; lr: 0.00010; 11046/10427 tok/s;   7265 sec
[2021-04-25 00:33:56,365 INFO] Step 38950/50000; acc:  94.66; ppl:  1.23; xent: 0.21; lr: 0.00010; 11004/10401 tok/s;   7274 sec
[2021-04-25 00:34:05,811 INFO] Step 39000/50000; acc:  94.88; ppl:  1.22; xent: 0.20; lr: 0.00010; 10974/10348 tok/s;   7284 sec
[2021-04-25 00:34:15,015 INFO] Step 39050/50000; acc:  94.66; ppl:  1.23; xent: 0.20; lr: 0.00010; 10956/10345 tok/s;   7293 sec
[2021-04-25 00:34:24,171 INFO] Step 39100/50000; acc:  94.64; ppl:  1.23; xent: 0.21; lr: 0.00010; 10986/10408 tok/s;   7302 sec
[2021-04-25 00:34:33,551 INFO] Step 39150/50000; acc:  94.71; ppl:  1.23; xent: 0.20; lr: 0.00010; 10957/10251 tok/s;   7312 sec
[2021-04-25 00:34:42,731 INFO] Step 39200/50000; acc:  94.63; ppl:  1.23; xent: 0.21; lr: 0.00010; 10955/10481 tok/s;   7321 sec
[2021-04-25 00:34:52,032 INFO] Step 39250/50000; acc:  94.84; ppl:  1.22; xent: 0.20; lr: 0.00010; 11156/10400 tok/s;   7330 sec
[2021-04-25 00:35:01,304 INFO] Step 39300/50000; acc:  94.67; ppl:  1.23; xent: 0.20; lr: 0.00010; 10896/10256 tok/s;   7339 sec
[2021-04-25 00:35:10,619 INFO] Step 39350/50000; acc:  94.74; ppl:  1.23; xent: 0.20; lr: 0.00010; 11013/10321 tok/s;   7349 sec
[2021-04-25 00:35:12,277 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:35:19,721 INFO] Step 39400/50000; acc:  94.63; ppl:  1.23; xent: 0.21; lr: 0.00010; 11069/10418 tok/s;   7358 sec
[2021-04-25 00:35:28,830 INFO] Step 39450/50000; acc:  94.79; ppl:  1.23; xent: 0.20; lr: 0.00010; 11048/10465 tok/s;   7367 sec
[2021-04-25 00:35:38,103 INFO] Step 39500/50000; acc:  94.76; ppl:  1.23; xent: 0.21; lr: 0.00010; 11076/10467 tok/s;   7376 sec
[2021-04-25 00:35:47,219 INFO] Step 39550/50000; acc:  94.55; ppl:  1.24; xent: 0.21; lr: 0.00010; 10894/10348 tok/s;   7385 sec
[2021-04-25 00:35:56,629 INFO] Step 39600/50000; acc:  94.76; ppl:  1.23; xent: 0.20; lr: 0.00010; 11021/10376 tok/s;   7395 sec
[2021-04-25 00:36:05,755 INFO] Step 39650/50000; acc:  94.74; ppl:  1.23; xent: 0.21; lr: 0.00010; 10982/10419 tok/s;   7404 sec
[2021-04-25 00:36:15,155 INFO] Step 39700/50000; acc:  94.90; ppl:  1.22; xent: 0.20; lr: 0.00010; 11066/10425 tok/s;   7413 sec
[2021-04-25 00:36:24,409 INFO] Step 39750/50000; acc:  94.92; ppl:  1.22; xent: 0.20; lr: 0.00010; 11022/10407 tok/s;   7422 sec
[2021-04-25 00:36:33,767 INFO] Step 39800/50000; acc:  94.58; ppl:  1.23; xent: 0.21; lr: 0.00010; 10875/10260 tok/s;   7432 sec
[2021-04-25 00:36:42,958 INFO] Step 39850/50000; acc:  94.65; ppl:  1.23; xent: 0.21; lr: 0.00010; 10981/10335 tok/s;   7441 sec
[2021-04-25 00:36:52,130 INFO] Step 39900/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 10982/10398 tok/s;   7450 sec
[2021-04-25 00:37:01,439 INFO] Step 39950/50000; acc:  94.78; ppl:  1.22; xent: 0.20; lr: 0.00010; 11042/10420 tok/s;   7460 sec
[2021-04-25 00:37:10,650 INFO] Step 40000/50000; acc:  94.79; ppl:  1.22; xent: 0.20; lr: 0.00010; 11040/10380 tok/s;   7469 sec
[2021-04-25 00:37:10,652 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-25 00:37:19,431 INFO] Validation perplexity: 1.32237
[2021-04-25 00:37:19,431 INFO] Validation accuracy: 93.5725
[2021-04-25 00:37:19,433 INFO] Saving checkpoint ../models/group1_params/control/model_step_40000.pt
[2021-04-25 00:37:29,203 INFO] Step 40050/50000; acc:  94.72; ppl:  1.23; xent: 0.20; lr: 0.00010; 5484/5157 tok/s;   7487 sec
[2021-04-25 00:37:37,630 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:37:38,421 INFO] Step 40100/50000; acc:  94.68; ppl:  1.23; xent: 0.20; lr: 0.00010; 11009/10348 tok/s;   7496 sec
[2021-04-25 00:37:47,866 INFO] Step 40150/50000; acc:  94.75; ppl:  1.22; xent: 0.20; lr: 0.00010; 10903/10250 tok/s;   7506 sec
[2021-04-25 00:37:57,005 INFO] Step 40200/50000; acc:  94.75; ppl:  1.23; xent: 0.21; lr: 0.00010; 11029/10400 tok/s;   7515 sec
[2021-04-25 00:38:06,042 INFO] Step 40250/50000; acc:  94.67; ppl:  1.23; xent: 0.21; lr: 0.00010; 11011/10497 tok/s;   7524 sec
[2021-04-25 00:38:15,432 INFO] Step 40300/50000; acc:  94.77; ppl:  1.23; xent: 0.20; lr: 0.00010; 11010/10330 tok/s;   7534 sec
[2021-04-25 00:38:24,557 INFO] Step 40350/50000; acc:  94.60; ppl:  1.23; xent: 0.21; lr: 0.00010; 10873/10383 tok/s;   7543 sec
[2021-04-25 00:38:33,926 INFO] Step 40400/50000; acc:  94.83; ppl:  1.23; xent: 0.20; lr: 0.00010; 11086/10429 tok/s;   7552 sec
[2021-04-25 00:38:43,282 INFO] Step 40450/50000; acc:  94.86; ppl:  1.22; xent: 0.20; lr: 0.00010; 10911/10357 tok/s;   7561 sec
[2021-04-25 00:38:52,723 INFO] Step 40500/50000; acc:  94.75; ppl:  1.22; xent: 0.20; lr: 0.00010; 10999/10319 tok/s;   7571 sec
[2021-04-25 00:39:02,011 INFO] Step 40550/50000; acc:  94.68; ppl:  1.23; xent: 0.20; lr: 0.00010; 10790/10135 tok/s;   7580 sec
[2021-04-25 00:39:11,388 INFO] Step 40600/50000; acc:  94.81; ppl:  1.22; xent: 0.20; lr: 0.00010; 10916/10283 tok/s;   7589 sec
[2021-04-25 00:39:20,574 INFO] Step 40650/50000; acc:  94.84; ppl:  1.22; xent: 0.20; lr: 0.00010; 10960/10418 tok/s;   7599 sec
[2021-04-25 00:39:29,774 INFO] Step 40700/50000; acc:  94.81; ppl:  1.22; xent: 0.20; lr: 0.00010; 10978/10309 tok/s;   7608 sec
[2021-04-25 00:39:39,068 INFO] Step 40750/50000; acc:  94.79; ppl:  1.22; xent: 0.20; lr: 0.00010; 11025/10401 tok/s;   7617 sec
[2021-04-25 00:39:48,198 INFO] Step 40800/50000; acc:  94.75; ppl:  1.22; xent: 0.20; lr: 0.00010; 10971/10311 tok/s;   7626 sec
[2021-04-25 00:39:54,091 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:39:57,532 INFO] Step 40850/50000; acc:  94.81; ppl:  1.22; xent: 0.20; lr: 0.00010; 11076/10366 tok/s;   7636 sec
[2021-04-25 00:40:06,781 INFO] Step 40900/50000; acc:  94.81; ppl:  1.22; xent: 0.20; lr: 0.00010; 11004/10430 tok/s;   7645 sec
[2021-04-25 00:40:15,972 INFO] Step 40950/50000; acc:  94.82; ppl:  1.22; xent: 0.20; lr: 0.00010; 11154/10496 tok/s;   7654 sec
[2021-04-25 00:40:25,114 INFO] Step 41000/50000; acc:  94.71; ppl:  1.23; xent: 0.20; lr: 0.00010; 10914/10424 tok/s;   7663 sec
[2021-04-25 00:40:34,360 INFO] Step 41050/50000; acc:  94.74; ppl:  1.23; xent: 0.20; lr: 0.00010; 10848/10276 tok/s;   7672 sec
[2021-04-25 00:40:43,644 INFO] Step 41100/50000; acc:  94.80; ppl:  1.23; xent: 0.20; lr: 0.00010; 11113/10427 tok/s;   7682 sec
[2021-04-25 00:40:52,849 INFO] Step 41150/50000; acc:  94.70; ppl:  1.23; xent: 0.20; lr: 0.00010; 10866/10340 tok/s;   7691 sec
[2021-04-25 00:41:02,332 INFO] Step 41200/50000; acc:  95.13; ppl:  1.21; xent: 0.19; lr: 0.00010; 11108/10390 tok/s;   7700 sec
[2021-04-25 00:41:11,720 INFO] Step 41250/50000; acc:  94.66; ppl:  1.23; xent: 0.20; lr: 0.00010; 10763/10218 tok/s;   7710 sec
[2021-04-25 00:41:21,132 INFO] Step 41300/50000; acc:  94.87; ppl:  1.22; xent: 0.20; lr: 0.00010; 10929/10232 tok/s;   7719 sec
[2021-04-25 00:41:30,367 INFO] Step 41350/50000; acc:  94.94; ppl:  1.22; xent: 0.20; lr: 0.00010; 10928/10351 tok/s;   7728 sec
[2021-04-25 00:41:39,563 INFO] Step 41400/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 11098/10452 tok/s;   7738 sec
[2021-04-25 00:41:48,727 INFO] Step 41450/50000; acc:  94.89; ppl:  1.22; xent: 0.19; lr: 0.00010; 11027/10399 tok/s;   7747 sec
[2021-04-25 00:41:57,840 INFO] Step 41500/50000; acc:  94.77; ppl:  1.23; xent: 0.20; lr: 0.00010; 10986/10292 tok/s;   7756 sec
[2021-04-25 00:42:07,129 INFO] Step 41550/50000; acc:  94.78; ppl:  1.22; xent: 0.20; lr: 0.00010; 11003/10434 tok/s;   7765 sec
[2021-04-25 00:42:10,403 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:42:16,374 INFO] Step 41600/50000; acc:  94.82; ppl:  1.22; xent: 0.20; lr: 0.00010; 10930/10275 tok/s;   7774 sec
[2021-04-25 00:42:25,671 INFO] Step 41650/50000; acc:  94.98; ppl:  1.22; xent: 0.20; lr: 0.00010; 11108/10464 tok/s;   7784 sec
[2021-04-25 00:42:34,790 INFO] Step 41700/50000; acc:  94.75; ppl:  1.23; xent: 0.20; lr: 0.00010; 11063/10476 tok/s;   7793 sec
[2021-04-25 00:42:44,213 INFO] Step 41750/50000; acc:  94.79; ppl:  1.22; xent: 0.20; lr: 0.00010; 10887/10249 tok/s;   7802 sec
[2021-04-25 00:42:53,316 INFO] Step 41800/50000; acc:  94.79; ppl:  1.22; xent: 0.20; lr: 0.00010; 11006/10508 tok/s;   7811 sec
[2021-04-25 00:43:02,402 INFO] Step 41850/50000; acc:  94.75; ppl:  1.23; xent: 0.20; lr: 0.00010; 10998/10380 tok/s;   7820 sec
[2021-04-25 00:43:11,992 INFO] Step 41900/50000; acc:  95.04; ppl:  1.21; xent: 0.19; lr: 0.00010; 10954/10318 tok/s;   7830 sec
[2021-04-25 00:43:21,033 INFO] Step 41950/50000; acc:  94.75; ppl:  1.22; xent: 0.20; lr: 0.00010; 11031/10405 tok/s;   7839 sec
[2021-04-25 00:43:30,615 INFO] Step 42000/50000; acc:  94.85; ppl:  1.22; xent: 0.20; lr: 0.00010; 10883/10221 tok/s;   7849 sec
[2021-04-25 00:43:39,893 INFO] Step 42050/50000; acc:  94.91; ppl:  1.22; xent: 0.20; lr: 0.00010; 10903/10271 tok/s;   7858 sec
[2021-04-25 00:43:49,155 INFO] Step 42100/50000; acc:  94.94; ppl:  1.22; xent: 0.20; lr: 0.00010; 11096/10481 tok/s;   7867 sec
[2021-04-25 00:43:58,291 INFO] Step 42150/50000; acc:  94.90; ppl:  1.22; xent: 0.20; lr: 0.00010; 11079/10487 tok/s;   7876 sec
[2021-04-25 00:44:07,669 INFO] Step 42200/50000; acc:  94.81; ppl:  1.22; xent: 0.20; lr: 0.00010; 10857/10175 tok/s;   7886 sec
[2021-04-25 00:44:16,808 INFO] Step 42250/50000; acc:  94.77; ppl:  1.22; xent: 0.20; lr: 0.00010; 10991/10368 tok/s;   7895 sec
[2021-04-25 00:44:20,775 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:44:25,918 INFO] Step 42300/50000; acc:  94.74; ppl:  1.22; xent: 0.20; lr: 0.00010; 11069/10424 tok/s;   7904 sec
[2021-04-25 00:44:35,321 INFO] Step 42350/50000; acc:  94.95; ppl:  1.22; xent: 0.20; lr: 0.00010; 10929/10293 tok/s;   7913 sec
[2021-04-25 00:44:44,312 INFO] Step 42400/50000; acc:  94.85; ppl:  1.22; xent: 0.20; lr: 0.00010; 11169/10535 tok/s;   7922 sec
[2021-04-25 00:44:53,586 INFO] Step 42450/50000; acc:  94.75; ppl:  1.22; xent: 0.20; lr: 0.00010; 11042/10463 tok/s;   7932 sec
[2021-04-25 00:45:02,792 INFO] Step 42500/50000; acc:  94.74; ppl:  1.22; xent: 0.20; lr: 0.00010; 10993/10434 tok/s;   7941 sec
[2021-04-25 00:45:12,061 INFO] Step 42550/50000; acc:  94.87; ppl:  1.22; xent: 0.20; lr: 0.00010; 11065/10427 tok/s;   7950 sec
[2021-04-25 00:45:21,217 INFO] Step 42600/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 10971/10357 tok/s;   7959 sec
[2021-04-25 00:45:30,567 INFO] Step 42650/50000; acc:  95.01; ppl:  1.21; xent: 0.19; lr: 0.00010; 10955/10366 tok/s;   7969 sec
[2021-04-25 00:45:39,965 INFO] Step 42700/50000; acc:  94.83; ppl:  1.22; xent: 0.20; lr: 0.00010; 10998/10352 tok/s;   7978 sec
[2021-04-25 00:45:49,081 INFO] Step 42750/50000; acc:  94.81; ppl:  1.22; xent: 0.20; lr: 0.00010; 10908/10331 tok/s;   7987 sec
[2021-04-25 00:45:58,540 INFO] Step 42800/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 11022/10290 tok/s;   7997 sec
[2021-04-25 00:46:07,798 INFO] Step 42850/50000; acc:  94.81; ppl:  1.22; xent: 0.20; lr: 0.00010; 10897/10433 tok/s;   8006 sec
[2021-04-25 00:46:17,107 INFO] Step 42900/50000; acc:  94.95; ppl:  1.21; xent: 0.19; lr: 0.00010; 11136/10358 tok/s;   8015 sec
[2021-04-25 00:46:26,271 INFO] Step 42950/50000; acc:  94.96; ppl:  1.21; xent: 0.19; lr: 0.00010; 10962/10370 tok/s;   8024 sec
[2021-04-25 00:46:35,534 INFO] Step 43000/50000; acc:  94.77; ppl:  1.22; xent: 0.20; lr: 0.00010; 10937/10262 tok/s;   8034 sec
[2021-04-25 00:46:36,865 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:46:44,754 INFO] Step 43050/50000; acc:  94.84; ppl:  1.22; xent: 0.20; lr: 0.00010; 10967/10321 tok/s;   8043 sec
[2021-04-25 00:46:53,885 INFO] Step 43100/50000; acc:  94.92; ppl:  1.22; xent: 0.20; lr: 0.00010; 11047/10436 tok/s;   8052 sec
[2021-04-25 00:47:03,145 INFO] Step 43150/50000; acc:  94.90; ppl:  1.22; xent: 0.20; lr: 0.00010; 10979/10411 tok/s;   8061 sec
[2021-04-25 00:47:12,315 INFO] Step 43200/50000; acc:  94.79; ppl:  1.22; xent: 0.20; lr: 0.00010; 10991/10400 tok/s;   8070 sec
[2021-04-25 00:47:21,740 INFO] Step 43250/50000; acc:  94.92; ppl:  1.22; xent: 0.20; lr: 0.00010; 10893/10294 tok/s;   8080 sec
[2021-04-25 00:47:30,851 INFO] Step 43300/50000; acc:  94.94; ppl:  1.22; xent: 0.20; lr: 0.00010; 11031/10393 tok/s;   8089 sec
[2021-04-25 00:47:40,395 INFO] Step 43350/50000; acc:  94.98; ppl:  1.22; xent: 0.20; lr: 0.00010; 10945/10351 tok/s;   8098 sec
[2021-04-25 00:47:49,612 INFO] Step 43400/50000; acc:  95.01; ppl:  1.21; xent: 0.19; lr: 0.00010; 10986/10383 tok/s;   8108 sec
[2021-04-25 00:47:58,942 INFO] Step 43450/50000; acc:  94.71; ppl:  1.22; xent: 0.20; lr: 0.00010; 10781/10193 tok/s;   8117 sec
[2021-04-25 00:48:08,338 INFO] Step 43500/50000; acc:  94.87; ppl:  1.22; xent: 0.20; lr: 0.00010; 11022/10304 tok/s;   8126 sec
[2021-04-25 00:48:17,384 INFO] Step 43550/50000; acc:  94.95; ppl:  1.22; xent: 0.20; lr: 0.00010; 10977/10465 tok/s;   8135 sec
[2021-04-25 00:48:26,838 INFO] Step 43600/50000; acc:  94.95; ppl:  1.21; xent: 0.19; lr: 0.00010; 11051/10411 tok/s;   8145 sec
[2021-04-25 00:48:36,064 INFO] Step 43650/50000; acc:  94.91; ppl:  1.21; xent: 0.19; lr: 0.00010; 11013/10314 tok/s;   8154 sec
[2021-04-25 00:48:45,332 INFO] Step 43700/50000; acc:  94.97; ppl:  1.21; xent: 0.19; lr: 0.00010; 11010/10311 tok/s;   8163 sec
[2021-04-25 00:48:53,333 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:48:54,543 INFO] Step 43750/50000; acc:  94.77; ppl:  1.22; xent: 0.20; lr: 0.00010; 10959/10347 tok/s;   8173 sec
[2021-04-25 00:49:03,920 INFO] Step 43800/50000; acc:  94.96; ppl:  1.21; xent: 0.19; lr: 0.00010; 10842/10229 tok/s;   8182 sec
[2021-04-25 00:49:13,083 INFO] Step 43850/50000; acc:  94.85; ppl:  1.22; xent: 0.20; lr: 0.00010; 11041/10423 tok/s;   8191 sec
[2021-04-25 00:49:22,134 INFO] Step 43900/50000; acc:  94.79; ppl:  1.22; xent: 0.20; lr: 0.00010; 11009/10511 tok/s;   8200 sec
[2021-04-25 00:49:31,535 INFO] Step 43950/50000; acc:  94.89; ppl:  1.22; xent: 0.20; lr: 0.00010; 10891/10203 tok/s;   8210 sec
[2021-04-25 00:49:40,712 INFO] Step 44000/50000; acc:  94.89; ppl:  1.22; xent: 0.20; lr: 0.00010; 10972/10440 tok/s;   8219 sec
[2021-04-25 00:49:50,097 INFO] Step 44050/50000; acc:  94.89; ppl:  1.22; xent: 0.20; lr: 0.00010; 10965/10363 tok/s;   8228 sec
[2021-04-25 00:49:59,402 INFO] Step 44100/50000; acc:  95.11; ppl:  1.21; xent: 0.19; lr: 0.00010; 11004/10409 tok/s;   8237 sec
[2021-04-25 00:50:08,920 INFO] Step 44150/50000; acc:  94.85; ppl:  1.22; xent: 0.20; lr: 0.00010; 10908/10269 tok/s;   8247 sec
[2021-04-25 00:50:18,170 INFO] Step 44200/50000; acc:  94.88; ppl:  1.22; xent: 0.20; lr: 0.00010; 10802/10138 tok/s;   8256 sec
[2021-04-25 00:50:27,483 INFO] Step 44250/50000; acc:  95.03; ppl:  1.21; xent: 0.19; lr: 0.00010; 10847/10231 tok/s;   8266 sec
[2021-04-25 00:50:36,787 INFO] Step 44300/50000; acc:  95.04; ppl:  1.21; xent: 0.19; lr: 0.00010; 11108/10506 tok/s;   8275 sec
[2021-04-25 00:50:45,927 INFO] Step 44350/50000; acc:  94.94; ppl:  1.21; xent: 0.19; lr: 0.00010; 10926/10282 tok/s;   8284 sec
[2021-04-25 00:50:55,334 INFO] Step 44400/50000; acc:  95.01; ppl:  1.21; xent: 0.19; lr: 0.00010; 11030/10382 tok/s;   8293 sec
[2021-04-25 00:51:04,510 INFO] Step 44450/50000; acc:  94.87; ppl:  1.22; xent: 0.20; lr: 0.00010; 10940/10284 tok/s;   8303 sec
[2021-04-25 00:51:09,911 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:51:13,828 INFO] Step 44500/50000; acc:  94.98; ppl:  1.21; xent: 0.19; lr: 0.00010; 11090/10380 tok/s;   8312 sec
[2021-04-25 00:51:23,102 INFO] Step 44550/50000; acc:  94.95; ppl:  1.22; xent: 0.19; lr: 0.00010; 10952/10366 tok/s;   8321 sec
[2021-04-25 00:51:32,189 INFO] Step 44600/50000; acc:  94.92; ppl:  1.22; xent: 0.20; lr: 0.00010; 11110/10488 tok/s;   8330 sec
[2021-04-25 00:51:41,363 INFO] Step 44650/50000; acc:  94.85; ppl:  1.22; xent: 0.19; lr: 0.00010; 10940/10453 tok/s;   8339 sec
[2021-04-25 00:51:50,641 INFO] Step 44700/50000; acc:  94.88; ppl:  1.22; xent: 0.20; lr: 0.00010; 10823/10232 tok/s;   8349 sec
[2021-04-25 00:51:59,838 INFO] Step 44750/50000; acc:  95.00; ppl:  1.22; xent: 0.19; lr: 0.00010; 11083/10433 tok/s;   8358 sec
[2021-04-25 00:52:09,170 INFO] Step 44800/50000; acc:  94.93; ppl:  1.21; xent: 0.19; lr: 0.00010; 10917/10356 tok/s;   8367 sec
[2021-04-25 00:52:18,555 INFO] Step 44850/50000; acc:  95.11; ppl:  1.21; xent: 0.19; lr: 0.00010; 11094/10390 tok/s;   8377 sec
[2021-04-25 00:52:27,914 INFO] Step 44900/50000; acc:  94.84; ppl:  1.22; xent: 0.20; lr: 0.00010; 10831/10243 tok/s;   8386 sec
[2021-04-25 00:52:37,385 INFO] Step 44950/50000; acc:  94.95; ppl:  1.21; xent: 0.19; lr: 0.00010; 10879/10202 tok/s;   8395 sec
[2021-04-25 00:52:46,639 INFO] Step 45000/50000; acc:  95.07; ppl:  1.21; xent: 0.19; lr: 0.00010; 10850/10279 tok/s;   8405 sec
[2021-04-25 00:52:46,642 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-25 00:52:55,468 INFO] Validation perplexity: 1.32551
[2021-04-25 00:52:55,469 INFO] Validation accuracy: 93.5049
[2021-04-25 00:52:55,471 INFO] Saving checkpoint ../models/group1_params/control/model_step_45000.pt
[2021-04-25 00:53:05,019 INFO] Step 45050/50000; acc:  94.97; ppl:  1.21; xent: 0.19; lr: 0.00010; 5464/5167 tok/s;   8423 sec
[2021-04-25 00:53:14,470 INFO] Step 45100/50000; acc:  95.06; ppl:  1.20; xent: 0.19; lr: 0.00010; 11003/10339 tok/s;   8433 sec
[2021-04-25 00:53:23,532 INFO] Step 45150/50000; acc:  94.94; ppl:  1.21; xent: 0.19; lr: 0.00010; 10917/10240 tok/s;   8442 sec
[2021-04-25 00:53:32,883 INFO] Step 45200/50000; acc:  94.95; ppl:  1.21; xent: 0.19; lr: 0.00010; 11103/10480 tok/s;   8451 sec
[2021-04-25 00:53:35,693 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:53:42,142 INFO] Step 45250/50000; acc:  94.93; ppl:  1.21; xent: 0.19; lr: 0.00010; 10923/10263 tok/s;   8460 sec
[2021-04-25 00:53:51,398 INFO] Step 45300/50000; acc:  95.03; ppl:  1.21; xent: 0.19; lr: 0.00010; 11146/10523 tok/s;   8469 sec
[2021-04-25 00:54:00,478 INFO] Step 45350/50000; acc:  94.92; ppl:  1.22; xent: 0.20; lr: 0.00010; 11057/10471 tok/s;   8479 sec
[2021-04-25 00:54:09,765 INFO] Step 45400/50000; acc:  94.86; ppl:  1.21; xent: 0.19; lr: 0.00010; 10898/10308 tok/s;   8488 sec
[2021-04-25 00:54:18,925 INFO] Step 45450/50000; acc:  95.01; ppl:  1.21; xent: 0.19; lr: 0.00010; 11000/10454 tok/s;   8497 sec
[2021-04-25 00:54:28,047 INFO] Step 45500/50000; acc:  94.85; ppl:  1.22; xent: 0.20; lr: 0.00010; 10969/10380 tok/s;   8506 sec
[2021-04-25 00:54:37,538 INFO] Step 45550/50000; acc:  95.16; ppl:  1.20; xent: 0.18; lr: 0.00010; 10981/10311 tok/s;   8516 sec
[2021-04-25 00:54:46,698 INFO] Step 45600/50000; acc:  94.89; ppl:  1.21; xent: 0.19; lr: 0.00010; 11039/10432 tok/s;   8525 sec
[2021-04-25 00:54:56,148 INFO] Step 45650/50000; acc:  95.00; ppl:  1.21; xent: 0.19; lr: 0.00010; 10914/10298 tok/s;   8534 sec
[2021-04-25 00:55:05,449 INFO] Step 45700/50000; acc:  95.06; ppl:  1.21; xent: 0.19; lr: 0.00010; 10918/10263 tok/s;   8544 sec
[2021-04-25 00:55:14,771 INFO] Step 45750/50000; acc:  95.05; ppl:  1.21; xent: 0.19; lr: 0.00010; 11028/10412 tok/s;   8553 sec
[2021-04-25 00:55:23,875 INFO] Step 45800/50000; acc:  95.01; ppl:  1.21; xent: 0.19; lr: 0.00010; 11068/10472 tok/s;   8562 sec
[2021-04-25 00:55:33,082 INFO] Step 45850/50000; acc:  94.97; ppl:  1.21; xent: 0.19; lr: 0.00010; 10916/10239 tok/s;   8571 sec
[2021-04-25 00:55:42,399 INFO] Step 45900/50000; acc:  95.00; ppl:  1.21; xent: 0.19; lr: 0.00010; 11051/10406 tok/s;   8580 sec
[2021-04-25 00:55:45,992 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:55:51,489 INFO] Step 45950/50000; acc:  94.75; ppl:  1.22; xent: 0.20; lr: 0.00010; 10985/10328 tok/s;   8590 sec
[2021-04-25 00:56:00,960 INFO] Step 46000/50000; acc:  95.07; ppl:  1.21; xent: 0.19; lr: 0.00010; 10996/10368 tok/s;   8599 sec
[2021-04-25 00:56:10,048 INFO] Step 46050/50000; acc:  94.98; ppl:  1.21; xent: 0.19; lr: 0.00010; 11070/10442 tok/s;   8608 sec
[2021-04-25 00:56:19,335 INFO] Step 46100/50000; acc:  94.97; ppl:  1.21; xent: 0.19; lr: 0.00010; 11022/10433 tok/s;   8617 sec
[2021-04-25 00:56:28,511 INFO] Step 46150/50000; acc:  94.95; ppl:  1.21; xent: 0.19; lr: 0.00010; 10998/10437 tok/s;   8627 sec
[2021-04-25 00:56:37,727 INFO] Step 46200/50000; acc:  94.98; ppl:  1.21; xent: 0.19; lr: 0.00010; 10966/10383 tok/s;   8636 sec
[2021-04-25 00:56:46,916 INFO] Step 46250/50000; acc:  94.97; ppl:  1.21; xent: 0.19; lr: 0.00010; 10999/10363 tok/s;   8645 sec
[2021-04-25 00:56:56,273 INFO] Step 46300/50000; acc:  95.11; ppl:  1.20; xent: 0.18; lr: 0.00010; 10949/10337 tok/s;   8654 sec
[2021-04-25 00:57:05,602 INFO] Step 46350/50000; acc:  95.00; ppl:  1.21; xent: 0.19; lr: 0.00010; 10982/10363 tok/s;   8664 sec
[2021-04-25 00:57:14,839 INFO] Step 46400/50000; acc:  94.98; ppl:  1.21; xent: 0.19; lr: 0.00010; 10935/10336 tok/s;   8673 sec
[2021-04-25 00:57:24,214 INFO] Step 46450/50000; acc:  95.15; ppl:  1.20; xent: 0.18; lr: 0.00010; 10987/10304 tok/s;   8682 sec
[2021-04-25 00:57:33,478 INFO] Step 46500/50000; acc:  95.04; ppl:  1.21; xent: 0.19; lr: 0.00010; 10932/10387 tok/s;   8692 sec
[2021-04-25 00:57:42,790 INFO] Step 46550/50000; acc:  95.14; ppl:  1.20; xent: 0.18; lr: 0.00010; 11145/10379 tok/s;   8701 sec
[2021-04-25 00:57:51,933 INFO] Step 46600/50000; acc:  95.05; ppl:  1.21; xent: 0.19; lr: 0.00010; 10914/10346 tok/s;   8710 sec
[2021-04-25 00:58:01,038 INFO] Step 46650/50000; acc:  94.96; ppl:  1.21; xent: 0.19; lr: 0.00010; 11011/10342 tok/s;   8719 sec
[2021-04-25 00:58:02,165 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 00:58:10,457 INFO] Step 46700/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 11002/10337 tok/s;   8729 sec
[2021-04-25 00:58:19,505 INFO] Step 46750/50000; acc:  95.07; ppl:  1.21; xent: 0.19; lr: 0.00010; 11035/10409 tok/s;   8738 sec
[2021-04-25 00:58:28,827 INFO] Step 46800/50000; acc:  95.05; ppl:  1.21; xent: 0.19; lr: 0.00010; 11047/10466 tok/s;   8747 sec
[2021-04-25 00:58:38,055 INFO] Step 46850/50000; acc:  94.87; ppl:  1.21; xent: 0.19; lr: 0.00010; 10934/10357 tok/s;   8756 sec
[2021-04-25 00:58:47,386 INFO] Step 46900/50000; acc:  95.09; ppl:  1.21; xent: 0.19; lr: 0.00010; 11021/10405 tok/s;   8765 sec
[2021-04-25 00:58:56,497 INFO] Step 46950/50000; acc:  94.90; ppl:  1.21; xent: 0.19; lr: 0.00010; 10980/10342 tok/s;   8775 sec
[2021-04-25 00:59:05,939 INFO] Step 47000/50000; acc:  95.12; ppl:  1.20; xent: 0.19; lr: 0.00010; 10924/10382 tok/s;   8784 sec
[2021-04-25 00:59:15,175 INFO] Step 47050/50000; acc:  95.07; ppl:  1.20; xent: 0.19; lr: 0.00010; 11005/10400 tok/s;   8793 sec
[2021-04-25 00:59:24,499 INFO] Step 47100/50000; acc:  94.98; ppl:  1.21; xent: 0.19; lr: 0.00010; 10789/10184 tok/s;   8803 sec
[2021-04-25 00:59:33,854 INFO] Step 47150/50000; acc:  95.06; ppl:  1.21; xent: 0.19; lr: 0.00010; 10980/10257 tok/s;   8812 sec
[2021-04-25 00:59:43,005 INFO] Step 47200/50000; acc:  95.18; ppl:  1.21; xent: 0.19; lr: 0.00010; 11011/10508 tok/s;   8821 sec
[2021-04-25 00:59:52,369 INFO] Step 47250/50000; acc:  95.05; ppl:  1.21; xent: 0.19; lr: 0.00010; 11045/10403 tok/s;   8830 sec
[2021-04-25 01:00:01,546 INFO] Step 47300/50000; acc:  95.04; ppl:  1.20; xent: 0.18; lr: 0.00010; 11083/10408 tok/s;   8840 sec
[2021-04-25 01:00:10,790 INFO] Step 47350/50000; acc:  95.16; ppl:  1.20; xent: 0.19; lr: 0.00010; 11051/10329 tok/s;   8849 sec
[2021-04-25 01:00:18,413 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 01:00:19,963 INFO] Step 47400/50000; acc:  94.94; ppl:  1.21; xent: 0.19; lr: 0.00010; 10970/10355 tok/s;   8858 sec
[2021-04-25 01:00:29,217 INFO] Step 47450/50000; acc:  95.12; ppl:  1.20; xent: 0.19; lr: 0.00010; 10852/10246 tok/s;   8867 sec
[2021-04-25 01:00:38,557 INFO] Step 47500/50000; acc:  95.10; ppl:  1.21; xent: 0.19; lr: 0.00010; 11092/10456 tok/s;   8877 sec
[2021-04-25 01:00:47,526 INFO] Step 47550/50000; acc:  94.96; ppl:  1.21; xent: 0.19; lr: 0.00010; 10998/10503 tok/s;   8886 sec
[2021-04-25 01:00:57,048 INFO] Step 47600/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 10916/10220 tok/s;   8895 sec
[2021-04-25 01:01:06,208 INFO] Step 47650/50000; acc:  95.05; ppl:  1.21; xent: 0.19; lr: 0.00010; 10991/10444 tok/s;   8904 sec
[2021-04-25 01:01:15,582 INFO] Step 47700/50000; acc:  95.01; ppl:  1.21; xent: 0.19; lr: 0.00010; 10996/10400 tok/s;   8914 sec
[2021-04-25 01:01:24,822 INFO] Step 47750/50000; acc:  95.32; ppl:  1.20; xent: 0.18; lr: 0.00010; 11059/10423 tok/s;   8923 sec
[2021-04-25 01:01:34,203 INFO] Step 47800/50000; acc:  94.94; ppl:  1.21; xent: 0.19; lr: 0.00010; 10891/10296 tok/s;   8932 sec
[2021-04-25 01:01:43,454 INFO] Step 47850/50000; acc:  95.05; ppl:  1.21; xent: 0.19; lr: 0.00010; 10869/10212 tok/s;   8942 sec
[2021-04-25 01:01:52,810 INFO] Step 47900/50000; acc:  95.25; ppl:  1.20; xent: 0.18; lr: 0.00010; 10798/10187 tok/s;   8951 sec
[2021-04-25 01:02:02,051 INFO] Step 47950/50000; acc:  95.18; ppl:  1.21; xent: 0.19; lr: 0.00010; 11074/10529 tok/s;   8960 sec
[2021-04-25 01:02:11,222 INFO] Step 48000/50000; acc:  95.13; ppl:  1.20; xent: 0.18; lr: 0.00010; 11044/10334 tok/s;   8969 sec
[2021-04-25 01:02:20,548 INFO] Step 48050/50000; acc:  95.10; ppl:  1.21; xent: 0.19; lr: 0.00010; 11015/10373 tok/s;   8979 sec
[2021-04-25 01:02:29,716 INFO] Step 48100/50000; acc:  95.01; ppl:  1.21; xent: 0.19; lr: 0.00010; 10992/10326 tok/s;   8988 sec
[2021-04-25 01:02:34,785 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 01:02:39,078 INFO] Step 48150/50000; acc:  95.11; ppl:  1.21; xent: 0.19; lr: 0.00010; 11039/10357 tok/s;   8997 sec
[2021-04-25 01:02:48,309 INFO] Step 48200/50000; acc:  95.21; ppl:  1.20; xent: 0.18; lr: 0.00010; 10968/10348 tok/s;   9006 sec
[2021-04-25 01:02:57,269 INFO] Step 48250/50000; acc:  95.03; ppl:  1.21; xent: 0.19; lr: 0.00010; 11121/10539 tok/s;   9015 sec
[2021-04-25 01:03:06,688 INFO] Step 48300/50000; acc:  95.05; ppl:  1.21; xent: 0.19; lr: 0.00010; 10919/10371 tok/s;   9025 sec
[2021-04-25 01:03:15,865 INFO] Step 48350/50000; acc:  94.97; ppl:  1.21; xent: 0.19; lr: 0.00010; 10820/10264 tok/s;   9034 sec
[2021-04-25 01:03:25,134 INFO] Step 48400/50000; acc:  95.15; ppl:  1.21; xent: 0.19; lr: 0.00010; 11151/10447 tok/s;   9043 sec
[2021-04-25 01:03:34,481 INFO] Step 48450/50000; acc:  95.02; ppl:  1.20; xent: 0.19; lr: 0.00010; 10915/10396 tok/s;   9053 sec
[2021-04-25 01:03:43,793 INFO] Step 48500/50000; acc:  95.31; ppl:  1.20; xent: 0.18; lr: 0.00010; 11189/10473 tok/s;   9062 sec
[2021-04-25 01:03:53,164 INFO] Step 48550/50000; acc:  95.00; ppl:  1.21; xent: 0.19; lr: 0.00010; 10781/10190 tok/s;   9071 sec
[2021-04-25 01:04:02,468 INFO] Step 48600/50000; acc:  95.14; ppl:  1.20; xent: 0.18; lr: 0.00010; 10922/10283 tok/s;   9081 sec
[2021-04-25 01:04:11,655 INFO] Step 48650/50000; acc:  95.20; ppl:  1.20; xent: 0.18; lr: 0.00010; 10981/10349 tok/s;   9090 sec
[2021-04-25 01:04:20,752 INFO] Step 48700/50000; acc:  95.08; ppl:  1.21; xent: 0.19; lr: 0.00010; 11066/10494 tok/s;   9099 sec
[2021-04-25 01:04:30,128 INFO] Step 48750/50000; acc:  95.18; ppl:  1.20; xent: 0.18; lr: 0.00010; 10963/10337 tok/s;   9108 sec
[2021-04-25 01:04:39,205 INFO] Step 48800/50000; acc:  95.12; ppl:  1.20; xent: 0.19; lr: 0.00010; 11043/10290 tok/s;   9117 sec
[2021-04-25 01:04:48,482 INFO] Step 48850/50000; acc:  95.01; ppl:  1.20; xent: 0.19; lr: 0.00010; 11107/10521 tok/s;   9127 sec
[2021-04-25 01:04:50,952 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 01:04:57,742 INFO] Step 48900/50000; acc:  95.14; ppl:  1.21; xent: 0.19; lr: 0.00010; 10967/10283 tok/s;   9136 sec
[2021-04-25 01:05:07,000 INFO] Step 48950/50000; acc:  95.16; ppl:  1.20; xent: 0.18; lr: 0.00010; 11127/10533 tok/s;   9145 sec
[2021-04-25 01:05:15,955 INFO] Step 49000/50000; acc:  95.02; ppl:  1.21; xent: 0.19; lr: 0.00010; 11159/10553 tok/s;   9154 sec
[2021-04-25 01:05:25,148 INFO] Step 49050/50000; acc:  95.10; ppl:  1.20; xent: 0.19; lr: 0.00010; 10891/10339 tok/s;   9163 sec
[2021-04-25 01:05:34,474 INFO] Step 49100/50000; acc:  95.20; ppl:  1.20; xent: 0.18; lr: 0.00010; 11065/10405 tok/s;   9173 sec
[2021-04-25 01:05:43,531 INFO] Step 49150/50000; acc:  95.00; ppl:  1.21; xent: 0.19; lr: 0.00010; 10934/10410 tok/s;   9182 sec
[2021-04-25 01:05:53,134 INFO] Step 49200/50000; acc:  95.36; ppl:  1.19; xent: 0.18; lr: 0.00010; 11005/10321 tok/s;   9191 sec
[2021-04-25 01:06:02,339 INFO] Step 49250/50000; acc:  95.08; ppl:  1.20; xent: 0.19; lr: 0.00010; 10988/10363 tok/s;   9200 sec
[2021-04-25 01:06:11,758 INFO] Step 49300/50000; acc:  95.15; ppl:  1.20; xent: 0.18; lr: 0.00010; 10968/10337 tok/s;   9210 sec
[2021-04-25 01:06:21,073 INFO] Step 49350/50000; acc:  95.15; ppl:  1.20; xent: 0.18; lr: 0.00010; 10854/10241 tok/s;   9219 sec
[2021-04-25 01:06:30,309 INFO] Step 49400/50000; acc:  95.14; ppl:  1.20; xent: 0.18; lr: 0.00010; 10972/10389 tok/s;   9228 sec
[2021-04-25 01:06:39,445 INFO] Step 49450/50000; acc:  95.20; ppl:  1.20; xent: 0.18; lr: 0.00010; 11098/10466 tok/s;   9238 sec
[2021-04-25 01:06:48,691 INFO] Step 49500/50000; acc:  95.16; ppl:  1.20; xent: 0.18; lr: 0.00010; 10891/10219 tok/s;   9247 sec
[2021-04-25 01:06:57,902 INFO] Step 49550/50000; acc:  95.09; ppl:  1.20; xent: 0.18; lr: 0.00010; 11047/10454 tok/s;   9256 sec
[2021-04-25 01:07:01,179 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/abstract_methods/train_fixed.txt, align=None)...
[2021-04-25 01:07:07,065 INFO] Step 49600/50000; acc:  94.97; ppl:  1.21; xent: 0.19; lr: 0.00010; 11086/10357 tok/s;   9265 sec
[2021-04-25 01:07:16,474 INFO] Step 49650/50000; acc:  95.20; ppl:  1.20; xent: 0.18; lr: 0.00010; 10944/10352 tok/s;   9275 sec
[2021-04-25 01:07:25,523 INFO] Step 49700/50000; acc:  95.25; ppl:  1.20; xent: 0.18; lr: 0.00010; 11124/10498 tok/s;   9284 sec
[2021-04-25 01:07:34,827 INFO] Step 49750/50000; acc:  95.08; ppl:  1.21; xent: 0.19; lr: 0.00010; 11033/10467 tok/s;   9293 sec
[2021-04-25 01:07:43,933 INFO] Step 49800/50000; acc:  95.11; ppl:  1.20; xent: 0.18; lr: 0.00010; 11024/10438 tok/s;   9302 sec
[2021-04-25 01:07:53,058 INFO] Step 49850/50000; acc:  95.11; ppl:  1.20; xent: 0.19; lr: 0.00010; 10938/10366 tok/s;   9311 sec
[2021-04-25 01:08:02,431 INFO] Step 49900/50000; acc:  95.19; ppl:  1.20; xent: 0.19; lr: 0.00010; 11084/10385 tok/s;   9321 sec
[2021-04-25 01:08:11,651 INFO] Step 49950/50000; acc:  95.23; ppl:  1.19; xent: 0.18; lr: 0.00010; 10979/10434 tok/s;   9330 sec
[2021-04-25 01:08:21,072 INFO] Step 50000/50000; acc:  95.18; ppl:  1.20; xent: 0.18; lr: 0.00005; 11023/10353 tok/s;   9339 sec
[2021-04-25 01:08:21,074 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/abstract_methods/valid_fixed.txt, align=None)...
[2021-04-25 01:08:29,869 INFO] Validation perplexity: 1.33122
[2021-04-25 01:08:29,869 INFO] Validation accuracy: 93.4697
[2021-04-25 01:08:29,871 INFO] Saving checkpoint ../models/group1_params/control/model_step_50000.pt

Basic condensed EditOperations:

modelGroup1Basic = HephaestusModel(MODEL_GROUP1_BASIC)
modelGroup1Basic.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_OPS_GENERAL_BASIC_TRAIN,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_OPS_GENERAL_BASIC_VALID,
    **GROUP1_PARAMS
)
[2021-04-25 02:40:01,208 INFO] Counter vocab from -1 samples.
[2021-04-25 02:40:01,208 INFO] n_sample=-1: Build vocab on full datasets.
[2021-04-25 02:40:01,217 INFO] corpus_1's transforms: TransformPipe()
[2021-04-25 02:40:01,218 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 02:40:02,420 INFO] Counters src:429
[2021-04-25 02:40:02,420 INFO] Counters tgt:438
[2021-04-25 02:40:02,420 WARNING] path ../models/group1_params/basic_ops/save_data.vocab.src exists, may overwrite...
[2021-04-25 02:40:02,422 WARNING] path ../models/group1_params/basic_ops/save_data.vocab.tgt exists, may overwrite...
[2021-04-25 02:40:03,150 INFO] Parsed 2 corpora from -data.
[2021-04-25 02:40:03,151 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-04-25 02:40:03,151 INFO] Loading vocab from text file...
[2021-04-25 02:40:03,151 INFO] Loading src vocabulary from ../models/group1_params/basic_ops/save_data.vocab.src
[2021-04-25 02:40:03,152 INFO] Loaded src vocab has 429 tokens.
[2021-04-25 02:40:03,152 INFO] Loading tgt vocabulary from ../models/group1_params/basic_ops/save_data.vocab.tgt
[2021-04-25 02:40:03,153 INFO] Loaded tgt vocab has 438 tokens.
[2021-04-25 02:40:03,153 INFO] Building fields with vocab in counters...
[2021-04-25 02:40:03,154 INFO]  * tgt vocab size: 442.
[2021-04-25 02:40:03,155 INFO]  * src vocab size: 431.
[2021-04-25 02:40:03,155 INFO]  * src vocab size = 431
[2021-04-25 02:40:03,155 INFO]  * tgt vocab size = 442
[2021-04-25 02:40:03,157 INFO] Building model...
[2021-04-25 02:40:05,658 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): GRU(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(442, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedGRU(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): GRUCell(768, 256)
        (1): GRUCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=442, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-04-25 02:40:05,659 INFO] encoder: 1206784
[2021-04-25 02:40:05,659 INFO] decoder: 1785530
[2021-04-25 02:40:05,659 INFO] * number of parameters: 2992314
[2021-04-25 02:40:05,659 INFO] Starting training on GPU: [0]
[2021-04-25 02:40:05,660 INFO] Start training loop and validate every 5000 steps...
[2021-04-25 02:40:05,660 INFO] corpus_1's transforms: TransformPipe()
[2021-04-25 02:40:05,660 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 02:40:25,402 INFO] Step 50/50000; acc:  17.15; ppl: 77.08; xent: 4.34; lr: 0.00010; 5237/6572 tok/s;     20 sec
[2021-04-25 02:40:43,750 INFO] Step 100/50000; acc:  30.07; ppl: 22.57; xent: 3.12; lr: 0.00010; 5555/6852 tok/s;     38 sec
[2021-04-25 02:41:03,045 INFO] Step 150/50000; acc:  53.03; ppl: 10.35; xent: 2.34; lr: 0.00010; 5158/6490 tok/s;     57 sec
[2021-04-25 02:41:22,469 INFO] Step 200/50000; acc:  57.52; ppl:  6.44; xent: 1.86; lr: 0.00010; 5204/6646 tok/s;     77 sec
[2021-04-25 02:41:41,326 INFO] Step 250/50000; acc:  58.18; ppl:  5.73; xent: 1.75; lr: 0.00010; 5401/6685 tok/s;     96 sec
[2021-04-25 02:42:00,420 INFO] Step 300/50000; acc:  59.68; ppl:  5.16; xent: 1.64; lr: 0.00010; 5365/6623 tok/s;    115 sec
[2021-04-25 02:42:19,288 INFO] Step 350/50000; acc:  61.43; ppl:  4.66; xent: 1.54; lr: 0.00010; 5372/6506 tok/s;    134 sec
[2021-04-25 02:42:38,540 INFO] Step 400/50000; acc:  64.18; ppl:  4.15; xent: 1.42; lr: 0.00010; 5421/6723 tok/s;    153 sec
[2021-04-25 02:42:57,962 INFO] Step 450/50000; acc:  66.51; ppl:  3.66; xent: 1.30; lr: 0.00010; 5167/6513 tok/s;    172 sec
[2021-04-25 02:43:17,235 INFO] Step 500/50000; acc:  68.76; ppl:  3.34; xent: 1.21; lr: 0.00010; 5262/6786 tok/s;    192 sec
[2021-04-25 02:43:36,946 INFO] Step 550/50000; acc:  70.16; ppl:  3.11; xent: 1.13; lr: 0.00010; 5232/6427 tok/s;    211 sec
[2021-04-25 02:43:56,290 INFO] Step 600/50000; acc:  72.95; ppl:  2.77; xent: 1.02; lr: 0.00010; 5264/6539 tok/s;    231 sec
[2021-04-25 02:44:14,590 INFO] Step 650/50000; acc:  75.14; ppl:  2.55; xent: 0.94; lr: 0.00010; 5443/6973 tok/s;    249 sec
[2021-04-25 02:44:33,531 INFO] Step 700/50000; acc:  76.43; ppl:  2.44; xent: 0.89; lr: 0.00010; 5345/6708 tok/s;    268 sec
[2021-04-25 02:44:35,007 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 02:44:52,734 INFO] Step 750/50000; acc:  77.52; ppl:  2.35; xent: 0.86; lr: 0.00010; 5404/6798 tok/s;    287 sec
[2021-04-25 02:45:11,177 INFO] Step 800/50000; acc:  79.00; ppl:  2.25; xent: 0.81; lr: 0.00010; 5439/6780 tok/s;    306 sec
[2021-04-25 02:45:29,915 INFO] Step 850/50000; acc:  79.44; ppl:  2.22; xent: 0.80; lr: 0.00010; 5450/6776 tok/s;    324 sec
[2021-04-25 02:45:49,344 INFO] Step 900/50000; acc:  80.19; ppl:  2.17; xent: 0.77; lr: 0.00010; 5276/6599 tok/s;    344 sec
[2021-04-25 02:46:08,913 INFO] Step 950/50000; acc:  81.38; ppl:  2.09; xent: 0.74; lr: 0.00010; 5096/6352 tok/s;    363 sec
[2021-04-25 02:46:27,895 INFO] Step 1000/50000; acc:  81.86; ppl:  2.06; xent: 0.72; lr: 0.00010; 5301/6762 tok/s;    382 sec
[2021-04-25 02:46:46,745 INFO] Step 1050/50000; acc:  82.50; ppl:  2.02; xent: 0.70; lr: 0.00010; 5500/6596 tok/s;    401 sec
[2021-04-25 02:47:05,575 INFO] Step 1100/50000; acc:  82.85; ppl:  2.00; xent: 0.69; lr: 0.00010; 5468/6750 tok/s;    420 sec
[2021-04-25 02:47:25,239 INFO] Step 1150/50000; acc:  83.30; ppl:  1.97; xent: 0.68; lr: 0.00010; 5061/6363 tok/s;    440 sec
[2021-04-25 02:47:44,942 INFO] Step 1200/50000; acc:  84.13; ppl:  1.92; xent: 0.65; lr: 0.00010; 5311/6772 tok/s;    459 sec
[2021-04-25 02:48:03,598 INFO] Step 1250/50000; acc:  83.52; ppl:  1.96; xent: 0.67; lr: 0.00010; 5377/6633 tok/s;    478 sec
[2021-04-25 02:48:22,745 INFO] Step 1300/50000; acc:  84.24; ppl:  1.92; xent: 0.65; lr: 0.00010; 5311/6621 tok/s;    497 sec
[2021-04-25 02:48:41,654 INFO] Step 1350/50000; acc:  85.06; ppl:  1.86; xent: 0.62; lr: 0.00010; 5483/6973 tok/s;    516 sec
[2021-04-25 02:49:00,441 INFO] Step 1400/50000; acc:  84.94; ppl:  1.86; xent: 0.62; lr: 0.00010; 5332/6693 tok/s;    535 sec
[2021-04-25 02:49:15,560 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 02:49:19,448 INFO] Step 1450/50000; acc:  85.09; ppl:  1.85; xent: 0.62; lr: 0.00010; 5281/6791 tok/s;    554 sec
[2021-04-25 02:49:38,824 INFO] Step 1500/50000; acc:  84.90; ppl:  1.87; xent: 0.62; lr: 0.00010; 5236/6452 tok/s;    573 sec
[2021-04-25 02:49:57,689 INFO] Step 1550/50000; acc:  85.37; ppl:  1.84; xent: 0.61; lr: 0.00010; 5506/6877 tok/s;    592 sec
[2021-04-25 02:50:16,111 INFO] Step 1600/50000; acc:  84.26; ppl:  1.91; xent: 0.65; lr: 0.00010; 5372/6586 tok/s;    610 sec
[2021-04-25 02:50:36,177 INFO] Step 1650/50000; acc:  85.71; ppl:  1.81; xent: 0.59; lr: 0.00010; 5134/6568 tok/s;    631 sec
[2021-04-25 02:50:55,401 INFO] Step 1700/50000; acc:  85.59; ppl:  1.82; xent: 0.60; lr: 0.00010; 5320/6605 tok/s;    650 sec
[2021-04-25 02:51:14,590 INFO] Step 1750/50000; acc:  85.59; ppl:  1.81; xent: 0.59; lr: 0.00010; 5221/6665 tok/s;    669 sec
[2021-04-25 02:51:33,401 INFO] Step 1800/50000; acc:  85.53; ppl:  1.82; xent: 0.60; lr: 0.00010; 5441/6524 tok/s;    688 sec
[2021-04-25 02:51:52,830 INFO] Step 1850/50000; acc:  85.55; ppl:  1.82; xent: 0.60; lr: 0.00010; 5305/6506 tok/s;    707 sec
[2021-04-25 02:52:12,364 INFO] Step 1900/50000; acc:  86.07; ppl:  1.77; xent: 0.57; lr: 0.00010; 5199/6565 tok/s;    727 sec
[2021-04-25 02:52:32,472 INFO] Step 1950/50000; acc:  85.78; ppl:  1.80; xent: 0.59; lr: 0.00010; 4970/6354 tok/s;    747 sec
[2021-04-25 02:52:51,879 INFO] Step 2000/50000; acc:  86.29; ppl:  1.77; xent: 0.57; lr: 0.00010; 5383/6708 tok/s;    766 sec
[2021-04-25 02:53:10,843 INFO] Step 2050/50000; acc:  85.83; ppl:  1.79; xent: 0.58; lr: 0.00010; 5324/6521 tok/s;    785 sec
[2021-04-25 02:53:29,562 INFO] Step 2100/50000; acc:  86.52; ppl:  1.75; xent: 0.56; lr: 0.00010; 5390/6896 tok/s;    804 sec
[2021-04-25 02:53:48,538 INFO] Step 2150/50000; acc:  86.26; ppl:  1.77; xent: 0.57; lr: 0.00010; 5402/6751 tok/s;    823 sec
[2021-04-25 02:53:58,050 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 02:54:07,497 INFO] Step 2200/50000; acc:  86.53; ppl:  1.75; xent: 0.56; lr: 0.00010; 5349/6875 tok/s;    842 sec
[2021-04-25 02:54:26,234 INFO] Step 2250/50000; acc:  86.38; ppl:  1.76; xent: 0.57; lr: 0.00010; 5379/6770 tok/s;    861 sec
[2021-04-25 02:54:44,465 INFO] Step 2300/50000; acc:  85.62; ppl:  1.80; xent: 0.59; lr: 0.00010; 5538/6735 tok/s;    879 sec
[2021-04-25 02:55:04,102 INFO] Step 2350/50000; acc:  86.32; ppl:  1.76; xent: 0.57; lr: 0.00010; 5247/6630 tok/s;    898 sec
[2021-04-25 02:55:23,576 INFO] Step 2400/50000; acc:  86.24; ppl:  1.76; xent: 0.56; lr: 0.00010; 5124/6375 tok/s;    918 sec
[2021-04-25 02:55:42,475 INFO] Step 2450/50000; acc:  86.80; ppl:  1.72; xent: 0.54; lr: 0.00010; 5425/6861 tok/s;    937 sec
[2021-04-25 02:56:02,136 INFO] Step 2500/50000; acc:  86.46; ppl:  1.75; xent: 0.56; lr: 0.00010; 5267/6448 tok/s;    956 sec
[2021-04-25 02:56:20,542 INFO] Step 2550/50000; acc:  86.79; ppl:  1.73; xent: 0.55; lr: 0.00010; 5493/6728 tok/s;    975 sec
[2021-04-25 02:56:40,134 INFO] Step 2600/50000; acc:  86.43; ppl:  1.75; xent: 0.56; lr: 0.00010; 5170/6434 tok/s;    994 sec
[2021-04-25 02:56:59,882 INFO] Step 2650/50000; acc:  86.81; ppl:  1.72; xent: 0.54; lr: 0.00010; 5189/6540 tok/s;   1014 sec
[2021-04-25 02:57:19,443 INFO] Step 2700/50000; acc:  86.88; ppl:  1.72; xent: 0.54; lr: 0.00010; 5218/6529 tok/s;   1034 sec
[2021-04-25 02:57:38,505 INFO] Step 2750/50000; acc:  86.59; ppl:  1.74; xent: 0.55; lr: 0.00010; 5216/6604 tok/s;   1053 sec
[2021-04-25 02:57:57,719 INFO] Step 2800/50000; acc:  87.34; ppl:  1.68; xent: 0.52; lr: 0.00010; 5470/6727 tok/s;   1072 sec
[2021-04-25 02:58:16,094 INFO] Step 2850/50000; acc:  86.95; ppl:  1.71; xent: 0.54; lr: 0.00010; 5429/7001 tok/s;   1090 sec
[2021-04-25 02:58:34,753 INFO] Step 2900/50000; acc:  87.05; ppl:  1.70; xent: 0.53; lr: 0.00010; 5420/6779 tok/s;   1109 sec
[2021-04-25 02:58:39,125 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 02:58:53,954 INFO] Step 2950/50000; acc:  87.13; ppl:  1.71; xent: 0.53; lr: 0.00010; 5379/6732 tok/s;   1128 sec
[2021-04-25 02:59:12,593 INFO] Step 3000/50000; acc:  87.01; ppl:  1.71; xent: 0.53; lr: 0.00010; 5425/6817 tok/s;   1147 sec
[2021-04-25 02:59:31,200 INFO] Step 3050/50000; acc:  86.72; ppl:  1.72; xent: 0.54; lr: 0.00010; 5365/6682 tok/s;   1166 sec
[2021-04-25 02:59:50,805 INFO] Step 3100/50000; acc:  86.99; ppl:  1.70; xent: 0.53; lr: 0.00010; 5160/6622 tok/s;   1185 sec
[2021-04-25 03:00:10,073 INFO] Step 3150/50000; acc:  87.16; ppl:  1.69; xent: 0.53; lr: 0.00010; 5361/6590 tok/s;   1204 sec
[2021-04-25 03:00:28,762 INFO] Step 3200/50000; acc:  87.02; ppl:  1.70; xent: 0.53; lr: 0.00010; 5325/6624 tok/s;   1223 sec
[2021-04-25 03:00:48,089 INFO] Step 3250/50000; acc:  87.28; ppl:  1.69; xent: 0.52; lr: 0.00010; 5430/6574 tok/s;   1242 sec
[2021-04-25 03:01:06,816 INFO] Step 3300/50000; acc:  87.50; ppl:  1.67; xent: 0.52; lr: 0.00010; 5476/6835 tok/s;   1261 sec
[2021-04-25 03:01:26,123 INFO] Step 3350/50000; acc:  86.99; ppl:  1.71; xent: 0.53; lr: 0.00010; 5195/6432 tok/s;   1280 sec
[2021-04-25 03:01:45,064 INFO] Step 3400/50000; acc:  87.65; ppl:  1.66; xent: 0.51; lr: 0.00010; 5357/6863 tok/s;   1299 sec
[2021-04-25 03:02:04,641 INFO] Step 3450/50000; acc:  87.36; ppl:  1.69; xent: 0.52; lr: 0.00010; 5210/6581 tok/s;   1319 sec
[2021-04-25 03:02:23,518 INFO] Step 3500/50000; acc:  87.33; ppl:  1.68; xent: 0.52; lr: 0.00010; 5440/6577 tok/s;   1338 sec
[2021-04-25 03:02:42,157 INFO] Step 3550/50000; acc:  87.86; ppl:  1.65; xent: 0.50; lr: 0.00010; 5329/6912 tok/s;   1356 sec
[2021-04-25 03:03:01,389 INFO] Step 3600/50000; acc:  87.83; ppl:  1.65; xent: 0.50; lr: 0.00010; 5418/6746 tok/s;   1376 sec
[2021-04-25 03:03:07,349 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:03:20,119 INFO] Step 3650/50000; acc:  87.35; ppl:  1.68; xent: 0.52; lr: 0.00010; 5380/6813 tok/s;   1394 sec
[2021-04-25 03:03:39,286 INFO] Step 3700/50000; acc:  87.73; ppl:  1.66; xent: 0.50; lr: 0.00010; 5294/6661 tok/s;   1414 sec
[2021-04-25 03:03:57,999 INFO] Step 3750/50000; acc:  87.59; ppl:  1.67; xent: 0.51; lr: 0.00010; 5480/6801 tok/s;   1432 sec
[2021-04-25 03:04:17,839 INFO] Step 3800/50000; acc:  87.03; ppl:  1.70; xent: 0.53; lr: 0.00010; 5070/6334 tok/s;   1452 sec
[2021-04-25 03:04:36,807 INFO] Step 3850/50000; acc:  87.72; ppl:  1.65; xent: 0.50; lr: 0.00010; 5279/6753 tok/s;   1471 sec
[2021-04-25 03:04:55,942 INFO] Step 3900/50000; acc:  87.63; ppl:  1.66; xent: 0.51; lr: 0.00010; 5273/6611 tok/s;   1490 sec
[2021-04-25 03:05:15,065 INFO] Step 3950/50000; acc:  87.70; ppl:  1.65; xent: 0.50; lr: 0.00010; 5443/6596 tok/s;   1509 sec
[2021-04-25 03:05:33,825 INFO] Step 4000/50000; acc:  87.58; ppl:  1.66; xent: 0.51; lr: 0.00010; 5422/6565 tok/s;   1528 sec
[2021-04-25 03:05:52,798 INFO] Step 4050/50000; acc:  87.93; ppl:  1.64; xent: 0.50; lr: 0.00010; 5428/6804 tok/s;   1547 sec
[2021-04-25 03:06:12,355 INFO] Step 4100/50000; acc:  88.05; ppl:  1.63; xent: 0.49; lr: 0.00010; 5245/6659 tok/s;   1567 sec
[2021-04-25 03:06:31,731 INFO] Step 4150/50000; acc:  87.81; ppl:  1.65; xent: 0.50; lr: 0.00010; 5162/6527 tok/s;   1586 sec
[2021-04-25 03:06:51,069 INFO] Step 4200/50000; acc:  87.47; ppl:  1.67; xent: 0.51; lr: 0.00010; 5240/6509 tok/s;   1605 sec
[2021-04-25 03:07:10,417 INFO] Step 4250/50000; acc:  88.05; ppl:  1.63; xent: 0.49; lr: 0.00010; 5328/6618 tok/s;   1625 sec
[2021-04-25 03:07:29,085 INFO] Step 4300/50000; acc:  88.37; ppl:  1.61; xent: 0.48; lr: 0.00010; 5437/6865 tok/s;   1643 sec
[2021-04-25 03:07:47,760 INFO] Step 4350/50000; acc:  87.82; ppl:  1.64; xent: 0.49; lr: 0.00010; 5313/6770 tok/s;   1662 sec
[2021-04-25 03:07:48,547 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:08:07,027 INFO] Step 4400/50000; acc:  88.21; ppl:  1.62; xent: 0.48; lr: 0.00010; 5430/6791 tok/s;   1681 sec
[2021-04-25 03:08:25,583 INFO] Step 4450/50000; acc:  88.01; ppl:  1.63; xent: 0.49; lr: 0.00010; 5437/6788 tok/s;   1700 sec
[2021-04-25 03:08:44,056 INFO] Step 4500/50000; acc:  87.56; ppl:  1.66; xent: 0.50; lr: 0.00010; 5420/6753 tok/s;   1718 sec
[2021-04-25 03:09:04,147 INFO] Step 4550/50000; acc:  87.85; ppl:  1.64; xent: 0.49; lr: 0.00010; 5127/6424 tok/s;   1738 sec
[2021-04-25 03:09:23,656 INFO] Step 4600/50000; acc:  87.90; ppl:  1.63; xent: 0.49; lr: 0.00010; 5178/6428 tok/s;   1758 sec
[2021-04-25 03:09:42,692 INFO] Step 4650/50000; acc:  88.18; ppl:  1.62; xent: 0.48; lr: 0.00010; 5231/6768 tok/s;   1777 sec
[2021-04-25 03:10:02,110 INFO] Step 4700/50000; acc:  87.87; ppl:  1.64; xent: 0.50; lr: 0.00010; 5302/6325 tok/s;   1796 sec
[2021-04-25 03:10:21,218 INFO] Step 4750/50000; acc:  88.08; ppl:  1.63; xent: 0.49; lr: 0.00010; 5465/6666 tok/s;   1816 sec
[2021-04-25 03:10:40,817 INFO] Step 4800/50000; acc:  88.20; ppl:  1.61; xent: 0.48; lr: 0.00010; 5090/6451 tok/s;   1835 sec
[2021-04-25 03:11:00,652 INFO] Step 4850/50000; acc:  88.35; ppl:  1.61; xent: 0.48; lr: 0.00010; 5208/6631 tok/s;   1855 sec
[2021-04-25 03:11:19,767 INFO] Step 4900/50000; acc:  88.07; ppl:  1.62; xent: 0.48; lr: 0.00010; 5360/6654 tok/s;   1874 sec
[2021-04-25 03:11:38,966 INFO] Step 4950/50000; acc:  87.90; ppl:  1.63; xent: 0.49; lr: 0.00010; 5234/6454 tok/s;   1893 sec
[2021-04-25 03:11:57,652 INFO] Step 5000/50000; acc:  88.51; ppl:  1.60; xent: 0.47; lr: 0.00010; 5439/6979 tok/s;   1912 sec
[2021-04-25 03:11:57,654 INFO] valid's transforms: TransformPipe()
[2021-04-25 03:11:57,654 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-25 03:12:24,352 INFO] Validation perplexity: 1.58487
[2021-04-25 03:12:24,352 INFO] Validation accuracy: 88.5795
[2021-04-25 03:12:24,356 INFO] Saving checkpoint ../models/group1_params/basic_ops/model_step_5000.pt
[2021-04-25 03:12:43,057 INFO] Step 5050/50000; acc:  88.35; ppl:  1.61; xent: 0.47; lr: 0.00010; 2233/2790 tok/s;   1957 sec
[2021-04-25 03:12:57,369 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:13:02,186 INFO] Step 5100/50000; acc:  88.59; ppl:  1.58; xent: 0.46; lr: 0.00010; 5354/6868 tok/s;   1977 sec
[2021-04-25 03:13:21,316 INFO] Step 5150/50000; acc:  88.01; ppl:  1.62; xent: 0.48; lr: 0.00010; 5192/6435 tok/s;   1996 sec
[2021-04-25 03:13:40,212 INFO] Step 5200/50000; acc:  88.28; ppl:  1.61; xent: 0.48; lr: 0.00010; 5537/6877 tok/s;   2015 sec
[2021-04-25 03:13:59,009 INFO] Step 5250/50000; acc:  87.64; ppl:  1.64; xent: 0.50; lr: 0.00010; 5298/6485 tok/s;   2033 sec
[2021-04-25 03:14:19,205 INFO] Step 5300/50000; acc:  88.37; ppl:  1.60; xent: 0.47; lr: 0.00010; 5010/6470 tok/s;   2054 sec
[2021-04-25 03:14:38,252 INFO] Step 5350/50000; acc:  88.56; ppl:  1.59; xent: 0.46; lr: 0.00010; 5398/6677 tok/s;   2073 sec
[2021-04-25 03:14:57,327 INFO] Step 5400/50000; acc:  88.43; ppl:  1.60; xent: 0.47; lr: 0.00010; 5319/6714 tok/s;   2092 sec
[2021-04-25 03:15:16,059 INFO] Step 5450/50000; acc:  88.09; ppl:  1.62; xent: 0.48; lr: 0.00010; 5417/6512 tok/s;   2110 sec
[2021-04-25 03:15:35,746 INFO] Step 5500/50000; acc:  88.06; ppl:  1.62; xent: 0.48; lr: 0.00010; 5181/6439 tok/s;   2130 sec
[2021-04-25 03:15:55,285 INFO] Step 5550/50000; acc:  88.74; ppl:  1.57; xent: 0.45; lr: 0.00010; 5283/6620 tok/s;   2150 sec
[2021-04-25 03:16:15,178 INFO] Step 5600/50000; acc:  88.33; ppl:  1.60; xent: 0.47; lr: 0.00010; 5042/6430 tok/s;   2170 sec
[2021-04-25 03:16:34,609 INFO] Step 5650/50000; acc:  88.44; ppl:  1.60; xent: 0.47; lr: 0.00010; 5297/6557 tok/s;   2189 sec
[2021-04-25 03:16:53,735 INFO] Step 5700/50000; acc:  88.39; ppl:  1.60; xent: 0.47; lr: 0.00010; 5378/6638 tok/s;   2208 sec
[2021-04-25 03:17:12,251 INFO] Step 5750/50000; acc:  88.87; ppl:  1.57; xent: 0.45; lr: 0.00010; 5393/6937 tok/s;   2227 sec
[2021-04-25 03:17:30,877 INFO] Step 5800/50000; acc:  88.42; ppl:  1.59; xent: 0.47; lr: 0.00010; 5411/6774 tok/s;   2245 sec
[2021-04-25 03:17:39,836 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:17:50,264 INFO] Step 5850/50000; acc:  88.66; ppl:  1.58; xent: 0.46; lr: 0.00010; 5287/6808 tok/s;   2265 sec
[2021-04-25 03:18:09,394 INFO] Step 5900/50000; acc:  88.63; ppl:  1.59; xent: 0.46; lr: 0.00010; 5378/6675 tok/s;   2284 sec
[2021-04-25 03:18:27,291 INFO] Step 5950/50000; acc:  87.73; ppl:  1.63; xent: 0.49; lr: 0.00010; 5513/6733 tok/s;   2302 sec
[2021-04-25 03:18:47,260 INFO] Step 6000/50000; acc:  88.49; ppl:  1.59; xent: 0.46; lr: 0.00010; 5209/6598 tok/s;   2322 sec
[2021-04-25 03:19:06,886 INFO] Step 6050/50000; acc:  88.35; ppl:  1.59; xent: 0.47; lr: 0.00010; 5110/6378 tok/s;   2341 sec
[2021-04-25 03:19:25,543 INFO] Step 6100/50000; acc:  88.70; ppl:  1.57; xent: 0.45; lr: 0.00010; 5387/6845 tok/s;   2360 sec
[2021-04-25 03:19:45,098 INFO] Step 6150/50000; acc:  88.50; ppl:  1.59; xent: 0.46; lr: 0.00010; 5337/6476 tok/s;   2379 sec
[2021-04-25 03:20:03,556 INFO] Step 6200/50000; acc:  88.71; ppl:  1.58; xent: 0.46; lr: 0.00010; 5536/6762 tok/s;   2398 sec
[2021-04-25 03:20:23,129 INFO] Step 6250/50000; acc:  88.24; ppl:  1.60; xent: 0.47; lr: 0.00010; 5124/6395 tok/s;   2417 sec
[2021-04-25 03:20:42,626 INFO] Step 6300/50000; acc:  88.68; ppl:  1.58; xent: 0.45; lr: 0.00010; 5211/6618 tok/s;   2437 sec
[2021-04-25 03:21:02,028 INFO] Step 6350/50000; acc:  88.86; ppl:  1.57; xent: 0.45; lr: 0.00010; 5335/6733 tok/s;   2456 sec
[2021-04-25 03:21:20,598 INFO] Step 6400/50000; acc:  88.29; ppl:  1.60; xent: 0.47; lr: 0.00010; 5388/6674 tok/s;   2475 sec
[2021-04-25 03:21:39,957 INFO] Step 6450/50000; acc:  88.99; ppl:  1.56; xent: 0.44; lr: 0.00010; 5345/6619 tok/s;   2494 sec
[2021-04-25 03:21:58,425 INFO] Step 6500/50000; acc:  89.03; ppl:  1.56; xent: 0.44; lr: 0.00010; 5510/7095 tok/s;   2513 sec
[2021-04-25 03:22:16,902 INFO] Step 6550/50000; acc:  88.37; ppl:  1.58; xent: 0.46; lr: 0.00010; 5425/6800 tok/s;   2531 sec
[2021-04-25 03:22:20,404 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:22:36,107 INFO] Step 6600/50000; acc:  88.72; ppl:  1.58; xent: 0.45; lr: 0.00010; 5279/6667 tok/s;   2550 sec
[2021-04-25 03:22:54,683 INFO] Step 6650/50000; acc:  88.61; ppl:  1.58; xent: 0.45; lr: 0.00010; 5499/6858 tok/s;   2569 sec
[2021-04-25 03:23:13,742 INFO] Step 6700/50000; acc:  88.43; ppl:  1.59; xent: 0.46; lr: 0.00010; 5336/6596 tok/s;   2588 sec
[2021-04-25 03:23:33,015 INFO] Step 6750/50000; acc:  88.42; ppl:  1.59; xent: 0.46; lr: 0.00010; 5150/6574 tok/s;   2607 sec
[2021-04-25 03:23:52,533 INFO] Step 6800/50000; acc:  88.72; ppl:  1.57; xent: 0.45; lr: 0.00010; 5334/6591 tok/s;   2627 sec
[2021-04-25 03:24:11,361 INFO] Step 6850/50000; acc:  88.66; ppl:  1.57; xent: 0.45; lr: 0.00010; 5319/6592 tok/s;   2646 sec
[2021-04-25 03:24:30,817 INFO] Step 6900/50000; acc:  88.70; ppl:  1.57; xent: 0.45; lr: 0.00010; 5296/6543 tok/s;   2665 sec
[2021-04-25 03:24:49,700 INFO] Step 6950/50000; acc:  88.86; ppl:  1.56; xent: 0.45; lr: 0.00010; 5464/6763 tok/s;   2684 sec
[2021-04-25 03:25:09,227 INFO] Step 7000/50000; acc:  88.51; ppl:  1.58; xent: 0.46; lr: 0.00010; 5193/6387 tok/s;   2704 sec
[2021-04-25 03:25:28,186 INFO] Step 7050/50000; acc:  88.99; ppl:  1.55; xent: 0.44; lr: 0.00010; 5303/6815 tok/s;   2723 sec
[2021-04-25 03:25:47,612 INFO] Step 7100/50000; acc:  88.57; ppl:  1.58; xent: 0.46; lr: 0.00010; 5200/6576 tok/s;   2742 sec
[2021-04-25 03:26:06,474 INFO] Step 7150/50000; acc:  88.74; ppl:  1.57; xent: 0.45; lr: 0.00010; 5534/6683 tok/s;   2761 sec
[2021-04-25 03:26:25,178 INFO] Step 7200/50000; acc:  89.18; ppl:  1.54; xent: 0.43; lr: 0.00010; 5329/6861 tok/s;   2780 sec
[2021-04-25 03:26:44,295 INFO] Step 7250/50000; acc:  88.95; ppl:  1.55; xent: 0.44; lr: 0.00010; 5363/6761 tok/s;   2799 sec
[2021-04-25 03:26:49,572 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:27:03,418 INFO] Step 7300/50000; acc:  88.96; ppl:  1.56; xent: 0.44; lr: 0.00010; 5392/6806 tok/s;   2818 sec
[2021-04-25 03:27:22,318 INFO] Step 7350/50000; acc:  88.87; ppl:  1.56; xent: 0.45; lr: 0.00010; 5300/6617 tok/s;   2837 sec
[2021-04-25 03:27:40,648 INFO] Step 7400/50000; acc:  88.58; ppl:  1.58; xent: 0.45; lr: 0.00010; 5487/6854 tok/s;   2855 sec
[2021-04-25 03:28:00,186 INFO] Step 7450/50000; acc:  88.47; ppl:  1.58; xent: 0.46; lr: 0.00010; 5215/6520 tok/s;   2875 sec
[2021-04-25 03:28:19,624 INFO] Step 7500/50000; acc:  89.08; ppl:  1.55; xent: 0.44; lr: 0.00010; 5253/6632 tok/s;   2894 sec
[2021-04-25 03:28:38,464 INFO] Step 7550/50000; acc:  88.72; ppl:  1.56; xent: 0.45; lr: 0.00010; 5238/6603 tok/s;   2913 sec
[2021-04-25 03:28:57,799 INFO] Step 7600/50000; acc:  88.89; ppl:  1.56; xent: 0.44; lr: 0.00010; 5443/6600 tok/s;   2932 sec
[2021-04-25 03:29:16,440 INFO] Step 7650/50000; acc:  88.69; ppl:  1.57; xent: 0.45; lr: 0.00010; 5476/6631 tok/s;   2951 sec
[2021-04-25 03:29:35,521 INFO] Step 7700/50000; acc:  88.97; ppl:  1.55; xent: 0.44; lr: 0.00010; 5299/6742 tok/s;   2970 sec
[2021-04-25 03:29:55,132 INFO] Step 7750/50000; acc:  89.29; ppl:  1.53; xent: 0.43; lr: 0.00010; 5264/6642 tok/s;   2989 sec
[2021-04-25 03:30:14,423 INFO] Step 7800/50000; acc:  88.79; ppl:  1.56; xent: 0.45; lr: 0.00010; 5239/6510 tok/s;   3009 sec
[2021-04-25 03:30:33,692 INFO] Step 7850/50000; acc:  88.65; ppl:  1.57; xent: 0.45; lr: 0.00010; 5214/6580 tok/s;   3028 sec
[2021-04-25 03:30:52,670 INFO] Step 7900/50000; acc:  89.07; ppl:  1.54; xent: 0.43; lr: 0.00010; 5391/6696 tok/s;   3047 sec
[2021-04-25 03:31:11,603 INFO] Step 7950/50000; acc:  89.35; ppl:  1.53; xent: 0.42; lr: 0.00010; 5426/6867 tok/s;   3066 sec
[2021-04-25 03:31:30,377 INFO] Step 8000/50000; acc:  88.90; ppl:  1.55; xent: 0.44; lr: 0.00010; 5323/6743 tok/s;   3085 sec
[2021-04-25 03:31:30,390 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:31:49,676 INFO] Step 8050/50000; acc:  89.04; ppl:  1.55; xent: 0.44; lr: 0.00010; 5336/6681 tok/s;   3104 sec
[2021-04-25 03:32:08,471 INFO] Step 8100/50000; acc:  89.06; ppl:  1.55; xent: 0.44; lr: 0.00010; 5480/6825 tok/s;   3123 sec
[2021-04-25 03:32:27,180 INFO] Step 8150/50000; acc:  88.51; ppl:  1.57; xent: 0.45; lr: 0.00010; 5290/6575 tok/s;   3142 sec
[2021-04-25 03:32:46,508 INFO] Step 8200/50000; acc:  88.84; ppl:  1.56; xent: 0.44; lr: 0.00010; 5238/6670 tok/s;   3161 sec
[2021-04-25 03:33:05,880 INFO] Step 8250/50000; acc:  88.89; ppl:  1.55; xent: 0.44; lr: 0.00010; 5275/6490 tok/s;   3180 sec
[2021-04-25 03:33:25,309 INFO] Step 8300/50000; acc:  89.19; ppl:  1.54; xent: 0.43; lr: 0.00010; 5228/6674 tok/s;   3200 sec
[2021-04-25 03:33:44,575 INFO] Step 8350/50000; acc:  88.73; ppl:  1.56; xent: 0.45; lr: 0.00010; 5225/6265 tok/s;   3219 sec
[2021-04-25 03:34:04,051 INFO] Step 8400/50000; acc:  88.96; ppl:  1.55; xent: 0.44; lr: 0.00010; 5413/6649 tok/s;   3238 sec
[2021-04-25 03:34:23,752 INFO] Step 8450/50000; acc:  89.10; ppl:  1.54; xent: 0.43; lr: 0.00010; 5076/6394 tok/s;   3258 sec
[2021-04-25 03:34:43,869 INFO] Step 8500/50000; acc:  89.21; ppl:  1.53; xent: 0.43; lr: 0.00010; 5056/6470 tok/s;   3278 sec
[2021-04-25 03:35:02,878 INFO] Step 8550/50000; acc:  88.92; ppl:  1.55; xent: 0.44; lr: 0.00010; 5421/6696 tok/s;   3297 sec
[2021-04-25 03:35:22,349 INFO] Step 8600/50000; acc:  88.93; ppl:  1.55; xent: 0.44; lr: 0.00010; 5221/6467 tok/s;   3317 sec
[2021-04-25 03:35:40,656 INFO] Step 8650/50000; acc:  89.31; ppl:  1.53; xent: 0.42; lr: 0.00010; 5474/7058 tok/s;   3335 sec
[2021-04-25 03:35:59,008 INFO] Step 8700/50000; acc:  89.14; ppl:  1.54; xent: 0.43; lr: 0.00010; 5489/6870 tok/s;   3353 sec
[2021-04-25 03:36:12,654 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:36:18,132 INFO] Step 8750/50000; acc:  89.51; ppl:  1.51; xent: 0.42; lr: 0.00010; 5441/6925 tok/s;   3372 sec
[2021-04-25 03:36:37,216 INFO] Step 8800/50000; acc:  88.91; ppl:  1.55; xent: 0.44; lr: 0.00010; 5233/6475 tok/s;   3392 sec
[2021-04-25 03:36:55,771 INFO] Step 8850/50000; acc:  88.92; ppl:  1.55; xent: 0.44; lr: 0.00010; 5548/6863 tok/s;   3410 sec
[2021-04-25 03:37:14,977 INFO] Step 8900/50000; acc:  88.81; ppl:  1.55; xent: 0.44; lr: 0.00010; 5302/6553 tok/s;   3429 sec
[2021-04-25 03:37:34,928 INFO] Step 8950/50000; acc:  89.05; ppl:  1.54; xent: 0.43; lr: 0.00010; 5004/6462 tok/s;   3449 sec
[2021-04-25 03:37:53,622 INFO] Step 9000/50000; acc:  89.19; ppl:  1.53; xent: 0.43; lr: 0.00010; 5403/6706 tok/s;   3468 sec
[2021-04-25 03:38:12,748 INFO] Step 9050/50000; acc:  89.22; ppl:  1.53; xent: 0.43; lr: 0.00010; 5381/6753 tok/s;   3487 sec
[2021-04-25 03:38:31,838 INFO] Step 9100/50000; acc:  89.09; ppl:  1.54; xent: 0.43; lr: 0.00010; 5413/6532 tok/s;   3506 sec
[2021-04-25 03:38:51,187 INFO] Step 9150/50000; acc:  88.62; ppl:  1.56; xent: 0.44; lr: 0.00010; 5158/6390 tok/s;   3526 sec
[2021-04-25 03:39:11,129 INFO] Step 9200/50000; acc:  89.46; ppl:  1.52; xent: 0.42; lr: 0.00010; 5226/6550 tok/s;   3545 sec
[2021-04-25 03:39:30,576 INFO] Step 9250/50000; acc:  89.09; ppl:  1.53; xent: 0.43; lr: 0.00010; 5174/6547 tok/s;   3565 sec
[2021-04-25 03:39:49,769 INFO] Step 9300/50000; acc:  89.22; ppl:  1.53; xent: 0.43; lr: 0.00010; 5270/6656 tok/s;   3584 sec
[2021-04-25 03:40:08,840 INFO] Step 9350/50000; acc:  89.19; ppl:  1.53; xent: 0.43; lr: 0.00010; 5432/6617 tok/s;   3603 sec
[2021-04-25 03:40:27,258 INFO] Step 9400/50000; acc:  89.52; ppl:  1.51; xent: 0.42; lr: 0.00010; 5481/7016 tok/s;   3622 sec
[2021-04-25 03:40:45,866 INFO] Step 9450/50000; acc:  89.09; ppl:  1.53; xent: 0.43; lr: 0.00010; 5361/6785 tok/s;   3640 sec
[2021-04-25 03:40:54,160 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:41:05,135 INFO] Step 9500/50000; acc:  89.37; ppl:  1.52; xent: 0.42; lr: 0.00010; 5273/6782 tok/s;   3659 sec
[2021-04-25 03:41:24,346 INFO] Step 9550/50000; acc:  89.49; ppl:  1.52; xent: 0.42; lr: 0.00010; 5428/6773 tok/s;   3679 sec
[2021-04-25 03:41:42,549 INFO] Step 9600/50000; acc:  88.41; ppl:  1.57; xent: 0.45; lr: 0.00010; 5458/6568 tok/s;   3697 sec
[2021-04-25 03:42:02,473 INFO] Step 9650/50000; acc:  89.26; ppl:  1.53; xent: 0.43; lr: 0.00010; 5140/6593 tok/s;   3717 sec
[2021-04-25 03:42:22,340 INFO] Step 9700/50000; acc:  89.26; ppl:  1.52; xent: 0.42; lr: 0.00010; 5153/6412 tok/s;   3737 sec
[2021-04-25 03:42:41,300 INFO] Step 9750/50000; acc:  89.15; ppl:  1.53; xent: 0.43; lr: 0.00010; 5242/6606 tok/s;   3756 sec
[2021-04-25 03:43:00,600 INFO] Step 9800/50000; acc:  89.19; ppl:  1.53; xent: 0.43; lr: 0.00010; 5316/6484 tok/s;   3775 sec
[2021-04-25 03:43:19,284 INFO] Step 9850/50000; acc:  89.28; ppl:  1.53; xent: 0.42; lr: 0.00010; 5527/6725 tok/s;   3794 sec
[2021-04-25 03:43:39,044 INFO] Step 9900/50000; acc:  89.13; ppl:  1.53; xent: 0.42; lr: 0.00010; 5167/6481 tok/s;   3813 sec
[2021-04-25 03:43:58,082 INFO] Step 9950/50000; acc:  89.34; ppl:  1.52; xent: 0.42; lr: 0.00010; 5229/6671 tok/s;   3832 sec
[2021-04-25 03:44:17,538 INFO] Step 10000/50000; acc:  89.46; ppl:  1.51; xent: 0.41; lr: 0.00010; 5369/6740 tok/s;   3852 sec
[2021-04-25 03:44:17,539 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-25 03:44:44,276 INFO] Validation perplexity: 1.50801
[2021-04-25 03:44:44,277 INFO] Validation accuracy: 89.5488
[2021-04-25 03:44:44,281 INFO] Saving checkpoint ../models/group1_params/basic_ops/model_step_10000.pt
[2021-04-25 03:45:02,969 INFO] Step 10050/50000; acc:  88.88; ppl:  1.55; xent: 0.44; lr: 0.00010; 2214/2713 tok/s;   3897 sec
[2021-04-25 03:45:22,021 INFO] Step 10100/50000; acc:  89.61; ppl:  1.50; xent: 0.41; lr: 0.00010; 5330/6711 tok/s;   3916 sec
[2021-04-25 03:45:40,557 INFO] Step 10150/50000; acc:  89.64; ppl:  1.50; xent: 0.41; lr: 0.00010; 5529/7073 tok/s;   3935 sec
[2021-04-25 03:45:59,153 INFO] Step 10200/50000; acc:  89.12; ppl:  1.53; xent: 0.42; lr: 0.00010; 5453/6819 tok/s;   3953 sec
[2021-04-25 03:46:01,883 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:46:18,175 INFO] Step 10250/50000; acc:  89.23; ppl:  1.53; xent: 0.42; lr: 0.00010; 5281/6718 tok/s;   3973 sec
[2021-04-25 03:46:36,803 INFO] Step 10300/50000; acc:  89.27; ppl:  1.52; xent: 0.42; lr: 0.00010; 5430/6738 tok/s;   3991 sec
[2021-04-25 03:46:55,532 INFO] Step 10350/50000; acc:  89.09; ppl:  1.54; xent: 0.43; lr: 0.00010; 5513/6776 tok/s;   4010 sec
[2021-04-25 03:47:14,919 INFO] Step 10400/50000; acc:  89.24; ppl:  1.52; xent: 0.42; lr: 0.00010; 5139/6559 tok/s;   4029 sec
[2021-04-25 03:47:34,369 INFO] Step 10450/50000; acc:  89.25; ppl:  1.52; xent: 0.42; lr: 0.00010; 5275/6551 tok/s;   4049 sec
[2021-04-25 03:47:53,636 INFO] Step 10500/50000; acc:  89.50; ppl:  1.51; xent: 0.41; lr: 0.00010; 5313/6644 tok/s;   4068 sec
[2021-04-25 03:48:12,871 INFO] Step 10550/50000; acc:  89.09; ppl:  1.53; xent: 0.43; lr: 0.00010; 5300/6403 tok/s;   4087 sec
[2021-04-25 03:48:31,790 INFO] Step 10600/50000; acc:  89.39; ppl:  1.51; xent: 0.41; lr: 0.00010; 5344/6720 tok/s;   4106 sec
[2021-04-25 03:48:51,442 INFO] Step 10650/50000; acc:  89.28; ppl:  1.52; xent: 0.42; lr: 0.00010; 5227/6473 tok/s;   4126 sec
[2021-04-25 03:49:10,978 INFO] Step 10700/50000; acc:  89.65; ppl:  1.50; xent: 0.41; lr: 0.00010; 5240/6703 tok/s;   4145 sec
[2021-04-25 03:49:29,950 INFO] Step 10750/50000; acc:  89.00; ppl:  1.54; xent: 0.43; lr: 0.00010; 5211/6542 tok/s;   4164 sec
[2021-04-25 03:49:49,094 INFO] Step 10800/50000; acc:  89.49; ppl:  1.51; xent: 0.41; lr: 0.00010; 5509/6640 tok/s;   4183 sec
[2021-04-25 03:50:07,812 INFO] Step 10850/50000; acc:  89.85; ppl:  1.49; xent: 0.40; lr: 0.00010; 5351/6924 tok/s;   4202 sec
[2021-04-25 03:50:26,677 INFO] Step 10900/50000; acc:  89.42; ppl:  1.51; xent: 0.41; lr: 0.00010; 5340/6736 tok/s;   4221 sec
[2021-04-25 03:50:31,283 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:50:45,982 INFO] Step 10950/50000; acc:  89.46; ppl:  1.51; xent: 0.41; lr: 0.00010; 5364/6760 tok/s;   4240 sec
[2021-04-25 03:51:04,673 INFO] Step 11000/50000; acc:  89.57; ppl:  1.51; xent: 0.41; lr: 0.00010; 5424/6802 tok/s;   4259 sec
[2021-04-25 03:51:22,991 INFO] Step 11050/50000; acc:  88.98; ppl:  1.53; xent: 0.43; lr: 0.00010; 5427/6747 tok/s;   4277 sec
[2021-04-25 03:51:42,596 INFO] Step 11100/50000; acc:  89.15; ppl:  1.53; xent: 0.43; lr: 0.00010; 5160/6515 tok/s;   4297 sec
[2021-04-25 03:52:02,236 INFO] Step 11150/50000; acc:  89.61; ppl:  1.50; xent: 0.41; lr: 0.00010; 5275/6578 tok/s;   4317 sec
[2021-04-25 03:52:21,241 INFO] Step 11200/50000; acc:  89.23; ppl:  1.52; xent: 0.42; lr: 0.00010; 5218/6563 tok/s;   4336 sec
[2021-04-25 03:52:40,390 INFO] Step 11250/50000; acc:  89.42; ppl:  1.51; xent: 0.42; lr: 0.00010; 5420/6601 tok/s;   4355 sec
[2021-04-25 03:52:58,914 INFO] Step 11300/50000; acc:  89.41; ppl:  1.52; xent: 0.42; lr: 0.00010; 5621/6815 tok/s;   4373 sec
[2021-04-25 03:53:17,968 INFO] Step 11350/50000; acc:  89.38; ppl:  1.51; xent: 0.41; lr: 0.00010; 5239/6629 tok/s;   4392 sec
[2021-04-25 03:53:37,495 INFO] Step 11400/50000; acc:  89.64; ppl:  1.49; xent: 0.40; lr: 0.00010; 5197/6646 tok/s;   4412 sec
[2021-04-25 03:53:56,699 INFO] Step 11450/50000; acc:  89.28; ppl:  1.52; xent: 0.42; lr: 0.00010; 5326/6584 tok/s;   4431 sec
[2021-04-25 03:54:16,018 INFO] Step 11500/50000; acc:  89.28; ppl:  1.52; xent: 0.42; lr: 0.00010; 5302/6588 tok/s;   4450 sec
[2021-04-25 03:54:34,715 INFO] Step 11550/50000; acc:  89.77; ppl:  1.49; xent: 0.40; lr: 0.00010; 5352/6756 tok/s;   4469 sec
[2021-04-25 03:54:53,784 INFO] Step 11600/50000; acc:  89.98; ppl:  1.48; xent: 0.39; lr: 0.00010; 5438/6935 tok/s;   4488 sec
[2021-04-25 03:55:11,653 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 03:55:12,500 INFO] Step 11650/50000; acc:  89.29; ppl:  1.51; xent: 0.41; lr: 0.00010; 5373/6722 tok/s;   4507 sec
[2021-04-25 03:55:31,924 INFO] Step 11700/50000; acc:  89.46; ppl:  1.51; xent: 0.42; lr: 0.00010; 5200/6512 tok/s;   4526 sec
[2021-04-25 03:55:50,836 INFO] Step 11750/50000; acc:  89.71; ppl:  1.50; xent: 0.41; lr: 0.00010; 5474/6857 tok/s;   4545 sec
[2021-04-25 03:56:09,703 INFO] Step 11800/50000; acc:  89.06; ppl:  1.53; xent: 0.42; lr: 0.00010; 5310/6526 tok/s;   4564 sec
[2021-04-25 03:56:28,795 INFO] Step 11850/50000; acc:  89.42; ppl:  1.51; xent: 0.41; lr: 0.00010; 5244/6795 tok/s;   4583 sec
[2021-04-25 03:56:48,359 INFO] Step 11900/50000; acc:  89.45; ppl:  1.51; xent: 0.41; lr: 0.00010; 5185/6385 tok/s;   4603 sec
[2021-04-25 03:57:07,778 INFO] Step 11950/50000; acc:  89.72; ppl:  1.49; xent: 0.40; lr: 0.00010; 5320/6697 tok/s;   4622 sec
[2021-04-25 03:57:27,015 INFO] Step 12000/50000; acc:  89.21; ppl:  1.52; xent: 0.42; lr: 0.00010; 5260/6267 tok/s;   4641 sec
[2021-04-25 03:57:46,350 INFO] Step 12050/50000; acc:  89.50; ppl:  1.51; xent: 0.41; lr: 0.00010; 5364/6684 tok/s;   4661 sec
[2021-04-25 03:58:05,733 INFO] Step 12100/50000; acc:  89.74; ppl:  1.49; xent: 0.40; lr: 0.00010; 5262/6663 tok/s;   4680 sec
[2021-04-25 03:58:25,459 INFO] Step 12150/50000; acc:  89.51; ppl:  1.50; xent: 0.41; lr: 0.00010; 5104/6423 tok/s;   4700 sec
[2021-04-25 03:58:44,619 INFO] Step 12200/50000; acc:  89.37; ppl:  1.51; xent: 0.41; lr: 0.00010; 5276/6615 tok/s;   4719 sec
[2021-04-25 03:59:04,026 INFO] Step 12250/50000; acc:  89.47; ppl:  1.50; xent: 0.41; lr: 0.00010; 5310/6499 tok/s;   4738 sec
[2021-04-25 03:59:22,826 INFO] Step 12300/50000; acc:  89.96; ppl:  1.48; xent: 0.39; lr: 0.00010; 5422/6954 tok/s;   4757 sec
[2021-04-25 03:59:41,090 INFO] Step 12350/50000; acc:  89.49; ppl:  1.50; xent: 0.41; lr: 0.00010; 5398/6763 tok/s;   4775 sec
[2021-04-25 03:59:54,029 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:00:00,386 INFO] Step 12400/50000; acc:  89.95; ppl:  1.48; xent: 0.39; lr: 0.00010; 5437/6927 tok/s;   4795 sec
[2021-04-25 04:00:19,623 INFO] Step 12450/50000; acc:  89.56; ppl:  1.50; xent: 0.41; lr: 0.00010; 5230/6542 tok/s;   4814 sec
[2021-04-25 04:00:37,959 INFO] Step 12500/50000; acc:  89.28; ppl:  1.52; xent: 0.42; lr: 0.00010; 5505/6820 tok/s;   4832 sec
[2021-04-25 04:00:57,325 INFO] Step 12550/50000; acc:  89.26; ppl:  1.52; xent: 0.42; lr: 0.00010; 5296/6542 tok/s;   4852 sec
[2021-04-25 04:01:17,190 INFO] Step 12600/50000; acc:  89.62; ppl:  1.49; xent: 0.40; lr: 0.00010; 5078/6489 tok/s;   4872 sec
[2021-04-25 04:01:35,969 INFO] Step 12650/50000; acc:  89.66; ppl:  1.50; xent: 0.40; lr: 0.00010; 5325/6705 tok/s;   4890 sec
[2021-04-25 04:01:55,086 INFO] Step 12700/50000; acc:  89.55; ppl:  1.50; xent: 0.40; lr: 0.00010; 5349/6649 tok/s;   4909 sec
[2021-04-25 04:02:14,219 INFO] Step 12750/50000; acc:  89.67; ppl:  1.49; xent: 0.40; lr: 0.00010; 5469/6570 tok/s;   4929 sec
[2021-04-25 04:02:33,663 INFO] Step 12800/50000; acc:  89.25; ppl:  1.52; xent: 0.42; lr: 0.00010; 5159/6407 tok/s;   4948 sec
[2021-04-25 04:02:53,451 INFO] Step 12850/50000; acc:  89.83; ppl:  1.49; xent: 0.40; lr: 0.00010; 5190/6521 tok/s;   4968 sec
[2021-04-25 04:03:12,770 INFO] Step 12900/50000; acc:  89.68; ppl:  1.49; xent: 0.40; lr: 0.00010; 5312/6637 tok/s;   4987 sec
[2021-04-25 04:03:32,130 INFO] Step 12950/50000; acc:  89.63; ppl:  1.50; xent: 0.40; lr: 0.00010; 5174/6578 tok/s;   5006 sec
[2021-04-25 04:03:50,962 INFO] Step 13000/50000; acc:  89.67; ppl:  1.49; xent: 0.40; lr: 0.00010; 5398/6689 tok/s;   5025 sec
[2021-04-25 04:04:09,142 INFO] Step 13050/50000; acc:  90.00; ppl:  1.48; xent: 0.39; lr: 0.00010; 5614/7155 tok/s;   5043 sec
[2021-04-25 04:04:28,010 INFO] Step 13100/50000; acc:  89.57; ppl:  1.50; xent: 0.40; lr: 0.00010; 5396/6714 tok/s;   5062 sec
[2021-04-25 04:04:35,480 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:04:47,202 INFO] Step 13150/50000; acc:  89.58; ppl:  1.49; xent: 0.40; lr: 0.00010; 5179/6665 tok/s;   5082 sec
[2021-04-25 04:05:06,299 INFO] Step 13200/50000; acc:  90.02; ppl:  1.48; xent: 0.39; lr: 0.00010; 5506/6900 tok/s;   5101 sec
[2021-04-25 04:05:24,426 INFO] Step 13250/50000; acc:  89.04; ppl:  1.52; xent: 0.42; lr: 0.00010; 5515/6644 tok/s;   5119 sec
[2021-04-25 04:05:44,016 INFO] Step 13300/50000; acc:  89.69; ppl:  1.49; xent: 0.40; lr: 0.00010; 5135/6675 tok/s;   5138 sec
[2021-04-25 04:06:03,939 INFO] Step 13350/50000; acc:  89.65; ppl:  1.49; xent: 0.40; lr: 0.00010; 5166/6382 tok/s;   5158 sec
[2021-04-25 04:06:22,862 INFO] Step 13400/50000; acc:  89.72; ppl:  1.49; xent: 0.40; lr: 0.00010; 5316/6644 tok/s;   5177 sec
[2021-04-25 04:06:42,097 INFO] Step 13450/50000; acc:  89.46; ppl:  1.50; xent: 0.41; lr: 0.00010; 5284/6453 tok/s;   5196 sec
[2021-04-25 04:07:00,803 INFO] Step 13500/50000; acc:  89.67; ppl:  1.49; xent: 0.40; lr: 0.00010; 5460/6756 tok/s;   5215 sec
[2021-04-25 04:07:20,606 INFO] Step 13550/50000; acc:  89.61; ppl:  1.50; xent: 0.40; lr: 0.00010; 5239/6470 tok/s;   5235 sec
[2021-04-25 04:07:39,678 INFO] Step 13600/50000; acc:  89.85; ppl:  1.48; xent: 0.39; lr: 0.00010; 5251/6708 tok/s;   5254 sec
[2021-04-25 04:07:58,891 INFO] Step 13650/50000; acc:  89.82; ppl:  1.48; xent: 0.39; lr: 0.00010; 5349/6731 tok/s;   5273 sec
[2021-04-25 04:08:18,163 INFO] Step 13700/50000; acc:  89.46; ppl:  1.51; xent: 0.41; lr: 0.00010; 5336/6475 tok/s;   5293 sec
[2021-04-25 04:08:37,165 INFO] Step 13750/50000; acc:  89.99; ppl:  1.47; xent: 0.38; lr: 0.00010; 5268/6743 tok/s;   5312 sec
[2021-04-25 04:08:55,580 INFO] Step 13800/50000; acc:  90.10; ppl:  1.47; xent: 0.38; lr: 0.00010; 5475/7016 tok/s;   5330 sec
[2021-04-25 04:09:14,458 INFO] Step 13850/50000; acc:  89.56; ppl:  1.49; xent: 0.40; lr: 0.00010; 5433/6820 tok/s;   5349 sec
[2021-04-25 04:09:16,586 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:09:33,827 INFO] Step 13900/50000; acc:  89.79; ppl:  1.49; xent: 0.40; lr: 0.00010; 5286/6589 tok/s;   5368 sec
[2021-04-25 04:09:52,190 INFO] Step 13950/50000; acc:  89.55; ppl:  1.50; xent: 0.40; lr: 0.00010; 5390/6728 tok/s;   5387 sec
[2021-04-25 04:10:11,480 INFO] Step 14000/50000; acc:  89.61; ppl:  1.49; xent: 0.40; lr: 0.00010; 5393/6728 tok/s;   5406 sec
[2021-04-25 04:10:30,875 INFO] Step 14050/50000; acc:  89.61; ppl:  1.49; xent: 0.40; lr: 0.00010; 5178/6516 tok/s;   5425 sec
[2021-04-25 04:10:49,860 INFO] Step 14100/50000; acc:  89.59; ppl:  1.49; xent: 0.40; lr: 0.00010; 5299/6568 tok/s;   5444 sec
[2021-04-25 04:11:09,176 INFO] Step 14150/50000; acc:  89.97; ppl:  1.47; xent: 0.39; lr: 0.00010; 5339/6732 tok/s;   5464 sec
[2021-04-25 04:11:28,744 INFO] Step 14200/50000; acc:  89.55; ppl:  1.50; xent: 0.40; lr: 0.00010; 5267/6328 tok/s;   5483 sec
[2021-04-25 04:11:47,430 INFO] Step 14250/50000; acc:  89.59; ppl:  1.49; xent: 0.40; lr: 0.00010; 5361/6751 tok/s;   5502 sec
[2021-04-25 04:12:07,013 INFO] Step 14300/50000; acc:  89.73; ppl:  1.49; xent: 0.40; lr: 0.00010; 5191/6489 tok/s;   5521 sec
[2021-04-25 04:12:26,531 INFO] Step 14350/50000; acc:  90.10; ppl:  1.47; xent: 0.38; lr: 0.00010; 5322/6787 tok/s;   5541 sec
[2021-04-25 04:12:45,531 INFO] Step 14400/50000; acc:  89.34; ppl:  1.50; xent: 0.41; lr: 0.00010; 5239/6465 tok/s;   5560 sec
[2021-04-25 04:13:04,839 INFO] Step 14450/50000; acc:  90.02; ppl:  1.47; xent: 0.38; lr: 0.00010; 5381/6634 tok/s;   5579 sec
[2021-04-25 04:13:23,480 INFO] Step 14500/50000; acc:  90.22; ppl:  1.46; xent: 0.38; lr: 0.00010; 5474/6972 tok/s;   5598 sec
[2021-04-25 04:13:42,547 INFO] Step 14550/50000; acc:  89.84; ppl:  1.48; xent: 0.39; lr: 0.00010; 5222/6642 tok/s;   5617 sec
[2021-04-25 04:13:46,149 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:14:01,499 INFO] Step 14600/50000; acc:  89.68; ppl:  1.49; xent: 0.40; lr: 0.00010; 5366/6729 tok/s;   5636 sec
[2021-04-25 04:14:20,330 INFO] Step 14650/50000; acc:  90.01; ppl:  1.47; xent: 0.39; lr: 0.00010; 5452/6842 tok/s;   5655 sec
[2021-04-25 04:14:38,905 INFO] Step 14700/50000; acc:  89.54; ppl:  1.50; xent: 0.40; lr: 0.00010; 5458/6711 tok/s;   5673 sec
[2021-04-25 04:14:58,347 INFO] Step 14750/50000; acc:  89.49; ppl:  1.49; xent: 0.40; lr: 0.00010; 5095/6470 tok/s;   5693 sec
[2021-04-25 04:15:18,103 INFO] Step 14800/50000; acc:  89.96; ppl:  1.47; xent: 0.39; lr: 0.00010; 5287/6576 tok/s;   5712 sec
[2021-04-25 04:15:37,133 INFO] Step 14850/50000; acc:  89.57; ppl:  1.49; xent: 0.40; lr: 0.00010; 5236/6535 tok/s;   5731 sec
[2021-04-25 04:15:56,326 INFO] Step 14900/50000; acc:  89.85; ppl:  1.48; xent: 0.39; lr: 0.00010; 5321/6561 tok/s;   5751 sec
[2021-04-25 04:16:15,028 INFO] Step 14950/50000; acc:  89.71; ppl:  1.49; xent: 0.40; lr: 0.00010; 5587/6811 tok/s;   5769 sec
[2021-04-25 04:16:34,409 INFO] Step 15000/50000; acc:  89.96; ppl:  1.47; xent: 0.39; lr: 0.00010; 5214/6527 tok/s;   5789 sec
[2021-04-25 04:16:34,410 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-25 04:17:00,797 INFO] Validation perplexity: 1.47235
[2021-04-25 04:17:00,797 INFO] Validation accuracy: 89.9899
[2021-04-25 04:17:00,801 INFO] Saving checkpoint ../models/group1_params/basic_ops/model_step_15000.pt
[2021-04-25 04:17:20,121 INFO] Step 15050/50000; acc:  90.09; ppl:  1.47; xent: 0.38; lr: 0.00010; 2197/2835 tok/s;   5834 sec
[2021-04-25 04:17:39,286 INFO] Step 15100/50000; acc:  89.67; ppl:  1.48; xent: 0.39; lr: 0.00010; 5295/6625 tok/s;   5854 sec
[2021-04-25 04:17:58,687 INFO] Step 15150/50000; acc:  89.70; ppl:  1.49; xent: 0.40; lr: 0.00010; 5359/6538 tok/s;   5873 sec
[2021-04-25 04:18:17,417 INFO] Step 15200/50000; acc:  90.13; ppl:  1.46; xent: 0.38; lr: 0.00010; 5372/6796 tok/s;   5892 sec
[2021-04-25 04:18:36,453 INFO] Step 15250/50000; acc:  90.21; ppl:  1.46; xent: 0.38; lr: 0.00010; 5358/6884 tok/s;   5911 sec
[2021-04-25 04:18:53,801 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:18:55,569 INFO] Step 15300/50000; acc:  89.98; ppl:  1.47; xent: 0.38; lr: 0.00010; 5368/6726 tok/s;   5930 sec
[2021-04-25 04:19:14,798 INFO] Step 15350/50000; acc:  89.71; ppl:  1.49; xent: 0.40; lr: 0.00010; 5195/6517 tok/s;   5949 sec
[2021-04-25 04:19:33,583 INFO] Step 15400/50000; acc:  89.86; ppl:  1.48; xent: 0.39; lr: 0.00010; 5418/6751 tok/s;   5968 sec
[2021-04-25 04:19:52,188 INFO] Step 15450/50000; acc:  89.55; ppl:  1.49; xent: 0.40; lr: 0.00010; 5446/6713 tok/s;   5987 sec
[2021-04-25 04:20:11,767 INFO] Step 15500/50000; acc:  90.03; ppl:  1.47; xent: 0.39; lr: 0.00010; 5215/6674 tok/s;   6006 sec
[2021-04-25 04:20:30,847 INFO] Step 15550/50000; acc:  89.68; ppl:  1.48; xent: 0.39; lr: 0.00010; 5199/6455 tok/s;   6025 sec
[2021-04-25 04:20:50,441 INFO] Step 15600/50000; acc:  90.12; ppl:  1.46; xent: 0.38; lr: 0.00010; 5329/6681 tok/s;   6045 sec
[2021-04-25 04:21:09,477 INFO] Step 15650/50000; acc:  89.65; ppl:  1.49; xent: 0.40; lr: 0.00010; 5338/6386 tok/s;   6064 sec
[2021-04-25 04:21:28,544 INFO] Step 15700/50000; acc:  89.72; ppl:  1.48; xent: 0.39; lr: 0.00010; 5351/6606 tok/s;   6083 sec
[2021-04-25 04:21:48,294 INFO] Step 15750/50000; acc:  90.15; ppl:  1.46; xent: 0.38; lr: 0.00010; 5185/6662 tok/s;   6103 sec
[2021-04-25 04:22:07,880 INFO] Step 15800/50000; acc:  90.05; ppl:  1.47; xent: 0.38; lr: 0.00010; 5200/6513 tok/s;   6122 sec
[2021-04-25 04:22:27,080 INFO] Step 15850/50000; acc:  89.63; ppl:  1.48; xent: 0.40; lr: 0.00010; 5208/6520 tok/s;   6141 sec
[2021-04-25 04:22:46,433 INFO] Step 15900/50000; acc:  89.92; ppl:  1.47; xent: 0.38; lr: 0.00010; 5272/6487 tok/s;   6161 sec
[2021-04-25 04:23:05,319 INFO] Step 15950/50000; acc:  90.35; ppl:  1.45; xent: 0.37; lr: 0.00010; 5486/6999 tok/s;   6180 sec
[2021-04-25 04:23:23,612 INFO] Step 16000/50000; acc:  89.84; ppl:  1.47; xent: 0.39; lr: 0.00010; 5420/6771 tok/s;   6198 sec
[2021-04-25 04:23:35,739 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:23:42,823 INFO] Step 16050/50000; acc:  90.14; ppl:  1.46; xent: 0.38; lr: 0.00010; 5381/6889 tok/s;   6217 sec
[2021-04-25 04:24:02,181 INFO] Step 16100/50000; acc:  90.11; ppl:  1.47; xent: 0.38; lr: 0.00010; 5312/6653 tok/s;   6237 sec
[2021-04-25 04:24:20,434 INFO] Step 16150/50000; acc:  89.47; ppl:  1.49; xent: 0.40; lr: 0.00010; 5460/6734 tok/s;   6255 sec
[2021-04-25 04:24:39,706 INFO] Step 16200/50000; acc:  89.61; ppl:  1.49; xent: 0.40; lr: 0.00010; 5225/6511 tok/s;   6274 sec
[2021-04-25 04:24:59,422 INFO] Step 16250/50000; acc:  90.05; ppl:  1.46; xent: 0.38; lr: 0.00010; 5178/6564 tok/s;   6294 sec
[2021-04-25 04:25:18,622 INFO] Step 16300/50000; acc:  90.00; ppl:  1.47; xent: 0.38; lr: 0.00010; 5302/6655 tok/s;   6313 sec
[2021-04-25 04:25:37,693 INFO] Step 16350/50000; acc:  89.74; ppl:  1.48; xent: 0.39; lr: 0.00010; 5250/6520 tok/s;   6332 sec
[2021-04-25 04:25:56,591 INFO] Step 16400/50000; acc:  90.06; ppl:  1.46; xent: 0.38; lr: 0.00010; 5597/6734 tok/s;   6351 sec
[2021-04-25 04:26:16,188 INFO] Step 16450/50000; acc:  89.63; ppl:  1.48; xent: 0.39; lr: 0.00010; 5137/6379 tok/s;   6371 sec
[2021-04-25 04:26:35,526 INFO] Step 16500/50000; acc:  90.03; ppl:  1.46; xent: 0.38; lr: 0.00010; 5221/6547 tok/s;   6390 sec
[2021-04-25 04:26:55,028 INFO] Step 16550/50000; acc:  90.18; ppl:  1.46; xent: 0.38; lr: 0.00010; 5291/6712 tok/s;   6409 sec
[2021-04-25 04:27:14,274 INFO] Step 16600/50000; acc:  89.88; ppl:  1.47; xent: 0.39; lr: 0.00010; 5265/6583 tok/s;   6429 sec
[2021-04-25 04:27:32,793 INFO] Step 16650/50000; acc:  90.04; ppl:  1.46; xent: 0.38; lr: 0.00010; 5431/6831 tok/s;   6447 sec
[2021-04-25 04:27:50,894 INFO] Step 16700/50000; acc:  90.37; ppl:  1.45; xent: 0.37; lr: 0.00010; 5589/7145 tok/s;   6465 sec
[2021-04-25 04:28:09,903 INFO] Step 16750/50000; acc:  89.80; ppl:  1.47; xent: 0.39; lr: 0.00010; 5438/6665 tok/s;   6484 sec
[2021-04-25 04:28:16,497 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:28:29,099 INFO] Step 16800/50000; acc:  90.03; ppl:  1.46; xent: 0.38; lr: 0.00010; 5217/6681 tok/s;   6503 sec
[2021-04-25 04:28:48,121 INFO] Step 16850/50000; acc:  90.20; ppl:  1.46; xent: 0.38; lr: 0.00010; 5424/6926 tok/s;   6522 sec
[2021-04-25 04:29:06,525 INFO] Step 16900/50000; acc:  89.66; ppl:  1.49; xent: 0.40; lr: 0.00010; 5549/6637 tok/s;   6541 sec
[2021-04-25 04:29:26,150 INFO] Step 16950/50000; acc:  89.94; ppl:  1.47; xent: 0.39; lr: 0.00010; 5068/6602 tok/s;   6560 sec
[2021-04-25 04:29:45,263 INFO] Step 17000/50000; acc:  89.89; ppl:  1.47; xent: 0.38; lr: 0.00010; 5285/6525 tok/s;   6580 sec
[2021-04-25 04:30:04,293 INFO] Step 17050/50000; acc:  90.00; ppl:  1.46; xent: 0.38; lr: 0.00010; 5357/6690 tok/s;   6599 sec
[2021-04-25 04:30:23,819 INFO] Step 17100/50000; acc:  89.94; ppl:  1.47; xent: 0.38; lr: 0.00010; 5306/6422 tok/s;   6618 sec
[2021-04-25 04:30:42,136 INFO] Step 17150/50000; acc:  89.92; ppl:  1.47; xent: 0.38; lr: 0.00010; 5453/6729 tok/s;   6636 sec
[2021-04-25 04:31:02,031 INFO] Step 17200/50000; acc:  89.97; ppl:  1.47; xent: 0.38; lr: 0.00010; 5264/6527 tok/s;   6656 sec
[2021-04-25 04:31:21,072 INFO] Step 17250/50000; acc:  90.17; ppl:  1.45; xent: 0.37; lr: 0.00010; 5288/6699 tok/s;   6675 sec
[2021-04-25 04:31:40,530 INFO] Step 17300/50000; acc:  90.15; ppl:  1.46; xent: 0.38; lr: 0.00010; 5186/6628 tok/s;   6695 sec
[2021-04-25 04:31:59,642 INFO] Step 17350/50000; acc:  89.89; ppl:  1.47; xent: 0.39; lr: 0.00010; 5417/6515 tok/s;   6714 sec
[2021-04-25 04:32:18,743 INFO] Step 17400/50000; acc:  90.48; ppl:  1.44; xent: 0.36; lr: 0.00010; 5294/6848 tok/s;   6733 sec
[2021-04-25 04:32:37,110 INFO] Step 17450/50000; acc:  90.33; ppl:  1.45; xent: 0.37; lr: 0.00010; 5439/6938 tok/s;   6751 sec
[2021-04-25 04:32:55,748 INFO] Step 17500/50000; acc:  89.88; ppl:  1.46; xent: 0.38; lr: 0.00010; 5456/6862 tok/s;   6770 sec
[2021-04-25 04:32:57,206 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:33:15,348 INFO] Step 17550/50000; acc:  90.15; ppl:  1.46; xent: 0.38; lr: 0.00010; 5308/6564 tok/s;   6790 sec
[2021-04-25 04:33:33,623 INFO] Step 17600/50000; acc:  89.93; ppl:  1.47; xent: 0.38; lr: 0.00010; 5440/6810 tok/s;   6808 sec
[2021-04-25 04:33:52,749 INFO] Step 17650/50000; acc:  89.78; ppl:  1.47; xent: 0.39; lr: 0.00010; 5358/6699 tok/s;   6827 sec
[2021-04-25 04:34:12,459 INFO] Step 17700/50000; acc:  90.21; ppl:  1.45; xent: 0.37; lr: 0.00010; 5194/6562 tok/s;   6847 sec
[2021-04-25 04:34:31,569 INFO] Step 17750/50000; acc:  89.78; ppl:  1.47; xent: 0.39; lr: 0.00010; 5204/6463 tok/s;   6866 sec
[2021-04-25 04:34:50,506 INFO] Step 17800/50000; acc:  90.17; ppl:  1.46; xent: 0.38; lr: 0.00010; 5355/6723 tok/s;   6885 sec
[2021-04-25 04:35:09,950 INFO] Step 17850/50000; acc:  89.91; ppl:  1.47; xent: 0.38; lr: 0.00010; 5366/6428 tok/s;   6904 sec
[2021-04-25 04:35:28,838 INFO] Step 17900/50000; acc:  90.06; ppl:  1.46; xent: 0.38; lr: 0.00010; 5401/6762 tok/s;   6923 sec
[2021-04-25 04:35:48,025 INFO] Step 17950/50000; acc:  90.02; ppl:  1.46; xent: 0.38; lr: 0.00010; 5176/6576 tok/s;   6942 sec
[2021-04-25 04:36:07,663 INFO] Step 18000/50000; acc:  90.35; ppl:  1.45; xent: 0.37; lr: 0.00010; 5343/6751 tok/s;   6962 sec
[2021-04-25 04:36:27,034 INFO] Step 18050/50000; acc:  89.67; ppl:  1.48; xent: 0.39; lr: 0.00010; 5171/6353 tok/s;   6981 sec
[2021-04-25 04:36:46,238 INFO] Step 18100/50000; acc:  90.36; ppl:  1.44; xent: 0.36; lr: 0.00010; 5317/6650 tok/s;   7001 sec
[2021-04-25 04:37:05,087 INFO] Step 18150/50000; acc:  90.51; ppl:  1.44; xent: 0.36; lr: 0.00010; 5446/6884 tok/s;   7019 sec
[2021-04-25 04:37:24,273 INFO] Step 18200/50000; acc:  90.11; ppl:  1.45; xent: 0.37; lr: 0.00010; 5249/6654 tok/s;   7039 sec
[2021-04-25 04:37:27,019 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:37:42,939 INFO] Step 18250/50000; acc:  89.99; ppl:  1.46; xent: 0.38; lr: 0.00010; 5386/6798 tok/s;   7057 sec
[2021-04-25 04:38:01,816 INFO] Step 18300/50000; acc:  90.26; ppl:  1.45; xent: 0.37; lr: 0.00010; 5397/6755 tok/s;   7076 sec
[2021-04-25 04:38:20,159 INFO] Step 18350/50000; acc:  89.67; ppl:  1.48; xent: 0.39; lr: 0.00010; 5599/6798 tok/s;   7094 sec
[2021-04-25 04:38:39,645 INFO] Step 18400/50000; acc:  89.91; ppl:  1.46; xent: 0.38; lr: 0.00010; 5123/6543 tok/s;   7114 sec
[2021-04-25 04:38:59,317 INFO] Step 18450/50000; acc:  90.28; ppl:  1.45; xent: 0.37; lr: 0.00010; 5220/6522 tok/s;   7134 sec
[2021-04-25 04:39:18,589 INFO] Step 18500/50000; acc:  90.08; ppl:  1.46; xent: 0.38; lr: 0.00010; 5286/6638 tok/s;   7153 sec
[2021-04-25 04:39:37,806 INFO] Step 18550/50000; acc:  90.06; ppl:  1.46; xent: 0.38; lr: 0.00010; 5255/6363 tok/s;   7172 sec
[2021-04-25 04:39:56,308 INFO] Step 18600/50000; acc:  89.89; ppl:  1.47; xent: 0.38; lr: 0.00010; 5533/6826 tok/s;   7191 sec
[2021-04-25 04:40:15,940 INFO] Step 18650/50000; acc:  90.28; ppl:  1.45; xent: 0.37; lr: 0.00010; 5218/6556 tok/s;   7210 sec
[2021-04-25 04:40:35,424 INFO] Step 18700/50000; acc:  90.40; ppl:  1.44; xent: 0.36; lr: 0.00010; 5244/6654 tok/s;   7230 sec
[2021-04-25 04:40:54,381 INFO] Step 18750/50000; acc:  90.00; ppl:  1.46; xent: 0.38; lr: 0.00010; 5246/6633 tok/s;   7249 sec
[2021-04-25 04:41:13,786 INFO] Step 18800/50000; acc:  89.99; ppl:  1.46; xent: 0.38; lr: 0.00010; 5410/6580 tok/s;   7268 sec
[2021-04-25 04:41:32,581 INFO] Step 18850/50000; acc:  90.47; ppl:  1.44; xent: 0.36; lr: 0.00010; 5383/6827 tok/s;   7287 sec
[2021-04-25 04:41:51,491 INFO] Step 18900/50000; acc:  90.46; ppl:  1.44; xent: 0.36; lr: 0.00010; 5292/6844 tok/s;   7306 sec
[2021-04-25 04:42:08,115 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:42:10,794 INFO] Step 18950/50000; acc:  90.26; ppl:  1.44; xent: 0.37; lr: 0.00010; 5348/6729 tok/s;   7325 sec
[2021-04-25 04:42:29,790 INFO] Step 19000/50000; acc:  90.04; ppl:  1.46; xent: 0.38; lr: 0.00010; 5315/6528 tok/s;   7344 sec
[2021-04-25 04:42:48,248 INFO] Step 19050/50000; acc:  90.27; ppl:  1.45; xent: 0.37; lr: 0.00010; 5453/6945 tok/s;   7363 sec
[2021-04-25 04:43:06,913 INFO] Step 19100/50000; acc:  89.55; ppl:  1.48; xent: 0.39; lr: 0.00010; 5394/6603 tok/s;   7381 sec
[2021-04-25 04:43:26,669 INFO] Step 19150/50000; acc:  90.29; ppl:  1.44; xent: 0.37; lr: 0.00010; 5240/6634 tok/s;   7401 sec
[2021-04-25 04:43:45,901 INFO] Step 19200/50000; acc:  90.05; ppl:  1.46; xent: 0.38; lr: 0.00010; 5185/6471 tok/s;   7420 sec
[2021-04-25 04:44:05,373 INFO] Step 19250/50000; acc:  90.39; ppl:  1.44; xent: 0.37; lr: 0.00010; 5290/6692 tok/s;   7440 sec
[2021-04-25 04:44:24,631 INFO] Step 19300/50000; acc:  90.01; ppl:  1.46; xent: 0.38; lr: 0.00010; 5378/6420 tok/s;   7459 sec
[2021-04-25 04:44:43,775 INFO] Step 19350/50000; acc:  89.88; ppl:  1.46; xent: 0.38; lr: 0.00010; 5271/6515 tok/s;   7478 sec
[2021-04-25 04:45:03,019 INFO] Step 19400/50000; acc:  90.33; ppl:  1.44; xent: 0.36; lr: 0.00010; 5222/6698 tok/s;   7497 sec
[2021-04-25 04:45:23,146 INFO] Step 19450/50000; acc:  90.33; ppl:  1.45; xent: 0.37; lr: 0.00010; 5120/6444 tok/s;   7517 sec
[2021-04-25 04:45:42,734 INFO] Step 19500/50000; acc:  90.08; ppl:  1.45; xent: 0.37; lr: 0.00010; 5203/6505 tok/s;   7537 sec
[2021-04-25 04:46:01,570 INFO] Step 19550/50000; acc:  90.11; ppl:  1.45; xent: 0.37; lr: 0.00010; 5295/6504 tok/s;   7556 sec
[2021-04-25 04:46:20,719 INFO] Step 19600/50000; acc:  90.75; ppl:  1.43; xent: 0.35; lr: 0.00010; 5466/6945 tok/s;   7575 sec
[2021-04-25 04:46:39,065 INFO] Step 19650/50000; acc:  90.10; ppl:  1.45; xent: 0.37; lr: 0.00010; 5434/6804 tok/s;   7593 sec
[2021-04-25 04:46:50,296 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:46:58,135 INFO] Step 19700/50000; acc:  90.32; ppl:  1.44; xent: 0.37; lr: 0.00010; 5321/6876 tok/s;   7612 sec
[2021-04-25 04:47:17,470 INFO] Step 19750/50000; acc:  90.34; ppl:  1.45; xent: 0.37; lr: 0.00010; 5362/6631 tok/s;   7632 sec
[2021-04-25 04:47:35,560 INFO] Step 19800/50000; acc:  89.84; ppl:  1.47; xent: 0.38; lr: 0.00010; 5563/6801 tok/s;   7650 sec
[2021-04-25 04:47:54,952 INFO] Step 19850/50000; acc:  89.86; ppl:  1.46; xent: 0.38; lr: 0.00010; 5134/6471 tok/s;   7669 sec
[2021-04-25 04:48:14,301 INFO] Step 19900/50000; acc:  90.39; ppl:  1.44; xent: 0.36; lr: 0.00010; 5241/6605 tok/s;   7689 sec
[2021-04-25 04:48:33,829 INFO] Step 19950/50000; acc:  90.34; ppl:  1.45; xent: 0.37; lr: 0.00010; 5289/6707 tok/s;   7708 sec
[2021-04-25 04:48:53,097 INFO] Step 20000/50000; acc:  90.00; ppl:  1.46; xent: 0.38; lr: 0.00010; 5221/6374 tok/s;   7727 sec
[2021-04-25 04:48:53,098 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-25 04:49:19,680 INFO] Validation perplexity: 1.4531
[2021-04-25 04:49:19,680 INFO] Validation accuracy: 90.1975
[2021-04-25 04:49:19,684 INFO] Saving checkpoint ../models/group1_params/basic_ops/model_step_20000.pt
[2021-04-25 04:49:38,309 INFO] Step 20050/50000; acc:  90.44; ppl:  1.44; xent: 0.36; lr: 0.00010; 2306/2818 tok/s;   7773 sec
[2021-04-25 04:49:58,302 INFO] Step 20100/50000; acc:  90.16; ppl:  1.45; xent: 0.37; lr: 0.00010; 5134/6367 tok/s;   7793 sec
[2021-04-25 04:50:18,068 INFO] Step 20150/50000; acc:  90.19; ppl:  1.44; xent: 0.37; lr: 0.00010; 5058/6355 tok/s;   7812 sec
[2021-04-25 04:50:37,292 INFO] Step 20200/50000; acc:  90.34; ppl:  1.44; xent: 0.36; lr: 0.00010; 5271/6697 tok/s;   7832 sec
[2021-04-25 04:50:56,531 INFO] Step 20250/50000; acc:  90.27; ppl:  1.45; xent: 0.37; lr: 0.00010; 5335/6613 tok/s;   7851 sec
[2021-04-25 04:51:15,288 INFO] Step 20300/50000; acc:  90.39; ppl:  1.44; xent: 0.36; lr: 0.00010; 5457/6754 tok/s;   7870 sec
[2021-04-25 04:51:33,599 INFO] Step 20350/50000; acc:  90.57; ppl:  1.43; xent: 0.36; lr: 0.00010; 5395/7013 tok/s;   7888 sec
[2021-04-25 04:51:52,654 INFO] Step 20400/50000; acc:  90.29; ppl:  1.44; xent: 0.36; lr: 0.00010; 5484/6728 tok/s;   7907 sec
[2021-04-25 04:51:58,374 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:52:11,800 INFO] Step 20450/50000; acc:  90.37; ppl:  1.45; xent: 0.37; lr: 0.00010; 5249/6724 tok/s;   7926 sec
[2021-04-25 04:52:30,262 INFO] Step 20500/50000; acc:  90.34; ppl:  1.44; xent: 0.36; lr: 0.00010; 5491/7047 tok/s;   7945 sec
[2021-04-25 04:52:48,994 INFO] Step 20550/50000; acc:  90.00; ppl:  1.46; xent: 0.38; lr: 0.00010; 5489/6567 tok/s;   7963 sec
[2021-04-25 04:53:08,774 INFO] Step 20600/50000; acc:  90.13; ppl:  1.45; xent: 0.37; lr: 0.00010; 5089/6543 tok/s;   7983 sec
[2021-04-25 04:53:27,779 INFO] Step 20650/50000; acc:  90.16; ppl:  1.45; xent: 0.37; lr: 0.00010; 5258/6546 tok/s;   8002 sec
[2021-04-25 04:53:46,787 INFO] Step 20700/50000; acc:  90.36; ppl:  1.44; xent: 0.36; lr: 0.00010; 5320/6662 tok/s;   8021 sec
[2021-04-25 04:54:06,143 INFO] Step 20750/50000; acc:  90.27; ppl:  1.45; xent: 0.37; lr: 0.00010; 5438/6578 tok/s;   8040 sec
[2021-04-25 04:54:24,641 INFO] Step 20800/50000; acc:  90.15; ppl:  1.45; xent: 0.37; lr: 0.00010; 5412/6687 tok/s;   8059 sec
[2021-04-25 04:54:43,971 INFO] Step 20850/50000; acc:  90.11; ppl:  1.45; xent: 0.37; lr: 0.00010; 5346/6569 tok/s;   8078 sec
[2021-04-25 04:55:03,272 INFO] Step 20900/50000; acc:  90.65; ppl:  1.42; xent: 0.35; lr: 0.00010; 5323/6805 tok/s;   8098 sec
[2021-04-25 04:55:22,702 INFO] Step 20950/50000; acc:  90.27; ppl:  1.44; xent: 0.37; lr: 0.00010; 5131/6523 tok/s;   8117 sec
[2021-04-25 04:55:41,479 INFO] Step 21000/50000; acc:  90.14; ppl:  1.45; xent: 0.37; lr: 0.00010; 5420/6606 tok/s;   8136 sec
[2021-04-25 04:56:00,714 INFO] Step 21050/50000; acc:  90.79; ppl:  1.42; xent: 0.35; lr: 0.00010; 5321/6822 tok/s;   8155 sec
[2021-04-25 04:56:19,451 INFO] Step 21100/50000; acc:  90.39; ppl:  1.43; xent: 0.36; lr: 0.00010; 5432/6826 tok/s;   8174 sec
[2021-04-25 04:56:26,876 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 04:56:37,951 INFO] Step 21150/50000; acc:  90.14; ppl:  1.45; xent: 0.37; lr: 0.00010; 5383/6811 tok/s;   8192 sec
[2021-04-25 04:56:57,761 INFO] Step 21200/50000; acc:  90.62; ppl:  1.43; xent: 0.36; lr: 0.00010; 5298/6636 tok/s;   8212 sec
[2021-04-25 04:57:15,912 INFO] Step 21250/50000; acc:  90.06; ppl:  1.45; xent: 0.37; lr: 0.00010; 5499/6820 tok/s;   8230 sec
[2021-04-25 04:57:35,240 INFO] Step 21300/50000; acc:  90.14; ppl:  1.45; xent: 0.37; lr: 0.00010; 5210/6589 tok/s;   8250 sec
[2021-04-25 04:57:54,822 INFO] Step 21350/50000; acc:  90.40; ppl:  1.44; xent: 0.36; lr: 0.00010; 5261/6605 tok/s;   8269 sec
[2021-04-25 04:58:13,913 INFO] Step 21400/50000; acc:  90.21; ppl:  1.44; xent: 0.37; lr: 0.00010; 5268/6561 tok/s;   8288 sec
[2021-04-25 04:58:32,630 INFO] Step 21450/50000; acc:  90.37; ppl:  1.43; xent: 0.36; lr: 0.00010; 5367/6729 tok/s;   8307 sec
[2021-04-25 04:58:51,881 INFO] Step 21500/50000; acc:  90.22; ppl:  1.45; xent: 0.37; lr: 0.00010; 5381/6465 tok/s;   8326 sec
[2021-04-25 04:59:10,998 INFO] Step 21550/50000; acc:  90.23; ppl:  1.44; xent: 0.37; lr: 0.00010; 5409/6683 tok/s;   8345 sec
[2021-04-25 04:59:30,359 INFO] Step 21600/50000; acc:  90.48; ppl:  1.43; xent: 0.36; lr: 0.00010; 5157/6533 tok/s;   8365 sec
[2021-04-25 04:59:49,783 INFO] Step 21650/50000; acc:  90.46; ppl:  1.43; xent: 0.36; lr: 0.00010; 5317/6772 tok/s;   8384 sec
[2021-04-25 05:00:09,163 INFO] Step 21700/50000; acc:  90.26; ppl:  1.44; xent: 0.37; lr: 0.00010; 5290/6522 tok/s;   8404 sec
[2021-04-25 05:00:28,361 INFO] Step 21750/50000; acc:  90.44; ppl:  1.43; xent: 0.36; lr: 0.00010; 5244/6524 tok/s;   8423 sec
[2021-04-25 05:00:46,940 INFO] Step 21800/50000; acc:  90.80; ppl:  1.41; xent: 0.35; lr: 0.00010; 5424/6951 tok/s;   8441 sec
[2021-04-25 05:01:06,015 INFO] Step 21850/50000; acc:  90.27; ppl:  1.44; xent: 0.36; lr: 0.00010; 5346/6714 tok/s;   8460 sec
[2021-04-25 05:01:08,068 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:01:25,073 INFO] Step 21900/50000; acc:  90.46; ppl:  1.43; xent: 0.36; lr: 0.00010; 5368/6767 tok/s;   8479 sec
[2021-04-25 05:01:43,552 INFO] Step 21950/50000; acc:  90.47; ppl:  1.43; xent: 0.36; lr: 0.00010; 5398/6746 tok/s;   8498 sec
[2021-04-25 05:02:02,336 INFO] Step 22000/50000; acc:  90.07; ppl:  1.45; xent: 0.37; lr: 0.00010; 5523/6756 tok/s;   8517 sec
[2021-04-25 05:02:21,669 INFO] Step 22050/50000; acc:  90.16; ppl:  1.45; xent: 0.37; lr: 0.00010; 5191/6573 tok/s;   8536 sec
[2021-04-25 05:02:41,355 INFO] Step 22100/50000; acc:  90.40; ppl:  1.43; xent: 0.36; lr: 0.00010; 5124/6404 tok/s;   8556 sec
[2021-04-25 05:03:00,764 INFO] Step 22150/50000; acc:  90.43; ppl:  1.44; xent: 0.36; lr: 0.00010; 5277/6676 tok/s;   8575 sec
[2021-04-25 05:03:19,684 INFO] Step 22200/50000; acc:  90.43; ppl:  1.43; xent: 0.36; lr: 0.00010; 5416/6558 tok/s;   8594 sec
[2021-04-25 05:03:38,464 INFO] Step 22250/50000; acc:  90.12; ppl:  1.45; xent: 0.37; lr: 0.00010; 5386/6668 tok/s;   8613 sec
[2021-04-25 05:03:58,415 INFO] Step 22300/50000; acc:  90.34; ppl:  1.44; xent: 0.36; lr: 0.00010; 5096/6369 tok/s;   8633 sec
[2021-04-25 05:04:17,705 INFO] Step 22350/50000; acc:  90.68; ppl:  1.42; xent: 0.35; lr: 0.00010; 5372/6791 tok/s;   8652 sec
[2021-04-25 05:04:36,508 INFO] Step 22400/50000; acc:  90.32; ppl:  1.44; xent: 0.36; lr: 0.00010; 5309/6635 tok/s;   8671 sec
[2021-04-25 05:04:56,110 INFO] Step 22450/50000; acc:  90.29; ppl:  1.44; xent: 0.36; lr: 0.00010; 5283/6541 tok/s;   8690 sec
[2021-04-25 05:05:14,800 INFO] Step 22500/50000; acc:  90.80; ppl:  1.41; xent: 0.34; lr: 0.00010; 5517/6991 tok/s;   8709 sec
[2021-04-25 05:05:33,703 INFO] Step 22550/50000; acc:  90.62; ppl:  1.42; xent: 0.35; lr: 0.00010; 5238/6701 tok/s;   8728 sec
[2021-04-25 05:05:49,475 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:05:52,726 INFO] Step 22600/50000; acc:  90.44; ppl:  1.43; xent: 0.35; lr: 0.00010; 5328/6763 tok/s;   8747 sec
[2021-04-25 05:06:12,121 INFO] Step 22650/50000; acc:  90.26; ppl:  1.44; xent: 0.37; lr: 0.00010; 5275/6478 tok/s;   8766 sec
[2021-04-25 05:06:30,903 INFO] Step 22700/50000; acc:  90.48; ppl:  1.43; xent: 0.36; lr: 0.00010; 5451/6816 tok/s;   8785 sec
[2021-04-25 05:06:49,440 INFO] Step 22750/50000; acc:  89.86; ppl:  1.46; xent: 0.38; lr: 0.00010; 5313/6588 tok/s;   8804 sec
[2021-04-25 05:07:09,552 INFO] Step 22800/50000; acc:  90.60; ppl:  1.42; xent: 0.35; lr: 0.00010; 5196/6625 tok/s;   8824 sec
[2021-04-25 05:07:28,611 INFO] Step 22850/50000; acc:  90.34; ppl:  1.44; xent: 0.36; lr: 0.00010; 5259/6491 tok/s;   8843 sec
[2021-04-25 05:07:47,820 INFO] Step 22900/50000; acc:  90.70; ppl:  1.42; xent: 0.35; lr: 0.00010; 5273/6785 tok/s;   8862 sec
[2021-04-25 05:08:06,911 INFO] Step 22950/50000; acc:  90.23; ppl:  1.44; xent: 0.37; lr: 0.00010; 5459/6464 tok/s;   8881 sec
[2021-04-25 05:08:26,370 INFO] Step 23000/50000; acc:  90.23; ppl:  1.44; xent: 0.36; lr: 0.00010; 5233/6440 tok/s;   8901 sec
[2021-04-25 05:08:45,525 INFO] Step 23050/50000; acc:  90.46; ppl:  1.43; xent: 0.35; lr: 0.00010; 5203/6645 tok/s;   8920 sec
[2021-04-25 05:09:05,565 INFO] Step 23100/50000; acc:  90.59; ppl:  1.42; xent: 0.35; lr: 0.00010; 5098/6476 tok/s;   8940 sec
[2021-04-25 05:09:25,313 INFO] Step 23150/50000; acc:  90.51; ppl:  1.43; xent: 0.36; lr: 0.00010; 5240/6508 tok/s;   8960 sec
[2021-04-25 05:09:44,046 INFO] Step 23200/50000; acc:  90.41; ppl:  1.43; xent: 0.36; lr: 0.00010; 5357/6584 tok/s;   8978 sec
[2021-04-25 05:10:02,985 INFO] Step 23250/50000; acc:  90.84; ppl:  1.41; xent: 0.35; lr: 0.00010; 5433/6904 tok/s;   8997 sec
[2021-04-25 05:10:21,760 INFO] Step 23300/50000; acc:  90.50; ppl:  1.43; xent: 0.35; lr: 0.00010; 5424/6836 tok/s;   9016 sec
[2021-04-25 05:10:32,003 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:10:40,773 INFO] Step 23350/50000; acc:  90.55; ppl:  1.43; xent: 0.35; lr: 0.00010; 5273/6813 tok/s;   9035 sec
[2021-04-25 05:10:59,761 INFO] Step 23400/50000; acc:  90.55; ppl:  1.43; xent: 0.36; lr: 0.00010; 5365/6661 tok/s;   9054 sec
[2021-04-25 05:11:17,862 INFO] Step 23450/50000; acc:  90.01; ppl:  1.45; xent: 0.37; lr: 0.00010; 5625/6853 tok/s;   9072 sec
[2021-04-25 05:11:37,325 INFO] Step 23500/50000; acc:  90.28; ppl:  1.44; xent: 0.36; lr: 0.00010; 5215/6543 tok/s;   9092 sec
[2021-04-25 05:11:56,871 INFO] Step 23550/50000; acc:  90.45; ppl:  1.43; xent: 0.35; lr: 0.00010; 5077/6421 tok/s;   9111 sec
[2021-04-25 05:12:16,097 INFO] Step 23600/50000; acc:  90.59; ppl:  1.43; xent: 0.36; lr: 0.00010; 5417/6763 tok/s;   9130 sec
[2021-04-25 05:12:35,363 INFO] Step 23650/50000; acc:  90.34; ppl:  1.43; xent: 0.36; lr: 0.00010; 5261/6457 tok/s;   9150 sec
[2021-04-25 05:12:53,918 INFO] Step 23700/50000; acc:  90.64; ppl:  1.42; xent: 0.35; lr: 0.00010; 5514/6839 tok/s;   9168 sec
[2021-04-25 05:13:13,649 INFO] Step 23750/50000; acc:  90.29; ppl:  1.44; xent: 0.36; lr: 0.00010; 5230/6431 tok/s;   9188 sec
[2021-04-25 05:13:33,279 INFO] Step 23800/50000; acc:  90.60; ppl:  1.42; xent: 0.35; lr: 0.00010; 5156/6473 tok/s;   9208 sec
[2021-04-25 05:13:52,552 INFO] Step 23850/50000; acc:  90.56; ppl:  1.42; xent: 0.35; lr: 0.00010; 5198/6588 tok/s;   9227 sec
[2021-04-25 05:14:12,011 INFO] Step 23900/50000; acc:  90.48; ppl:  1.43; xent: 0.36; lr: 0.00010; 5223/6589 tok/s;   9246 sec
[2021-04-25 05:14:30,979 INFO] Step 23950/50000; acc:  90.69; ppl:  1.42; xent: 0.35; lr: 0.00010; 5487/6681 tok/s;   9265 sec
[2021-04-25 05:14:49,440 INFO] Step 24000/50000; acc:  90.78; ppl:  1.41; xent: 0.35; lr: 0.00010; 5375/7030 tok/s;   9284 sec
[2021-04-25 05:15:08,297 INFO] Step 24050/50000; acc:  90.48; ppl:  1.42; xent: 0.35; lr: 0.00010; 5463/6749 tok/s;   9303 sec
[2021-04-25 05:15:13,364 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:15:27,721 INFO] Step 24100/50000; acc:  90.71; ppl:  1.42; xent: 0.35; lr: 0.00010; 5284/6693 tok/s;   9322 sec
[2021-04-25 05:15:46,085 INFO] Step 24150/50000; acc:  90.45; ppl:  1.43; xent: 0.36; lr: 0.00010; 5446/6880 tok/s;   9340 sec
[2021-04-25 05:16:04,958 INFO] Step 24200/50000; acc:  90.24; ppl:  1.44; xent: 0.36; lr: 0.00010; 5350/6578 tok/s;   9359 sec
[2021-04-25 05:16:24,528 INFO] Step 24250/50000; acc:  90.41; ppl:  1.43; xent: 0.36; lr: 0.00010; 5207/6671 tok/s;   9379 sec
[2021-04-25 05:16:43,913 INFO] Step 24300/50000; acc:  90.42; ppl:  1.43; xent: 0.36; lr: 0.00010; 5255/6444 tok/s;   9398 sec
[2021-04-25 05:17:02,626 INFO] Step 24350/50000; acc:  90.64; ppl:  1.42; xent: 0.35; lr: 0.00010; 5288/6655 tok/s;   9417 sec
[2021-04-25 05:17:22,330 INFO] Step 24400/50000; acc:  90.50; ppl:  1.43; xent: 0.36; lr: 0.00010; 5399/6504 tok/s;   9437 sec
[2021-04-25 05:17:40,923 INFO] Step 24450/50000; acc:  90.48; ppl:  1.43; xent: 0.35; lr: 0.00010; 5408/6761 tok/s;   9455 sec
[2021-04-25 05:18:00,082 INFO] Step 24500/50000; acc:  90.32; ppl:  1.43; xent: 0.36; lr: 0.00010; 5295/6522 tok/s;   9474 sec
[2021-04-25 05:18:19,476 INFO] Step 24550/50000; acc:  90.92; ppl:  1.40; xent: 0.34; lr: 0.00010; 5330/6823 tok/s;   9494 sec
[2021-04-25 05:18:38,931 INFO] Step 24600/50000; acc:  90.57; ppl:  1.42; xent: 0.35; lr: 0.00010; 5179/6539 tok/s;   9513 sec
[2021-04-25 05:18:57,635 INFO] Step 24650/50000; acc:  90.50; ppl:  1.43; xent: 0.35; lr: 0.00010; 5387/6617 tok/s;   9532 sec
[2021-04-25 05:19:16,664 INFO] Step 24700/50000; acc:  90.99; ppl:  1.40; xent: 0.34; lr: 0.00010; 5336/6839 tok/s;   9551 sec
[2021-04-25 05:19:35,664 INFO] Step 24750/50000; acc:  90.66; ppl:  1.42; xent: 0.35; lr: 0.00010; 5433/6764 tok/s;   9570 sec
[2021-04-25 05:19:42,284 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:19:54,137 INFO] Step 24800/50000; acc:  90.44; ppl:  1.42; xent: 0.35; lr: 0.00010; 5429/6924 tok/s;   9588 sec
[2021-04-25 05:20:13,601 INFO] Step 24850/50000; acc:  90.69; ppl:  1.42; xent: 0.35; lr: 0.00010; 5301/6598 tok/s;   9608 sec
[2021-04-25 05:20:32,156 INFO] Step 24900/50000; acc:  90.50; ppl:  1.42; xent: 0.35; lr: 0.00010; 5501/6859 tok/s;   9626 sec
[2021-04-25 05:20:51,691 INFO] Step 24950/50000; acc:  90.14; ppl:  1.44; xent: 0.37; lr: 0.00010; 5088/6394 tok/s;   9646 sec
[2021-04-25 05:21:11,005 INFO] Step 25000/50000; acc:  90.59; ppl:  1.42; xent: 0.35; lr: 0.00010; 5242/6627 tok/s;   9665 sec
[2021-04-25 05:21:11,005 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-25 05:21:37,524 INFO] Validation perplexity: 1.43913
[2021-04-25 05:21:37,524 INFO] Validation accuracy: 90.4644
[2021-04-25 05:21:37,528 INFO] Saving checkpoint ../models/group1_params/basic_ops/model_step_25000.pt
[2021-04-25 05:21:56,754 INFO] Step 25050/50000; acc:  90.56; ppl:  1.42; xent: 0.35; lr: 0.00010; 2223/2769 tok/s;   9711 sec
[2021-04-25 05:22:15,890 INFO] Step 25100/50000; acc:  90.69; ppl:  1.41; xent: 0.35; lr: 0.00010; 5353/6577 tok/s;   9730 sec
[2021-04-25 05:22:34,714 INFO] Step 25150/50000; acc:  90.35; ppl:  1.43; xent: 0.36; lr: 0.00010; 5377/6524 tok/s;   9749 sec
[2021-04-25 05:22:53,863 INFO] Step 25200/50000; acc:  90.62; ppl:  1.42; xent: 0.35; lr: 0.00010; 5463/6767 tok/s;   9768 sec
[2021-04-25 05:23:13,235 INFO] Step 25250/50000; acc:  90.59; ppl:  1.42; xent: 0.35; lr: 0.00010; 5187/6555 tok/s;   9788 sec
[2021-04-25 05:23:32,716 INFO] Step 25300/50000; acc:  90.73; ppl:  1.41; xent: 0.35; lr: 0.00010; 5194/6670 tok/s;   9807 sec
[2021-04-25 05:23:52,417 INFO] Step 25350/50000; acc:  90.43; ppl:  1.43; xent: 0.36; lr: 0.00010; 5239/6460 tok/s;   9827 sec
[2021-04-25 05:24:11,765 INFO] Step 25400/50000; acc:  90.81; ppl:  1.41; xent: 0.34; lr: 0.00010; 5265/6515 tok/s;   9846 sec
[2021-04-25 05:24:30,086 INFO] Step 25450/50000; acc:  90.94; ppl:  1.40; xent: 0.34; lr: 0.00010; 5433/6996 tok/s;   9864 sec
[2021-04-25 05:24:49,172 INFO] Step 25500/50000; acc:  90.54; ppl:  1.42; xent: 0.35; lr: 0.00010; 5318/6701 tok/s;   9884 sec
[2021-04-25 05:24:50,599 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:25:08,293 INFO] Step 25550/50000; acc:  90.68; ppl:  1.42; xent: 0.35; lr: 0.00010; 5418/6781 tok/s;   9903 sec
[2021-04-25 05:25:26,929 INFO] Step 25600/50000; acc:  90.61; ppl:  1.42; xent: 0.35; lr: 0.00010; 5386/6756 tok/s;   9921 sec
[2021-04-25 05:25:45,531 INFO] Step 25650/50000; acc:  90.33; ppl:  1.43; xent: 0.36; lr: 0.00010; 5482/6762 tok/s;   9940 sec
[2021-04-25 05:26:05,169 INFO] Step 25700/50000; acc:  90.52; ppl:  1.42; xent: 0.35; lr: 0.00010; 5215/6554 tok/s;   9960 sec
[2021-04-25 05:26:24,825 INFO] Step 25750/50000; acc:  90.57; ppl:  1.42; xent: 0.35; lr: 0.00010; 5079/6346 tok/s;   9979 sec
[2021-04-25 05:26:44,043 INFO] Step 25800/50000; acc:  90.59; ppl:  1.42; xent: 0.35; lr: 0.00010; 5232/6662 tok/s;   9998 sec
[2021-04-25 05:27:03,385 INFO] Step 25850/50000; acc:  90.57; ppl:  1.42; xent: 0.35; lr: 0.00010; 5366/6403 tok/s;  10018 sec
[2021-04-25 05:27:22,321 INFO] Step 25900/50000; acc:  90.52; ppl:  1.42; xent: 0.35; lr: 0.00010; 5435/6701 tok/s;  10037 sec
[2021-04-25 05:27:41,906 INFO] Step 25950/50000; acc:  90.59; ppl:  1.42; xent: 0.35; lr: 0.00010; 5072/6391 tok/s;  10056 sec
[2021-04-25 05:28:01,837 INFO] Step 26000/50000; acc:  90.97; ppl:  1.40; xent: 0.34; lr: 0.00010; 5261/6710 tok/s;  10076 sec
[2021-04-25 05:28:20,785 INFO] Step 26050/50000; acc:  90.44; ppl:  1.42; xent: 0.35; lr: 0.00010; 5297/6556 tok/s;  10095 sec
[2021-04-25 05:28:39,810 INFO] Step 26100/50000; acc:  90.54; ppl:  1.42; xent: 0.35; lr: 0.00010; 5343/6614 tok/s;  10114 sec
[2021-04-25 05:28:58,663 INFO] Step 26150/50000; acc:  91.05; ppl:  1.40; xent: 0.33; lr: 0.00010; 5489/7022 tok/s;  10133 sec
[2021-04-25 05:29:17,212 INFO] Step 26200/50000; acc:  90.72; ppl:  1.41; xent: 0.35; lr: 0.00010; 5402/6759 tok/s;  10152 sec
[2021-04-25 05:29:32,508 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:29:36,300 INFO] Step 26250/50000; acc:  90.74; ppl:  1.41; xent: 0.34; lr: 0.00010; 5263/6777 tok/s;  10171 sec
[2021-04-25 05:29:55,764 INFO] Step 26300/50000; acc:  90.61; ppl:  1.42; xent: 0.35; lr: 0.00010; 5217/6465 tok/s;  10190 sec
[2021-04-25 05:30:14,509 INFO] Step 26350/50000; acc:  90.57; ppl:  1.42; xent: 0.35; lr: 0.00010; 5531/6857 tok/s;  10209 sec
[2021-04-25 05:30:33,232 INFO] Step 26400/50000; acc:  90.19; ppl:  1.44; xent: 0.36; lr: 0.00010; 5292/6513 tok/s;  10228 sec
[2021-04-25 05:30:53,520 INFO] Step 26450/50000; acc:  90.74; ppl:  1.41; xent: 0.34; lr: 0.00010; 5080/6508 tok/s;  10248 sec
[2021-04-25 05:31:12,464 INFO] Step 26500/50000; acc:  90.69; ppl:  1.41; xent: 0.35; lr: 0.00010; 5394/6691 tok/s;  10267 sec
[2021-04-25 05:31:31,593 INFO] Step 26550/50000; acc:  90.73; ppl:  1.41; xent: 0.34; lr: 0.00010; 5238/6671 tok/s;  10286 sec
[2021-04-25 05:31:50,583 INFO] Step 26600/50000; acc:  90.52; ppl:  1.42; xent: 0.35; lr: 0.00010; 5403/6424 tok/s;  10305 sec
[2021-04-25 05:32:10,037 INFO] Step 26650/50000; acc:  90.45; ppl:  1.42; xent: 0.35; lr: 0.00010; 5286/6558 tok/s;  10324 sec
[2021-04-25 05:32:29,762 INFO] Step 26700/50000; acc:  90.82; ppl:  1.41; xent: 0.34; lr: 0.00010; 5157/6500 tok/s;  10344 sec
[2021-04-25 05:32:49,539 INFO] Step 26750/50000; acc:  90.73; ppl:  1.41; xent: 0.35; lr: 0.00010; 5046/6439 tok/s;  10364 sec
[2021-04-25 05:33:09,109 INFO] Step 26800/50000; acc:  90.84; ppl:  1.41; xent: 0.34; lr: 0.00010; 5341/6633 tok/s;  10383 sec
[2021-04-25 05:33:27,906 INFO] Step 26850/50000; acc:  90.62; ppl:  1.41; xent: 0.35; lr: 0.00010; 5361/6618 tok/s;  10402 sec
[2021-04-25 05:33:46,443 INFO] Step 26900/50000; acc:  91.10; ppl:  1.39; xent: 0.33; lr: 0.00010; 5449/6955 tok/s;  10421 sec
[2021-04-25 05:34:05,283 INFO] Step 26950/50000; acc:  90.72; ppl:  1.41; xent: 0.34; lr: 0.00010; 5444/6818 tok/s;  10440 sec
[2021-04-25 05:34:14,821 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:34:24,479 INFO] Step 27000/50000; acc:  90.83; ppl:  1.41; xent: 0.34; lr: 0.00010; 5277/6796 tok/s;  10459 sec
[2021-04-25 05:34:43,306 INFO] Step 27050/50000; acc:  90.72; ppl:  1.41; xent: 0.35; lr: 0.00010; 5367/6676 tok/s;  10478 sec
[2021-04-25 05:35:01,632 INFO] Step 27100/50000; acc:  90.26; ppl:  1.43; xent: 0.36; lr: 0.00010; 5502/6714 tok/s;  10496 sec
[2021-04-25 05:35:21,394 INFO] Step 27150/50000; acc:  90.60; ppl:  1.42; xent: 0.35; lr: 0.00010; 5211/6593 tok/s;  10516 sec
[2021-04-25 05:35:41,000 INFO] Step 27200/50000; acc:  90.59; ppl:  1.41; xent: 0.35; lr: 0.00010; 5088/6360 tok/s;  10535 sec
[2021-04-25 05:35:59,677 INFO] Step 27250/50000; acc:  90.84; ppl:  1.41; xent: 0.34; lr: 0.00010; 5485/6938 tok/s;  10554 sec
[2021-04-25 05:36:19,191 INFO] Step 27300/50000; acc:  90.68; ppl:  1.41; xent: 0.34; lr: 0.00010; 5311/6472 tok/s;  10574 sec
[2021-04-25 05:36:37,600 INFO] Step 27350/50000; acc:  90.73; ppl:  1.41; xent: 0.34; lr: 0.00010; 5494/6724 tok/s;  10592 sec
[2021-04-25 05:36:57,101 INFO] Step 27400/50000; acc:  90.52; ppl:  1.42; xent: 0.35; lr: 0.00010; 5200/6479 tok/s;  10611 sec
[2021-04-25 05:37:16,581 INFO] Step 27450/50000; acc:  90.82; ppl:  1.41; xent: 0.34; lr: 0.00010; 5253/6635 tok/s;  10631 sec
[2021-04-25 05:37:35,788 INFO] Step 27500/50000; acc:  90.81; ppl:  1.40; xent: 0.34; lr: 0.00010; 5313/6694 tok/s;  10650 sec
[2021-04-25 05:37:54,507 INFO] Step 27550/50000; acc:  90.62; ppl:  1.41; xent: 0.35; lr: 0.00010; 5315/6649 tok/s;  10669 sec
[2021-04-25 05:38:13,882 INFO] Step 27600/50000; acc:  91.11; ppl:  1.39; xent: 0.33; lr: 0.00010; 5424/6685 tok/s;  10688 sec
[2021-04-25 05:38:32,104 INFO] Step 27650/50000; acc:  90.90; ppl:  1.40; xent: 0.34; lr: 0.00010; 5470/7057 tok/s;  10706 sec
[2021-04-25 05:38:50,500 INFO] Step 27700/50000; acc:  90.62; ppl:  1.41; xent: 0.34; lr: 0.00010; 5511/6893 tok/s;  10725 sec
[2021-04-25 05:38:54,917 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:39:09,961 INFO] Step 27750/50000; acc:  90.92; ppl:  1.41; xent: 0.34; lr: 0.00010; 5303/6678 tok/s;  10744 sec
[2021-04-25 05:39:28,475 INFO] Step 27800/50000; acc:  90.78; ppl:  1.41; xent: 0.34; lr: 0.00010; 5458/6840 tok/s;  10763 sec
[2021-04-25 05:39:47,222 INFO] Step 27850/50000; acc:  90.44; ppl:  1.42; xent: 0.35; lr: 0.00010; 5323/6595 tok/s;  10782 sec
[2021-04-25 05:40:06,688 INFO] Step 27900/50000; acc:  90.63; ppl:  1.41; xent: 0.34; lr: 0.00010; 5204/6669 tok/s;  10801 sec
[2021-04-25 05:40:26,235 INFO] Step 27950/50000; acc:  90.66; ppl:  1.41; xent: 0.34; lr: 0.00010; 5281/6521 tok/s;  10821 sec
[2021-04-25 05:40:45,053 INFO] Step 28000/50000; acc:  90.71; ppl:  1.41; xent: 0.34; lr: 0.00010; 5291/6575 tok/s;  10839 sec
[2021-04-25 05:41:04,576 INFO] Step 28050/50000; acc:  90.66; ppl:  1.41; xent: 0.34; lr: 0.00010; 5372/6526 tok/s;  10859 sec
[2021-04-25 05:41:23,446 INFO] Step 28100/50000; acc:  90.93; ppl:  1.40; xent: 0.34; lr: 0.00010; 5437/6794 tok/s;  10878 sec
[2021-04-25 05:41:43,083 INFO] Step 28150/50000; acc:  90.48; ppl:  1.42; xent: 0.35; lr: 0.00010; 5109/6276 tok/s;  10897 sec
[2021-04-25 05:42:02,465 INFO] Step 28200/50000; acc:  91.02; ppl:  1.39; xent: 0.33; lr: 0.00010; 5238/6726 tok/s;  10917 sec
[2021-04-25 05:42:21,847 INFO] Step 28250/50000; acc:  90.75; ppl:  1.41; xent: 0.34; lr: 0.00010; 5256/6625 tok/s;  10936 sec
[2021-04-25 05:42:40,683 INFO] Step 28300/50000; acc:  90.82; ppl:  1.40; xent: 0.34; lr: 0.00010; 5458/6625 tok/s;  10955 sec
[2021-04-25 05:42:59,351 INFO] Step 28350/50000; acc:  91.08; ppl:  1.39; xent: 0.33; lr: 0.00010; 5319/6879 tok/s;  10974 sec
[2021-04-25 05:43:18,689 INFO] Step 28400/50000; acc:  90.89; ppl:  1.40; xent: 0.34; lr: 0.00010; 5379/6683 tok/s;  10993 sec
[2021-04-25 05:43:24,602 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:43:37,394 INFO] Step 28450/50000; acc:  90.68; ppl:  1.41; xent: 0.34; lr: 0.00010; 5400/6883 tok/s;  11012 sec
[2021-04-25 05:43:56,472 INFO] Step 28500/50000; acc:  90.97; ppl:  1.40; xent: 0.34; lr: 0.00010; 5308/6687 tok/s;  11031 sec
[2021-04-25 05:44:14,824 INFO] Step 28550/50000; acc:  90.58; ppl:  1.42; xent: 0.35; lr: 0.00010; 5585/6861 tok/s;  11049 sec
[2021-04-25 05:44:34,591 INFO] Step 28600/50000; acc:  90.43; ppl:  1.42; xent: 0.35; lr: 0.00010; 5092/6413 tok/s;  11069 sec
[2021-04-25 05:44:53,791 INFO] Step 28650/50000; acc:  90.93; ppl:  1.40; xent: 0.33; lr: 0.00010; 5220/6669 tok/s;  11088 sec
[2021-04-25 05:45:13,015 INFO] Step 28700/50000; acc:  90.71; ppl:  1.41; xent: 0.34; lr: 0.00010; 5245/6586 tok/s;  11107 sec
[2021-04-25 05:45:31,975 INFO] Step 28750/50000; acc:  90.87; ppl:  1.40; xent: 0.34; lr: 0.00010; 5497/6653 tok/s;  11126 sec
[2021-04-25 05:45:50,839 INFO] Step 28800/50000; acc:  90.56; ppl:  1.41; xent: 0.35; lr: 0.00010; 5388/6529 tok/s;  11145 sec
[2021-04-25 05:46:09,898 INFO] Step 28850/50000; acc:  90.82; ppl:  1.40; xent: 0.34; lr: 0.00010; 5399/6786 tok/s;  11164 sec
[2021-04-25 05:46:29,409 INFO] Step 28900/50000; acc:  91.03; ppl:  1.39; xent: 0.33; lr: 0.00010; 5258/6649 tok/s;  11184 sec
[2021-04-25 05:46:48,643 INFO] Step 28950/50000; acc:  90.73; ppl:  1.40; xent: 0.34; lr: 0.00010; 5200/6551 tok/s;  11203 sec
[2021-04-25 05:47:08,067 INFO] Step 29000/50000; acc:  90.67; ppl:  1.41; xent: 0.34; lr: 0.00010; 5222/6483 tok/s;  11222 sec
[2021-04-25 05:47:27,361 INFO] Step 29050/50000; acc:  91.03; ppl:  1.39; xent: 0.33; lr: 0.00010; 5347/6669 tok/s;  11242 sec
[2021-04-25 05:47:46,091 INFO] Step 29100/50000; acc:  91.13; ppl:  1.39; xent: 0.33; lr: 0.00010; 5406/6831 tok/s;  11260 sec
[2021-04-25 05:48:04,802 INFO] Step 29150/50000; acc:  90.75; ppl:  1.40; xent: 0.34; lr: 0.00010; 5313/6774 tok/s;  11279 sec
[2021-04-25 05:48:05,579 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:48:24,098 INFO] Step 29200/50000; acc:  90.81; ppl:  1.41; xent: 0.34; lr: 0.00010; 5419/6712 tok/s;  11298 sec
[2021-04-25 05:48:42,774 INFO] Step 29250/50000; acc:  90.95; ppl:  1.40; xent: 0.33; lr: 0.00010; 5402/6800 tok/s;  11317 sec
[2021-04-25 05:49:01,320 INFO] Step 29300/50000; acc:  90.48; ppl:  1.42; xent: 0.35; lr: 0.00010; 5397/6675 tok/s;  11336 sec
[2021-04-25 05:49:21,215 INFO] Step 29350/50000; acc:  90.80; ppl:  1.41; xent: 0.34; lr: 0.00010; 5184/6563 tok/s;  11356 sec
[2021-04-25 05:49:40,510 INFO] Step 29400/50000; acc:  90.74; ppl:  1.41; xent: 0.34; lr: 0.00010; 5233/6489 tok/s;  11375 sec
[2021-04-25 05:49:59,626 INFO] Step 29450/50000; acc:  90.86; ppl:  1.40; xent: 0.34; lr: 0.00010; 5209/6748 tok/s;  11394 sec
[2021-04-25 05:50:19,228 INFO] Step 29500/50000; acc:  90.74; ppl:  1.41; xent: 0.34; lr: 0.00010; 5253/6226 tok/s;  11414 sec
[2021-04-25 05:50:38,358 INFO] Step 29550/50000; acc:  90.69; ppl:  1.41; xent: 0.34; lr: 0.00010; 5454/6667 tok/s;  11433 sec
[2021-04-25 05:50:57,957 INFO] Step 29600/50000; acc:  90.96; ppl:  1.39; xent: 0.33; lr: 0.00010; 5078/6447 tok/s;  11452 sec
[2021-04-25 05:51:17,773 INFO] Step 29650/50000; acc:  91.09; ppl:  1.39; xent: 0.33; lr: 0.00010; 5224/6609 tok/s;  11472 sec
[2021-04-25 05:51:37,090 INFO] Step 29700/50000; acc:  90.81; ppl:  1.40; xent: 0.34; lr: 0.00010; 5304/6619 tok/s;  11491 sec
[2021-04-25 05:51:56,233 INFO] Step 29750/50000; acc:  90.81; ppl:  1.40; xent: 0.34; lr: 0.00010; 5251/6476 tok/s;  11511 sec
[2021-04-25 05:52:14,804 INFO] Step 29800/50000; acc:  91.20; ppl:  1.39; xent: 0.33; lr: 0.00010; 5461/7042 tok/s;  11529 sec
[2021-04-25 05:52:33,190 INFO] Step 29850/50000; acc:  90.90; ppl:  1.40; xent: 0.34; lr: 0.00010; 5519/6876 tok/s;  11548 sec
[2021-04-25 05:52:47,439 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:52:52,219 INFO] Step 29900/50000; acc:  91.15; ppl:  1.38; xent: 0.32; lr: 0.00010; 5387/6920 tok/s;  11567 sec
[2021-04-25 05:53:11,227 INFO] Step 29950/50000; acc:  90.76; ppl:  1.41; xent: 0.34; lr: 0.00010; 5229/6496 tok/s;  11586 sec
[2021-04-25 05:53:29,922 INFO] Step 30000/50000; acc:  90.76; ppl:  1.41; xent: 0.34; lr: 0.00010; 5593/6882 tok/s;  11604 sec
[2021-04-25 05:53:29,924 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-25 05:53:56,395 INFO] Validation perplexity: 1.43001
[2021-04-25 05:53:56,396 INFO] Validation accuracy: 90.5263
[2021-04-25 05:53:56,400 INFO] Saving checkpoint ../models/group1_params/basic_ops/model_step_30000.pt
[2021-04-25 05:54:15,455 INFO] Step 30050/50000; acc:  90.38; ppl:  1.42; xent: 0.35; lr: 0.00010; 2189/2679 tok/s;  11650 sec
[2021-04-25 05:54:35,452 INFO] Step 30100/50000; acc:  90.98; ppl:  1.39; xent: 0.33; lr: 0.00010; 5050/6561 tok/s;  11670 sec
[2021-04-25 05:54:54,416 INFO] Step 30150/50000; acc:  90.93; ppl:  1.40; xent: 0.34; lr: 0.00010; 5425/6684 tok/s;  11689 sec
[2021-04-25 05:55:13,342 INFO] Step 30200/50000; acc:  90.93; ppl:  1.40; xent: 0.34; lr: 0.00010; 5367/6753 tok/s;  11708 sec
[2021-04-25 05:55:32,376 INFO] Step 30250/50000; acc:  90.76; ppl:  1.40; xent: 0.34; lr: 0.00010; 5335/6499 tok/s;  11727 sec
[2021-04-25 05:55:51,896 INFO] Step 30300/50000; acc:  90.47; ppl:  1.41; xent: 0.35; lr: 0.00010; 5223/6423 tok/s;  11746 sec
[2021-04-25 05:56:11,543 INFO] Step 30350/50000; acc:  91.15; ppl:  1.39; xent: 0.33; lr: 0.00010; 5252/6624 tok/s;  11766 sec
[2021-04-25 05:56:30,961 INFO] Step 30400/50000; acc:  90.96; ppl:  1.39; xent: 0.33; lr: 0.00010; 5157/6568 tok/s;  11785 sec
[2021-04-25 05:56:50,359 INFO] Step 30450/50000; acc:  90.91; ppl:  1.40; xent: 0.33; lr: 0.00010; 5312/6584 tok/s;  11805 sec
[2021-04-25 05:57:09,427 INFO] Step 30500/50000; acc:  90.96; ppl:  1.39; xent: 0.33; lr: 0.00010; 5399/6656 tok/s;  11824 sec
[2021-04-25 05:57:27,861 INFO] Step 30550/50000; acc:  91.26; ppl:  1.38; xent: 0.32; lr: 0.00010; 5417/6950 tok/s;  11842 sec
[2021-04-25 05:57:46,596 INFO] Step 30600/50000; acc:  90.78; ppl:  1.40; xent: 0.34; lr: 0.00010; 5377/6766 tok/s;  11861 sec
[2021-04-25 05:57:55,580 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 05:58:05,766 INFO] Step 30650/50000; acc:  90.99; ppl:  1.39; xent: 0.33; lr: 0.00010; 5344/6806 tok/s;  11880 sec
[2021-04-25 05:58:25,049 INFO] Step 30700/50000; acc:  91.09; ppl:  1.39; xent: 0.33; lr: 0.00010; 5333/6703 tok/s;  11899 sec
[2021-04-25 05:58:43,182 INFO] Step 30750/50000; acc:  90.36; ppl:  1.42; xent: 0.35; lr: 0.00010; 5452/6642 tok/s;  11918 sec
[2021-04-25 05:59:02,930 INFO] Step 30800/50000; acc:  90.91; ppl:  1.40; xent: 0.34; lr: 0.00010; 5261/6659 tok/s;  11937 sec
[2021-04-25 05:59:22,486 INFO] Step 30850/50000; acc:  90.84; ppl:  1.40; xent: 0.33; lr: 0.00010; 5132/6399 tok/s;  11957 sec
[2021-04-25 05:59:41,371 INFO] Step 30900/50000; acc:  90.97; ppl:  1.39; xent: 0.33; lr: 0.00010; 5322/6768 tok/s;  11976 sec
[2021-04-25 06:00:00,900 INFO] Step 30950/50000; acc:  90.85; ppl:  1.40; xent: 0.34; lr: 0.00010; 5345/6433 tok/s;  11995 sec
[2021-04-25 06:00:19,582 INFO] Step 31000/50000; acc:  90.87; ppl:  1.40; xent: 0.33; lr: 0.00010; 5466/6677 tok/s;  12014 sec
[2021-04-25 06:00:38,966 INFO] Step 31050/50000; acc:  90.82; ppl:  1.40; xent: 0.34; lr: 0.00010; 5169/6509 tok/s;  12033 sec
[2021-04-25 06:00:58,320 INFO] Step 31100/50000; acc:  91.08; ppl:  1.39; xent: 0.33; lr: 0.00010; 5253/6677 tok/s;  12053 sec
[2021-04-25 06:01:17,753 INFO] Step 31150/50000; acc:  91.04; ppl:  1.39; xent: 0.33; lr: 0.00010; 5330/6680 tok/s;  12072 sec
[2021-04-25 06:01:36,394 INFO] Step 31200/50000; acc:  90.81; ppl:  1.40; xent: 0.34; lr: 0.00010; 5360/6646 tok/s;  12091 sec
[2021-04-25 06:01:55,607 INFO] Step 31250/50000; acc:  91.25; ppl:  1.38; xent: 0.32; lr: 0.00010; 5387/6702 tok/s;  12110 sec
[2021-04-25 06:02:14,301 INFO] Step 31300/50000; acc:  91.32; ppl:  1.38; xent: 0.32; lr: 0.00010; 5451/6978 tok/s;  12129 sec
[2021-04-25 06:02:32,799 INFO] Step 31350/50000; acc:  90.70; ppl:  1.40; xent: 0.34; lr: 0.00010; 5416/6781 tok/s;  12147 sec
[2021-04-25 06:02:36,252 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:02:52,054 INFO] Step 31400/50000; acc:  90.98; ppl:  1.39; xent: 0.33; lr: 0.00010; 5272/6696 tok/s;  12166 sec
[2021-04-25 06:03:10,401 INFO] Step 31450/50000; acc:  90.89; ppl:  1.40; xent: 0.33; lr: 0.00010; 5562/6888 tok/s;  12185 sec
[2021-04-25 06:03:29,278 INFO] Step 31500/50000; acc:  90.75; ppl:  1.40; xent: 0.34; lr: 0.00010; 5388/6690 tok/s;  12204 sec
[2021-04-25 06:03:48,642 INFO] Step 31550/50000; acc:  90.89; ppl:  1.40; xent: 0.33; lr: 0.00010; 5121/6524 tok/s;  12223 sec
[2021-04-25 06:04:08,110 INFO] Step 31600/50000; acc:  90.91; ppl:  1.40; xent: 0.33; lr: 0.00010; 5351/6608 tok/s;  12242 sec
[2021-04-25 06:04:27,176 INFO] Step 31650/50000; acc:  90.92; ppl:  1.39; xent: 0.33; lr: 0.00010; 5257/6544 tok/s;  12262 sec
[2021-04-25 06:04:46,748 INFO] Step 31700/50000; acc:  90.94; ppl:  1.39; xent: 0.33; lr: 0.00010; 5265/6468 tok/s;  12281 sec
[2021-04-25 06:05:05,722 INFO] Step 31750/50000; acc:  91.05; ppl:  1.39; xent: 0.33; lr: 0.00010; 5428/6722 tok/s;  12300 sec
[2021-04-25 06:05:25,566 INFO] Step 31800/50000; acc:  90.91; ppl:  1.40; xent: 0.33; lr: 0.00010; 5117/6361 tok/s;  12320 sec
[2021-04-25 06:05:44,861 INFO] Step 31850/50000; acc:  91.16; ppl:  1.38; xent: 0.32; lr: 0.00010; 5207/6663 tok/s;  12339 sec
[2021-04-25 06:06:04,091 INFO] Step 31900/50000; acc:  90.89; ppl:  1.40; xent: 0.33; lr: 0.00010; 5257/6626 tok/s;  12358 sec
[2021-04-25 06:06:23,108 INFO] Step 31950/50000; acc:  91.07; ppl:  1.39; xent: 0.33; lr: 0.00010; 5491/6625 tok/s;  12377 sec
[2021-04-25 06:06:41,695 INFO] Step 32000/50000; acc:  91.43; ppl:  1.37; xent: 0.32; lr: 0.00010; 5362/6913 tok/s;  12396 sec
[2021-04-25 06:07:00,742 INFO] Step 32050/50000; acc:  91.04; ppl:  1.39; xent: 0.33; lr: 0.00010; 5385/6776 tok/s;  12415 sec
[2021-04-25 06:07:06,047 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:07:19,879 INFO] Step 32100/50000; acc:  90.98; ppl:  1.39; xent: 0.33; lr: 0.00010; 5380/6787 tok/s;  12434 sec
[2021-04-25 06:07:38,641 INFO] Step 32150/50000; acc:  91.06; ppl:  1.39; xent: 0.33; lr: 0.00010; 5341/6691 tok/s;  12453 sec
[2021-04-25 06:07:57,115 INFO] Step 32200/50000; acc:  90.79; ppl:  1.40; xent: 0.34; lr: 0.00010; 5442/6801 tok/s;  12471 sec
[2021-04-25 06:08:16,415 INFO] Step 32250/50000; acc:  90.77; ppl:  1.40; xent: 0.34; lr: 0.00010; 5285/6658 tok/s;  12491 sec
[2021-04-25 06:08:36,283 INFO] Step 32300/50000; acc:  91.07; ppl:  1.39; xent: 0.33; lr: 0.00010; 5136/6404 tok/s;  12511 sec
[2021-04-25 06:08:55,036 INFO] Step 32350/50000; acc:  90.95; ppl:  1.39; xent: 0.33; lr: 0.00010; 5260/6666 tok/s;  12529 sec
[2021-04-25 06:09:14,331 INFO] Step 32400/50000; acc:  90.97; ppl:  1.40; xent: 0.33; lr: 0.00010; 5459/6563 tok/s;  12549 sec
[2021-04-25 06:09:32,663 INFO] Step 32450/50000; acc:  90.86; ppl:  1.40; xent: 0.33; lr: 0.00010; 5567/6784 tok/s;  12567 sec
[2021-04-25 06:09:51,932 INFO] Step 32500/50000; acc:  91.11; ppl:  1.38; xent: 0.33; lr: 0.00010; 5240/6717 tok/s;  12586 sec
[2021-04-25 06:10:11,475 INFO] Step 32550/50000; acc:  91.22; ppl:  1.38; xent: 0.32; lr: 0.00010; 5286/6633 tok/s;  12606 sec
[2021-04-25 06:10:30,633 INFO] Step 32600/50000; acc:  90.89; ppl:  1.39; xent: 0.33; lr: 0.00010; 5278/6565 tok/s;  12625 sec
[2021-04-25 06:10:49,888 INFO] Step 32650/50000; acc:  90.87; ppl:  1.40; xent: 0.33; lr: 0.00010; 5216/6563 tok/s;  12644 sec
[2021-04-25 06:11:08,766 INFO] Step 32700/50000; acc:  91.33; ppl:  1.37; xent: 0.32; lr: 0.00010; 5414/6788 tok/s;  12663 sec
[2021-04-25 06:11:27,632 INFO] Step 32750/50000; acc:  91.38; ppl:  1.37; xent: 0.32; lr: 0.00010; 5451/6862 tok/s;  12682 sec
[2021-04-25 06:11:46,424 INFO] Step 32800/50000; acc:  90.98; ppl:  1.39; xent: 0.33; lr: 0.00010; 5319/6727 tok/s;  12701 sec
[2021-04-25 06:11:46,438 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:12:05,901 INFO] Step 32850/50000; acc:  91.00; ppl:  1.39; xent: 0.33; lr: 0.00010; 5283/6587 tok/s;  12720 sec
[2021-04-25 06:12:24,880 INFO] Step 32900/50000; acc:  91.12; ppl:  1.39; xent: 0.33; lr: 0.00010; 5427/6772 tok/s;  12739 sec
[2021-04-25 06:12:43,595 INFO] Step 32950/50000; acc:  90.74; ppl:  1.40; xent: 0.34; lr: 0.00010; 5292/6573 tok/s;  12758 sec
[2021-04-25 06:13:03,003 INFO] Step 33000/50000; acc:  91.00; ppl:  1.39; xent: 0.33; lr: 0.00010; 5210/6689 tok/s;  12777 sec
[2021-04-25 06:13:22,444 INFO] Step 33050/50000; acc:  90.92; ppl:  1.39; xent: 0.33; lr: 0.00010; 5258/6469 tok/s;  12797 sec
[2021-04-25 06:13:41,718 INFO] Step 33100/50000; acc:  91.18; ppl:  1.38; xent: 0.32; lr: 0.00010; 5281/6678 tok/s;  12816 sec
[2021-04-25 06:14:00,872 INFO] Step 33150/50000; acc:  90.86; ppl:  1.40; xent: 0.33; lr: 0.00010; 5256/6331 tok/s;  12835 sec
[2021-04-25 06:14:20,155 INFO] Step 33200/50000; acc:  91.01; ppl:  1.39; xent: 0.33; lr: 0.00010; 5462/6718 tok/s;  12854 sec
[2021-04-25 06:14:39,524 INFO] Step 33250/50000; acc:  91.07; ppl:  1.39; xent: 0.33; lr: 0.00010; 5159/6486 tok/s;  12874 sec
[2021-04-25 06:14:59,336 INFO] Step 33300/50000; acc:  91.17; ppl:  1.38; xent: 0.32; lr: 0.00010; 5137/6497 tok/s;  12894 sec
[2021-04-25 06:15:18,919 INFO] Step 33350/50000; acc:  91.10; ppl:  1.39; xent: 0.33; lr: 0.00010; 5261/6598 tok/s;  12913 sec
[2021-04-25 06:15:38,310 INFO] Step 33400/50000; acc:  91.00; ppl:  1.39; xent: 0.33; lr: 0.00010; 5248/6461 tok/s;  12933 sec
[2021-04-25 06:15:56,692 INFO] Step 33450/50000; acc:  91.36; ppl:  1.37; xent: 0.32; lr: 0.00010; 5448/7061 tok/s;  12951 sec
[2021-04-25 06:16:15,022 INFO] Step 33500/50000; acc:  91.03; ppl:  1.39; xent: 0.33; lr: 0.00010; 5494/6849 tok/s;  12969 sec
[2021-04-25 06:16:28,657 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:16:34,239 INFO] Step 33550/50000; acc:  91.23; ppl:  1.38; xent: 0.32; lr: 0.00010; 5413/6860 tok/s;  12989 sec
[2021-04-25 06:16:53,372 INFO] Step 33600/50000; acc:  91.11; ppl:  1.39; xent: 0.33; lr: 0.00010; 5227/6504 tok/s;  13008 sec
[2021-04-25 06:17:11,751 INFO] Step 33650/50000; acc:  90.80; ppl:  1.40; xent: 0.33; lr: 0.00010; 5596/6921 tok/s;  13026 sec
[2021-04-25 06:17:31,028 INFO] Step 33700/50000; acc:  90.80; ppl:  1.40; xent: 0.33; lr: 0.00010; 5288/6514 tok/s;  13045 sec
[2021-04-25 06:17:50,964 INFO] Step 33750/50000; acc:  91.10; ppl:  1.38; xent: 0.32; lr: 0.00010; 5001/6470 tok/s;  13065 sec
[2021-04-25 06:18:09,833 INFO] Step 33800/50000; acc:  91.06; ppl:  1.39; xent: 0.33; lr: 0.00010; 5357/6677 tok/s;  13084 sec
[2021-04-25 06:18:28,924 INFO] Step 33850/50000; acc:  91.11; ppl:  1.39; xent: 0.33; lr: 0.00010; 5394/6757 tok/s;  13103 sec
[2021-04-25 06:18:47,919 INFO] Step 33900/50000; acc:  91.14; ppl:  1.38; xent: 0.32; lr: 0.00010; 5433/6559 tok/s;  13122 sec
[2021-04-25 06:19:07,366 INFO] Step 33950/50000; acc:  90.70; ppl:  1.40; xent: 0.34; lr: 0.00010; 5135/6345 tok/s;  13142 sec
[2021-04-25 06:19:27,324 INFO] Step 34000/50000; acc:  91.25; ppl:  1.38; xent: 0.32; lr: 0.00010; 5223/6527 tok/s;  13162 sec
[2021-04-25 06:19:46,639 INFO] Step 34050/50000; acc:  91.20; ppl:  1.38; xent: 0.32; lr: 0.00010; 5205/6585 tok/s;  13181 sec
[2021-04-25 06:20:05,777 INFO] Step 34100/50000; acc:  91.21; ppl:  1.38; xent: 0.32; lr: 0.00010; 5295/6714 tok/s;  13200 sec
[2021-04-25 06:20:24,795 INFO] Step 34150/50000; acc:  91.16; ppl:  1.38; xent: 0.32; lr: 0.00010; 5440/6642 tok/s;  13219 sec
[2021-04-25 06:20:43,220 INFO] Step 34200/50000; acc:  91.44; ppl:  1.37; xent: 0.31; lr: 0.00010; 5479/7004 tok/s;  13238 sec
[2021-04-25 06:21:01,806 INFO] Step 34250/50000; acc:  90.96; ppl:  1.39; xent: 0.33; lr: 0.00010; 5368/6773 tok/s;  13256 sec
[2021-04-25 06:21:10,174 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:21:21,270 INFO] Step 34300/50000; acc:  91.16; ppl:  1.38; xent: 0.32; lr: 0.00010; 5223/6727 tok/s;  13276 sec
[2021-04-25 06:21:40,334 INFO] Step 34350/50000; acc:  91.40; ppl:  1.38; xent: 0.32; lr: 0.00010; 5466/6817 tok/s;  13295 sec
[2021-04-25 06:21:58,355 INFO] Step 34400/50000; acc:  90.45; ppl:  1.41; xent: 0.34; lr: 0.00010; 5515/6648 tok/s;  13313 sec
[2021-04-25 06:22:18,045 INFO] Step 34450/50000; acc:  91.15; ppl:  1.38; xent: 0.32; lr: 0.00010; 5201/6652 tok/s;  13332 sec
[2021-04-25 06:22:37,593 INFO] Step 34500/50000; acc:  91.20; ppl:  1.38; xent: 0.32; lr: 0.00010; 5237/6526 tok/s;  13352 sec
[2021-04-25 06:22:56,604 INFO] Step 34550/50000; acc:  91.12; ppl:  1.39; xent: 0.33; lr: 0.00010; 5231/6602 tok/s;  13371 sec
[2021-04-25 06:23:15,937 INFO] Step 34600/50000; acc:  91.12; ppl:  1.38; xent: 0.32; lr: 0.00010; 5305/6458 tok/s;  13390 sec
[2021-04-25 06:23:34,534 INFO] Step 34650/50000; acc:  91.11; ppl:  1.38; xent: 0.32; lr: 0.00010; 5546/6795 tok/s;  13409 sec
[2021-04-25 06:23:54,363 INFO] Step 34700/50000; acc:  90.98; ppl:  1.39; xent: 0.33; lr: 0.00010; 5156/6422 tok/s;  13429 sec
[2021-04-25 06:24:13,404 INFO] Step 34750/50000; acc:  91.28; ppl:  1.37; xent: 0.32; lr: 0.00010; 5230/6682 tok/s;  13448 sec
[2021-04-25 06:24:32,823 INFO] Step 34800/50000; acc:  91.35; ppl:  1.37; xent: 0.32; lr: 0.00010; 5373/6705 tok/s;  13467 sec
[2021-04-25 06:24:51,889 INFO] Step 34850/50000; acc:  90.86; ppl:  1.39; xent: 0.33; lr: 0.00010; 5277/6490 tok/s;  13486 sec
[2021-04-25 06:25:11,085 INFO] Step 34900/50000; acc:  91.54; ppl:  1.36; xent: 0.31; lr: 0.00010; 5284/6749 tok/s;  13505 sec
[2021-04-25 06:25:29,454 INFO] Step 34950/50000; acc:  91.39; ppl:  1.37; xent: 0.32; lr: 0.00010; 5590/7046 tok/s;  13524 sec
[2021-04-25 06:25:48,353 INFO] Step 35000/50000; acc:  91.06; ppl:  1.38; xent: 0.32; lr: 0.00010; 5360/6770 tok/s;  13543 sec
[2021-04-25 06:25:48,354 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-25 06:26:15,158 INFO] Validation perplexity: 1.42499
[2021-04-25 06:26:15,158 INFO] Validation accuracy: 90.6046
[2021-04-25 06:26:15,162 INFO] Saving checkpoint ../models/group1_params/basic_ops/model_step_35000.pt
[2021-04-25 06:26:18,124 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:26:34,637 INFO] Step 35050/50000; acc:  91.06; ppl:  1.39; xent: 0.33; lr: 0.00010; 2171/2741 tok/s;  13589 sec
[2021-04-25 06:26:53,102 INFO] Step 35100/50000; acc:  91.20; ppl:  1.38; xent: 0.32; lr: 0.00010; 5480/6763 tok/s;  13607 sec
[2021-04-25 06:27:12,166 INFO] Step 35150/50000; acc:  91.08; ppl:  1.39; xent: 0.33; lr: 0.00010; 5406/6764 tok/s;  13627 sec
[2021-04-25 06:27:31,500 INFO] Step 35200/50000; acc:  91.08; ppl:  1.38; xent: 0.32; lr: 0.00010; 5166/6513 tok/s;  13646 sec
[2021-04-25 06:27:50,661 INFO] Step 35250/50000; acc:  91.02; ppl:  1.39; xent: 0.33; lr: 0.00010; 5349/6591 tok/s;  13665 sec
[2021-04-25 06:28:09,925 INFO] Step 35300/50000; acc:  91.32; ppl:  1.37; xent: 0.32; lr: 0.00010; 5318/6698 tok/s;  13684 sec
[2021-04-25 06:28:29,433 INFO] Step 35350/50000; acc:  90.98; ppl:  1.39; xent: 0.33; lr: 0.00010; 5225/6304 tok/s;  13704 sec
[2021-04-25 06:28:48,314 INFO] Step 35400/50000; acc:  91.21; ppl:  1.38; xent: 0.32; lr: 0.00010; 5360/6743 tok/s;  13723 sec
[2021-04-25 06:29:07,693 INFO] Step 35450/50000; acc:  91.02; ppl:  1.39; xent: 0.33; lr: 0.00010; 5292/6561 tok/s;  13742 sec
[2021-04-25 06:29:27,247 INFO] Step 35500/50000; acc:  91.49; ppl:  1.37; xent: 0.31; lr: 0.00010; 5239/6724 tok/s;  13762 sec
[2021-04-25 06:29:46,315 INFO] Step 35550/50000; acc:  90.92; ppl:  1.39; xent: 0.33; lr: 0.00010; 5189/6483 tok/s;  13781 sec
[2021-04-25 06:30:05,601 INFO] Step 35600/50000; acc:  91.30; ppl:  1.37; xent: 0.31; lr: 0.00010; 5470/6633 tok/s;  13800 sec
[2021-04-25 06:30:24,168 INFO] Step 35650/50000; acc:  91.57; ppl:  1.36; xent: 0.31; lr: 0.00010; 5383/6942 tok/s;  13819 sec
[2021-04-25 06:30:43,064 INFO] Step 35700/50000; acc:  91.28; ppl:  1.37; xent: 0.32; lr: 0.00010; 5332/6711 tok/s;  13837 sec
[2021-04-25 06:30:47,696 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:31:02,416 INFO] Step 35750/50000; acc:  91.16; ppl:  1.38; xent: 0.32; lr: 0.00010; 5351/6718 tok/s;  13857 sec
[2021-04-25 06:31:21,323 INFO] Step 35800/50000; acc:  91.35; ppl:  1.37; xent: 0.32; lr: 0.00010; 5361/6781 tok/s;  13876 sec
[2021-04-25 06:31:39,700 INFO] Step 35850/50000; acc:  90.91; ppl:  1.39; xent: 0.33; lr: 0.00010; 5415/6723 tok/s;  13894 sec
[2021-04-25 06:31:59,342 INFO] Step 35900/50000; acc:  90.88; ppl:  1.39; xent: 0.33; lr: 0.00010; 5157/6476 tok/s;  13914 sec
[2021-04-25 06:32:18,931 INFO] Step 35950/50000; acc:  91.41; ppl:  1.37; xent: 0.31; lr: 0.00010; 5281/6603 tok/s;  13933 sec
[2021-04-25 06:32:37,910 INFO] Step 36000/50000; acc:  91.08; ppl:  1.38; xent: 0.33; lr: 0.00010; 5219/6573 tok/s;  13952 sec
[2021-04-25 06:32:57,143 INFO] Step 36050/50000; acc:  91.22; ppl:  1.38; xent: 0.32; lr: 0.00010; 5406/6527 tok/s;  13971 sec
[2021-04-25 06:33:15,834 INFO] Step 36100/50000; acc:  91.00; ppl:  1.38; xent: 0.32; lr: 0.00010; 5560/6801 tok/s;  13990 sec
[2021-04-25 06:33:35,079 INFO] Step 36150/50000; acc:  91.25; ppl:  1.37; xent: 0.32; lr: 0.00010; 5192/6580 tok/s;  14009 sec
[2021-04-25 06:33:54,254 INFO] Step 36200/50000; acc:  91.45; ppl:  1.37; xent: 0.31; lr: 0.00010; 5291/6745 tok/s;  14029 sec
[2021-04-25 06:34:13,395 INFO] Step 36250/50000; acc:  91.15; ppl:  1.38; xent: 0.32; lr: 0.00010; 5348/6681 tok/s;  14048 sec
[2021-04-25 06:34:32,602 INFO] Step 36300/50000; acc:  91.05; ppl:  1.39; xent: 0.33; lr: 0.00010; 5331/6502 tok/s;  14067 sec
[2021-04-25 06:34:51,406 INFO] Step 36350/50000; acc:  91.52; ppl:  1.36; xent: 0.31; lr: 0.00010; 5324/6738 tok/s;  14086 sec
[2021-04-25 06:35:10,599 INFO] Step 36400/50000; acc:  91.64; ppl:  1.36; xent: 0.31; lr: 0.00010; 5398/6942 tok/s;  14105 sec
[2021-04-25 06:35:28,602 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:35:29,443 INFO] Step 36450/50000; acc:  91.10; ppl:  1.38; xent: 0.32; lr: 0.00010; 5331/6664 tok/s;  14124 sec
[2021-04-25 06:35:48,833 INFO] Step 36500/50000; acc:  91.17; ppl:  1.38; xent: 0.32; lr: 0.00010; 5216/6518 tok/s;  14143 sec
[2021-04-25 06:36:07,920 INFO] Step 36550/50000; acc:  91.40; ppl:  1.37; xent: 0.32; lr: 0.00010; 5429/6795 tok/s;  14162 sec
[2021-04-25 06:36:26,693 INFO] Step 36600/50000; acc:  90.90; ppl:  1.39; xent: 0.33; lr: 0.00010; 5336/6558 tok/s;  14181 sec
[2021-04-25 06:36:46,010 INFO] Step 36650/50000; acc:  91.26; ppl:  1.37; xent: 0.32; lr: 0.00010; 5182/6724 tok/s;  14200 sec
[2021-04-25 06:37:05,296 INFO] Step 36700/50000; acc:  91.15; ppl:  1.38; xent: 0.32; lr: 0.00010; 5253/6481 tok/s;  14220 sec
[2021-04-25 06:37:24,748 INFO] Step 36750/50000; acc:  91.41; ppl:  1.37; xent: 0.31; lr: 0.00010; 5318/6667 tok/s;  14239 sec
[2021-04-25 06:37:43,911 INFO] Step 36800/50000; acc:  91.01; ppl:  1.38; xent: 0.32; lr: 0.00010; 5275/6293 tok/s;  14258 sec
[2021-04-25 06:38:03,405 INFO] Step 36850/50000; acc:  91.12; ppl:  1.38; xent: 0.32; lr: 0.00010; 5328/6609 tok/s;  14278 sec
[2021-04-25 06:38:22,931 INFO] Step 36900/50000; acc:  91.41; ppl:  1.36; xent: 0.31; lr: 0.00010; 5211/6652 tok/s;  14297 sec
[2021-04-25 06:38:42,730 INFO] Step 36950/50000; acc:  91.31; ppl:  1.37; xent: 0.32; lr: 0.00010; 5088/6384 tok/s;  14317 sec
[2021-04-25 06:39:02,078 INFO] Step 37000/50000; acc:  91.14; ppl:  1.38; xent: 0.32; lr: 0.00010; 5221/6538 tok/s;  14336 sec
[2021-04-25 06:39:21,582 INFO] Step 37050/50000; acc:  91.30; ppl:  1.37; xent: 0.31; lr: 0.00010; 5277/6494 tok/s;  14356 sec
[2021-04-25 06:39:40,331 INFO] Step 37100/50000; acc:  91.68; ppl:  1.36; xent: 0.30; lr: 0.00010; 5449/6969 tok/s;  14375 sec
[2021-04-25 06:39:58,525 INFO] Step 37150/50000; acc:  91.07; ppl:  1.38; xent: 0.32; lr: 0.00010; 5421/6756 tok/s;  14393 sec
[2021-04-25 06:40:11,535 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:40:17,862 INFO] Step 37200/50000; acc:  91.56; ppl:  1.36; xent: 0.31; lr: 0.00010; 5429/6949 tok/s;  14412 sec
[2021-04-25 06:40:37,097 INFO] Step 37250/50000; acc:  91.33; ppl:  1.37; xent: 0.32; lr: 0.00010; 5233/6546 tok/s;  14431 sec
[2021-04-25 06:40:55,451 INFO] Step 37300/50000; acc:  91.02; ppl:  1.38; xent: 0.32; lr: 0.00010; 5492/6826 tok/s;  14450 sec
[2021-04-25 06:41:14,918 INFO] Step 37350/50000; acc:  91.01; ppl:  1.38; xent: 0.32; lr: 0.00010; 5266/6496 tok/s;  14469 sec
[2021-04-25 06:41:34,417 INFO] Step 37400/50000; acc:  91.39; ppl:  1.37; xent: 0.31; lr: 0.00010; 5177/6553 tok/s;  14489 sec
[2021-04-25 06:41:53,595 INFO] Step 37450/50000; acc:  91.31; ppl:  1.37; xent: 0.32; lr: 0.00010; 5209/6644 tok/s;  14508 sec
[2021-04-25 06:42:12,562 INFO] Step 37500/50000; acc:  91.19; ppl:  1.38; xent: 0.32; lr: 0.00010; 5393/6667 tok/s;  14527 sec
[2021-04-25 06:42:31,482 INFO] Step 37550/50000; acc:  91.45; ppl:  1.37; xent: 0.31; lr: 0.00010; 5541/6646 tok/s;  14546 sec
[2021-04-25 06:42:51,078 INFO] Step 37600/50000; acc:  90.97; ppl:  1.38; xent: 0.32; lr: 0.00010; 5113/6357 tok/s;  14565 sec
[2021-04-25 06:43:10,794 INFO] Step 37650/50000; acc:  91.42; ppl:  1.37; xent: 0.31; lr: 0.00010; 5208/6542 tok/s;  14585 sec
[2021-04-25 06:43:30,062 INFO] Step 37700/50000; acc:  91.41; ppl:  1.37; xent: 0.31; lr: 0.00010; 5326/6678 tok/s;  14604 sec
[2021-04-25 06:43:49,092 INFO] Step 37750/50000; acc:  91.33; ppl:  1.37; xent: 0.31; lr: 0.00010; 5265/6675 tok/s;  14623 sec
[2021-04-25 06:44:07,824 INFO] Step 37800/50000; acc:  91.49; ppl:  1.36; xent: 0.31; lr: 0.00010; 5425/6780 tok/s;  14642 sec
[2021-04-25 06:44:26,012 INFO] Step 37850/50000; acc:  91.57; ppl:  1.36; xent: 0.31; lr: 0.00010; 5611/7094 tok/s;  14660 sec
[2021-04-25 06:44:44,985 INFO] Step 37900/50000; acc:  91.18; ppl:  1.37; xent: 0.32; lr: 0.00010; 5362/6662 tok/s;  14679 sec
[2021-04-25 06:44:52,348 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:45:04,155 INFO] Step 37950/50000; acc:  91.40; ppl:  1.36; xent: 0.31; lr: 0.00010; 5202/6724 tok/s;  14698 sec
[2021-04-25 06:45:23,262 INFO] Step 38000/50000; acc:  91.46; ppl:  1.36; xent: 0.31; lr: 0.00010; 5483/6869 tok/s;  14718 sec
[2021-04-25 06:45:41,477 INFO] Step 38050/50000; acc:  90.87; ppl:  1.39; xent: 0.33; lr: 0.00010; 5491/6581 tok/s;  14736 sec
[2021-04-25 06:46:01,067 INFO] Step 38100/50000; acc:  91.28; ppl:  1.37; xent: 0.31; lr: 0.00010; 5134/6697 tok/s;  14755 sec
[2021-04-25 06:46:20,669 INFO] Step 38150/50000; acc:  91.37; ppl:  1.37; xent: 0.31; lr: 0.00010; 5253/6493 tok/s;  14775 sec
[2021-04-25 06:46:39,496 INFO] Step 38200/50000; acc:  91.22; ppl:  1.38; xent: 0.32; lr: 0.00010; 5345/6662 tok/s;  14794 sec
[2021-04-25 06:46:58,723 INFO] Step 38250/50000; acc:  91.22; ppl:  1.37; xent: 0.31; lr: 0.00010; 5288/6503 tok/s;  14813 sec
[2021-04-25 06:47:17,537 INFO] Step 38300/50000; acc:  91.42; ppl:  1.37; xent: 0.31; lr: 0.00010; 5433/6710 tok/s;  14832 sec
[2021-04-25 06:47:37,221 INFO] Step 38350/50000; acc:  91.15; ppl:  1.38; xent: 0.32; lr: 0.00010; 5267/6485 tok/s;  14852 sec
[2021-04-25 06:47:56,187 INFO] Step 38400/50000; acc:  91.47; ppl:  1.36; xent: 0.31; lr: 0.00010; 5284/6716 tok/s;  14871 sec
[2021-04-25 06:48:15,695 INFO] Step 38450/50000; acc:  91.38; ppl:  1.37; xent: 0.31; lr: 0.00010; 5265/6635 tok/s;  14890 sec
[2021-04-25 06:48:34,850 INFO] Step 38500/50000; acc:  91.19; ppl:  1.38; xent: 0.32; lr: 0.00010; 5369/6520 tok/s;  14909 sec
[2021-04-25 06:48:53,983 INFO] Step 38550/50000; acc:  91.69; ppl:  1.35; xent: 0.30; lr: 0.00010; 5231/6733 tok/s;  14928 sec
[2021-04-25 06:49:12,490 INFO] Step 38600/50000; acc:  91.56; ppl:  1.36; xent: 0.31; lr: 0.00010; 5450/6961 tok/s;  14947 sec
[2021-04-25 06:49:31,267 INFO] Step 38650/50000; acc:  91.24; ppl:  1.37; xent: 0.31; lr: 0.00010; 5461/6853 tok/s;  14966 sec
[2021-04-25 06:49:33,392 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:49:50,907 INFO] Step 38700/50000; acc:  91.35; ppl:  1.37; xent: 0.31; lr: 0.00010; 5218/6485 tok/s;  14985 sec
[2021-04-25 06:50:09,011 INFO] Step 38750/50000; acc:  91.30; ppl:  1.37; xent: 0.31; lr: 0.00010; 5467/6828 tok/s;  15003 sec
[2021-04-25 06:50:28,231 INFO] Step 38800/50000; acc:  91.22; ppl:  1.37; xent: 0.32; lr: 0.00010; 5411/6761 tok/s;  15023 sec
[2021-04-25 06:50:47,719 INFO] Step 38850/50000; acc:  91.29; ppl:  1.37; xent: 0.31; lr: 0.00010; 5145/6480 tok/s;  15042 sec
[2021-04-25 06:51:06,891 INFO] Step 38900/50000; acc:  91.19; ppl:  1.37; xent: 0.32; lr: 0.00010; 5252/6518 tok/s;  15061 sec
[2021-04-25 06:51:26,001 INFO] Step 38950/50000; acc:  91.47; ppl:  1.36; xent: 0.31; lr: 0.00010; 5399/6757 tok/s;  15080 sec
[2021-04-25 06:51:45,306 INFO] Step 39000/50000; acc:  91.31; ppl:  1.37; xent: 0.31; lr: 0.00010; 5335/6453 tok/s;  15100 sec
[2021-04-25 06:52:04,000 INFO] Step 39050/50000; acc:  91.23; ppl:  1.37; xent: 0.32; lr: 0.00010; 5363/6755 tok/s;  15118 sec
[2021-04-25 06:52:23,249 INFO] Step 39100/50000; acc:  91.30; ppl:  1.37; xent: 0.31; lr: 0.00010; 5276/6587 tok/s;  15138 sec
[2021-04-25 06:52:43,114 INFO] Step 39150/50000; acc:  91.57; ppl:  1.36; xent: 0.31; lr: 0.00010; 5229/6642 tok/s;  15157 sec
[2021-04-25 06:53:02,308 INFO] Step 39200/50000; acc:  91.06; ppl:  1.38; xent: 0.32; lr: 0.00010; 5187/6424 tok/s;  15177 sec
[2021-04-25 06:53:21,667 INFO] Step 39250/50000; acc:  91.71; ppl:  1.35; xent: 0.30; lr: 0.00010; 5370/6647 tok/s;  15196 sec
[2021-04-25 06:53:40,489 INFO] Step 39300/50000; acc:  91.72; ppl:  1.35; xent: 0.30; lr: 0.00010; 5429/6870 tok/s;  15215 sec
[2021-04-25 06:53:59,659 INFO] Step 39350/50000; acc:  91.44; ppl:  1.36; xent: 0.31; lr: 0.00010; 5190/6625 tok/s;  15234 sec
[2021-04-25 06:54:03,160 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:54:18,376 INFO] Step 39400/50000; acc:  91.34; ppl:  1.37; xent: 0.31; lr: 0.00010; 5429/6790 tok/s;  15253 sec
[2021-04-25 06:54:37,181 INFO] Step 39450/50000; acc:  91.55; ppl:  1.36; xent: 0.31; lr: 0.00010; 5457/6871 tok/s;  15272 sec
[2021-04-25 06:54:55,543 INFO] Step 39500/50000; acc:  91.12; ppl:  1.38; xent: 0.32; lr: 0.00010; 5518/6740 tok/s;  15290 sec
[2021-04-25 06:55:15,127 INFO] Step 39550/50000; acc:  91.27; ppl:  1.37; xent: 0.32; lr: 0.00010; 5068/6476 tok/s;  15309 sec
[2021-04-25 06:55:34,704 INFO] Step 39600/50000; acc:  91.45; ppl:  1.36; xent: 0.31; lr: 0.00010; 5330/6620 tok/s;  15329 sec
[2021-04-25 06:55:53,751 INFO] Step 39650/50000; acc:  91.20; ppl:  1.37; xent: 0.32; lr: 0.00010; 5232/6542 tok/s;  15348 sec
[2021-04-25 06:56:12,961 INFO] Step 39700/50000; acc:  91.46; ppl:  1.36; xent: 0.31; lr: 0.00010; 5319/6545 tok/s;  15367 sec
[2021-04-25 06:56:31,718 INFO] Step 39750/50000; acc:  91.27; ppl:  1.37; xent: 0.31; lr: 0.00010; 5559/6788 tok/s;  15386 sec
[2021-04-25 06:56:51,347 INFO] Step 39800/50000; acc:  91.51; ppl:  1.36; xent: 0.31; lr: 0.00010; 5159/6483 tok/s;  15406 sec
[2021-04-25 06:57:10,558 INFO] Step 39850/50000; acc:  91.46; ppl:  1.36; xent: 0.31; lr: 0.00010; 5216/6707 tok/s;  15425 sec
[2021-04-25 06:57:29,735 INFO] Step 39900/50000; acc:  91.45; ppl:  1.36; xent: 0.31; lr: 0.00010; 5302/6605 tok/s;  15444 sec
[2021-04-25 06:57:49,266 INFO] Step 39950/50000; acc:  91.30; ppl:  1.37; xent: 0.31; lr: 0.00010; 5324/6495 tok/s;  15464 sec
[2021-04-25 06:58:07,983 INFO] Step 40000/50000; acc:  91.73; ppl:  1.35; xent: 0.30; lr: 0.00010; 5378/6790 tok/s;  15482 sec
[2021-04-25 06:58:07,985 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-25 06:58:34,743 INFO] Validation perplexity: 1.42332
[2021-04-25 06:58:34,743 INFO] Validation accuracy: 90.7051
[2021-04-25 06:58:34,747 INFO] Saving checkpoint ../models/group1_params/basic_ops/model_step_40000.pt
[2021-04-25 06:58:53,898 INFO] Step 40050/50000; acc:  91.79; ppl:  1.35; xent: 0.30; lr: 0.00010; 2219/2873 tok/s;  15528 sec
[2021-04-25 06:59:11,195 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 06:59:12,980 INFO] Step 40100/50000; acc:  91.41; ppl:  1.36; xent: 0.31; lr: 0.00010; 5380/6719 tok/s;  15547 sec
[2021-04-25 06:59:32,186 INFO] Step 40150/50000; acc:  91.36; ppl:  1.37; xent: 0.31; lr: 0.00010; 5200/6506 tok/s;  15567 sec
[2021-04-25 06:59:50,995 INFO] Step 40200/50000; acc:  91.51; ppl:  1.36; xent: 0.31; lr: 0.00010; 5406/6759 tok/s;  15585 sec
[2021-04-25 07:00:09,585 INFO] Step 40250/50000; acc:  91.08; ppl:  1.38; xent: 0.32; lr: 0.00010; 5460/6701 tok/s;  15604 sec
[2021-04-25 07:00:29,191 INFO] Step 40300/50000; acc:  91.52; ppl:  1.36; xent: 0.31; lr: 0.00010; 5200/6671 tok/s;  15624 sec
[2021-04-25 07:00:48,200 INFO] Step 40350/50000; acc:  91.34; ppl:  1.37; xent: 0.31; lr: 0.00010; 5219/6503 tok/s;  15643 sec
[2021-04-25 07:01:07,545 INFO] Step 40400/50000; acc:  91.54; ppl:  1.36; xent: 0.31; lr: 0.00010; 5406/6760 tok/s;  15662 sec
[2021-04-25 07:01:26,625 INFO] Step 40450/50000; acc:  91.28; ppl:  1.37; xent: 0.31; lr: 0.00010; 5318/6357 tok/s;  15681 sec
[2021-04-25 07:01:45,906 INFO] Step 40500/50000; acc:  91.34; ppl:  1.37; xent: 0.31; lr: 0.00010; 5290/6585 tok/s;  15700 sec
[2021-04-25 07:02:05,582 INFO] Step 40550/50000; acc:  91.55; ppl:  1.35; xent: 0.30; lr: 0.00010; 5206/6631 tok/s;  15720 sec
[2021-04-25 07:02:25,507 INFO] Step 40600/50000; acc:  91.58; ppl:  1.36; xent: 0.31; lr: 0.00010; 5110/6420 tok/s;  15740 sec
[2021-04-25 07:02:44,586 INFO] Step 40650/50000; acc:  91.35; ppl:  1.36; xent: 0.31; lr: 0.00010; 5244/6584 tok/s;  15759 sec
[2021-04-25 07:03:03,727 INFO] Step 40700/50000; acc:  91.54; ppl:  1.35; xent: 0.30; lr: 0.00010; 5327/6539 tok/s;  15778 sec
[2021-04-25 07:03:22,932 INFO] Step 40750/50000; acc:  91.83; ppl:  1.35; xent: 0.30; lr: 0.00010; 5397/6863 tok/s;  15797 sec
[2021-04-25 07:03:41,382 INFO] Step 40800/50000; acc:  91.43; ppl:  1.36; xent: 0.31; lr: 0.00010; 5370/6744 tok/s;  15816 sec
[2021-04-25 07:03:53,484 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:04:00,553 INFO] Step 40850/50000; acc:  91.63; ppl:  1.35; xent: 0.30; lr: 0.00010; 5394/6921 tok/s;  15835 sec
[2021-04-25 07:04:19,973 INFO] Step 40900/50000; acc:  91.58; ppl:  1.36; xent: 0.31; lr: 0.00010; 5308/6578 tok/s;  15854 sec
[2021-04-25 07:04:38,055 INFO] Step 40950/50000; acc:  91.07; ppl:  1.38; xent: 0.32; lr: 0.00010; 5502/6808 tok/s;  15872 sec
[2021-04-25 07:04:57,443 INFO] Step 41000/50000; acc:  91.13; ppl:  1.37; xent: 0.31; lr: 0.00010; 5189/6497 tok/s;  15892 sec
[2021-04-25 07:05:16,824 INFO] Step 41050/50000; acc:  91.57; ppl:  1.36; xent: 0.30; lr: 0.00010; 5274/6627 tok/s;  15911 sec
[2021-04-25 07:05:36,200 INFO] Step 41100/50000; acc:  91.57; ppl:  1.36; xent: 0.31; lr: 0.00010; 5254/6649 tok/s;  15931 sec
[2021-04-25 07:05:55,383 INFO] Step 41150/50000; acc:  91.33; ppl:  1.36; xent: 0.31; lr: 0.00010; 5213/6461 tok/s;  15950 sec
[2021-04-25 07:06:14,270 INFO] Step 41200/50000; acc:  91.63; ppl:  1.35; xent: 0.30; lr: 0.00010; 5610/6772 tok/s;  15969 sec
[2021-04-25 07:06:33,945 INFO] Step 41250/50000; acc:  91.15; ppl:  1.37; xent: 0.32; lr: 0.00010; 5108/6331 tok/s;  15988 sec
[2021-04-25 07:06:53,743 INFO] Step 41300/50000; acc:  91.63; ppl:  1.35; xent: 0.30; lr: 0.00010; 5103/6431 tok/s;  16008 sec
[2021-04-25 07:07:13,240 INFO] Step 41350/50000; acc:  91.65; ppl:  1.35; xent: 0.30; lr: 0.00010; 5292/6697 tok/s;  16028 sec
[2021-04-25 07:07:32,361 INFO] Step 41400/50000; acc:  91.52; ppl:  1.36; xent: 0.31; lr: 0.00010; 5304/6597 tok/s;  16047 sec
[2021-04-25 07:07:50,984 INFO] Step 41450/50000; acc:  91.62; ppl:  1.35; xent: 0.30; lr: 0.00010; 5397/6800 tok/s;  16065 sec
[2021-04-25 07:08:09,342 INFO] Step 41500/50000; acc:  91.83; ppl:  1.34; xent: 0.30; lr: 0.00010; 5503/7015 tok/s;  16084 sec
[2021-04-25 07:08:28,408 INFO] Step 41550/50000; acc:  91.36; ppl:  1.36; xent: 0.31; lr: 0.00010; 5427/6654 tok/s;  16103 sec
[2021-04-25 07:08:34,959 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:08:47,526 INFO] Step 41600/50000; acc:  91.55; ppl:  1.35; xent: 0.30; lr: 0.00010; 5228/6718 tok/s;  16122 sec
[2021-04-25 07:09:06,250 INFO] Step 41650/50000; acc:  91.60; ppl:  1.35; xent: 0.30; lr: 0.00010; 5519/6969 tok/s;  16141 sec
[2021-04-25 07:09:24,839 INFO] Step 41700/50000; acc:  91.30; ppl:  1.37; xent: 0.31; lr: 0.00010; 5495/6668 tok/s;  16159 sec
[2021-04-25 07:09:44,402 INFO] Step 41750/50000; acc:  91.44; ppl:  1.36; xent: 0.31; lr: 0.00010; 5085/6574 tok/s;  16179 sec
[2021-04-25 07:10:03,552 INFO] Step 41800/50000; acc:  91.47; ppl:  1.36; xent: 0.31; lr: 0.00010; 5279/6519 tok/s;  16198 sec
[2021-04-25 07:10:22,504 INFO] Step 41850/50000; acc:  91.46; ppl:  1.36; xent: 0.31; lr: 0.00010; 5376/6732 tok/s;  16217 sec
[2021-04-25 07:10:41,847 INFO] Step 41900/50000; acc:  91.50; ppl:  1.35; xent: 0.30; lr: 0.00010; 5360/6496 tok/s;  16236 sec
[2021-04-25 07:11:00,062 INFO] Step 41950/50000; acc:  91.38; ppl:  1.36; xent: 0.31; lr: 0.00010; 5473/6759 tok/s;  16254 sec
[2021-04-25 07:11:19,704 INFO] Step 42000/50000; acc:  91.34; ppl:  1.36; xent: 0.31; lr: 0.00010; 5344/6583 tok/s;  16274 sec
[2021-04-25 07:11:38,670 INFO] Step 42050/50000; acc:  91.68; ppl:  1.35; xent: 0.30; lr: 0.00010; 5301/6746 tok/s;  16293 sec
[2021-04-25 07:11:58,082 INFO] Step 42100/50000; acc:  91.68; ppl:  1.35; xent: 0.30; lr: 0.00010; 5195/6671 tok/s;  16312 sec
[2021-04-25 07:12:17,174 INFO] Step 42150/50000; acc:  91.37; ppl:  1.36; xent: 0.31; lr: 0.00010; 5428/6522 tok/s;  16332 sec
[2021-04-25 07:12:36,499 INFO] Step 42200/50000; acc:  91.95; ppl:  1.34; xent: 0.29; lr: 0.00010; 5236/6735 tok/s;  16351 sec
[2021-04-25 07:12:54,977 INFO] Step 42250/50000; acc:  91.80; ppl:  1.34; xent: 0.30; lr: 0.00010; 5400/6933 tok/s;  16369 sec
[2021-04-25 07:13:03,251 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:13:13,673 INFO] Step 42300/50000; acc:  91.35; ppl:  1.36; xent: 0.31; lr: 0.00010; 5445/6797 tok/s;  16388 sec
[2021-04-25 07:13:33,277 INFO] Step 42350/50000; acc:  91.73; ppl:  1.35; xent: 0.30; lr: 0.00010; 5301/6639 tok/s;  16408 sec
[2021-04-25 07:13:51,538 INFO] Step 42400/50000; acc:  91.40; ppl:  1.36; xent: 0.31; lr: 0.00010; 5442/6756 tok/s;  16426 sec
[2021-04-25 07:14:10,831 INFO] Step 42450/50000; acc:  91.35; ppl:  1.36; xent: 0.31; lr: 0.00010; 5313/6646 tok/s;  16445 sec
[2021-04-25 07:14:30,393 INFO] Step 42500/50000; acc:  91.63; ppl:  1.35; xent: 0.30; lr: 0.00010; 5236/6594 tok/s;  16465 sec
[2021-04-25 07:14:49,496 INFO] Step 42550/50000; acc:  91.31; ppl:  1.36; xent: 0.31; lr: 0.00010; 5207/6469 tok/s;  16484 sec
[2021-04-25 07:15:08,323 INFO] Step 42600/50000; acc:  91.64; ppl:  1.35; xent: 0.30; lr: 0.00010; 5383/6771 tok/s;  16503 sec
[2021-04-25 07:15:27,771 INFO] Step 42650/50000; acc:  91.49; ppl:  1.36; xent: 0.31; lr: 0.00010; 5370/6416 tok/s;  16522 sec
[2021-04-25 07:15:46,564 INFO] Step 42700/50000; acc:  91.53; ppl:  1.36; xent: 0.30; lr: 0.00010; 5427/6774 tok/s;  16541 sec
[2021-04-25 07:16:05,899 INFO] Step 42750/50000; acc:  91.64; ppl:  1.35; xent: 0.30; lr: 0.00010; 5140/6546 tok/s;  16560 sec
[2021-04-25 07:16:25,467 INFO] Step 42800/50000; acc:  91.69; ppl:  1.35; xent: 0.30; lr: 0.00010; 5359/6748 tok/s;  16580 sec
[2021-04-25 07:16:44,769 INFO] Step 42850/50000; acc:  91.38; ppl:  1.36; xent: 0.31; lr: 0.00010; 5198/6402 tok/s;  16599 sec
[2021-04-25 07:17:03,953 INFO] Step 42900/50000; acc:  91.85; ppl:  1.34; xent: 0.29; lr: 0.00010; 5310/6666 tok/s;  16618 sec
[2021-04-25 07:17:22,695 INFO] Step 42950/50000; acc:  91.94; ppl:  1.34; xent: 0.29; lr: 0.00010; 5479/6930 tok/s;  16637 sec
[2021-04-25 07:17:41,710 INFO] Step 43000/50000; acc:  91.53; ppl:  1.35; xent: 0.30; lr: 0.00010; 5298/6706 tok/s;  16656 sec
[2021-04-25 07:17:44,536 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:18:00,637 INFO] Step 43050/50000; acc:  91.53; ppl:  1.35; xent: 0.30; lr: 0.00010; 5306/6717 tok/s;  16675 sec
[2021-04-25 07:18:19,438 INFO] Step 43100/50000; acc:  91.68; ppl:  1.35; xent: 0.30; lr: 0.00010; 5417/6786 tok/s;  16694 sec
[2021-04-25 07:18:37,863 INFO] Step 43150/50000; acc:  91.29; ppl:  1.37; xent: 0.31; lr: 0.00010; 5581/6780 tok/s;  16712 sec
[2021-04-25 07:18:57,167 INFO] Step 43200/50000; acc:  91.41; ppl:  1.36; xent: 0.31; lr: 0.00010; 5169/6568 tok/s;  16732 sec
[2021-04-25 07:19:16,936 INFO] Step 43250/50000; acc:  91.64; ppl:  1.35; xent: 0.30; lr: 0.00010; 5199/6479 tok/s;  16751 sec
[2021-04-25 07:19:36,117 INFO] Step 43300/50000; acc:  91.61; ppl:  1.36; xent: 0.30; lr: 0.00010; 5304/6712 tok/s;  16770 sec
[2021-04-25 07:19:55,703 INFO] Step 43350/50000; acc:  91.54; ppl:  1.36; xent: 0.30; lr: 0.00010; 5173/6218 tok/s;  16790 sec
[2021-04-25 07:20:14,415 INFO] Step 43400/50000; acc:  91.45; ppl:  1.36; xent: 0.30; lr: 0.00010; 5459/6841 tok/s;  16809 sec
[2021-04-25 07:20:34,176 INFO] Step 43450/50000; acc:  91.68; ppl:  1.35; xent: 0.30; lr: 0.00010; 5190/6438 tok/s;  16829 sec
[2021-04-25 07:20:53,577 INFO] Step 43500/50000; acc:  91.85; ppl:  1.34; xent: 0.29; lr: 0.00010; 5264/6711 tok/s;  16848 sec
[2021-04-25 07:21:12,362 INFO] Step 43550/50000; acc:  91.59; ppl:  1.35; xent: 0.30; lr: 0.00010; 5286/6657 tok/s;  16867 sec
[2021-04-25 07:21:32,030 INFO] Step 43600/50000; acc:  91.48; ppl:  1.36; xent: 0.30; lr: 0.00010; 5345/6501 tok/s;  16886 sec
[2021-04-25 07:21:50,543 INFO] Step 43650/50000; acc:  91.88; ppl:  1.34; xent: 0.29; lr: 0.00010; 5457/6932 tok/s;  16905 sec
[2021-04-25 07:22:09,539 INFO] Step 43700/50000; acc:  91.92; ppl:  1.34; xent: 0.29; lr: 0.00010; 5273/6794 tok/s;  16924 sec
[2021-04-25 07:22:26,209 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:22:28,916 INFO] Step 43750/50000; acc:  91.67; ppl:  1.34; xent: 0.30; lr: 0.00010; 5328/6694 tok/s;  16943 sec
[2021-04-25 07:22:48,053 INFO] Step 43800/50000; acc:  91.49; ppl:  1.36; xent: 0.30; lr: 0.00010; 5281/6494 tok/s;  16962 sec
[2021-04-25 07:23:06,591 INFO] Step 43850/50000; acc:  91.64; ppl:  1.35; xent: 0.30; lr: 0.00010; 5420/6884 tok/s;  16981 sec
[2021-04-25 07:23:25,173 INFO] Step 43900/50000; acc:  91.15; ppl:  1.37; xent: 0.31; lr: 0.00010; 5416/6645 tok/s;  17000 sec
[2021-04-25 07:23:45,254 INFO] Step 43950/50000; acc:  91.70; ppl:  1.35; xent: 0.30; lr: 0.00010; 5158/6564 tok/s;  17020 sec
[2021-04-25 07:24:04,267 INFO] Step 44000/50000; acc:  91.57; ppl:  1.35; xent: 0.30; lr: 0.00010; 5242/6511 tok/s;  17039 sec
[2021-04-25 07:24:23,678 INFO] Step 44050/50000; acc:  91.82; ppl:  1.34; xent: 0.30; lr: 0.00010; 5309/6771 tok/s;  17058 sec
[2021-04-25 07:24:42,691 INFO] Step 44100/50000; acc:  91.47; ppl:  1.36; xent: 0.30; lr: 0.00010; 5451/6451 tok/s;  17077 sec
[2021-04-25 07:25:01,967 INFO] Step 44150/50000; acc:  91.58; ppl:  1.35; xent: 0.30; lr: 0.00010; 5225/6496 tok/s;  17096 sec
[2021-04-25 07:25:21,345 INFO] Step 44200/50000; acc:  91.69; ppl:  1.34; xent: 0.30; lr: 0.00010; 5199/6618 tok/s;  17116 sec
[2021-04-25 07:25:41,435 INFO] Step 44250/50000; acc:  91.72; ppl:  1.35; xent: 0.30; lr: 0.00010; 5127/6475 tok/s;  17136 sec
[2021-04-25 07:26:01,020 INFO] Step 44300/50000; acc:  91.70; ppl:  1.35; xent: 0.30; lr: 0.00010; 5205/6537 tok/s;  17155 sec
[2021-04-25 07:26:19,760 INFO] Step 44350/50000; acc:  91.68; ppl:  1.34; xent: 0.30; lr: 0.00010; 5327/6506 tok/s;  17174 sec
[2021-04-25 07:26:38,927 INFO] Step 44400/50000; acc:  92.08; ppl:  1.33; xent: 0.29; lr: 0.00010; 5455/6924 tok/s;  17193 sec
[2021-04-25 07:26:57,506 INFO] Step 44450/50000; acc:  91.53; ppl:  1.35; xent: 0.30; lr: 0.00010; 5365/6765 tok/s;  17212 sec
[2021-04-25 07:27:08,681 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:27:16,656 INFO] Step 44500/50000; acc:  91.79; ppl:  1.34; xent: 0.29; lr: 0.00010; 5295/6872 tok/s;  17231 sec
[2021-04-25 07:27:35,834 INFO] Step 44550/50000; acc:  91.69; ppl:  1.35; xent: 0.30; lr: 0.00010; 5415/6621 tok/s;  17250 sec
[2021-04-25 07:27:54,210 INFO] Step 44600/50000; acc:  91.31; ppl:  1.36; xent: 0.31; lr: 0.00010; 5468/6703 tok/s;  17269 sec
[2021-04-25 07:28:13,390 INFO] Step 44650/50000; acc:  91.37; ppl:  1.36; xent: 0.30; lr: 0.00010; 5194/6540 tok/s;  17288 sec
[2021-04-25 07:28:32,745 INFO] Step 44700/50000; acc:  91.78; ppl:  1.34; xent: 0.29; lr: 0.00010; 5239/6597 tok/s;  17307 sec
[2021-04-25 07:28:52,362 INFO] Step 44750/50000; acc:  91.76; ppl:  1.35; xent: 0.30; lr: 0.00010; 5257/6637 tok/s;  17327 sec
[2021-04-25 07:29:11,457 INFO] Step 44800/50000; acc:  91.61; ppl:  1.35; xent: 0.30; lr: 0.00010; 5281/6495 tok/s;  17346 sec
[2021-04-25 07:29:30,156 INFO] Step 44850/50000; acc:  91.85; ppl:  1.34; xent: 0.29; lr: 0.00010; 5574/6783 tok/s;  17364 sec
[2021-04-25 07:29:50,021 INFO] Step 44900/50000; acc:  91.58; ppl:  1.35; xent: 0.30; lr: 0.00010; 5165/6426 tok/s;  17384 sec
[2021-04-25 07:30:09,551 INFO] Step 44950/50000; acc:  91.74; ppl:  1.34; xent: 0.30; lr: 0.00010; 5119/6442 tok/s;  17404 sec
[2021-04-25 07:30:29,023 INFO] Step 45000/50000; acc:  91.86; ppl:  1.34; xent: 0.29; lr: 0.00010; 5201/6590 tok/s;  17423 sec
[2021-04-25 07:30:29,024 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-25 07:30:55,910 INFO] Validation perplexity: 1.42447
[2021-04-25 07:30:55,911 INFO] Validation accuracy: 90.6467
[2021-04-25 07:30:55,915 INFO] Saving checkpoint ../models/group1_params/basic_ops/model_step_45000.pt
[2021-04-25 07:31:15,110 INFO] Step 45050/50000; acc:  91.75; ppl:  1.34; xent: 0.29; lr: 0.00010; 2223/2771 tok/s;  17469 sec
[2021-04-25 07:31:34,191 INFO] Step 45100/50000; acc:  91.88; ppl:  1.34; xent: 0.29; lr: 0.00010; 5376/6654 tok/s;  17489 sec
[2021-04-25 07:31:52,480 INFO] Step 45150/50000; acc:  91.99; ppl:  1.33; xent: 0.29; lr: 0.00010; 5404/7034 tok/s;  17507 sec
[2021-04-25 07:32:11,518 INFO] Step 45200/50000; acc:  91.60; ppl:  1.35; xent: 0.30; lr: 0.00010; 5490/6735 tok/s;  17526 sec
[2021-04-25 07:32:17,195 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:32:30,744 INFO] Step 45250/50000; acc:  91.75; ppl:  1.34; xent: 0.29; lr: 0.00010; 5227/6675 tok/s;  17545 sec
[2021-04-25 07:32:49,326 INFO] Step 45300/50000; acc:  91.70; ppl:  1.35; xent: 0.30; lr: 0.00010; 5449/6925 tok/s;  17564 sec
[2021-04-25 07:33:08,104 INFO] Step 45350/50000; acc:  91.51; ppl:  1.35; xent: 0.30; lr: 0.00010; 5472/6633 tok/s;  17582 sec
[2021-04-25 07:33:27,790 INFO] Step 45400/50000; acc:  91.64; ppl:  1.35; xent: 0.30; lr: 0.00010; 5112/6569 tok/s;  17602 sec
[2021-04-25 07:33:47,008 INFO] Step 45450/50000; acc:  91.63; ppl:  1.35; xent: 0.30; lr: 0.00010; 5207/6463 tok/s;  17621 sec
[2021-04-25 07:34:05,947 INFO] Step 45500/50000; acc:  91.79; ppl:  1.34; xent: 0.30; lr: 0.00010; 5335/6674 tok/s;  17640 sec
[2021-04-25 07:34:25,417 INFO] Step 45550/50000; acc:  91.64; ppl:  1.35; xent: 0.30; lr: 0.00010; 5415/6538 tok/s;  17660 sec
[2021-04-25 07:34:43,872 INFO] Step 45600/50000; acc:  91.65; ppl:  1.35; xent: 0.30; lr: 0.00010; 5423/6730 tok/s;  17678 sec
[2021-04-25 07:35:03,144 INFO] Step 45650/50000; acc:  91.52; ppl:  1.35; xent: 0.30; lr: 0.00010; 5359/6562 tok/s;  17697 sec
[2021-04-25 07:35:22,585 INFO] Step 45700/50000; acc:  92.08; ppl:  1.33; xent: 0.28; lr: 0.00010; 5287/6770 tok/s;  17717 sec
[2021-04-25 07:35:41,970 INFO] Step 45750/50000; acc:  91.85; ppl:  1.34; xent: 0.29; lr: 0.00010; 5139/6525 tok/s;  17736 sec
[2021-04-25 07:36:00,605 INFO] Step 45800/50000; acc:  91.61; ppl:  1.34; xent: 0.30; lr: 0.00010; 5460/6672 tok/s;  17755 sec
[2021-04-25 07:36:19,698 INFO] Step 45850/50000; acc:  92.10; ppl:  1.33; xent: 0.28; lr: 0.00010; 5364/6828 tok/s;  17774 sec
[2021-04-25 07:36:38,471 INFO] Step 45900/50000; acc:  91.94; ppl:  1.33; xent: 0.29; lr: 0.00010; 5413/6838 tok/s;  17793 sec
[2021-04-25 07:36:45,866 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:36:57,076 INFO] Step 45950/50000; acc:  91.54; ppl:  1.35; xent: 0.30; lr: 0.00010; 5363/6801 tok/s;  17811 sec
[2021-04-25 07:37:16,517 INFO] Step 46000/50000; acc:  91.89; ppl:  1.34; xent: 0.29; lr: 0.00010; 5390/6707 tok/s;  17831 sec
[2021-04-25 07:37:34,879 INFO] Step 46050/50000; acc:  91.53; ppl:  1.35; xent: 0.30; lr: 0.00010; 5448/6748 tok/s;  17849 sec
[2021-04-25 07:37:54,186 INFO] Step 46100/50000; acc:  91.55; ppl:  1.35; xent: 0.30; lr: 0.00010; 5207/6644 tok/s;  17869 sec
[2021-04-25 07:38:13,833 INFO] Step 46150/50000; acc:  91.80; ppl:  1.34; xent: 0.29; lr: 0.00010; 5251/6547 tok/s;  17888 sec
[2021-04-25 07:38:33,208 INFO] Step 46200/50000; acc:  91.66; ppl:  1.35; xent: 0.30; lr: 0.00010; 5185/6495 tok/s;  17908 sec
[2021-04-25 07:38:51,950 INFO] Step 46250/50000; acc:  91.77; ppl:  1.34; xent: 0.29; lr: 0.00010; 5362/6680 tok/s;  17926 sec
[2021-04-25 07:39:11,075 INFO] Step 46300/50000; acc:  91.72; ppl:  1.34; xent: 0.29; lr: 0.00010; 5404/6528 tok/s;  17945 sec
[2021-04-25 07:39:30,049 INFO] Step 46350/50000; acc:  91.62; ppl:  1.35; xent: 0.30; lr: 0.00010; 5463/6730 tok/s;  17964 sec
[2021-04-25 07:39:49,303 INFO] Step 46400/50000; acc:  91.89; ppl:  1.33; xent: 0.29; lr: 0.00010; 5192/6578 tok/s;  17984 sec
[2021-04-25 07:40:08,867 INFO] Step 46450/50000; acc:  91.88; ppl:  1.34; xent: 0.29; lr: 0.00010; 5270/6703 tok/s;  18003 sec
[2021-04-25 07:40:28,401 INFO] Step 46500/50000; acc:  91.72; ppl:  1.34; xent: 0.30; lr: 0.00010; 5251/6477 tok/s;  18023 sec
[2021-04-25 07:40:47,594 INFO] Step 46550/50000; acc:  91.92; ppl:  1.33; xent: 0.29; lr: 0.00010; 5245/6512 tok/s;  18042 sec
[2021-04-25 07:41:06,143 INFO] Step 46600/50000; acc:  92.19; ppl:  1.32; xent: 0.28; lr: 0.00010; 5428/6998 tok/s;  18060 sec
[2021-04-25 07:41:25,329 INFO] Step 46650/50000; acc:  91.69; ppl:  1.34; xent: 0.29; lr: 0.00010; 5328/6700 tok/s;  18080 sec
[2021-04-25 07:41:27,399 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:41:44,446 INFO] Step 46700/50000; acc:  91.81; ppl:  1.34; xent: 0.29; lr: 0.00010; 5344/6701 tok/s;  18099 sec
[2021-04-25 07:42:02,923 INFO] Step 46750/50000; acc:  91.91; ppl:  1.33; xent: 0.29; lr: 0.00010; 5402/6794 tok/s;  18117 sec
[2021-04-25 07:42:21,836 INFO] Step 46800/50000; acc:  91.42; ppl:  1.36; xent: 0.30; lr: 0.00010; 5478/6676 tok/s;  18136 sec
[2021-04-25 07:42:41,308 INFO] Step 46850/50000; acc:  91.60; ppl:  1.34; xent: 0.30; lr: 0.00010; 5153/6548 tok/s;  18156 sec
[2021-04-25 07:43:00,857 INFO] Step 46900/50000; acc:  91.82; ppl:  1.34; xent: 0.29; lr: 0.00010; 5163/6470 tok/s;  18175 sec
[2021-04-25 07:43:20,341 INFO] Step 46950/50000; acc:  91.78; ppl:  1.34; xent: 0.29; lr: 0.00010; 5256/6629 tok/s;  18195 sec
[2021-04-25 07:43:39,644 INFO] Step 47000/50000; acc:  91.78; ppl:  1.34; xent: 0.29; lr: 0.00010; 5310/6404 tok/s;  18214 sec
[2021-04-25 07:43:58,290 INFO] Step 47050/50000; acc:  91.62; ppl:  1.34; xent: 0.29; lr: 0.00010; 5420/6728 tok/s;  18233 sec
[2021-04-25 07:44:18,131 INFO] Step 47100/50000; acc:  91.85; ppl:  1.34; xent: 0.29; lr: 0.00010; 5117/6377 tok/s;  18252 sec
[2021-04-25 07:44:37,934 INFO] Step 47150/50000; acc:  92.02; ppl:  1.33; xent: 0.29; lr: 0.00010; 5244/6644 tok/s;  18272 sec
[2021-04-25 07:44:56,588 INFO] Step 47200/50000; acc:  91.71; ppl:  1.34; xent: 0.29; lr: 0.00010; 5350/6686 tok/s;  18291 sec
[2021-04-25 07:45:16,004 INFO] Step 47250/50000; acc:  91.81; ppl:  1.34; xent: 0.29; lr: 0.00010; 5334/6583 tok/s;  18310 sec
[2021-04-25 07:45:34,642 INFO] Step 47300/50000; acc:  92.13; ppl:  1.32; xent: 0.28; lr: 0.00010; 5522/7036 tok/s;  18329 sec
[2021-04-25 07:45:53,386 INFO] Step 47350/50000; acc:  91.92; ppl:  1.33; xent: 0.29; lr: 0.00010; 5283/6724 tok/s;  18348 sec
[2021-04-25 07:46:09,250 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:46:12,492 INFO] Step 47400/50000; acc:  91.82; ppl:  1.33; xent: 0.29; lr: 0.00010; 5312/6759 tok/s;  18367 sec
[2021-04-25 07:46:31,879 INFO] Step 47450/50000; acc:  91.76; ppl:  1.34; xent: 0.29; lr: 0.00010; 5280/6510 tok/s;  18386 sec
[2021-04-25 07:46:50,435 INFO] Step 47500/50000; acc:  91.76; ppl:  1.34; xent: 0.30; lr: 0.00010; 5511/6822 tok/s;  18405 sec
[2021-04-25 07:47:09,301 INFO] Step 47550/50000; acc:  91.42; ppl:  1.35; xent: 0.30; lr: 0.00010; 5227/6471 tok/s;  18424 sec
[2021-04-25 07:47:29,753 INFO] Step 47600/50000; acc:  91.88; ppl:  1.33; xent: 0.29; lr: 0.00010; 5111/6555 tok/s;  18444 sec
[2021-04-25 07:47:48,580 INFO] Step 47650/50000; acc:  91.68; ppl:  1.34; xent: 0.29; lr: 0.00010; 5319/6590 tok/s;  18463 sec
[2021-04-25 07:48:07,847 INFO] Step 47700/50000; acc:  91.95; ppl:  1.33; xent: 0.29; lr: 0.00010; 5259/6732 tok/s;  18482 sec
[2021-04-25 07:48:26,866 INFO] Step 47750/50000; acc:  91.62; ppl:  1.34; xent: 0.29; lr: 0.00010; 5493/6444 tok/s;  18501 sec
[2021-04-25 07:48:46,329 INFO] Step 47800/50000; acc:  91.65; ppl:  1.34; xent: 0.29; lr: 0.00010; 5216/6524 tok/s;  18521 sec
[2021-04-25 07:49:05,657 INFO] Step 47850/50000; acc:  91.82; ppl:  1.33; xent: 0.29; lr: 0.00010; 5165/6574 tok/s;  18540 sec
[2021-04-25 07:49:25,493 INFO] Step 47900/50000; acc:  91.97; ppl:  1.33; xent: 0.29; lr: 0.00010; 5142/6537 tok/s;  18560 sec
[2021-04-25 07:49:45,481 INFO] Step 47950/50000; acc:  91.81; ppl:  1.33; xent: 0.29; lr: 0.00010; 5181/6384 tok/s;  18580 sec
[2021-04-25 07:50:04,085 INFO] Step 48000/50000; acc:  91.84; ppl:  1.33; xent: 0.29; lr: 0.00010; 5386/6671 tok/s;  18598 sec
[2021-04-25 07:50:22,807 INFO] Step 48050/50000; acc:  92.24; ppl:  1.32; xent: 0.28; lr: 0.00010; 5502/7006 tok/s;  18617 sec
[2021-04-25 07:50:41,425 INFO] Step 48100/50000; acc:  91.80; ppl:  1.34; xent: 0.29; lr: 0.00010; 5472/6864 tok/s;  18636 sec
[2021-04-25 07:50:51,669 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:51:00,501 INFO] Step 48150/50000; acc:  91.90; ppl:  1.33; xent: 0.29; lr: 0.00010; 5252/6809 tok/s;  18655 sec
[2021-04-25 07:51:19,470 INFO] Step 48200/50000; acc:  91.94; ppl:  1.33; xent: 0.29; lr: 0.00010; 5382/6642 tok/s;  18674 sec
[2021-04-25 07:51:37,690 INFO] Step 48250/50000; acc:  91.49; ppl:  1.35; xent: 0.30; lr: 0.00010; 5581/6808 tok/s;  18692 sec
[2021-04-25 07:51:57,186 INFO] Step 48300/50000; acc:  91.66; ppl:  1.34; xent: 0.30; lr: 0.00010; 5201/6549 tok/s;  18712 sec
[2021-04-25 07:52:16,776 INFO] Step 48350/50000; acc:  91.93; ppl:  1.33; xent: 0.29; lr: 0.00010; 5068/6405 tok/s;  18731 sec
[2021-04-25 07:52:35,769 INFO] Step 48400/50000; acc:  91.86; ppl:  1.34; xent: 0.29; lr: 0.00010; 5478/6875 tok/s;  18750 sec
[2021-04-25 07:52:55,107 INFO] Step 48450/50000; acc:  91.79; ppl:  1.34; xent: 0.29; lr: 0.00010; 5246/6397 tok/s;  18769 sec
[2021-04-25 07:53:13,659 INFO] Step 48500/50000; acc:  91.99; ppl:  1.33; xent: 0.28; lr: 0.00010; 5517/6834 tok/s;  18788 sec
[2021-04-25 07:53:33,450 INFO] Step 48550/50000; acc:  91.79; ppl:  1.34; xent: 0.29; lr: 0.00010; 5218/6424 tok/s;  18808 sec
[2021-04-25 07:53:52,889 INFO] Step 48600/50000; acc:  91.91; ppl:  1.34; xent: 0.29; lr: 0.00010; 5201/6558 tok/s;  18827 sec
[2021-04-25 07:54:11,845 INFO] Step 48650/50000; acc:  91.99; ppl:  1.33; xent: 0.29; lr: 0.00010; 5284/6694 tok/s;  18846 sec
[2021-04-25 07:54:30,949 INFO] Step 48700/50000; acc:  92.01; ppl:  1.33; xent: 0.28; lr: 0.00010; 5321/6684 tok/s;  18865 sec
[2021-04-25 07:54:50,195 INFO] Step 48750/50000; acc:  92.04; ppl:  1.33; xent: 0.28; lr: 0.00010; 5407/6564 tok/s;  18885 sec
[2021-04-25 07:55:08,455 INFO] Step 48800/50000; acc:  92.25; ppl:  1.32; xent: 0.28; lr: 0.00010; 5427/7123 tok/s;  18903 sec
[2021-04-25 07:55:27,224 INFO] Step 48850/50000; acc:  91.71; ppl:  1.34; xent: 0.29; lr: 0.00010; 5501/6783 tok/s;  18922 sec
[2021-04-25 07:55:32,234 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 07:55:46,561 INFO] Step 48900/50000; acc:  91.93; ppl:  1.33; xent: 0.28; lr: 0.00010; 5307/6765 tok/s;  18941 sec
[2021-04-25 07:56:05,053 INFO] Step 48950/50000; acc:  91.92; ppl:  1.33; xent: 0.29; lr: 0.00010; 5404/6810 tok/s;  18959 sec
[2021-04-25 07:56:23,775 INFO] Step 49000/50000; acc:  91.60; ppl:  1.34; xent: 0.30; lr: 0.00010; 5388/6625 tok/s;  18978 sec
[2021-04-25 07:56:43,274 INFO] Step 49050/50000; acc:  91.88; ppl:  1.33; xent: 0.29; lr: 0.00010; 5233/6658 tok/s;  18998 sec
[2021-04-25 07:57:02,793 INFO] Step 49100/50000; acc:  91.81; ppl:  1.34; xent: 0.29; lr: 0.00010; 5215/6416 tok/s;  19017 sec
[2021-04-25 07:57:21,430 INFO] Step 49150/50000; acc:  92.10; ppl:  1.33; xent: 0.28; lr: 0.00010; 5314/6698 tok/s;  19036 sec
[2021-04-25 07:57:41,077 INFO] Step 49200/50000; acc:  91.84; ppl:  1.33; xent: 0.29; lr: 0.00010; 5413/6524 tok/s;  19055 sec
[2021-04-25 07:57:59,647 INFO] Step 49250/50000; acc:  91.85; ppl:  1.33; xent: 0.29; lr: 0.00010; 5416/6795 tok/s;  19074 sec
[2021-04-25 07:58:18,956 INFO] Step 49300/50000; acc:  91.63; ppl:  1.34; xent: 0.29; lr: 0.00010; 5257/6415 tok/s;  19093 sec
[2021-04-25 07:58:38,650 INFO] Step 49350/50000; acc:  92.19; ppl:  1.32; xent: 0.28; lr: 0.00010; 5250/6769 tok/s;  19113 sec
[2021-04-25 07:58:57,977 INFO] Step 49400/50000; acc:  91.97; ppl:  1.33; xent: 0.28; lr: 0.00010; 5207/6549 tok/s;  19132 sec
[2021-04-25 07:59:16,649 INFO] Step 49450/50000; acc:  91.98; ppl:  1.33; xent: 0.28; lr: 0.00010; 5400/6662 tok/s;  19151 sec
[2021-04-25 07:59:35,517 INFO] Step 49500/50000; acc:  92.27; ppl:  1.32; xent: 0.28; lr: 0.00010; 5384/6888 tok/s;  19170 sec
[2021-04-25 07:59:54,794 INFO] Step 49550/50000; acc:  91.93; ppl:  1.33; xent: 0.29; lr: 0.00010; 5345/6615 tok/s;  19189 sec
[2021-04-25 08:00:01,297 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/basic/train.txt, align=None)...
[2021-04-25 08:00:13,179 INFO] Step 49600/50000; acc:  91.92; ppl:  1.33; xent: 0.29; lr: 0.00010; 5465/7022 tok/s;  19208 sec
[2021-04-25 08:00:32,667 INFO] Step 49650/50000; acc:  92.03; ppl:  1.33; xent: 0.28; lr: 0.00010; 5290/6579 tok/s;  19227 sec
[2021-04-25 08:00:50,946 INFO] Step 49700/50000; acc:  91.86; ppl:  1.33; xent: 0.29; lr: 0.00010; 5573/6884 tok/s;  19245 sec
[2021-04-25 08:01:10,373 INFO] Step 49750/50000; acc:  91.72; ppl:  1.34; xent: 0.29; lr: 0.00010; 5127/6528 tok/s;  19265 sec
[2021-04-25 08:01:29,791 INFO] Step 49800/50000; acc:  91.98; ppl:  1.33; xent: 0.28; lr: 0.00010; 5215/6534 tok/s;  19284 sec
[2021-04-25 08:01:48,949 INFO] Step 49850/50000; acc:  91.84; ppl:  1.33; xent: 0.29; lr: 0.00010; 5310/6649 tok/s;  19303 sec
[2021-04-25 08:02:07,868 INFO] Step 49900/50000; acc:  92.06; ppl:  1.33; xent: 0.28; lr: 0.00010; 5419/6651 tok/s;  19322 sec
[2021-04-25 08:02:26,598 INFO] Step 49950/50000; acc:  91.80; ppl:  1.34; xent: 0.29; lr: 0.00010; 5402/6586 tok/s;  19341 sec
[2021-04-25 08:02:45,734 INFO] Step 50000/50000; acc:  91.94; ppl:  1.33; xent: 0.29; lr: 0.00005; 5464/6759 tok/s;  19360 sec
[2021-04-25 08:02:45,735 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/basic/valid.txt, align=None)...
[2021-04-25 08:03:12,633 INFO] Validation perplexity: 1.42827
[2021-04-25 08:03:12,633 INFO] Validation accuracy: 90.7228
[2021-04-25 08:03:12,637 INFO] Saving checkpoint ../models/group1_params/basic_ops/model_step_50000.pt

Strictly condensed EditOperations:

modelGroup1Strict = HephaestusModel(MODEL_GROUP1_STRICT)
modelGroup1Strict.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_OPS_GENERAL_STRICT_TRAIN,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_OPS_GENERAL_STRICT_VALID,
    **GROUP1_PARAMS
)
[2021-04-25 01:08:32,213 INFO] Counter vocab from -1 samples.
[2021-04-25 01:08:32,213 INFO] n_sample=-1: Build vocab on full datasets.
[2021-04-25 01:08:32,218 INFO] corpus_1's transforms: TransformPipe()
[2021-04-25 01:08:32,218 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:08:32,819 INFO] Counters src:429
[2021-04-25 01:08:32,819 INFO] Counters tgt:444
[2021-04-25 01:08:32,819 WARNING] path ../models/group1_params/strict_ops/save_data.vocab.src exists, may overwrite...
[2021-04-25 01:08:32,821 WARNING] path ../models/group1_params/strict_ops/save_data.vocab.tgt exists, may overwrite...
[2021-04-25 01:08:33,483 INFO] Parsed 2 corpora from -data.
[2021-04-25 01:08:33,484 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-04-25 01:08:33,484 INFO] Loading vocab from text file...
[2021-04-25 01:08:33,484 INFO] Loading src vocabulary from ../models/group1_params/strict_ops/save_data.vocab.src
[2021-04-25 01:08:33,486 INFO] Loaded src vocab has 429 tokens.
[2021-04-25 01:08:33,486 INFO] Loading tgt vocabulary from ../models/group1_params/strict_ops/save_data.vocab.tgt
[2021-04-25 01:08:33,488 INFO] Loaded tgt vocab has 444 tokens.
[2021-04-25 01:08:33,488 INFO] Building fields with vocab in counters...
[2021-04-25 01:08:33,488 INFO]  * tgt vocab size: 448.
[2021-04-25 01:08:33,489 INFO]  * src vocab size: 431.
[2021-04-25 01:08:33,489 INFO]  * src vocab size = 431
[2021-04-25 01:08:33,489 INFO]  * tgt vocab size = 448
[2021-04-25 01:08:33,490 INFO] Building model...
[2021-04-25 01:08:34,629 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): GRU(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(448, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedGRU(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): GRUCell(768, 256)
        (1): GRUCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=448, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-04-25 01:08:34,629 INFO] encoder: 1206784
[2021-04-25 01:08:34,629 INFO] decoder: 1790144
[2021-04-25 01:08:34,629 INFO] * number of parameters: 2996928
[2021-04-25 01:08:34,630 INFO] Starting training on GPU: [0]
[2021-04-25 01:08:34,630 INFO] Start training loop and validate every 5000 steps...
[2021-04-25 01:08:34,630 INFO] corpus_1's transforms: TransformPipe()
[2021-04-25 01:08:34,630 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:08:44,609 INFO] Step 50/50000; acc:  20.18; ppl: 97.76; xent: 4.58; lr: 0.00010; 10489/4277 tok/s;     10 sec
[2021-04-25 01:08:54,468 INFO] Step 100/50000; acc:  28.82; ppl: 29.99; xent: 3.40; lr: 0.00010; 9889/4324 tok/s;     20 sec
[2021-04-25 01:09:04,530 INFO] Step 150/50000; acc:  41.11; ppl: 16.75; xent: 2.82; lr: 0.00010; 10225/4329 tok/s;     30 sec
[2021-04-25 01:09:14,282 INFO] Step 200/50000; acc:  45.22; ppl: 10.78; xent: 2.38; lr: 0.00010; 10414/4379 tok/s;     40 sec
[2021-04-25 01:09:23,822 INFO] Step 250/50000; acc:  45.56; ppl:  9.78; xent: 2.28; lr: 0.00010; 10662/4443 tok/s;     49 sec
[2021-04-25 01:09:33,653 INFO] Step 300/50000; acc:  45.63; ppl:  9.55; xent: 2.26; lr: 0.00010; 10368/4415 tok/s;     59 sec
[2021-04-25 01:09:43,684 INFO] Step 350/50000; acc:  46.22; ppl:  9.10; xent: 2.21; lr: 0.00010; 10173/4194 tok/s;     69 sec
[2021-04-25 01:09:54,128 INFO] Step 400/50000; acc:  46.34; ppl:  8.90; xent: 2.19; lr: 0.00010; 9973/4084 tok/s;     79 sec
[2021-04-25 01:10:03,648 INFO] Step 450/50000; acc:  46.15; ppl:  8.73; xent: 2.17; lr: 0.00010; 10444/4453 tok/s;     89 sec
[2021-04-25 01:10:13,434 INFO] Step 500/50000; acc:  46.64; ppl:  8.60; xent: 2.15; lr: 0.00010; 10554/4403 tok/s;     99 sec
[2021-04-25 01:10:23,214 INFO] Step 550/50000; acc:  47.14; ppl:  8.37; xent: 2.12; lr: 0.00010; 10113/4327 tok/s;    109 sec
[2021-04-25 01:10:33,748 INFO] Step 600/50000; acc:  47.95; ppl:  7.93; xent: 2.07; lr: 0.00010; 9926/4101 tok/s;    119 sec
[2021-04-25 01:10:43,013 INFO] Step 650/50000; acc:  48.79; ppl:  7.75; xent: 2.05; lr: 0.00010; 10815/4500 tok/s;    128 sec
[2021-04-25 01:10:52,889 INFO] Step 700/50000; acc:  49.33; ppl:  7.39; xent: 2.00; lr: 0.00010; 10382/4342 tok/s;    138 sec
[2021-04-25 01:10:53,623 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:11:02,966 INFO] Step 750/50000; acc:  50.15; ppl:  7.21; xent: 1.98; lr: 0.00010; 10114/4313 tok/s;    148 sec
[2021-04-25 01:11:12,420 INFO] Step 800/50000; acc:  50.78; ppl:  7.03; xent: 1.95; lr: 0.00010; 10612/4435 tok/s;    158 sec
[2021-04-25 01:11:23,127 INFO] Step 850/50000; acc:  50.85; ppl:  6.92; xent: 1.94; lr: 0.00010; 9648/4103 tok/s;    168 sec
[2021-04-25 01:11:32,628 INFO] Step 900/50000; acc:  51.66; ppl:  6.57; xent: 1.88; lr: 0.00010; 10337/4501 tok/s;    178 sec
[2021-04-25 01:11:42,116 INFO] Step 950/50000; acc:  52.14; ppl:  6.47; xent: 1.87; lr: 0.00010; 10863/4461 tok/s;    187 sec
[2021-04-25 01:11:51,841 INFO] Step 1000/50000; acc:  52.34; ppl:  6.32; xent: 1.84; lr: 0.00010; 10393/4447 tok/s;    197 sec
[2021-04-25 01:12:01,698 INFO] Step 1050/50000; acc:  53.05; ppl:  6.16; xent: 1.82; lr: 0.00010; 10499/4295 tok/s;    207 sec
[2021-04-25 01:12:12,283 INFO] Step 1100/50000; acc:  53.46; ppl:  5.92; xent: 1.78; lr: 0.00010; 9677/4061 tok/s;    218 sec
[2021-04-25 01:12:21,891 INFO] Step 1150/50000; acc:  53.99; ppl:  5.82; xent: 1.76; lr: 0.00010; 10430/4382 tok/s;    227 sec
[2021-04-25 01:12:32,062 INFO] Step 1200/50000; acc:  54.02; ppl:  5.78; xent: 1.75; lr: 0.00010; 10273/4263 tok/s;    237 sec
[2021-04-25 01:12:41,571 INFO] Step 1250/50000; acc:  54.40; ppl:  5.69; xent: 1.74; lr: 0.00010; 10455/4420 tok/s;    247 sec
[2021-04-25 01:12:51,736 INFO] Step 1300/50000; acc:  54.53; ppl:  5.57; xent: 1.72; lr: 0.00010; 10191/4228 tok/s;    257 sec
[2021-04-25 01:13:01,011 INFO] Step 1350/50000; acc:  55.16; ppl:  5.41; xent: 1.69; lr: 0.00010; 10707/4486 tok/s;    266 sec
[2021-04-25 01:13:10,688 INFO] Step 1400/50000; acc:  55.39; ppl:  5.39; xent: 1.69; lr: 0.00010; 10643/4523 tok/s;    276 sec
[2021-04-25 01:13:18,518 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:13:20,351 INFO] Step 1450/50000; acc:  55.90; ppl:  5.22; xent: 1.65; lr: 0.00010; 10453/4351 tok/s;    286 sec
[2021-04-25 01:13:30,233 INFO] Step 1500/50000; acc:  55.91; ppl:  5.23; xent: 1.65; lr: 0.00010; 10390/4358 tok/s;    296 sec
[2021-04-25 01:13:40,569 INFO] Step 1550/50000; acc:  55.80; ppl:  5.24; xent: 1.66; lr: 0.00010; 9861/4149 tok/s;    306 sec
[2021-04-25 01:13:50,603 INFO] Step 1600/50000; acc:  56.41; ppl:  5.10; xent: 1.63; lr: 0.00010; 9872/4325 tok/s;    316 sec
[2021-04-25 01:14:00,599 INFO] Step 1650/50000; acc:  56.37; ppl:  5.09; xent: 1.63; lr: 0.00010; 10431/4321 tok/s;    326 sec
[2021-04-25 01:14:09,955 INFO] Step 1700/50000; acc:  56.56; ppl:  5.05; xent: 1.62; lr: 0.00010; 10461/4526 tok/s;    335 sec
[2021-04-25 01:14:19,743 INFO] Step 1750/50000; acc:  56.69; ppl:  5.03; xent: 1.62; lr: 0.00010; 10582/4437 tok/s;    345 sec
[2021-04-25 01:14:29,743 INFO] Step 1800/50000; acc:  57.27; ppl:  4.90; xent: 1.59; lr: 0.00010; 10272/4107 tok/s;    355 sec
[2021-04-25 01:14:40,314 INFO] Step 1850/50000; acc:  57.31; ppl:  4.88; xent: 1.59; lr: 0.00010; 9731/4139 tok/s;    366 sec
[2021-04-25 01:14:49,654 INFO] Step 1900/50000; acc:  57.75; ppl:  4.78; xent: 1.57; lr: 0.00010; 10818/4501 tok/s;    375 sec
[2021-04-25 01:14:59,667 INFO] Step 1950/50000; acc:  57.52; ppl:  4.85; xent: 1.58; lr: 0.00010; 10055/4314 tok/s;    385 sec
[2021-04-25 01:15:09,985 INFO] Step 2000/50000; acc:  57.72; ppl:  4.86; xent: 1.58; lr: 0.00010; 10113/4123 tok/s;    395 sec
[2021-04-25 01:15:19,890 INFO] Step 2050/50000; acc:  58.53; ppl:  4.66; xent: 1.54; lr: 0.00010; 10096/4272 tok/s;    405 sec
[2021-04-25 01:15:29,281 INFO] Step 2100/50000; acc:  58.58; ppl:  4.71; xent: 1.55; lr: 0.00010; 10934/4504 tok/s;    415 sec
[2021-04-25 01:15:39,133 INFO] Step 2150/50000; acc:  58.90; ppl:  4.59; xent: 1.52; lr: 0.00010; 9981/4358 tok/s;    425 sec
[2021-04-25 01:15:44,159 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:15:48,893 INFO] Step 2200/50000; acc:  58.48; ppl:  4.65; xent: 1.54; lr: 0.00010; 10685/4413 tok/s;    434 sec
[2021-04-25 01:15:59,217 INFO] Step 2250/50000; acc:  58.99; ppl:  4.60; xent: 1.53; lr: 0.00010; 9817/4138 tok/s;    445 sec
[2021-04-25 01:16:09,429 INFO] Step 2300/50000; acc:  59.00; ppl:  4.55; xent: 1.52; lr: 0.00010; 10000/4259 tok/s;    455 sec
[2021-04-25 01:16:19,323 INFO] Step 2350/50000; acc:  59.30; ppl:  4.54; xent: 1.51; lr: 0.00010; 10232/4271 tok/s;    465 sec
[2021-04-25 01:16:29,111 INFO] Step 2400/50000; acc:  59.00; ppl:  4.54; xent: 1.51; lr: 0.00010; 10196/4418 tok/s;    474 sec
[2021-04-25 01:16:38,702 INFO] Step 2450/50000; acc:  59.40; ppl:  4.53; xent: 1.51; lr: 0.00010; 10819/4451 tok/s;    484 sec
[2021-04-25 01:16:48,077 INFO] Step 2500/50000; acc:  59.75; ppl:  4.43; xent: 1.49; lr: 0.00010; 10590/4552 tok/s;    493 sec
[2021-04-25 01:16:58,675 INFO] Step 2550/50000; acc:  59.66; ppl:  4.46; xent: 1.49; lr: 0.00010; 9844/3960 tok/s;    504 sec
[2021-04-25 01:17:08,666 INFO] Step 2600/50000; acc:  59.47; ppl:  4.46; xent: 1.49; lr: 0.00010; 10180/4301 tok/s;    514 sec
[2021-04-25 01:17:18,856 INFO] Step 2650/50000; acc:  59.93; ppl:  4.37; xent: 1.48; lr: 0.00010; 10040/4235 tok/s;    524 sec
[2021-04-25 01:17:28,376 INFO] Step 2700/50000; acc:  59.96; ppl:  4.43; xent: 1.49; lr: 0.00010; 10672/4433 tok/s;    534 sec
[2021-04-25 01:17:38,496 INFO] Step 2750/50000; acc:  59.92; ppl:  4.41; xent: 1.48; lr: 0.00010; 9903/4221 tok/s;    544 sec
[2021-04-25 01:17:48,772 INFO] Step 2800/50000; acc:  60.32; ppl:  4.34; xent: 1.47; lr: 0.00010; 10202/4095 tok/s;    554 sec
[2021-04-25 01:17:57,882 INFO] Step 2850/50000; acc:  60.35; ppl:  4.30; xent: 1.46; lr: 0.00010; 10854/4714 tok/s;    563 sec
[2021-04-25 01:18:07,912 INFO] Step 2900/50000; acc:  60.79; ppl:  4.25; xent: 1.45; lr: 0.00010; 10270/4298 tok/s;    573 sec
[2021-04-25 01:18:10,088 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:18:17,390 INFO] Step 2950/50000; acc:  60.45; ppl:  4.26; xent: 1.45; lr: 0.00010; 10451/4547 tok/s;    583 sec
[2021-04-25 01:18:27,364 INFO] Step 3000/50000; acc:  60.68; ppl:  4.30; xent: 1.46; lr: 0.00010; 10421/4218 tok/s;    593 sec
[2021-04-25 01:18:37,745 INFO] Step 3050/50000; acc:  60.79; ppl:  4.23; xent: 1.44; lr: 0.00010; 9663/4220 tok/s;    603 sec
[2021-04-25 01:18:47,628 INFO] Step 3100/50000; acc:  60.57; ppl:  4.26; xent: 1.45; lr: 0.00010; 10365/4336 tok/s;    613 sec
[2021-04-25 01:18:57,337 INFO] Step 3150/50000; acc:  60.86; ppl:  4.26; xent: 1.45; lr: 0.00010; 10440/4386 tok/s;    623 sec
[2021-04-25 01:19:06,806 INFO] Step 3200/50000; acc:  61.10; ppl:  4.14; xent: 1.42; lr: 0.00010; 10519/4491 tok/s;    632 sec
[2021-04-25 01:19:17,194 INFO] Step 3250/50000; acc:  60.94; ppl:  4.26; xent: 1.45; lr: 0.00010; 10223/4108 tok/s;    643 sec
[2021-04-25 01:19:26,911 INFO] Step 3300/50000; acc:  61.43; ppl:  4.12; xent: 1.42; lr: 0.00010; 10114/4363 tok/s;    652 sec
[2021-04-25 01:19:36,821 INFO] Step 3350/50000; acc:  61.17; ppl:  4.20; xent: 1.43; lr: 0.00010; 10447/4283 tok/s;    662 sec
[2021-04-25 01:19:46,701 INFO] Step 3400/50000; acc:  60.94; ppl:  4.19; xent: 1.43; lr: 0.00010; 10315/4345 tok/s;    672 sec
[2021-04-25 01:19:56,497 INFO] Step 3450/50000; acc:  61.34; ppl:  4.18; xent: 1.43; lr: 0.00010; 10401/4363 tok/s;    682 sec
[2021-04-25 01:20:06,657 INFO] Step 3500/50000; acc:  61.85; ppl:  4.08; xent: 1.41; lr: 0.00010; 10050/4209 tok/s;    692 sec
[2021-04-25 01:20:16,031 INFO] Step 3550/50000; acc:  61.61; ppl:  4.08; xent: 1.41; lr: 0.00010; 10678/4470 tok/s;    701 sec
[2021-04-25 01:20:25,923 INFO] Step 3600/50000; acc:  61.49; ppl:  4.10; xent: 1.41; lr: 0.00010; 10516/4360 tok/s;    711 sec
[2021-04-25 01:20:28,931 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:20:35,887 INFO] Step 3650/50000; acc:  61.95; ppl:  4.05; xent: 1.40; lr: 0.00010; 10018/4315 tok/s;    721 sec
[2021-04-25 01:20:45,795 INFO] Step 3700/50000; acc:  62.10; ppl:  4.04; xent: 1.40; lr: 0.00010; 10434/4291 tok/s;    731 sec
[2021-04-25 01:20:55,442 INFO] Step 3750/50000; acc:  61.65; ppl:  4.08; xent: 1.41; lr: 0.00010; 10180/4445 tok/s;    741 sec
[2021-04-25 01:21:05,797 INFO] Step 3800/50000; acc:  62.01; ppl:  4.03; xent: 1.39; lr: 0.00010; 9985/4205 tok/s;    751 sec
[2021-04-25 01:21:15,363 INFO] Step 3850/50000; acc:  61.94; ppl:  4.05; xent: 1.40; lr: 0.00010; 10532/4477 tok/s;    761 sec
[2021-04-25 01:21:25,007 INFO] Step 3900/50000; acc:  62.02; ppl:  4.03; xent: 1.39; lr: 0.00010; 10597/4444 tok/s;    770 sec
[2021-04-25 01:21:34,646 INFO] Step 3950/50000; acc:  62.19; ppl:  3.99; xent: 1.38; lr: 0.00010; 10591/4448 tok/s;    780 sec
[2021-04-25 01:21:44,604 INFO] Step 4000/50000; acc:  62.19; ppl:  4.00; xent: 1.39; lr: 0.00010; 10220/4178 tok/s;    790 sec
[2021-04-25 01:21:54,980 INFO] Step 4050/50000; acc:  61.99; ppl:  4.04; xent: 1.40; lr: 0.00010; 10042/4159 tok/s;    800 sec
[2021-04-25 01:22:04,397 INFO] Step 4100/50000; acc:  62.66; ppl:  3.88; xent: 1.36; lr: 0.00010; 10434/4482 tok/s;    810 sec
[2021-04-25 01:22:14,349 INFO] Step 4150/50000; acc:  61.77; ppl:  4.05; xent: 1.40; lr: 0.00010; 10379/4299 tok/s;    820 sec
[2021-04-25 01:22:24,402 INFO] Step 4200/50000; acc:  62.28; ppl:  4.00; xent: 1.39; lr: 0.00010; 10133/4279 tok/s;    830 sec
[2021-04-25 01:22:34,706 INFO] Step 4250/50000; acc:  62.58; ppl:  3.91; xent: 1.36; lr: 0.00010; 9980/4169 tok/s;    840 sec
[2021-04-25 01:22:44,022 INFO] Step 4300/50000; acc:  63.05; ppl:  3.88; xent: 1.36; lr: 0.00010; 10834/4469 tok/s;    849 sec
[2021-04-25 01:22:53,735 INFO] Step 4350/50000; acc:  62.91; ppl:  3.87; xent: 1.35; lr: 0.00010; 10299/4406 tok/s;    859 sec
[2021-04-25 01:22:54,220 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:23:03,945 INFO] Step 4400/50000; acc:  62.59; ppl:  3.93; xent: 1.37; lr: 0.00010; 10234/4245 tok/s;    869 sec
[2021-04-25 01:23:13,592 INFO] Step 4450/50000; acc:  62.91; ppl:  3.89; xent: 1.36; lr: 0.00010; 10356/4371 tok/s;    879 sec
[2021-04-25 01:23:24,116 INFO] Step 4500/50000; acc:  62.50; ppl:  3.91; xent: 1.36; lr: 0.00010; 9688/4145 tok/s;    889 sec
[2021-04-25 01:23:33,765 INFO] Step 4550/50000; acc:  62.91; ppl:  3.87; xent: 1.35; lr: 0.00010; 10244/4422 tok/s;    899 sec
[2021-04-25 01:23:43,491 INFO] Step 4600/50000; acc:  62.91; ppl:  3.90; xent: 1.36; lr: 0.00010; 10670/4411 tok/s;    909 sec
[2021-04-25 01:23:53,242 INFO] Step 4650/50000; acc:  62.94; ppl:  3.86; xent: 1.35; lr: 0.00010; 10275/4394 tok/s;    919 sec
[2021-04-25 01:24:03,236 INFO] Step 4700/50000; acc:  62.79; ppl:  3.90; xent: 1.36; lr: 0.00010; 10425/4260 tok/s;    929 sec
[2021-04-25 01:24:13,828 INFO] Step 4750/50000; acc:  63.03; ppl:  3.83; xent: 1.34; lr: 0.00010; 9674/4049 tok/s;    939 sec
[2021-04-25 01:24:23,226 INFO] Step 4800/50000; acc:  63.50; ppl:  3.80; xent: 1.33; lr: 0.00010; 10624/4476 tok/s;    949 sec
[2021-04-25 01:24:33,532 INFO] Step 4850/50000; acc:  62.75; ppl:  3.89; xent: 1.36; lr: 0.00010; 10144/4217 tok/s;    959 sec
[2021-04-25 01:24:42,956 INFO] Step 4900/50000; acc:  63.24; ppl:  3.82; xent: 1.34; lr: 0.00010; 10411/4439 tok/s;    968 sec
[2021-04-25 01:24:53,258 INFO] Step 4950/50000; acc:  63.24; ppl:  3.82; xent: 1.34; lr: 0.00010; 10082/4173 tok/s;    979 sec
[2021-04-25 01:25:02,627 INFO] Step 5000/50000; acc:  63.27; ppl:  3.80; xent: 1.33; lr: 0.00010; 10882/4495 tok/s;    988 sec
[2021-04-25 01:25:02,628 INFO] valid's transforms: TransformPipe()
[2021-04-25 01:25:02,630 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-25 01:25:10,618 INFO] Validation perplexity: 3.62367
[2021-04-25 01:25:10,618 INFO] Validation accuracy: 64.6613
[2021-04-25 01:25:10,620 INFO] Saving checkpoint ../models/group1_params/strict_ops/model_step_5000.pt
[2021-04-25 01:25:20,698 INFO] Step 5050/50000; acc:  63.64; ppl:  3.77; xent: 1.33; lr: 0.00010; 5604/2384 tok/s;   1006 sec
[2021-04-25 01:25:27,960 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:25:30,294 INFO] Step 5100/50000; acc:  63.39; ppl:  3.77; xent: 1.33; lr: 0.00010; 10620/4424 tok/s;   1016 sec
[2021-04-25 01:25:40,083 INFO] Step 5150/50000; acc:  63.76; ppl:  3.74; xent: 1.32; lr: 0.00010; 10221/4366 tok/s;   1025 sec
[2021-04-25 01:25:50,671 INFO] Step 5200/50000; acc:  63.28; ppl:  3.83; xent: 1.34; lr: 0.00010; 9863/4111 tok/s;   1036 sec
[2021-04-25 01:26:00,669 INFO] Step 5250/50000; acc:  63.48; ppl:  3.73; xent: 1.32; lr: 0.00010; 9872/4310 tok/s;   1046 sec
[2021-04-25 01:26:10,501 INFO] Step 5300/50000; acc:  63.47; ppl:  3.79; xent: 1.33; lr: 0.00010; 10485/4370 tok/s;   1056 sec
[2021-04-25 01:26:19,840 INFO] Step 5350/50000; acc:  63.70; ppl:  3.75; xent: 1.32; lr: 0.00010; 10548/4508 tok/s;   1065 sec
[2021-04-25 01:26:29,696 INFO] Step 5400/50000; acc:  63.17; ppl:  3.81; xent: 1.34; lr: 0.00010; 10585/4444 tok/s;   1075 sec
[2021-04-25 01:26:39,679 INFO] Step 5450/50000; acc:  63.81; ppl:  3.72; xent: 1.31; lr: 0.00010; 10219/4111 tok/s;   1085 sec
[2021-04-25 01:26:50,159 INFO] Step 5500/50000; acc:  63.62; ppl:  3.73; xent: 1.32; lr: 0.00010; 9850/4180 tok/s;   1096 sec
[2021-04-25 01:26:59,909 INFO] Step 5550/50000; acc:  63.97; ppl:  3.70; xent: 1.31; lr: 0.00010; 10387/4368 tok/s;   1105 sec
[2021-04-25 01:27:09,456 INFO] Step 5600/50000; acc:  63.80; ppl:  3.71; xent: 1.31; lr: 0.00010; 10521/4440 tok/s;   1115 sec
[2021-04-25 01:27:19,754 INFO] Step 5650/50000; acc:  63.68; ppl:  3.77; xent: 1.33; lr: 0.00010; 10115/4122 tok/s;   1125 sec
[2021-04-25 01:27:29,546 INFO] Step 5700/50000; acc:  64.36; ppl:  3.62; xent: 1.29; lr: 0.00010; 10060/4351 tok/s;   1135 sec
[2021-04-25 01:27:39,016 INFO] Step 5750/50000; acc:  64.01; ppl:  3.70; xent: 1.31; lr: 0.00010; 10879/4499 tok/s;   1144 sec
[2021-04-25 01:27:49,020 INFO] Step 5800/50000; acc:  64.20; ppl:  3.65; xent: 1.29; lr: 0.00010; 10131/4263 tok/s;   1154 sec
[2021-04-25 01:27:53,612 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:27:58,790 INFO] Step 5850/50000; acc:  63.87; ppl:  3.69; xent: 1.31; lr: 0.00010; 10477/4404 tok/s;   1164 sec
[2021-04-25 01:28:09,303 INFO] Step 5900/50000; acc:  64.18; ppl:  3.69; xent: 1.31; lr: 0.00010; 9736/4077 tok/s;   1175 sec
[2021-04-25 01:28:19,244 INFO] Step 5950/50000; acc:  64.45; ppl:  3.61; xent: 1.28; lr: 0.00010; 9990/4312 tok/s;   1185 sec
[2021-04-25 01:28:29,409 INFO] Step 6000/50000; acc:  63.93; ppl:  3.69; xent: 1.31; lr: 0.00010; 10221/4232 tok/s;   1195 sec
[2021-04-25 01:28:39,286 INFO] Step 6050/50000; acc:  64.24; ppl:  3.68; xent: 1.30; lr: 0.00010; 10055/4362 tok/s;   1205 sec
[2021-04-25 01:28:48,796 INFO] Step 6100/50000; acc:  64.25; ppl:  3.65; xent: 1.29; lr: 0.00010; 10771/4505 tok/s;   1214 sec
[2021-04-25 01:28:58,120 INFO] Step 6150/50000; acc:  64.47; ppl:  3.62; xent: 1.29; lr: 0.00010; 10749/4501 tok/s;   1223 sec
[2021-04-25 01:29:08,842 INFO] Step 6200/50000; acc:  64.21; ppl:  3.66; xent: 1.30; lr: 0.00010; 9778/3989 tok/s;   1234 sec
[2021-04-25 01:29:18,655 INFO] Step 6250/50000; acc:  64.00; ppl:  3.66; xent: 1.30; lr: 0.00010; 10278/4363 tok/s;   1244 sec
[2021-04-25 01:29:28,636 INFO] Step 6300/50000; acc:  64.74; ppl:  3.60; xent: 1.28; lr: 0.00010; 10312/4288 tok/s;   1254 sec
[2021-04-25 01:29:38,236 INFO] Step 6350/50000; acc:  64.45; ppl:  3.66; xent: 1.30; lr: 0.00010; 10584/4397 tok/s;   1264 sec
[2021-04-25 01:29:48,311 INFO] Step 6400/50000; acc:  64.49; ppl:  3.61; xent: 1.28; lr: 0.00010; 9933/4251 tok/s;   1274 sec
[2021-04-25 01:29:58,325 INFO] Step 6450/50000; acc:  64.55; ppl:  3.58; xent: 1.28; lr: 0.00010; 10456/4196 tok/s;   1284 sec
[2021-04-25 01:30:07,596 INFO] Step 6500/50000; acc:  64.73; ppl:  3.55; xent: 1.27; lr: 0.00010; 10512/4608 tok/s;   1293 sec
[2021-04-25 01:30:17,756 INFO] Step 6550/50000; acc:  64.81; ppl:  3.59; xent: 1.28; lr: 0.00010; 10195/4284 tok/s;   1303 sec
[2021-04-25 01:30:19,572 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:30:27,357 INFO] Step 6600/50000; acc:  64.71; ppl:  3.60; xent: 1.28; lr: 0.00010; 10603/4467 tok/s;   1313 sec
[2021-04-25 01:30:37,373 INFO] Step 6650/50000; acc:  64.90; ppl:  3.59; xent: 1.28; lr: 0.00010; 10184/4228 tok/s;   1323 sec
[2021-04-25 01:30:47,686 INFO] Step 6700/50000; acc:  64.66; ppl:  3.58; xent: 1.28; lr: 0.00010; 9805/4225 tok/s;   1333 sec
[2021-04-25 01:30:57,413 INFO] Step 6750/50000; acc:  64.79; ppl:  3.57; xent: 1.27; lr: 0.00010; 10281/4408 tok/s;   1343 sec
[2021-04-25 01:31:07,405 INFO] Step 6800/50000; acc:  64.48; ppl:  3.61; xent: 1.28; lr: 0.00010; 10402/4286 tok/s;   1353 sec
[2021-04-25 01:31:16,831 INFO] Step 6850/50000; acc:  64.88; ppl:  3.51; xent: 1.26; lr: 0.00010; 10527/4499 tok/s;   1362 sec
[2021-04-25 01:31:27,184 INFO] Step 6900/50000; acc:  64.51; ppl:  3.60; xent: 1.28; lr: 0.00010; 10131/4143 tok/s;   1373 sec
[2021-04-25 01:31:36,774 INFO] Step 6950/50000; acc:  65.22; ppl:  3.50; xent: 1.25; lr: 0.00010; 10322/4372 tok/s;   1382 sec
[2021-04-25 01:31:46,931 INFO] Step 7000/50000; acc:  64.30; ppl:  3.59; xent: 1.28; lr: 0.00010; 10243/4220 tok/s;   1392 sec
[2021-04-25 01:31:56,584 INFO] Step 7050/50000; acc:  64.85; ppl:  3.53; xent: 1.26; lr: 0.00010; 10482/4457 tok/s;   1402 sec
[2021-04-25 01:32:06,542 INFO] Step 7100/50000; acc:  64.57; ppl:  3.60; xent: 1.28; lr: 0.00010; 10276/4282 tok/s;   1412 sec
[2021-04-25 01:32:16,757 INFO] Step 7150/50000; acc:  65.54; ppl:  3.47; xent: 1.25; lr: 0.00010; 10023/4176 tok/s;   1422 sec
[2021-04-25 01:32:25,935 INFO] Step 7200/50000; acc:  65.31; ppl:  3.48; xent: 1.25; lr: 0.00010; 10872/4550 tok/s;   1431 sec
[2021-04-25 01:32:36,134 INFO] Step 7250/50000; acc:  64.76; ppl:  3.52; xent: 1.26; lr: 0.00010; 10175/4225 tok/s;   1442 sec
[2021-04-25 01:32:38,713 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:32:45,914 INFO] Step 7300/50000; acc:  65.23; ppl:  3.50; xent: 1.25; lr: 0.00010; 10099/4427 tok/s;   1451 sec
[2021-04-25 01:32:55,568 INFO] Step 7350/50000; acc:  65.26; ppl:  3.49; xent: 1.25; lr: 0.00010; 10722/4388 tok/s;   1461 sec
[2021-04-25 01:33:05,585 INFO] Step 7400/50000; acc:  64.97; ppl:  3.56; xent: 1.27; lr: 0.00010; 10074/4273 tok/s;   1471 sec
[2021-04-25 01:33:15,744 INFO] Step 7450/50000; acc:  65.40; ppl:  3.47; xent: 1.24; lr: 0.00010; 10013/4294 tok/s;   1481 sec
[2021-04-25 01:33:25,325 INFO] Step 7500/50000; acc:  65.06; ppl:  3.53; xent: 1.26; lr: 0.00010; 10608/4451 tok/s;   1491 sec
[2021-04-25 01:33:34,899 INFO] Step 7550/50000; acc:  65.40; ppl:  3.47; xent: 1.24; lr: 0.00010; 10388/4465 tok/s;   1500 sec
[2021-04-25 01:33:44,777 INFO] Step 7600/50000; acc:  65.05; ppl:  3.49; xent: 1.25; lr: 0.00010; 10634/4352 tok/s;   1510 sec
[2021-04-25 01:33:55,135 INFO] Step 7650/50000; acc:  65.11; ppl:  3.50; xent: 1.25; lr: 0.00010; 9766/4056 tok/s;   1521 sec
[2021-04-25 01:34:05,015 INFO] Step 7700/50000; acc:  65.21; ppl:  3.50; xent: 1.25; lr: 0.00010; 10417/4306 tok/s;   1530 sec
[2021-04-25 01:34:14,770 INFO] Step 7750/50000; acc:  65.94; ppl:  3.39; xent: 1.22; lr: 0.00010; 10145/4369 tok/s;   1540 sec
[2021-04-25 01:34:24,657 INFO] Step 7800/50000; acc:  64.86; ppl:  3.53; xent: 1.26; lr: 0.00010; 10502/4324 tok/s;   1550 sec
[2021-04-25 01:34:34,625 INFO] Step 7850/50000; acc:  65.49; ppl:  3.49; xent: 1.25; lr: 0.00010; 10148/4305 tok/s;   1560 sec
[2021-04-25 01:34:45,096 INFO] Step 7900/50000; acc:  65.39; ppl:  3.43; xent: 1.23; lr: 0.00010; 9889/4100 tok/s;   1570 sec
[2021-04-25 01:34:54,145 INFO] Step 7950/50000; acc:  65.68; ppl:  3.41; xent: 1.23; lr: 0.00010; 11127/4619 tok/s;   1580 sec
[2021-04-25 01:35:04,053 INFO] Step 8000/50000; acc:  65.69; ppl:  3.42; xent: 1.23; lr: 0.00010; 10102/4307 tok/s;   1589 sec
[2021-04-25 01:35:04,061 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:35:14,199 INFO] Step 8050/50000; acc:  65.42; ppl:  3.45; xent: 1.24; lr: 0.00010; 10275/4275 tok/s;   1600 sec
[2021-04-25 01:35:23,749 INFO] Step 8100/50000; acc:  65.73; ppl:  3.43; xent: 1.23; lr: 0.00010; 10324/4407 tok/s;   1609 sec
[2021-04-25 01:35:34,348 INFO] Step 8150/50000; acc:  65.46; ppl:  3.45; xent: 1.24; lr: 0.00010; 9644/4091 tok/s;   1620 sec
[2021-04-25 01:35:44,151 INFO] Step 8200/50000; acc:  65.56; ppl:  3.46; xent: 1.24; lr: 0.00010; 10384/4384 tok/s;   1630 sec
[2021-04-25 01:35:53,803 INFO] Step 8250/50000; acc:  65.69; ppl:  3.45; xent: 1.24; lr: 0.00010; 10571/4444 tok/s;   1639 sec
[2021-04-25 01:36:03,676 INFO] Step 8300/50000; acc:  65.52; ppl:  3.44; xent: 1.24; lr: 0.00010; 10237/4348 tok/s;   1649 sec
[2021-04-25 01:36:13,606 INFO] Step 8350/50000; acc:  65.73; ppl:  3.42; xent: 1.23; lr: 0.00010; 10207/4250 tok/s;   1659 sec
[2021-04-25 01:36:24,378 INFO] Step 8400/50000; acc:  65.54; ppl:  3.43; xent: 1.23; lr: 0.00010; 9765/4006 tok/s;   1670 sec
[2021-04-25 01:36:33,470 INFO] Step 8450/50000; acc:  66.25; ppl:  3.36; xent: 1.21; lr: 0.00010; 10897/4609 tok/s;   1679 sec
[2021-04-25 01:36:43,889 INFO] Step 8500/50000; acc:  65.37; ppl:  3.44; xent: 1.24; lr: 0.00010; 9939/4202 tok/s;   1689 sec
[2021-04-25 01:36:53,259 INFO] Step 8550/50000; acc:  66.21; ppl:  3.39; xent: 1.22; lr: 0.00010; 10546/4424 tok/s;   1699 sec
[2021-04-25 01:37:03,806 INFO] Step 8600/50000; acc:  65.68; ppl:  3.42; xent: 1.23; lr: 0.00010; 9913/4092 tok/s;   1709 sec
[2021-04-25 01:37:13,116 INFO] Step 8650/50000; acc:  66.08; ppl:  3.37; xent: 1.21; lr: 0.00010; 10811/4559 tok/s;   1718 sec
[2021-04-25 01:37:22,705 INFO] Step 8700/50000; acc:  66.26; ppl:  3.36; xent: 1.21; lr: 0.00010; 10647/4440 tok/s;   1728 sec
[2021-04-25 01:37:29,670 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:37:32,485 INFO] Step 8750/50000; acc:  66.12; ppl:  3.35; xent: 1.21; lr: 0.00010; 10440/4335 tok/s;   1738 sec
[2021-04-25 01:37:42,364 INFO] Step 8800/50000; acc:  66.43; ppl:  3.36; xent: 1.21; lr: 0.00010; 10117/4352 tok/s;   1748 sec
[2021-04-25 01:37:52,950 INFO] Step 8850/50000; acc:  65.59; ppl:  3.44; xent: 1.24; lr: 0.00010; 9840/4111 tok/s;   1758 sec
[2021-04-25 01:38:02,651 INFO] Step 8900/50000; acc:  66.31; ppl:  3.33; xent: 1.20; lr: 0.00010; 10055/4443 tok/s;   1768 sec
[2021-04-25 01:38:12,485 INFO] Step 8950/50000; acc:  66.04; ppl:  3.40; xent: 1.22; lr: 0.00010; 10499/4364 tok/s;   1778 sec
[2021-04-25 01:38:22,034 INFO] Step 9000/50000; acc:  65.97; ppl:  3.38; xent: 1.22; lr: 0.00010; 10607/4432 tok/s;   1787 sec
[2021-04-25 01:38:31,840 INFO] Step 9050/50000; acc:  65.59; ppl:  3.41; xent: 1.23; lr: 0.00010; 10482/4426 tok/s;   1797 sec
[2021-04-25 01:38:41,883 INFO] Step 9100/50000; acc:  66.09; ppl:  3.34; xent: 1.21; lr: 0.00010; 10237/4123 tok/s;   1807 sec
[2021-04-25 01:38:52,117 INFO] Step 9150/50000; acc:  65.95; ppl:  3.34; xent: 1.21; lr: 0.00010; 9822/4245 tok/s;   1817 sec
[2021-04-25 01:39:02,299 INFO] Step 9200/50000; acc:  66.19; ppl:  3.36; xent: 1.21; lr: 0.00010; 10215/4233 tok/s;   1828 sec
[2021-04-25 01:39:11,812 INFO] Step 9250/50000; acc:  66.21; ppl:  3.33; xent: 1.20; lr: 0.00010; 10486/4431 tok/s;   1837 sec
[2021-04-25 01:39:22,079 INFO] Step 9300/50000; acc:  66.16; ppl:  3.38; xent: 1.22; lr: 0.00010; 10037/4143 tok/s;   1847 sec
[2021-04-25 01:39:31,789 INFO] Step 9350/50000; acc:  66.63; ppl:  3.28; xent: 1.19; lr: 0.00010; 10225/4361 tok/s;   1857 sec
[2021-04-25 01:39:41,448 INFO] Step 9400/50000; acc:  66.16; ppl:  3.35; xent: 1.21; lr: 0.00010; 10732/4460 tok/s;   1867 sec
[2021-04-25 01:39:51,099 INFO] Step 9450/50000; acc:  66.51; ppl:  3.28; xent: 1.19; lr: 0.00010; 10406/4411 tok/s;   1876 sec
[2021-04-25 01:39:55,386 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:40:00,790 INFO] Step 9500/50000; acc:  66.26; ppl:  3.35; xent: 1.21; lr: 0.00010; 10616/4430 tok/s;   1886 sec
[2021-04-25 01:40:11,287 INFO] Step 9550/50000; acc:  66.41; ppl:  3.34; xent: 1.21; lr: 0.00010; 9755/4056 tok/s;   1897 sec
[2021-04-25 01:40:21,593 INFO] Step 9600/50000; acc:  66.68; ppl:  3.30; xent: 1.19; lr: 0.00010; 9643/4187 tok/s;   1907 sec
[2021-04-25 01:40:31,481 INFO] Step 9650/50000; acc:  66.39; ppl:  3.33; xent: 1.20; lr: 0.00010; 10482/4371 tok/s;   1917 sec
[2021-04-25 01:40:41,121 INFO] Step 9700/50000; acc:  66.47; ppl:  3.32; xent: 1.20; lr: 0.00010; 10159/4442 tok/s;   1926 sec
[2021-04-25 01:40:50,678 INFO] Step 9750/50000; acc:  66.38; ppl:  3.31; xent: 1.20; lr: 0.00010; 10755/4461 tok/s;   1936 sec
[2021-04-25 01:41:00,484 INFO] Step 9800/50000; acc:  66.40; ppl:  3.33; xent: 1.20; lr: 0.00010; 10513/4307 tok/s;   1946 sec
[2021-04-25 01:41:10,979 INFO] Step 9850/50000; acc:  66.29; ppl:  3.31; xent: 1.20; lr: 0.00010; 9812/4079 tok/s;   1956 sec
[2021-04-25 01:41:20,781 INFO] Step 9900/50000; acc:  66.08; ppl:  3.32; xent: 1.20; lr: 0.00010; 10366/4326 tok/s;   1966 sec
[2021-04-25 01:41:30,446 INFO] Step 9950/50000; acc:  66.76; ppl:  3.24; xent: 1.18; lr: 0.00010; 10375/4419 tok/s;   1976 sec
[2021-04-25 01:41:40,331 INFO] Step 10000/50000; acc:  66.11; ppl:  3.36; xent: 1.21; lr: 0.00010; 10554/4307 tok/s;   1986 sec
[2021-04-25 01:41:40,336 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-25 01:41:48,304 INFO] Validation perplexity: 3.20322
[2021-04-25 01:41:48,304 INFO] Validation accuracy: 67.3673
[2021-04-25 01:41:48,306 INFO] Saving checkpoint ../models/group1_params/strict_ops/model_step_10000.pt
[2021-04-25 01:41:58,823 INFO] Step 10050/50000; acc:  66.74; ppl:  3.27; xent: 1.18; lr: 0.00010; 5387/2326 tok/s;   2004 sec
[2021-04-25 01:42:08,633 INFO] Step 10100/50000; acc:  66.57; ppl:  3.26; xent: 1.18; lr: 0.00010; 10538/4279 tok/s;   2014 sec
[2021-04-25 01:42:17,885 INFO] Step 10150/50000; acc:  67.00; ppl:  3.24; xent: 1.18; lr: 0.00010; 10620/4590 tok/s;   2023 sec
[2021-04-25 01:42:28,073 INFO] Step 10200/50000; acc:  66.37; ppl:  3.30; xent: 1.19; lr: 0.00010; 10230/4288 tok/s;   2033 sec
[2021-04-25 01:42:29,514 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:42:37,690 INFO] Step 10250/50000; acc:  66.89; ppl:  3.27; xent: 1.18; lr: 0.00010; 10507/4449 tok/s;   2043 sec
[2021-04-25 01:42:47,715 INFO] Step 10300/50000; acc:  66.66; ppl:  3.30; xent: 1.19; lr: 0.00010; 10218/4236 tok/s;   2053 sec
[2021-04-25 01:42:58,024 INFO] Step 10350/50000; acc:  66.68; ppl:  3.27; xent: 1.19; lr: 0.00010; 9824/4214 tok/s;   2063 sec
[2021-04-25 01:43:07,523 INFO] Step 10400/50000; acc:  66.62; ppl:  3.27; xent: 1.19; lr: 0.00010; 10496/4504 tok/s;   2073 sec
[2021-04-25 01:43:17,493 INFO] Step 10450/50000; acc:  66.41; ppl:  3.31; xent: 1.20; lr: 0.00010; 10416/4301 tok/s;   2083 sec
[2021-04-25 01:43:26,834 INFO] Step 10500/50000; acc:  67.21; ppl:  3.21; xent: 1.17; lr: 0.00010; 10503/4550 tok/s;   2092 sec
[2021-04-25 01:43:37,181 INFO] Step 10550/50000; acc:  66.44; ppl:  3.30; xent: 1.19; lr: 0.00010; 10166/4115 tok/s;   2103 sec
[2021-04-25 01:43:47,015 INFO] Step 10600/50000; acc:  66.87; ppl:  3.25; xent: 1.18; lr: 0.00010; 10324/4310 tok/s;   2112 sec
[2021-04-25 01:43:57,052 INFO] Step 10650/50000; acc:  66.53; ppl:  3.27; xent: 1.19; lr: 0.00010; 10212/4260 tok/s;   2122 sec
[2021-04-25 01:44:06,648 INFO] Step 10700/50000; acc:  66.95; ppl:  3.26; xent: 1.18; lr: 0.00010; 10619/4467 tok/s;   2132 sec
[2021-04-25 01:44:16,449 INFO] Step 10750/50000; acc:  66.92; ppl:  3.26; xent: 1.18; lr: 0.00010; 10163/4345 tok/s;   2142 sec
[2021-04-25 01:44:26,812 INFO] Step 10800/50000; acc:  67.17; ppl:  3.22; xent: 1.17; lr: 0.00010; 10158/4135 tok/s;   2152 sec
[2021-04-25 01:44:35,859 INFO] Step 10850/50000; acc:  67.16; ppl:  3.19; xent: 1.16; lr: 0.00010; 10966/4574 tok/s;   2161 sec
[2021-04-25 01:44:45,898 INFO] Step 10900/50000; acc:  66.95; ppl:  3.24; xent: 1.18; lr: 0.00010; 10224/4325 tok/s;   2171 sec
[2021-04-25 01:44:48,074 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:44:55,790 INFO] Step 10950/50000; acc:  66.98; ppl:  3.22; xent: 1.17; lr: 0.00010; 10043/4360 tok/s;   2181 sec
[2021-04-25 01:45:05,471 INFO] Step 11000/50000; acc:  66.99; ppl:  3.22; xent: 1.17; lr: 0.00010; 10763/4397 tok/s;   2191 sec
[2021-04-25 01:45:15,460 INFO] Step 11050/50000; acc:  67.00; ppl:  3.26; xent: 1.18; lr: 0.00010; 9996/4309 tok/s;   2201 sec
[2021-04-25 01:45:25,831 INFO] Step 11100/50000; acc:  67.16; ppl:  3.23; xent: 1.17; lr: 0.00010; 9878/4194 tok/s;   2211 sec
[2021-04-25 01:45:35,252 INFO] Step 11150/50000; acc:  66.88; ppl:  3.24; xent: 1.17; lr: 0.00010; 10797/4512 tok/s;   2221 sec
[2021-04-25 01:45:44,807 INFO] Step 11200/50000; acc:  67.17; ppl:  3.20; xent: 1.16; lr: 0.00010; 10385/4463 tok/s;   2230 sec
[2021-04-25 01:45:54,599 INFO] Step 11250/50000; acc:  66.85; ppl:  3.24; xent: 1.18; lr: 0.00010; 10727/4392 tok/s;   2240 sec
[2021-04-25 01:46:04,846 INFO] Step 11300/50000; acc:  67.11; ppl:  3.19; xent: 1.16; lr: 0.00010; 9741/4109 tok/s;   2250 sec
[2021-04-25 01:46:14,815 INFO] Step 11350/50000; acc:  66.71; ppl:  3.24; xent: 1.18; lr: 0.00010; 10338/4249 tok/s;   2260 sec
[2021-04-25 01:46:24,633 INFO] Step 11400/50000; acc:  67.28; ppl:  3.18; xent: 1.16; lr: 0.00010; 10378/4395 tok/s;   2270 sec
[2021-04-25 01:46:34,511 INFO] Step 11450/50000; acc:  66.94; ppl:  3.23; xent: 1.17; lr: 0.00010; 10337/4293 tok/s;   2280 sec
[2021-04-25 01:46:44,801 INFO] Step 11500/50000; acc:  67.23; ppl:  3.23; xent: 1.17; lr: 0.00010; 9910/4201 tok/s;   2290 sec
[2021-04-25 01:46:54,685 INFO] Step 11550/50000; acc:  67.69; ppl:  3.12; xent: 1.14; lr: 0.00010; 10195/4260 tok/s;   2300 sec
[2021-04-25 01:47:04,045 INFO] Step 11600/50000; acc:  67.08; ppl:  3.21; xent: 1.17; lr: 0.00010; 11057/4545 tok/s;   2309 sec
[2021-04-25 01:47:13,438 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:47:13,902 INFO] Step 11650/50000; acc:  67.55; ppl:  3.14; xent: 1.14; lr: 0.00010; 10108/4302 tok/s;   2319 sec
[2021-04-25 01:47:24,098 INFO] Step 11700/50000; acc:  67.40; ppl:  3.19; xent: 1.16; lr: 0.00010; 10091/4278 tok/s;   2329 sec
[2021-04-25 01:47:33,784 INFO] Step 11750/50000; acc:  67.23; ppl:  3.21; xent: 1.17; lr: 0.00010; 10248/4306 tok/s;   2339 sec
[2021-04-25 01:47:44,368 INFO] Step 11800/50000; acc:  67.36; ppl:  3.19; xent: 1.16; lr: 0.00010; 9720/4143 tok/s;   2350 sec
[2021-04-25 01:47:54,207 INFO] Step 11850/50000; acc:  67.09; ppl:  3.20; xent: 1.16; lr: 0.00010; 10247/4352 tok/s;   2360 sec
[2021-04-25 01:48:03,997 INFO] Step 11900/50000; acc:  67.12; ppl:  3.20; xent: 1.16; lr: 0.00010; 10493/4407 tok/s;   2369 sec
[2021-04-25 01:48:13,788 INFO] Step 11950/50000; acc:  67.08; ppl:  3.20; xent: 1.16; lr: 0.00010; 10348/4380 tok/s;   2379 sec
[2021-04-25 01:48:23,459 INFO] Step 12000/50000; acc:  67.58; ppl:  3.17; xent: 1.15; lr: 0.00010; 10474/4278 tok/s;   2389 sec
[2021-04-25 01:48:34,464 INFO] Step 12050/50000; acc:  67.06; ppl:  3.22; xent: 1.17; lr: 0.00010; 9536/3974 tok/s;   2400 sec
[2021-04-25 01:48:43,376 INFO] Step 12100/50000; acc:  68.02; ppl:  3.08; xent: 1.13; lr: 0.00010; 10957/4702 tok/s;   2409 sec
[2021-04-25 01:48:53,735 INFO] Step 12150/50000; acc:  67.04; ppl:  3.21; xent: 1.17; lr: 0.00010; 10037/4169 tok/s;   2419 sec
[2021-04-25 01:49:03,544 INFO] Step 12200/50000; acc:  67.48; ppl:  3.18; xent: 1.16; lr: 0.00010; 10353/4331 tok/s;   2429 sec
[2021-04-25 01:49:13,768 INFO] Step 12250/50000; acc:  67.57; ppl:  3.15; xent: 1.15; lr: 0.00010; 10067/4159 tok/s;   2439 sec
[2021-04-25 01:49:23,206 INFO] Step 12300/50000; acc:  67.85; ppl:  3.14; xent: 1.14; lr: 0.00010; 10726/4508 tok/s;   2449 sec
[2021-04-25 01:49:32,646 INFO] Step 12350/50000; acc:  67.90; ppl:  3.11; xent: 1.13; lr: 0.00010; 10536/4471 tok/s;   2458 sec
[2021-04-25 01:49:39,427 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:49:42,665 INFO] Step 12400/50000; acc:  67.45; ppl:  3.17; xent: 1.16; lr: 0.00010; 10457/4320 tok/s;   2468 sec
[2021-04-25 01:49:52,627 INFO] Step 12450/50000; acc:  68.00; ppl:  3.12; xent: 1.14; lr: 0.00010; 10004/4283 tok/s;   2478 sec
[2021-04-25 01:50:03,130 INFO] Step 12500/50000; acc:  67.14; ppl:  3.20; xent: 1.16; lr: 0.00010; 9786/4134 tok/s;   2489 sec
[2021-04-25 01:50:13,008 INFO] Step 12550/50000; acc:  67.90; ppl:  3.11; xent: 1.14; lr: 0.00010; 9962/4360 tok/s;   2498 sec
[2021-04-25 01:50:22,946 INFO] Step 12600/50000; acc:  67.38; ppl:  3.17; xent: 1.16; lr: 0.00010; 10437/4313 tok/s;   2508 sec
[2021-04-25 01:50:32,600 INFO] Step 12650/50000; acc:  67.66; ppl:  3.14; xent: 1.15; lr: 0.00010; 10400/4428 tok/s;   2518 sec
[2021-04-25 01:50:42,468 INFO] Step 12700/50000; acc:  66.99; ppl:  3.18; xent: 1.16; lr: 0.00010; 10499/4353 tok/s;   2528 sec
[2021-04-25 01:50:52,571 INFO] Step 12750/50000; acc:  67.80; ppl:  3.12; xent: 1.14; lr: 0.00010; 10162/4093 tok/s;   2538 sec
[2021-04-25 01:51:02,708 INFO] Step 12800/50000; acc:  67.63; ppl:  3.10; xent: 1.13; lr: 0.00010; 9904/4283 tok/s;   2548 sec
[2021-04-25 01:51:13,065 INFO] Step 12850/50000; acc:  67.67; ppl:  3.15; xent: 1.15; lr: 0.00010; 10033/4175 tok/s;   2558 sec
[2021-04-25 01:51:22,343 INFO] Step 12900/50000; acc:  67.95; ppl:  3.09; xent: 1.13; lr: 0.00010; 10596/4509 tok/s;   2568 sec
[2021-04-25 01:51:32,540 INFO] Step 12950/50000; acc:  67.50; ppl:  3.16; xent: 1.15; lr: 0.00010; 10157/4182 tok/s;   2578 sec
[2021-04-25 01:51:42,593 INFO] Step 13000/50000; acc:  67.95; ppl:  3.10; xent: 1.13; lr: 0.00010; 10139/4235 tok/s;   2588 sec
[2021-04-25 01:51:52,099 INFO] Step 13050/50000; acc:  67.82; ppl:  3.11; xent: 1.13; lr: 0.00010; 10721/4505 tok/s;   2597 sec
[2021-04-25 01:52:01,826 INFO] Step 13100/50000; acc:  68.04; ppl:  3.09; xent: 1.13; lr: 0.00010; 10413/4415 tok/s;   2607 sec
[2021-04-25 01:52:05,627 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:52:11,401 INFO] Step 13150/50000; acc:  67.88; ppl:  3.10; xent: 1.13; lr: 0.00010; 10459/4451 tok/s;   2617 sec
[2021-04-25 01:52:22,012 INFO] Step 13200/50000; acc:  67.66; ppl:  3.16; xent: 1.15; lr: 0.00010; 9900/4023 tok/s;   2627 sec
[2021-04-25 01:52:32,355 INFO] Step 13250/50000; acc:  68.10; ppl:  3.08; xent: 1.13; lr: 0.00010; 9568/4197 tok/s;   2638 sec
[2021-04-25 01:52:42,144 INFO] Step 13300/50000; acc:  67.84; ppl:  3.12; xent: 1.14; lr: 0.00010; 10470/4390 tok/s;   2648 sec
[2021-04-25 01:52:51,946 INFO] Step 13350/50000; acc:  67.82; ppl:  3.11; xent: 1.13; lr: 0.00010; 10062/4335 tok/s;   2657 sec
[2021-04-25 01:53:01,646 INFO] Step 13400/50000; acc:  67.78; ppl:  3.13; xent: 1.14; lr: 0.00010; 10662/4418 tok/s;   2667 sec
[2021-04-25 01:53:11,317 INFO] Step 13450/50000; acc:  67.81; ppl:  3.10; xent: 1.13; lr: 0.00010; 10575/4374 tok/s;   2677 sec
[2021-04-25 01:53:21,754 INFO] Step 13500/50000; acc:  67.70; ppl:  3.11; xent: 1.14; lr: 0.00010; 9904/4102 tok/s;   2687 sec
[2021-04-25 01:53:31,502 INFO] Step 13550/50000; acc:  67.76; ppl:  3.12; xent: 1.14; lr: 0.00010; 10447/4352 tok/s;   2697 sec
[2021-04-25 01:53:41,071 INFO] Step 13600/50000; acc:  68.31; ppl:  3.05; xent: 1.12; lr: 0.00010; 10471/4467 tok/s;   2706 sec
[2021-04-25 01:53:50,970 INFO] Step 13650/50000; acc:  67.83; ppl:  3.14; xent: 1.14; lr: 0.00010; 10510/4318 tok/s;   2716 sec
[2021-04-25 01:54:01,084 INFO] Step 13700/50000; acc:  68.26; ppl:  3.05; xent: 1.12; lr: 0.00010; 9735/4217 tok/s;   2726 sec
[2021-04-25 01:54:10,685 INFO] Step 13750/50000; acc:  68.22; ppl:  3.06; xent: 1.12; lr: 0.00010; 10776/4379 tok/s;   2736 sec
[2021-04-25 01:54:20,063 INFO] Step 13800/50000; acc:  68.17; ppl:  3.08; xent: 1.13; lr: 0.00010; 10796/4554 tok/s;   2745 sec
[2021-04-25 01:54:30,044 INFO] Step 13850/50000; acc:  67.97; ppl:  3.11; xent: 1.13; lr: 0.00010; 10264/4330 tok/s;   2755 sec
[2021-04-25 01:54:31,185 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:54:39,933 INFO] Step 13900/50000; acc:  68.16; ppl:  3.06; xent: 1.12; lr: 0.00010; 10300/4358 tok/s;   2765 sec
[2021-04-25 01:54:49,760 INFO] Step 13950/50000; acc:  68.18; ppl:  3.08; xent: 1.12; lr: 0.00010; 10144/4289 tok/s;   2775 sec
[2021-04-25 01:55:00,309 INFO] Step 14000/50000; acc:  68.17; ppl:  3.08; xent: 1.13; lr: 0.00010; 9844/4162 tok/s;   2786 sec
[2021-04-25 01:55:09,775 INFO] Step 14050/50000; acc:  68.01; ppl:  3.08; xent: 1.12; lr: 0.00010; 10511/4509 tok/s;   2795 sec
[2021-04-25 01:55:19,458 INFO] Step 14100/50000; acc:  67.99; ppl:  3.10; xent: 1.13; lr: 0.00010; 10581/4355 tok/s;   2805 sec
[2021-04-25 01:55:29,090 INFO] Step 14150/50000; acc:  68.44; ppl:  3.03; xent: 1.11; lr: 0.00010; 10275/4475 tok/s;   2814 sec
[2021-04-25 01:55:39,220 INFO] Step 14200/50000; acc:  67.84; ppl:  3.11; xent: 1.14; lr: 0.00010; 10445/4181 tok/s;   2825 sec
[2021-04-25 01:55:49,434 INFO] Step 14250/50000; acc:  68.11; ppl:  3.06; xent: 1.12; lr: 0.00010; 9859/4199 tok/s;   2835 sec
[2021-04-25 01:55:59,539 INFO] Step 14300/50000; acc:  67.95; ppl:  3.08; xent: 1.13; lr: 0.00010; 10182/4210 tok/s;   2845 sec
[2021-04-25 01:56:09,295 INFO] Step 14350/50000; acc:  68.25; ppl:  3.08; xent: 1.13; lr: 0.00010; 10461/4396 tok/s;   2855 sec
[2021-04-25 01:56:18,964 INFO] Step 14400/50000; acc:  68.25; ppl:  3.07; xent: 1.12; lr: 0.00010; 10297/4408 tok/s;   2864 sec
[2021-04-25 01:56:29,395 INFO] Step 14450/50000; acc:  68.19; ppl:  3.04; xent: 1.11; lr: 0.00010; 10079/4091 tok/s;   2875 sec
[2021-04-25 01:56:38,335 INFO] Step 14500/50000; acc:  68.93; ppl:  2.98; xent: 1.09; lr: 0.00010; 10920/4633 tok/s;   2884 sec
[2021-04-25 01:56:48,189 INFO] Step 14550/50000; acc:  68.04; ppl:  3.07; xent: 1.12; lr: 0.00010; 10452/4393 tok/s;   2894 sec
[2021-04-25 01:56:50,126 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:56:58,369 INFO] Step 14600/50000; acc:  68.14; ppl:  3.07; xent: 1.12; lr: 0.00010; 10028/4281 tok/s;   2904 sec
[2021-04-25 01:57:08,100 INFO] Step 14650/50000; acc:  68.52; ppl:  3.04; xent: 1.11; lr: 0.00010; 10537/4351 tok/s;   2913 sec
[2021-04-25 01:57:18,348 INFO] Step 14700/50000; acc:  67.85; ppl:  3.08; xent: 1.13; lr: 0.00010; 9828/4191 tok/s;   2924 sec
[2021-04-25 01:57:28,166 INFO] Step 14750/50000; acc:  68.49; ppl:  3.02; xent: 1.11; lr: 0.00010; 10176/4405 tok/s;   2934 sec
[2021-04-25 01:57:37,631 INFO] Step 14800/50000; acc:  68.19; ppl:  3.07; xent: 1.12; lr: 0.00010; 11019/4529 tok/s;   2943 sec
[2021-04-25 01:57:47,251 INFO] Step 14850/50000; acc:  68.25; ppl:  3.04; xent: 1.11; lr: 0.00010; 10259/4428 tok/s;   2953 sec
[2021-04-25 01:57:57,144 INFO] Step 14900/50000; acc:  68.04; ppl:  3.07; xent: 1.12; lr: 0.00010; 10515/4326 tok/s;   2963 sec
[2021-04-25 01:58:07,402 INFO] Step 14950/50000; acc:  68.79; ppl:  3.01; xent: 1.10; lr: 0.00010; 9775/4121 tok/s;   2973 sec
[2021-04-25 01:58:17,503 INFO] Step 15000/50000; acc:  68.32; ppl:  3.05; xent: 1.11; lr: 0.00010; 10271/4217 tok/s;   2983 sec
[2021-04-25 01:58:17,506 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-25 01:58:25,474 INFO] Validation perplexity: 3.02127
[2021-04-25 01:58:25,474 INFO] Validation accuracy: 68.8037
[2021-04-25 01:58:25,476 INFO] Saving checkpoint ../models/group1_params/strict_ops/model_step_15000.pt
[2021-04-25 01:58:35,677 INFO] Step 15050/50000; acc:  68.73; ppl:  3.00; xent: 1.10; lr: 0.00010; 5554/2361 tok/s;   3001 sec
[2021-04-25 01:58:45,524 INFO] Step 15100/50000; acc:  67.86; ppl:  3.07; xent: 1.12; lr: 0.00010; 10439/4346 tok/s;   3011 sec
[2021-04-25 01:58:55,805 INFO] Step 15150/50000; acc:  68.59; ppl:  3.05; xent: 1.11; lr: 0.00010; 9925/4145 tok/s;   3021 sec
[2021-04-25 01:59:05,617 INFO] Step 15200/50000; acc:  68.77; ppl:  2.96; xent: 1.09; lr: 0.00010; 10264/4321 tok/s;   3031 sec
[2021-04-25 01:59:14,942 INFO] Step 15250/50000; acc:  68.34; ppl:  3.04; xent: 1.11; lr: 0.00010; 11071/4593 tok/s;   3040 sec
[2021-04-25 01:59:24,030 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 01:59:24,848 INFO] Step 15300/50000; acc:  68.91; ppl:  2.97; xent: 1.09; lr: 0.00010; 9920/4270 tok/s;   3050 sec
[2021-04-25 01:59:34,772 INFO] Step 15350/50000; acc:  68.45; ppl:  3.02; xent: 1.11; lr: 0.00010; 10409/4345 tok/s;   3060 sec
[2021-04-25 01:59:44,654 INFO] Step 15400/50000; acc:  68.29; ppl:  3.07; xent: 1.12; lr: 0.00010; 10332/4280 tok/s;   3070 sec
[2021-04-25 01:59:54,878 INFO] Step 15450/50000; acc:  68.42; ppl:  3.00; xent: 1.10; lr: 0.00010; 9889/4273 tok/s;   3080 sec
[2021-04-25 02:00:04,886 INFO] Step 15500/50000; acc:  68.23; ppl:  3.04; xent: 1.11; lr: 0.00010; 10154/4287 tok/s;   3090 sec
[2021-04-25 02:00:14,358 INFO] Step 15550/50000; acc:  68.65; ppl:  3.01; xent: 1.10; lr: 0.00010; 10548/4503 tok/s;   3100 sec
[2021-04-25 02:00:24,211 INFO] Step 15600/50000; acc:  68.19; ppl:  3.04; xent: 1.11; lr: 0.00010; 10580/4370 tok/s;   3110 sec
[2021-04-25 02:00:34,038 INFO] Step 15650/50000; acc:  68.82; ppl:  3.00; xent: 1.10; lr: 0.00010; 10252/4217 tok/s;   3119 sec
[2021-04-25 02:00:44,818 INFO] Step 15700/50000; acc:  68.31; ppl:  3.04; xent: 1.11; lr: 0.00010; 9634/4047 tok/s;   3130 sec
[2021-04-25 02:00:53,855 INFO] Step 15750/50000; acc:  68.96; ppl:  2.95; xent: 1.08; lr: 0.00010; 10860/4675 tok/s;   3139 sec
[2021-04-25 02:01:04,119 INFO] Step 15800/50000; acc:  68.22; ppl:  3.04; xent: 1.11; lr: 0.00010; 10190/4213 tok/s;   3149 sec
[2021-04-25 02:01:13,960 INFO] Step 15850/50000; acc:  68.81; ppl:  3.01; xent: 1.10; lr: 0.00010; 10224/4301 tok/s;   3159 sec
[2021-04-25 02:01:24,365 INFO] Step 15900/50000; acc:  68.54; ppl:  2.98; xent: 1.09; lr: 0.00010; 9936/4088 tok/s;   3170 sec
[2021-04-25 02:01:33,639 INFO] Step 15950/50000; acc:  69.05; ppl:  2.97; xent: 1.09; lr: 0.00010; 10942/4562 tok/s;   3179 sec
[2021-04-25 02:01:43,018 INFO] Step 16000/50000; acc:  69.03; ppl:  2.96; xent: 1.08; lr: 0.00010; 10595/4531 tok/s;   3188 sec
[2021-04-25 02:01:49,522 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:01:53,267 INFO] Step 16050/50000; acc:  68.63; ppl:  3.02; xent: 1.11; lr: 0.00010; 10212/4211 tok/s;   3199 sec
[2021-04-25 02:02:03,071 INFO] Step 16100/50000; acc:  69.16; ppl:  2.95; xent: 1.08; lr: 0.00010; 10038/4330 tok/s;   3208 sec
[2021-04-25 02:02:13,551 INFO] Step 16150/50000; acc:  68.44; ppl:  3.03; xent: 1.11; lr: 0.00010; 9823/4157 tok/s;   3219 sec
[2021-04-25 02:02:23,513 INFO] Step 16200/50000; acc:  68.74; ppl:  2.99; xent: 1.10; lr: 0.00010; 10159/4330 tok/s;   3229 sec
[2021-04-25 02:02:33,396 INFO] Step 16250/50000; acc:  68.65; ppl:  2.99; xent: 1.09; lr: 0.00010; 10322/4338 tok/s;   3239 sec
[2021-04-25 02:02:43,102 INFO] Step 16300/50000; acc:  68.52; ppl:  3.00; xent: 1.10; lr: 0.00010; 10421/4404 tok/s;   3248 sec
[2021-04-25 02:02:52,761 INFO] Step 16350/50000; acc:  68.56; ppl:  3.00; xent: 1.10; lr: 0.00010; 10449/4434 tok/s;   3258 sec
[2021-04-25 02:03:03,072 INFO] Step 16400/50000; acc:  68.60; ppl:  3.01; xent: 1.10; lr: 0.00010; 10239/4056 tok/s;   3268 sec
[2021-04-25 02:03:13,198 INFO] Step 16450/50000; acc:  68.83; ppl:  2.96; xent: 1.09; lr: 0.00010; 9849/4243 tok/s;   3279 sec
[2021-04-25 02:03:23,493 INFO] Step 16500/50000; acc:  68.78; ppl:  2.98; xent: 1.09; lr: 0.00010; 9981/4187 tok/s;   3289 sec
[2021-04-25 02:03:32,785 INFO] Step 16550/50000; acc:  69.22; ppl:  2.95; xent: 1.08; lr: 0.00010; 10658/4527 tok/s;   3298 sec
[2021-04-25 02:03:43,046 INFO] Step 16600/50000; acc:  68.74; ppl:  3.00; xent: 1.10; lr: 0.00010; 10154/4140 tok/s;   3308 sec
[2021-04-25 02:03:53,065 INFO] Step 16650/50000; acc:  68.85; ppl:  2.95; xent: 1.08; lr: 0.00010; 10082/4253 tok/s;   3318 sec
[2021-04-25 02:04:02,462 INFO] Step 16700/50000; acc:  68.90; ppl:  2.96; xent: 1.09; lr: 0.00010; 10906/4562 tok/s;   3328 sec
[2021-04-25 02:04:12,169 INFO] Step 16750/50000; acc:  69.37; ppl:  2.94; xent: 1.08; lr: 0.00010; 10438/4419 tok/s;   3338 sec
[2021-04-25 02:04:15,697 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:04:21,883 INFO] Step 16800/50000; acc:  68.93; ppl:  2.97; xent: 1.09; lr: 0.00010; 10326/4406 tok/s;   3347 sec
[2021-04-25 02:04:32,348 INFO] Step 16850/50000; acc:  68.82; ppl:  3.00; xent: 1.10; lr: 0.00010; 9986/4107 tok/s;   3358 sec
[2021-04-25 02:04:42,584 INFO] Step 16900/50000; acc:  69.31; ppl:  2.93; xent: 1.07; lr: 0.00010; 9542/4196 tok/s;   3368 sec
[2021-04-25 02:04:52,327 INFO] Step 16950/50000; acc:  68.85; ppl:  2.98; xent: 1.09; lr: 0.00010; 10554/4411 tok/s;   3378 sec
[2021-04-25 02:05:02,293 INFO] Step 17000/50000; acc:  68.78; ppl:  2.98; xent: 1.09; lr: 0.00010; 10180/4277 tok/s;   3388 sec
[2021-04-25 02:05:12,077 INFO] Step 17050/50000; acc:  68.99; ppl:  2.96; xent: 1.08; lr: 0.00010; 10399/4427 tok/s;   3397 sec
[2021-04-25 02:05:21,644 INFO] Step 17100/50000; acc:  68.83; ppl:  2.96; xent: 1.09; lr: 0.00010; 10782/4394 tok/s;   3407 sec
[2021-04-25 02:05:31,922 INFO] Step 17150/50000; acc:  69.08; ppl:  2.94; xent: 1.08; lr: 0.00010; 9783/4112 tok/s;   3417 sec
[2021-04-25 02:05:41,831 INFO] Step 17200/50000; acc:  68.58; ppl:  3.01; xent: 1.10; lr: 0.00010; 10548/4329 tok/s;   3427 sec
[2021-04-25 02:05:51,455 INFO] Step 17250/50000; acc:  69.46; ppl:  2.91; xent: 1.07; lr: 0.00010; 10371/4423 tok/s;   3437 sec
[2021-04-25 02:06:01,342 INFO] Step 17300/50000; acc:  69.08; ppl:  2.96; xent: 1.09; lr: 0.00010; 10392/4328 tok/s;   3447 sec
[2021-04-25 02:06:11,406 INFO] Step 17350/50000; acc:  69.45; ppl:  2.91; xent: 1.07; lr: 0.00010; 9861/4226 tok/s;   3457 sec
[2021-04-25 02:06:21,165 INFO] Step 17400/50000; acc:  68.98; ppl:  2.93; xent: 1.07; lr: 0.00010; 10653/4321 tok/s;   3467 sec
[2021-04-25 02:06:30,590 INFO] Step 17450/50000; acc:  69.31; ppl:  2.93; xent: 1.07; lr: 0.00010; 10664/4538 tok/s;   3476 sec
[2021-04-25 02:06:40,507 INFO] Step 17500/50000; acc:  68.81; ppl:  2.97; xent: 1.09; lr: 0.00010; 10383/4350 tok/s;   3486 sec
[2021-04-25 02:06:41,352 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:06:50,420 INFO] Step 17550/50000; acc:  69.12; ppl:  2.93; xent: 1.08; lr: 0.00010; 10298/4316 tok/s;   3496 sec
[2021-04-25 02:07:00,265 INFO] Step 17600/50000; acc:  69.30; ppl:  2.94; xent: 1.08; lr: 0.00010; 10103/4306 tok/s;   3506 sec
[2021-04-25 02:07:10,647 INFO] Step 17650/50000; acc:  69.16; ppl:  2.94; xent: 1.08; lr: 0.00010; 9990/4233 tok/s;   3516 sec
[2021-04-25 02:07:20,136 INFO] Step 17700/50000; acc:  69.27; ppl:  2.92; xent: 1.07; lr: 0.00010; 10327/4475 tok/s;   3526 sec
[2021-04-25 02:07:29,866 INFO] Step 17750/50000; acc:  68.96; ppl:  2.95; xent: 1.08; lr: 0.00010; 10567/4359 tok/s;   3535 sec
[2021-04-25 02:07:39,553 INFO] Step 17800/50000; acc:  69.24; ppl:  2.93; xent: 1.07; lr: 0.00010; 10513/4473 tok/s;   3545 sec
[2021-04-25 02:07:49,765 INFO] Step 17850/50000; acc:  68.87; ppl:  2.97; xent: 1.09; lr: 0.00010; 10202/4164 tok/s;   3555 sec
[2021-04-25 02:07:59,811 INFO] Step 17900/50000; acc:  68.92; ppl:  2.94; xent: 1.08; lr: 0.00010; 10099/4251 tok/s;   3565 sec
[2021-04-25 02:08:09,378 INFO] Step 17950/50000; acc:  69.51; ppl:  2.90; xent: 1.06; lr: 0.00010; 10451/4394 tok/s;   3575 sec
[2021-04-25 02:08:19,452 INFO] Step 18000/50000; acc:  68.95; ppl:  2.96; xent: 1.09; lr: 0.00010; 10406/4290 tok/s;   3585 sec
[2021-04-25 02:08:29,104 INFO] Step 18050/50000; acc:  69.39; ppl:  2.93; xent: 1.07; lr: 0.00010; 10279/4434 tok/s;   3594 sec
[2021-04-25 02:08:39,398 INFO] Step 18100/50000; acc:  69.78; ppl:  2.87; xent: 1.05; lr: 0.00010; 10097/4107 tok/s;   3605 sec
[2021-04-25 02:08:48,608 INFO] Step 18150/50000; acc:  69.85; ppl:  2.85; xent: 1.05; lr: 0.00010; 10684/4524 tok/s;   3614 sec
[2021-04-25 02:08:58,662 INFO] Step 18200/50000; acc:  68.96; ppl:  2.95; xent: 1.08; lr: 0.00010; 10300/4345 tok/s;   3624 sec
[2021-04-25 02:09:00,079 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:09:08,572 INFO] Step 18250/50000; acc:  69.13; ppl:  2.92; xent: 1.07; lr: 0.00010; 10203/4345 tok/s;   3634 sec
[2021-04-25 02:09:18,304 INFO] Step 18300/50000; acc:  69.28; ppl:  2.92; xent: 1.07; lr: 0.00010; 10601/4381 tok/s;   3644 sec
[2021-04-25 02:09:28,401 INFO] Step 18350/50000; acc:  69.37; ppl:  2.92; xent: 1.07; lr: 0.00010; 9974/4223 tok/s;   3654 sec
[2021-04-25 02:09:38,286 INFO] Step 18400/50000; acc:  69.45; ppl:  2.91; xent: 1.07; lr: 0.00010; 10110/4419 tok/s;   3664 sec
[2021-04-25 02:09:47,911 INFO] Step 18450/50000; acc:  69.24; ppl:  2.93; xent: 1.08; lr: 0.00010; 10804/4416 tok/s;   3673 sec
[2021-04-25 02:09:57,170 INFO] Step 18500/50000; acc:  69.18; ppl:  2.89; xent: 1.06; lr: 0.00010; 10529/4592 tok/s;   3683 sec
[2021-04-25 02:10:07,101 INFO] Step 18550/50000; acc:  69.02; ppl:  2.93; xent: 1.08; lr: 0.00010; 10507/4294 tok/s;   3692 sec
[2021-04-25 02:10:17,664 INFO] Step 18600/50000; acc:  69.32; ppl:  2.92; xent: 1.07; lr: 0.00010; 9730/4044 tok/s;   3703 sec
[2021-04-25 02:10:27,638 INFO] Step 18650/50000; acc:  69.19; ppl:  2.92; xent: 1.07; lr: 0.00010; 10246/4273 tok/s;   3713 sec
[2021-04-25 02:10:37,442 INFO] Step 18700/50000; acc:  69.49; ppl:  2.89; xent: 1.06; lr: 0.00010; 10367/4360 tok/s;   3723 sec
[2021-04-25 02:10:47,163 INFO] Step 18750/50000; acc:  69.45; ppl:  2.91; xent: 1.07; lr: 0.00010; 10310/4355 tok/s;   3733 sec
[2021-04-25 02:10:57,585 INFO] Step 18800/50000; acc:  69.30; ppl:  2.93; xent: 1.07; lr: 0.00010; 10060/4138 tok/s;   3743 sec
[2021-04-25 02:11:07,185 INFO] Step 18850/50000; acc:  69.84; ppl:  2.84; xent: 1.04; lr: 0.00010; 10435/4368 tok/s;   3753 sec
[2021-04-25 02:11:16,672 INFO] Step 18900/50000; acc:  69.40; ppl:  2.90; xent: 1.06; lr: 0.00010; 10747/4535 tok/s;   3762 sec
[2021-04-25 02:11:25,292 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:11:26,343 INFO] Step 18950/50000; acc:  69.97; ppl:  2.85; xent: 1.05; lr: 0.00010; 10238/4376 tok/s;   3772 sec
[2021-04-25 02:11:36,266 INFO] Step 19000/50000; acc:  69.49; ppl:  2.90; xent: 1.07; lr: 0.00010; 10459/4352 tok/s;   3782 sec
[2021-04-25 02:11:46,395 INFO] Step 19050/50000; acc:  69.40; ppl:  2.93; xent: 1.07; lr: 0.00010; 9991/4189 tok/s;   3792 sec
[2021-04-25 02:11:56,745 INFO] Step 19100/50000; acc:  69.58; ppl:  2.88; xent: 1.06; lr: 0.00010; 9840/4235 tok/s;   3802 sec
[2021-04-25 02:12:06,532 INFO] Step 19150/50000; acc:  69.26; ppl:  2.91; xent: 1.07; lr: 0.00010; 10388/4359 tok/s;   3812 sec
[2021-04-25 02:12:16,158 INFO] Step 19200/50000; acc:  69.64; ppl:  2.89; xent: 1.06; lr: 0.00010; 10367/4432 tok/s;   3822 sec
[2021-04-25 02:12:25,879 INFO] Step 19250/50000; acc:  68.93; ppl:  2.94; xent: 1.08; lr: 0.00010; 10723/4452 tok/s;   3831 sec
[2021-04-25 02:12:35,470 INFO] Step 19300/50000; acc:  69.91; ppl:  2.83; xent: 1.04; lr: 0.00010; 10349/4278 tok/s;   3841 sec
[2021-04-25 02:12:46,390 INFO] Step 19350/50000; acc:  69.35; ppl:  2.90; xent: 1.07; lr: 0.00010; 9541/4022 tok/s;   3852 sec
[2021-04-25 02:12:55,688 INFO] Step 19400/50000; acc:  70.01; ppl:  2.87; xent: 1.05; lr: 0.00010; 10855/4557 tok/s;   3861 sec
[2021-04-25 02:13:05,734 INFO] Step 19450/50000; acc:  69.22; ppl:  2.90; xent: 1.06; lr: 0.00010; 10239/4278 tok/s;   3871 sec
[2021-04-25 02:13:15,655 INFO] Step 19500/50000; acc:  69.60; ppl:  2.90; xent: 1.07; lr: 0.00010; 10227/4279 tok/s;   3881 sec
[2021-04-25 02:13:25,724 INFO] Step 19550/50000; acc:  69.95; ppl:  2.83; xent: 1.04; lr: 0.00010; 9976/4222 tok/s;   3891 sec
[2021-04-25 02:13:35,253 INFO] Step 19600/50000; acc:  69.23; ppl:  2.90; xent: 1.07; lr: 0.00010; 10954/4471 tok/s;   3901 sec
[2021-04-25 02:13:44,866 INFO] Step 19650/50000; acc:  70.13; ppl:  2.82; xent: 1.04; lr: 0.00010; 10282/4419 tok/s;   3910 sec
[2021-04-25 02:13:50,793 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:13:54,911 INFO] Step 19700/50000; acc:  69.59; ppl:  2.88; xent: 1.06; lr: 0.00010; 10294/4291 tok/s;   3920 sec
[2021-04-25 02:14:04,813 INFO] Step 19750/50000; acc:  69.94; ppl:  2.87; xent: 1.05; lr: 0.00010; 10035/4280 tok/s;   3930 sec
[2021-04-25 02:14:15,162 INFO] Step 19800/50000; acc:  69.31; ppl:  2.90; xent: 1.07; lr: 0.00010; 9984/4192 tok/s;   3941 sec
[2021-04-25 02:14:25,107 INFO] Step 19850/50000; acc:  69.82; ppl:  2.86; xent: 1.05; lr: 0.00010; 10079/4299 tok/s;   3950 sec
[2021-04-25 02:14:35,073 INFO] Step 19900/50000; acc:  69.25; ppl:  2.90; xent: 1.06; lr: 0.00010; 10317/4331 tok/s;   3960 sec
[2021-04-25 02:14:44,814 INFO] Step 19950/50000; acc:  69.63; ppl:  2.87; xent: 1.05; lr: 0.00010; 10385/4370 tok/s;   3970 sec
[2021-04-25 02:14:54,312 INFO] Step 20000/50000; acc:  69.53; ppl:  2.86; xent: 1.05; lr: 0.00010; 10604/4487 tok/s;   3980 sec
[2021-04-25 02:14:54,316 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-25 02:15:02,301 INFO] Validation perplexity: 2.93815
[2021-04-25 02:15:02,301 INFO] Validation accuracy: 69.2615
[2021-04-25 02:15:02,304 INFO] Saving checkpoint ../models/group1_params/strict_ops/model_step_20000.pt
[2021-04-25 02:15:13,357 INFO] Step 20050/50000; acc:  69.38; ppl:  2.90; xent: 1.06; lr: 0.00010; 5541/2234 tok/s;   3999 sec
[2021-04-25 02:15:23,070 INFO] Step 20100/50000; acc:  69.76; ppl:  2.83; xent: 1.04; lr: 0.00010; 10119/4390 tok/s;   4008 sec
[2021-04-25 02:15:33,423 INFO] Step 20150/50000; acc:  69.81; ppl:  2.86; xent: 1.05; lr: 0.00010; 9967/4165 tok/s;   4019 sec
[2021-04-25 02:15:43,086 INFO] Step 20200/50000; acc:  69.74; ppl:  2.87; xent: 1.06; lr: 0.00010; 10541/4379 tok/s;   4028 sec
[2021-04-25 02:15:53,182 INFO] Step 20250/50000; acc:  69.69; ppl:  2.87; xent: 1.05; lr: 0.00010; 10151/4230 tok/s;   4039 sec
[2021-04-25 02:16:03,220 INFO] Step 20300/50000; acc:  69.82; ppl:  2.83; xent: 1.04; lr: 0.00010; 10137/4218 tok/s;   4049 sec
[2021-04-25 02:16:12,461 INFO] Step 20350/50000; acc:  70.07; ppl:  2.81; xent: 1.03; lr: 0.00010; 10774/4597 tok/s;   4058 sec
[2021-04-25 02:16:22,562 INFO] Step 20400/50000; acc:  69.96; ppl:  2.85; xent: 1.05; lr: 0.00010; 10326/4289 tok/s;   4068 sec
[2021-04-25 02:16:25,474 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:16:32,109 INFO] Step 20450/50000; acc:  69.77; ppl:  2.85; xent: 1.05; lr: 0.00010; 10435/4479 tok/s;   4077 sec
[2021-04-25 02:16:42,341 INFO] Step 20500/50000; acc:  69.72; ppl:  2.87; xent: 1.06; lr: 0.00010; 10098/4165 tok/s;   4088 sec
[2021-04-25 02:16:52,666 INFO] Step 20550/50000; acc:  70.02; ppl:  2.83; xent: 1.04; lr: 0.00010; 9536/4182 tok/s;   4098 sec
[2021-04-25 02:17:02,395 INFO] Step 20600/50000; acc:  69.73; ppl:  2.86; xent: 1.05; lr: 0.00010; 10635/4404 tok/s;   4108 sec
[2021-04-25 02:17:12,331 INFO] Step 20650/50000; acc:  69.70; ppl:  2.86; xent: 1.05; lr: 0.00010; 10114/4317 tok/s;   4118 sec
[2021-04-25 02:17:22,047 INFO] Step 20700/50000; acc:  69.93; ppl:  2.86; xent: 1.05; lr: 0.00010; 10540/4417 tok/s;   4127 sec
[2021-04-25 02:17:31,757 INFO] Step 20750/50000; acc:  69.83; ppl:  2.85; xent: 1.05; lr: 0.00010; 10642/4336 tok/s;   4137 sec
[2021-04-25 02:17:41,706 INFO] Step 20800/50000; acc:  69.89; ppl:  2.83; xent: 1.04; lr: 0.00010; 10067/4277 tok/s;   4147 sec
[2021-04-25 02:17:51,679 INFO] Step 20850/50000; acc:  69.41; ppl:  2.91; xent: 1.07; lr: 0.00010; 10485/4282 tok/s;   4157 sec
[2021-04-25 02:18:01,148 INFO] Step 20900/50000; acc:  70.08; ppl:  2.80; xent: 1.03; lr: 0.00010; 10392/4507 tok/s;   4167 sec
[2021-04-25 02:18:11,005 INFO] Step 20950/50000; acc:  69.79; ppl:  2.85; xent: 1.05; lr: 0.00010; 10454/4326 tok/s;   4176 sec
[2021-04-25 02:18:21,366 INFO] Step 21000/50000; acc:  70.17; ppl:  2.82; xent: 1.04; lr: 0.00010; 9861/4133 tok/s;   4187 sec
[2021-04-25 02:18:30,978 INFO] Step 21050/50000; acc:  69.83; ppl:  2.81; xent: 1.03; lr: 0.00010; 10632/4355 tok/s;   4196 sec
[2021-04-25 02:18:40,632 INFO] Step 21100/50000; acc:  70.29; ppl:  2.83; xent: 1.04; lr: 0.00010; 10488/4494 tok/s;   4206 sec
[2021-04-25 02:18:44,623 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:18:50,465 INFO] Step 21150/50000; acc:  70.02; ppl:  2.82; xent: 1.04; lr: 0.00010; 10202/4328 tok/s;   4216 sec
[2021-04-25 02:19:00,704 INFO] Step 21200/50000; acc:  69.84; ppl:  2.86; xent: 1.05; lr: 0.00010; 10232/4199 tok/s;   4226 sec
[2021-04-25 02:19:10,514 INFO] Step 21250/50000; acc:  70.15; ppl:  2.83; xent: 1.04; lr: 0.00010; 10074/4344 tok/s;   4236 sec
[2021-04-25 02:19:20,581 INFO] Step 21300/50000; acc:  70.03; ppl:  2.83; xent: 1.04; lr: 0.00010; 10192/4316 tok/s;   4246 sec
[2021-04-25 02:19:30,162 INFO] Step 21350/50000; acc:  70.05; ppl:  2.82; xent: 1.04; lr: 0.00010; 10310/4453 tok/s;   4256 sec
[2021-04-25 02:19:39,662 INFO] Step 21400/50000; acc:  70.13; ppl:  2.85; xent: 1.05; lr: 0.00010; 10886/4483 tok/s;   4265 sec
[2021-04-25 02:19:49,458 INFO] Step 21450/50000; acc:  70.21; ppl:  2.81; xent: 1.03; lr: 0.00010; 10312/4408 tok/s;   4275 sec
[2021-04-25 02:19:59,761 INFO] Step 21500/50000; acc:  69.66; ppl:  2.86; xent: 1.05; lr: 0.00010; 10178/4139 tok/s;   4285 sec
[2021-04-25 02:20:09,844 INFO] Step 21550/50000; acc:  69.96; ppl:  2.83; xent: 1.04; lr: 0.00010; 10059/4194 tok/s;   4295 sec
[2021-04-25 02:20:19,612 INFO] Step 21600/50000; acc:  70.53; ppl:  2.78; xent: 1.02; lr: 0.00010; 10231/4323 tok/s;   4305 sec
[2021-04-25 02:20:29,495 INFO] Step 21650/50000; acc:  69.83; ppl:  2.85; xent: 1.05; lr: 0.00010; 10576/4370 tok/s;   4315 sec
[2021-04-25 02:20:39,174 INFO] Step 21700/50000; acc:  70.35; ppl:  2.80; xent: 1.03; lr: 0.00010; 10138/4399 tok/s;   4325 sec
[2021-04-25 02:20:49,676 INFO] Step 21750/50000; acc:  70.19; ppl:  2.79; xent: 1.03; lr: 0.00010; 9902/4085 tok/s;   4335 sec
[2021-04-25 02:20:59,012 INFO] Step 21800/50000; acc:  70.26; ppl:  2.79; xent: 1.03; lr: 0.00010; 10841/4459 tok/s;   4344 sec
[2021-04-25 02:21:08,758 INFO] Step 21850/50000; acc:  69.92; ppl:  2.82; xent: 1.04; lr: 0.00010; 10448/4399 tok/s;   4354 sec
[2021-04-25 02:21:09,934 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:21:18,759 INFO] Step 21900/50000; acc:  70.27; ppl:  2.80; xent: 1.03; lr: 0.00010; 10181/4361 tok/s;   4364 sec
[2021-04-25 02:21:28,488 INFO] Step 21950/50000; acc:  70.30; ppl:  2.80; xent: 1.03; lr: 0.00010; 10325/4344 tok/s;   4374 sec
[2021-04-25 02:21:39,019 INFO] Step 22000/50000; acc:  69.69; ppl:  2.85; xent: 1.05; lr: 0.00010; 9827/4129 tok/s;   4384 sec
[2021-04-25 02:21:48,799 INFO] Step 22050/50000; acc:  70.14; ppl:  2.78; xent: 1.02; lr: 0.00010; 10173/4398 tok/s;   4394 sec
[2021-04-25 02:21:58,259 INFO] Step 22100/50000; acc:  70.10; ppl:  2.82; xent: 1.04; lr: 0.00010; 10865/4489 tok/s;   4404 sec
[2021-04-25 02:22:07,644 INFO] Step 22150/50000; acc:  70.24; ppl:  2.80; xent: 1.03; lr: 0.00010; 10459/4546 tok/s;   4413 sec
[2021-04-25 02:22:17,610 INFO] Step 22200/50000; acc:  69.82; ppl:  2.84; xent: 1.04; lr: 0.00010; 10563/4258 tok/s;   4423 sec
[2021-04-25 02:22:28,387 INFO] Step 22250/50000; acc:  70.24; ppl:  2.79; xent: 1.03; lr: 0.00010; 9432/4005 tok/s;   4434 sec
[2021-04-25 02:22:38,135 INFO] Step 22300/50000; acc:  70.14; ppl:  2.82; xent: 1.04; lr: 0.00010; 10555/4342 tok/s;   4444 sec
[2021-04-25 02:22:48,238 INFO] Step 22350/50000; acc:  70.37; ppl:  2.79; xent: 1.03; lr: 0.00010; 10074/4249 tok/s;   4454 sec
[2021-04-25 02:22:57,733 INFO] Step 22400/50000; acc:  70.38; ppl:  2.79; xent: 1.03; lr: 0.00010; 10518/4445 tok/s;   4463 sec
[2021-04-25 02:23:08,123 INFO] Step 22450/50000; acc:  70.22; ppl:  2.81; xent: 1.03; lr: 0.00010; 10088/4174 tok/s;   4473 sec
[2021-04-25 02:23:17,322 INFO] Step 22500/50000; acc:  70.66; ppl:  2.72; xent: 1.00; lr: 0.00010; 10725/4521 tok/s;   4483 sec
[2021-04-25 02:23:26,978 INFO] Step 22550/50000; acc:  70.22; ppl:  2.80; xent: 1.03; lr: 0.00010; 10604/4481 tok/s;   4492 sec
[2021-04-25 02:23:35,209 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:23:36,705 INFO] Step 22600/50000; acc:  70.31; ppl:  2.79; xent: 1.03; lr: 0.00010; 10471/4338 tok/s;   4502 sec
[2021-04-25 02:23:46,614 INFO] Step 22650/50000; acc:  70.23; ppl:  2.78; xent: 1.02; lr: 0.00010; 10308/4361 tok/s;   4512 sec
[2021-04-25 02:23:56,843 INFO] Step 22700/50000; acc:  70.18; ppl:  2.82; xent: 1.04; lr: 0.00010; 9950/4169 tok/s;   4522 sec
[2021-04-25 02:24:06,733 INFO] Step 22750/50000; acc:  70.53; ppl:  2.76; xent: 1.01; lr: 0.00010; 10027/4400 tok/s;   4532 sec
[2021-04-25 02:24:16,835 INFO] Step 22800/50000; acc:  69.87; ppl:  2.83; xent: 1.04; lr: 0.00010; 10342/4258 tok/s;   4542 sec
[2021-04-25 02:24:26,417 INFO] Step 22850/50000; acc:  70.32; ppl:  2.79; xent: 1.03; lr: 0.00010; 10356/4430 tok/s;   4552 sec
[2021-04-25 02:24:36,269 INFO] Step 22900/50000; acc:  69.80; ppl:  2.82; xent: 1.04; lr: 0.00010; 10474/4436 tok/s;   4562 sec
[2021-04-25 02:24:45,819 INFO] Step 22950/50000; acc:  70.59; ppl:  2.75; xent: 1.01; lr: 0.00010; 10466/4250 tok/s;   4571 sec
[2021-04-25 02:24:56,758 INFO] Step 23000/50000; acc:  70.20; ppl:  2.81; xent: 1.03; lr: 0.00010; 9559/4027 tok/s;   4582 sec
[2021-04-25 02:25:06,027 INFO] Step 23050/50000; acc:  70.50; ppl:  2.77; xent: 1.02; lr: 0.00010; 10819/4541 tok/s;   4591 sec
[2021-04-25 02:25:16,219 INFO] Step 23100/50000; acc:  70.04; ppl:  2.81; xent: 1.03; lr: 0.00010; 10147/4242 tok/s;   4602 sec
[2021-04-25 02:25:26,453 INFO] Step 23150/50000; acc:  70.67; ppl:  2.78; xent: 1.02; lr: 0.00010; 9925/4139 tok/s;   4612 sec
[2021-04-25 02:25:36,284 INFO] Step 23200/50000; acc:  70.84; ppl:  2.72; xent: 1.00; lr: 0.00010; 10216/4306 tok/s;   4622 sec
[2021-04-25 02:25:45,815 INFO] Step 23250/50000; acc:  70.25; ppl:  2.80; xent: 1.03; lr: 0.00010; 10920/4457 tok/s;   4631 sec
[2021-04-25 02:25:55,480 INFO] Step 23300/50000; acc:  70.94; ppl:  2.72; xent: 1.00; lr: 0.00010; 10091/4431 tok/s;   4641 sec
[2021-04-25 02:26:01,001 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:26:05,320 INFO] Step 23350/50000; acc:  70.25; ppl:  2.79; xent: 1.03; lr: 0.00010; 10534/4385 tok/s;   4651 sec
[2021-04-25 02:26:15,666 INFO] Step 23400/50000; acc:  70.35; ppl:  2.80; xent: 1.03; lr: 0.00010; 9887/4115 tok/s;   4661 sec
[2021-04-25 02:26:25,873 INFO] Step 23450/50000; acc:  70.15; ppl:  2.78; xent: 1.02; lr: 0.00010; 9944/4240 tok/s;   4671 sec
[2021-04-25 02:26:35,808 INFO] Step 23500/50000; acc:  70.39; ppl:  2.77; xent: 1.02; lr: 0.00010; 10176/4307 tok/s;   4681 sec
[2021-04-25 02:26:45,431 INFO] Step 23550/50000; acc:  70.42; ppl:  2.77; xent: 1.02; lr: 0.00010; 10392/4463 tok/s;   4691 sec
[2021-04-25 02:26:55,274 INFO] Step 23600/50000; acc:  70.08; ppl:  2.81; xent: 1.03; lr: 0.00010; 10553/4358 tok/s;   4701 sec
[2021-04-25 02:27:04,661 INFO] Step 23650/50000; acc:  70.59; ppl:  2.75; xent: 1.01; lr: 0.00010; 10705/4516 tok/s;   4710 sec
[2021-04-25 02:27:15,167 INFO] Step 23700/50000; acc:  70.41; ppl:  2.78; xent: 1.02; lr: 0.00010; 9914/4012 tok/s;   4721 sec
[2021-04-25 02:27:25,009 INFO] Step 23750/50000; acc:  70.49; ppl:  2.76; xent: 1.01; lr: 0.00010; 10053/4338 tok/s;   4730 sec
[2021-04-25 02:27:35,311 INFO] Step 23800/50000; acc:  70.52; ppl:  2.78; xent: 1.02; lr: 0.00010; 10087/4212 tok/s;   4741 sec
[2021-04-25 02:27:44,843 INFO] Step 23850/50000; acc:  70.62; ppl:  2.75; xent: 1.01; lr: 0.00010; 10579/4435 tok/s;   4750 sec
[2021-04-25 02:27:55,069 INFO] Step 23900/50000; acc:  70.55; ppl:  2.78; xent: 1.02; lr: 0.00010; 10073/4169 tok/s;   4760 sec
[2021-04-25 02:28:05,138 INFO] Step 23950/50000; acc:  70.64; ppl:  2.73; xent: 1.00; lr: 0.00010; 10127/4190 tok/s;   4771 sec
[2021-04-25 02:28:14,240 INFO] Step 24000/50000; acc:  70.74; ppl:  2.73; xent: 1.00; lr: 0.00010; 10918/4712 tok/s;   4780 sec
[2021-04-25 02:28:24,399 INFO] Step 24050/50000; acc:  70.29; ppl:  2.77; xent: 1.02; lr: 0.00010; 10263/4240 tok/s;   4790 sec
[2021-04-25 02:28:26,915 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:28:33,856 INFO] Step 24100/50000; acc:  70.89; ppl:  2.73; xent: 1.00; lr: 0.00010; 10398/4542 tok/s;   4799 sec
[2021-04-25 02:28:43,843 INFO] Step 24150/50000; acc:  70.38; ppl:  2.77; xent: 1.02; lr: 0.00010; 10354/4215 tok/s;   4809 sec
[2021-04-25 02:28:54,415 INFO] Step 24200/50000; acc:  70.36; ppl:  2.78; xent: 1.02; lr: 0.00010; 9582/4138 tok/s;   4820 sec
[2021-04-25 02:29:04,018 INFO] Step 24250/50000; acc:  70.63; ppl:  2.74; xent: 1.01; lr: 0.00010; 10593/4464 tok/s;   4829 sec
[2021-04-25 02:29:13,937 INFO] Step 24300/50000; acc:  70.19; ppl:  2.78; xent: 1.02; lr: 0.00010; 10214/4299 tok/s;   4839 sec
[2021-04-25 02:29:23,390 INFO] Step 24350/50000; acc:  70.92; ppl:  2.72; xent: 1.00; lr: 0.00010; 10546/4512 tok/s;   4849 sec
[2021-04-25 02:29:33,708 INFO] Step 24400/50000; acc:  70.40; ppl:  2.78; xent: 1.02; lr: 0.00010; 10296/4133 tok/s;   4859 sec
[2021-04-25 02:29:43,617 INFO] Step 24450/50000; acc:  70.90; ppl:  2.72; xent: 1.00; lr: 0.00010; 10052/4289 tok/s;   4869 sec
[2021-04-25 02:29:53,707 INFO] Step 24500/50000; acc:  70.58; ppl:  2.78; xent: 1.02; lr: 0.00010; 10239/4215 tok/s;   4879 sec
[2021-04-25 02:30:03,380 INFO] Step 24550/50000; acc:  70.95; ppl:  2.71; xent: 1.00; lr: 0.00010; 10250/4401 tok/s;   4889 sec
[2021-04-25 02:30:13,273 INFO] Step 24600/50000; acc:  70.46; ppl:  2.76; xent: 1.02; lr: 0.00010; 10469/4325 tok/s;   4899 sec
[2021-04-25 02:30:23,447 INFO] Step 24650/50000; acc:  71.18; ppl:  2.70; xent: 1.00; lr: 0.00010; 9957/4215 tok/s;   4909 sec
[2021-04-25 02:30:33,037 INFO] Step 24700/50000; acc:  70.65; ppl:  2.74; xent: 1.01; lr: 0.00010; 10721/4384 tok/s;   4918 sec
[2021-04-25 02:30:42,709 INFO] Step 24750/50000; acc:  70.88; ppl:  2.72; xent: 1.00; lr: 0.00010; 10467/4447 tok/s;   4928 sec
[2021-04-25 02:30:46,186 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:30:52,719 INFO] Step 24800/50000; acc:  70.70; ppl:  2.73; xent: 1.00; lr: 0.00010; 10027/4295 tok/s;   4938 sec
[2021-04-25 02:31:02,710 INFO] Step 24850/50000; acc:  70.70; ppl:  2.74; xent: 1.01; lr: 0.00010; 10456/4267 tok/s;   4948 sec
[2021-04-25 02:31:12,607 INFO] Step 24900/50000; acc:  70.70; ppl:  2.73; xent: 1.00; lr: 0.00010; 9863/4344 tok/s;   4958 sec
[2021-04-25 02:31:22,722 INFO] Step 24950/50000; acc:  70.87; ppl:  2.74; xent: 1.01; lr: 0.00010; 10161/4274 tok/s;   4968 sec
[2021-04-25 02:31:32,545 INFO] Step 25000/50000; acc:  70.59; ppl:  2.75; xent: 1.01; lr: 0.00010; 10356/4372 tok/s;   4978 sec
[2021-04-25 02:31:32,547 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-25 02:31:40,535 INFO] Validation perplexity: 2.86402
[2021-04-25 02:31:40,536 INFO] Validation accuracy: 70.2401
[2021-04-25 02:31:40,538 INFO] Saving checkpoint ../models/group1_params/strict_ops/model_step_25000.pt
[2021-04-25 02:31:50,659 INFO] Step 25050/50000; acc:  70.91; ppl:  2.74; xent: 1.01; lr: 0.00010; 5606/2332 tok/s;   4996 sec
[2021-04-25 02:32:00,289 INFO] Step 25100/50000; acc:  70.64; ppl:  2.73; xent: 1.00; lr: 0.00010; 10582/4508 tok/s;   5006 sec
[2021-04-25 02:32:10,505 INFO] Step 25150/50000; acc:  70.94; ppl:  2.73; xent: 1.00; lr: 0.00010; 9978/4105 tok/s;   5016 sec
[2021-04-25 02:32:20,917 INFO] Step 25200/50000; acc:  70.53; ppl:  2.78; xent: 1.02; lr: 0.00010; 10022/4117 tok/s;   5026 sec
[2021-04-25 02:32:30,514 INFO] Step 25250/50000; acc:  71.32; ppl:  2.68; xent: 0.99; lr: 0.00010; 10375/4418 tok/s;   5036 sec
[2021-04-25 02:32:40,401 INFO] Step 25300/50000; acc:  70.91; ppl:  2.74; xent: 1.01; lr: 0.00010; 10426/4327 tok/s;   5046 sec
[2021-04-25 02:32:50,232 INFO] Step 25350/50000; acc:  71.32; ppl:  2.71; xent: 1.00; lr: 0.00010; 10067/4317 tok/s;   5056 sec
[2021-04-25 02:33:00,752 INFO] Step 25400/50000; acc:  70.79; ppl:  2.71; xent: 1.00; lr: 0.00010; 9944/4117 tok/s;   5066 sec
[2021-04-25 02:33:09,933 INFO] Step 25450/50000; acc:  71.18; ppl:  2.69; xent: 0.99; lr: 0.00010; 10907/4523 tok/s;   5075 sec
[2021-04-25 02:33:19,902 INFO] Step 25500/50000; acc:  70.85; ppl:  2.73; xent: 1.01; lr: 0.00010; 10309/4347 tok/s;   5085 sec
[2021-04-25 02:33:20,640 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:33:29,928 INFO] Step 25550/50000; acc:  70.87; ppl:  2.71; xent: 1.00; lr: 0.00010; 10147/4289 tok/s;   5095 sec
[2021-04-25 02:33:39,441 INFO] Step 25600/50000; acc:  70.94; ppl:  2.72; xent: 1.00; lr: 0.00010; 10553/4421 tok/s;   5105 sec
[2021-04-25 02:33:49,969 INFO] Step 25650/50000; acc:  70.49; ppl:  2.75; xent: 1.01; lr: 0.00010; 9801/4148 tok/s;   5115 sec
[2021-04-25 02:33:59,588 INFO] Step 25700/50000; acc:  71.03; ppl:  2.69; xent: 0.99; lr: 0.00010; 10201/4450 tok/s;   5125 sec
[2021-04-25 02:34:09,339 INFO] Step 25750/50000; acc:  70.88; ppl:  2.73; xent: 1.01; lr: 0.00010; 10582/4382 tok/s;   5135 sec
[2021-04-25 02:34:19,103 INFO] Step 25800/50000; acc:  70.82; ppl:  2.73; xent: 1.01; lr: 0.00010; 10341/4404 tok/s;   5144 sec
[2021-04-25 02:34:29,078 INFO] Step 25850/50000; acc:  70.47; ppl:  2.74; xent: 1.01; lr: 0.00010; 10385/4238 tok/s;   5154 sec
[2021-04-25 02:34:39,548 INFO] Step 25900/50000; acc:  70.97; ppl:  2.71; xent: 1.00; lr: 0.00010; 9775/4108 tok/s;   5165 sec
[2021-04-25 02:34:48,922 INFO] Step 25950/50000; acc:  71.12; ppl:  2.71; xent: 1.00; lr: 0.00010; 10677/4475 tok/s;   5174 sec
[2021-04-25 02:34:59,102 INFO] Step 26000/50000; acc:  70.85; ppl:  2.74; xent: 1.01; lr: 0.00010; 10285/4261 tok/s;   5184 sec
[2021-04-25 02:35:08,724 INFO] Step 26050/50000; acc:  71.20; ppl:  2.69; xent: 0.99; lr: 0.00010; 10337/4392 tok/s;   5194 sec
[2021-04-25 02:35:18,913 INFO] Step 26100/50000; acc:  71.12; ppl:  2.71; xent: 1.00; lr: 0.00010; 10163/4205 tok/s;   5204 sec
[2021-04-25 02:35:28,114 INFO] Step 26150/50000; acc:  71.23; ppl:  2.66; xent: 0.98; lr: 0.00010; 10769/4545 tok/s;   5213 sec
[2021-04-25 02:35:37,794 INFO] Step 26200/50000; acc:  70.89; ppl:  2.72; xent: 1.00; lr: 0.00010; 10645/4506 tok/s;   5223 sec
[2021-04-25 02:35:45,414 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:35:47,273 INFO] Step 26250/50000; acc:  70.98; ppl:  2.69; xent: 0.99; lr: 0.00010; 10663/4434 tok/s;   5233 sec
[2021-04-25 02:35:57,276 INFO] Step 26300/50000; acc:  71.25; ppl:  2.69; xent: 0.99; lr: 0.00010; 10273/4325 tok/s;   5243 sec
[2021-04-25 02:36:07,672 INFO] Step 26350/50000; acc:  70.83; ppl:  2.74; xent: 1.01; lr: 0.00010; 9788/4119 tok/s;   5253 sec
[2021-04-25 02:36:17,709 INFO] Step 26400/50000; acc:  71.20; ppl:  2.68; xent: 0.98; lr: 0.00010; 9882/4308 tok/s;   5263 sec
[2021-04-25 02:36:27,662 INFO] Step 26450/50000; acc:  70.88; ppl:  2.73; xent: 1.00; lr: 0.00010; 10482/4343 tok/s;   5273 sec
[2021-04-25 02:36:36,968 INFO] Step 26500/50000; acc:  71.40; ppl:  2.69; xent: 0.99; lr: 0.00010; 10507/4543 tok/s;   5282 sec
[2021-04-25 02:36:46,798 INFO] Step 26550/50000; acc:  70.49; ppl:  2.73; xent: 1.01; lr: 0.00010; 10539/4434 tok/s;   5292 sec
[2021-04-25 02:36:56,864 INFO] Step 26600/50000; acc:  71.12; ppl:  2.70; xent: 0.99; lr: 0.00010; 10231/4084 tok/s;   5302 sec
[2021-04-25 02:37:07,459 INFO] Step 26650/50000; acc:  71.04; ppl:  2.71; xent: 1.00; lr: 0.00010; 9686/4109 tok/s;   5313 sec
[2021-04-25 02:37:16,932 INFO] Step 26700/50000; acc:  71.27; ppl:  2.69; xent: 0.99; lr: 0.00010; 10682/4486 tok/s;   5322 sec
[2021-04-25 02:37:26,844 INFO] Step 26750/50000; acc:  71.15; ppl:  2.69; xent: 0.99; lr: 0.00010; 10144/4321 tok/s;   5332 sec
[2021-04-25 02:37:37,295 INFO] Step 26800/50000; acc:  70.82; ppl:  2.73; xent: 1.00; lr: 0.00010; 9990/4073 tok/s;   5343 sec
[2021-04-25 02:37:47,052 INFO] Step 26850/50000; acc:  71.71; ppl:  2.63; xent: 0.97; lr: 0.00010; 10231/4347 tok/s;   5352 sec
[2021-04-25 02:37:56,486 INFO] Step 26900/50000; acc:  71.13; ppl:  2.69; xent: 0.99; lr: 0.00010; 10896/4483 tok/s;   5362 sec
[2021-04-25 02:38:06,350 INFO] Step 26950/50000; acc:  71.52; ppl:  2.65; xent: 0.97; lr: 0.00010; 9976/4331 tok/s;   5372 sec
[2021-04-25 02:38:11,401 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:38:16,155 INFO] Step 27000/50000; acc:  70.82; ppl:  2.71; xent: 1.00; lr: 0.00010; 10621/4408 tok/s;   5382 sec
[2021-04-25 02:38:26,517 INFO] Step 27050/50000; acc:  71.30; ppl:  2.68; xent: 0.99; lr: 0.00010; 9807/4119 tok/s;   5392 sec
[2021-04-25 02:38:36,749 INFO] Step 27100/50000; acc:  70.83; ppl:  2.69; xent: 0.99; lr: 0.00010; 9967/4223 tok/s;   5402 sec
[2021-04-25 02:38:46,779 INFO] Step 27150/50000; acc:  71.15; ppl:  2.69; xent: 0.99; lr: 0.00010; 10084/4264 tok/s;   5412 sec
[2021-04-25 02:38:56,576 INFO] Step 27200/50000; acc:  71.06; ppl:  2.69; xent: 0.99; lr: 0.00010; 10185/4401 tok/s;   5422 sec
[2021-04-25 02:39:06,212 INFO] Step 27250/50000; acc:  71.23; ppl:  2.71; xent: 1.00; lr: 0.00010; 10762/4429 tok/s;   5432 sec
[2021-04-25 02:39:15,495 INFO] Step 27300/50000; acc:  71.40; ppl:  2.65; xent: 0.98; lr: 0.00010; 10704/4566 tok/s;   5441 sec
[2021-04-25 02:39:26,149 INFO] Step 27350/50000; acc:  70.96; ppl:  2.71; xent: 1.00; lr: 0.00010; 9794/3978 tok/s;   5452 sec
[2021-04-25 02:39:36,040 INFO] Step 27400/50000; acc:  70.76; ppl:  2.71; xent: 1.00; lr: 0.00010; 10297/4326 tok/s;   5461 sec
[2021-04-25 02:39:46,026 INFO] Step 27450/50000; acc:  71.41; ppl:  2.66; xent: 0.98; lr: 0.00010; 10230/4325 tok/s;   5471 sec
[2021-04-25 02:39:55,522 INFO] Step 27500/50000; acc:  71.09; ppl:  2.70; xent: 0.99; lr: 0.00010; 10693/4439 tok/s;   5481 sec
[2021-04-25 02:40:05,550 INFO] Step 27550/50000; acc:  71.63; ppl:  2.66; xent: 0.98; lr: 0.00010; 10000/4244 tok/s;   5491 sec
[2021-04-25 02:40:15,740 INFO] Step 27600/50000; acc:  71.13; ppl:  2.67; xent: 0.98; lr: 0.00010; 10285/4155 tok/s;   5501 sec
[2021-04-25 02:40:24,970 INFO] Step 27650/50000; acc:  71.51; ppl:  2.64; xent: 0.97; lr: 0.00010; 10705/4622 tok/s;   5510 sec
[2021-04-25 02:40:35,137 INFO] Step 27700/50000; acc:  71.40; ppl:  2.67; xent: 0.98; lr: 0.00010; 10155/4261 tok/s;   5521 sec
[2021-04-25 02:40:37,285 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:40:44,509 INFO] Step 27750/50000; acc:  71.43; ppl:  2.65; xent: 0.98; lr: 0.00010; 10563/4568 tok/s;   5530 sec
[2021-04-25 02:40:54,558 INFO] Step 27800/50000; acc:  71.11; ppl:  2.70; xent: 0.99; lr: 0.00010; 10335/4202 tok/s;   5540 sec
[2021-04-25 02:41:04,854 INFO] Step 27850/50000; acc:  71.18; ppl:  2.68; xent: 0.99; lr: 0.00010; 9738/4260 tok/s;   5550 sec
[2021-04-25 02:41:14,712 INFO] Step 27900/50000; acc:  71.23; ppl:  2.67; xent: 0.98; lr: 0.00010; 10409/4345 tok/s;   5560 sec
[2021-04-25 02:41:24,433 INFO] Step 27950/50000; acc:  71.19; ppl:  2.68; xent: 0.99; lr: 0.00010; 10417/4372 tok/s;   5570 sec
[2021-04-25 02:41:33,854 INFO] Step 28000/50000; acc:  71.64; ppl:  2.63; xent: 0.97; lr: 0.00010; 10577/4513 tok/s;   5579 sec
[2021-04-25 02:41:44,282 INFO] Step 28050/50000; acc:  70.90; ppl:  2.72; xent: 1.00; lr: 0.00010; 10176/4118 tok/s;   5590 sec
[2021-04-25 02:41:53,849 INFO] Step 28100/50000; acc:  71.91; ppl:  2.63; xent: 0.97; lr: 0.00010; 10275/4415 tok/s;   5599 sec
[2021-04-25 02:42:03,935 INFO] Step 28150/50000; acc:  71.22; ppl:  2.69; xent: 0.99; lr: 0.00010; 10265/4200 tok/s;   5609 sec
[2021-04-25 02:42:13,721 INFO] Step 28200/50000; acc:  71.38; ppl:  2.66; xent: 0.98; lr: 0.00010; 10423/4434 tok/s;   5619 sec
[2021-04-25 02:42:23,706 INFO] Step 28250/50000; acc:  71.36; ppl:  2.68; xent: 0.98; lr: 0.00010; 10192/4266 tok/s;   5629 sec
[2021-04-25 02:42:33,816 INFO] Step 28300/50000; acc:  72.07; ppl:  2.62; xent: 0.96; lr: 0.00010; 10113/4217 tok/s;   5639 sec
[2021-04-25 02:42:43,107 INFO] Step 28350/50000; acc:  71.60; ppl:  2.63; xent: 0.97; lr: 0.00010; 10767/4488 tok/s;   5648 sec
[2021-04-25 02:42:53,180 INFO] Step 28400/50000; acc:  71.28; ppl:  2.68; xent: 0.98; lr: 0.00010; 10310/4298 tok/s;   5659 sec
[2021-04-25 02:42:56,142 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:43:03,001 INFO] Step 28450/50000; acc:  71.66; ppl:  2.64; xent: 0.97; lr: 0.00010; 10188/4390 tok/s;   5668 sec
[2021-04-25 02:43:12,708 INFO] Step 28500/50000; acc:  71.52; ppl:  2.65; xent: 0.97; lr: 0.00010; 10630/4379 tok/s;   5678 sec
[2021-04-25 02:43:22,428 INFO] Step 28550/50000; acc:  71.34; ppl:  2.65; xent: 0.98; lr: 0.00010; 10100/4366 tok/s;   5688 sec
[2021-04-25 02:43:32,783 INFO] Step 28600/50000; acc:  71.32; ppl:  2.66; xent: 0.98; lr: 0.00010; 9991/4233 tok/s;   5698 sec
[2021-04-25 02:43:42,348 INFO] Step 28650/50000; acc:  71.40; ppl:  2.66; xent: 0.98; lr: 0.00010; 10545/4481 tok/s;   5708 sec
[2021-04-25 02:43:52,153 INFO] Step 28700/50000; acc:  71.61; ppl:  2.66; xent: 0.98; lr: 0.00010; 10417/4376 tok/s;   5718 sec
[2021-04-25 02:44:01,736 INFO] Step 28750/50000; acc:  71.52; ppl:  2.63; xent: 0.97; lr: 0.00010; 10670/4458 tok/s;   5727 sec
[2021-04-25 02:44:12,085 INFO] Step 28800/50000; acc:  71.40; ppl:  2.66; xent: 0.98; lr: 0.00010; 9824/4047 tok/s;   5737 sec
[2021-04-25 02:44:22,061 INFO] Step 28850/50000; acc:  71.24; ppl:  2.70; xent: 0.99; lr: 0.00010; 10435/4273 tok/s;   5747 sec
[2021-04-25 02:44:31,630 INFO] Step 28900/50000; acc:  72.26; ppl:  2.59; xent: 0.95; lr: 0.00010; 10270/4434 tok/s;   5757 sec
[2021-04-25 02:44:41,668 INFO] Step 28950/50000; acc:  71.35; ppl:  2.66; xent: 0.98; lr: 0.00010; 10292/4289 tok/s;   5767 sec
[2021-04-25 02:44:51,650 INFO] Step 29000/50000; acc:  71.24; ppl:  2.67; xent: 0.98; lr: 0.00010; 10214/4282 tok/s;   5777 sec
[2021-04-25 02:45:01,816 INFO] Step 29050/50000; acc:  71.99; ppl:  2.59; xent: 0.95; lr: 0.00010; 10123/4229 tok/s;   5787 sec
[2021-04-25 02:45:11,188 INFO] Step 29100/50000; acc:  71.68; ppl:  2.63; xent: 0.97; lr: 0.00010; 10740/4499 tok/s;   5797 sec
[2021-04-25 02:45:21,082 INFO] Step 29150/50000; acc:  71.74; ppl:  2.62; xent: 0.96; lr: 0.00010; 10130/4280 tok/s;   5806 sec
[2021-04-25 02:45:21,451 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:45:31,229 INFO] Step 29200/50000; acc:  71.35; ppl:  2.66; xent: 0.98; lr: 0.00010; 10291/4270 tok/s;   5817 sec
[2021-04-25 02:45:40,923 INFO] Step 29250/50000; acc:  71.45; ppl:  2.65; xent: 0.97; lr: 0.00010; 10307/4336 tok/s;   5826 sec
[2021-04-25 02:45:51,499 INFO] Step 29300/50000; acc:  71.34; ppl:  2.65; xent: 0.97; lr: 0.00010; 9637/4128 tok/s;   5837 sec
[2021-04-25 02:46:01,178 INFO] Step 29350/50000; acc:  71.69; ppl:  2.62; xent: 0.96; lr: 0.00010; 10228/4404 tok/s;   5847 sec
[2021-04-25 02:46:10,872 INFO] Step 29400/50000; acc:  71.55; ppl:  2.64; xent: 0.97; lr: 0.00010; 10696/4422 tok/s;   5856 sec
[2021-04-25 02:46:20,646 INFO] Step 29450/50000; acc:  71.62; ppl:  2.65; xent: 0.97; lr: 0.00010; 10254/4394 tok/s;   5866 sec
[2021-04-25 02:46:30,813 INFO] Step 29500/50000; acc:  71.52; ppl:  2.66; xent: 0.98; lr: 0.00010; 10248/4199 tok/s;   5876 sec
[2021-04-25 02:46:41,257 INFO] Step 29550/50000; acc:  71.60; ppl:  2.63; xent: 0.97; lr: 0.00010; 9804/4099 tok/s;   5887 sec
[2021-04-25 02:46:50,418 INFO] Step 29600/50000; acc:  72.00; ppl:  2.62; xent: 0.96; lr: 0.00010; 10872/4579 tok/s;   5896 sec
[2021-04-25 02:47:00,703 INFO] Step 29650/50000; acc:  71.26; ppl:  2.68; xent: 0.98; lr: 0.00010; 10185/4224 tok/s;   5906 sec
[2021-04-25 02:47:10,127 INFO] Step 29700/50000; acc:  72.09; ppl:  2.59; xent: 0.95; lr: 0.00010; 10413/4451 tok/s;   5915 sec
[2021-04-25 02:47:20,462 INFO] Step 29750/50000; acc:  71.60; ppl:  2.63; xent: 0.97; lr: 0.00010; 10053/4143 tok/s;   5926 sec
[2021-04-25 02:47:29,950 INFO] Step 29800/50000; acc:  71.78; ppl:  2.63; xent: 0.97; lr: 0.00010; 10721/4483 tok/s;   5935 sec
[2021-04-25 02:47:39,417 INFO] Step 29850/50000; acc:  71.72; ppl:  2.61; xent: 0.96; lr: 0.00010; 10704/4495 tok/s;   5945 sec
[2021-04-25 02:47:46,789 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:47:49,074 INFO] Step 29900/50000; acc:  71.73; ppl:  2.62; xent: 0.96; lr: 0.00010; 10565/4406 tok/s;   5954 sec
[2021-04-25 02:47:58,805 INFO] Step 29950/50000; acc:  71.74; ppl:  2.61; xent: 0.96; lr: 0.00010; 10288/4430 tok/s;   5964 sec
[2021-04-25 02:48:09,406 INFO] Step 30000/50000; acc:  71.17; ppl:  2.68; xent: 0.98; lr: 0.00010; 9844/4089 tok/s;   5975 sec
[2021-04-25 02:48:09,408 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-25 02:48:17,377 INFO] Validation perplexity: 2.83637
[2021-04-25 02:48:17,377 INFO] Validation accuracy: 70.5389
[2021-04-25 02:48:17,379 INFO] Saving checkpoint ../models/group1_params/strict_ops/model_step_30000.pt
[2021-04-25 02:48:27,790 INFO] Step 30050/50000; acc:  71.90; ppl:  2.60; xent: 0.96; lr: 0.00010; 5374/2343 tok/s;   5993 sec
[2021-04-25 02:48:37,590 INFO] Step 30100/50000; acc:  71.69; ppl:  2.63; xent: 0.97; lr: 0.00010; 10500/4384 tok/s;   6003 sec
[2021-04-25 02:48:47,065 INFO] Step 30150/50000; acc:  71.86; ppl:  2.61; xent: 0.96; lr: 0.00010; 10401/4457 tok/s;   6012 sec
[2021-04-25 02:48:56,955 INFO] Step 30200/50000; acc:  71.23; ppl:  2.68; xent: 0.99; lr: 0.00010; 10561/4406 tok/s;   6022 sec
[2021-04-25 02:49:07,017 INFO] Step 30250/50000; acc:  71.85; ppl:  2.61; xent: 0.96; lr: 0.00010; 10145/4091 tok/s;   6032 sec
[2021-04-25 02:49:17,428 INFO] Step 30300/50000; acc:  71.38; ppl:  2.64; xent: 0.97; lr: 0.00010; 9910/4215 tok/s;   6043 sec
[2021-04-25 02:49:27,355 INFO] Step 30350/50000; acc:  71.98; ppl:  2.61; xent: 0.96; lr: 0.00010; 10199/4292 tok/s;   6053 sec
[2021-04-25 02:49:36,833 INFO] Step 30400/50000; acc:  72.02; ppl:  2.60; xent: 0.95; lr: 0.00010; 10579/4457 tok/s;   6062 sec
[2021-04-25 02:49:47,286 INFO] Step 30450/50000; acc:  71.62; ppl:  2.65; xent: 0.98; lr: 0.00010; 9978/4093 tok/s;   6073 sec
[2021-04-25 02:49:56,891 INFO] Step 30500/50000; acc:  72.42; ppl:  2.55; xent: 0.94; lr: 0.00010; 10264/4387 tok/s;   6082 sec
[2021-04-25 02:50:06,411 INFO] Step 30550/50000; acc:  71.80; ppl:  2.62; xent: 0.96; lr: 0.00010; 10826/4501 tok/s;   6092 sec
[2021-04-25 02:50:16,252 INFO] Step 30600/50000; acc:  71.98; ppl:  2.60; xent: 0.95; lr: 0.00010; 10292/4340 tok/s;   6102 sec
[2021-04-25 02:50:20,877 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:50:25,879 INFO] Step 30650/50000; acc:  71.51; ppl:  2.61; xent: 0.96; lr: 0.00010; 10629/4476 tok/s;   6111 sec
[2021-04-25 02:50:36,441 INFO] Step 30700/50000; acc:  71.63; ppl:  2.62; xent: 0.96; lr: 0.00010; 9689/4038 tok/s;   6122 sec
[2021-04-25 02:50:46,639 INFO] Step 30750/50000; acc:  71.98; ppl:  2.59; xent: 0.95; lr: 0.00010; 9752/4238 tok/s;   6132 sec
[2021-04-25 02:50:56,625 INFO] Step 30800/50000; acc:  71.66; ppl:  2.63; xent: 0.97; lr: 0.00010; 10392/4293 tok/s;   6142 sec
[2021-04-25 02:51:06,393 INFO] Step 30850/50000; acc:  72.03; ppl:  2.60; xent: 0.95; lr: 0.00010; 10175/4392 tok/s;   6152 sec
[2021-04-25 02:51:15,866 INFO] Step 30900/50000; acc:  71.92; ppl:  2.62; xent: 0.96; lr: 0.00010; 10811/4534 tok/s;   6161 sec
[2021-04-25 02:51:25,218 INFO] Step 30950/50000; acc:  72.16; ppl:  2.57; xent: 0.94; lr: 0.00010; 10717/4472 tok/s;   6171 sec
[2021-04-25 02:51:35,839 INFO] Step 31000/50000; acc:  71.65; ppl:  2.65; xent: 0.97; lr: 0.00010; 9866/4023 tok/s;   6181 sec
[2021-04-25 02:51:45,534 INFO] Step 31050/50000; acc:  71.73; ppl:  2.62; xent: 0.96; lr: 0.00010; 10394/4410 tok/s;   6191 sec
[2021-04-25 02:51:55,508 INFO] Step 31100/50000; acc:  72.03; ppl:  2.59; xent: 0.95; lr: 0.00010; 10326/4307 tok/s;   6201 sec
[2021-04-25 02:52:05,204 INFO] Step 31150/50000; acc:  71.99; ppl:  2.62; xent: 0.96; lr: 0.00010; 10481/4360 tok/s;   6211 sec
[2021-04-25 02:52:15,278 INFO] Step 31200/50000; acc:  72.09; ppl:  2.58; xent: 0.95; lr: 0.00010; 9925/4242 tok/s;   6221 sec
[2021-04-25 02:52:25,241 INFO] Step 31250/50000; acc:  71.82; ppl:  2.60; xent: 0.95; lr: 0.00010; 10510/4226 tok/s;   6231 sec
[2021-04-25 02:52:34,498 INFO] Step 31300/50000; acc:  72.21; ppl:  2.56; xent: 0.94; lr: 0.00010; 10540/4615 tok/s;   6240 sec
[2021-04-25 02:52:44,550 INFO] Step 31350/50000; acc:  71.86; ppl:  2.61; xent: 0.96; lr: 0.00010; 10301/4299 tok/s;   6250 sec
[2021-04-25 02:52:46,405 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:52:54,256 INFO] Step 31400/50000; acc:  71.81; ppl:  2.60; xent: 0.96; lr: 0.00010; 10502/4443 tok/s;   6260 sec
[2021-04-25 02:53:04,315 INFO] Step 31450/50000; acc:  72.04; ppl:  2.61; xent: 0.96; lr: 0.00010; 10131/4214 tok/s;   6270 sec
[2021-04-25 02:53:14,532 INFO] Step 31500/50000; acc:  71.88; ppl:  2.60; xent: 0.95; lr: 0.00010; 9897/4258 tok/s;   6280 sec
[2021-04-25 02:53:24,239 INFO] Step 31550/50000; acc:  71.96; ppl:  2.59; xent: 0.95; lr: 0.00010; 10293/4397 tok/s;   6290 sec
[2021-04-25 02:53:34,229 INFO] Step 31600/50000; acc:  71.59; ppl:  2.62; xent: 0.96; lr: 0.00010; 10406/4292 tok/s;   6300 sec
[2021-04-25 02:53:43,713 INFO] Step 31650/50000; acc:  72.40; ppl:  2.56; xent: 0.94; lr: 0.00010; 10477/4485 tok/s;   6309 sec
[2021-04-25 02:53:54,091 INFO] Step 31700/50000; acc:  71.60; ppl:  2.61; xent: 0.96; lr: 0.00010; 10103/4130 tok/s;   6319 sec
[2021-04-25 02:54:03,564 INFO] Step 31750/50000; acc:  72.32; ppl:  2.57; xent: 0.95; lr: 0.00010; 10430/4429 tok/s;   6329 sec
[2021-04-25 02:54:13,789 INFO] Step 31800/50000; acc:  71.77; ppl:  2.63; xent: 0.97; lr: 0.00010; 10190/4228 tok/s;   6339 sec
[2021-04-25 02:54:23,447 INFO] Step 31850/50000; acc:  72.13; ppl:  2.58; xent: 0.95; lr: 0.00010; 10471/4423 tok/s;   6349 sec
[2021-04-25 02:54:33,390 INFO] Step 31900/50000; acc:  71.77; ppl:  2.62; xent: 0.96; lr: 0.00010; 10297/4300 tok/s;   6359 sec
[2021-04-25 02:54:43,494 INFO] Step 31950/50000; acc:  72.62; ppl:  2.54; xent: 0.93; lr: 0.00010; 10137/4204 tok/s;   6369 sec
[2021-04-25 02:54:52,605 INFO] Step 32000/50000; acc:  72.50; ppl:  2.55; xent: 0.93; lr: 0.00010; 10950/4573 tok/s;   6378 sec
[2021-04-25 02:55:02,582 INFO] Step 32050/50000; acc:  71.87; ppl:  2.60; xent: 0.95; lr: 0.00010; 10402/4333 tok/s;   6388 sec
[2021-04-25 02:55:05,080 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:55:12,246 INFO] Step 32100/50000; acc:  72.35; ppl:  2.56; xent: 0.94; lr: 0.00010; 10210/4482 tok/s;   6398 sec
[2021-04-25 02:55:21,963 INFO] Step 32150/50000; acc:  72.17; ppl:  2.58; xent: 0.95; lr: 0.00010; 10657/4348 tok/s;   6407 sec
[2021-04-25 02:55:31,896 INFO] Step 32200/50000; acc:  71.81; ppl:  2.61; xent: 0.96; lr: 0.00010; 10155/4312 tok/s;   6417 sec
[2021-04-25 02:55:42,024 INFO] Step 32250/50000; acc:  72.28; ppl:  2.58; xent: 0.95; lr: 0.00010; 10056/4314 tok/s;   6427 sec
[2021-04-25 02:55:51,661 INFO] Step 32300/50000; acc:  71.90; ppl:  2.60; xent: 0.96; lr: 0.00010; 10534/4437 tok/s;   6437 sec
[2021-04-25 02:56:01,218 INFO] Step 32350/50000; acc:  72.45; ppl:  2.56; xent: 0.94; lr: 0.00010; 10404/4462 tok/s;   6447 sec
[2021-04-25 02:56:10,874 INFO] Step 32400/50000; acc:  71.89; ppl:  2.60; xent: 0.95; lr: 0.00010; 10890/4435 tok/s;   6456 sec
[2021-04-25 02:56:21,296 INFO] Step 32450/50000; acc:  71.94; ppl:  2.58; xent: 0.95; lr: 0.00010; 9703/4053 tok/s;   6467 sec
[2021-04-25 02:56:31,319 INFO] Step 32500/50000; acc:  71.87; ppl:  2.60; xent: 0.96; lr: 0.00010; 10254/4239 tok/s;   6477 sec
[2021-04-25 02:56:40,871 INFO] Step 32550/50000; acc:  72.62; ppl:  2.53; xent: 0.93; lr: 0.00010; 10368/4465 tok/s;   6486 sec
[2021-04-25 02:56:50,799 INFO] Step 32600/50000; acc:  71.72; ppl:  2.62; xent: 0.96; lr: 0.00010; 10464/4322 tok/s;   6496 sec
[2021-04-25 02:57:00,860 INFO] Step 32650/50000; acc:  72.22; ppl:  2.57; xent: 0.94; lr: 0.00010; 10053/4244 tok/s;   6506 sec
[2021-04-25 02:57:11,171 INFO] Step 32700/50000; acc:  72.39; ppl:  2.55; xent: 0.93; lr: 0.00010; 10031/4137 tok/s;   6517 sec
[2021-04-25 02:57:20,332 INFO] Step 32750/50000; acc:  72.61; ppl:  2.55; xent: 0.94; lr: 0.00010; 11005/4619 tok/s;   6526 sec
[2021-04-25 02:57:30,203 INFO] Step 32800/50000; acc:  72.26; ppl:  2.56; xent: 0.94; lr: 0.00010; 10141/4293 tok/s;   6536 sec
[2021-04-25 02:57:30,213 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:57:40,490 INFO] Step 32850/50000; acc:  72.17; ppl:  2.57; xent: 0.94; lr: 0.00010; 10126/4225 tok/s;   6546 sec
[2021-04-25 02:57:50,253 INFO] Step 32900/50000; acc:  72.32; ppl:  2.56; xent: 0.94; lr: 0.00010; 10101/4310 tok/s;   6556 sec
[2021-04-25 02:58:00,639 INFO] Step 32950/50000; acc:  71.81; ppl:  2.59; xent: 0.95; lr: 0.00010; 9847/4199 tok/s;   6566 sec
[2021-04-25 02:58:10,490 INFO] Step 33000/50000; acc:  72.12; ppl:  2.58; xent: 0.95; lr: 0.00010; 10319/4358 tok/s;   6576 sec
[2021-04-25 02:58:20,102 INFO] Step 33050/50000; acc:  72.37; ppl:  2.57; xent: 0.94; lr: 0.00010; 10620/4444 tok/s;   6585 sec
[2021-04-25 02:58:29,816 INFO] Step 33100/50000; acc:  71.89; ppl:  2.59; xent: 0.95; lr: 0.00010; 10424/4446 tok/s;   6595 sec
[2021-04-25 02:58:39,643 INFO] Step 33150/50000; acc:  72.40; ppl:  2.56; xent: 0.94; lr: 0.00010; 10316/4269 tok/s;   6605 sec
[2021-04-25 02:58:50,527 INFO] Step 33200/50000; acc:  71.86; ppl:  2.59; xent: 0.95; lr: 0.00010; 9657/3988 tok/s;   6616 sec
[2021-04-25 02:58:59,543 INFO] Step 33250/50000; acc:  72.61; ppl:  2.54; xent: 0.93; lr: 0.00010; 10977/4627 tok/s;   6625 sec
[2021-04-25 02:59:09,818 INFO] Step 33300/50000; acc:  71.86; ppl:  2.59; xent: 0.95; lr: 0.00010; 10085/4211 tok/s;   6635 sec
[2021-04-25 02:59:19,379 INFO] Step 33350/50000; acc:  72.75; ppl:  2.53; xent: 0.93; lr: 0.00010; 10334/4387 tok/s;   6645 sec
[2021-04-25 02:59:29,970 INFO] Step 33400/50000; acc:  72.20; ppl:  2.57; xent: 0.94; lr: 0.00010; 9878/4064 tok/s;   6655 sec
[2021-04-25 02:59:39,300 INFO] Step 33450/50000; acc:  72.46; ppl:  2.54; xent: 0.93; lr: 0.00010; 10779/4541 tok/s;   6665 sec
[2021-04-25 02:59:48,828 INFO] Step 33500/50000; acc:  72.47; ppl:  2.55; xent: 0.93; lr: 0.00010; 10711/4469 tok/s;   6674 sec
[2021-04-25 02:59:55,902 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 02:59:58,754 INFO] Step 33550/50000; acc:  72.37; ppl:  2.55; xent: 0.93; lr: 0.00010; 10283/4326 tok/s;   6684 sec
[2021-04-25 03:00:08,555 INFO] Step 33600/50000; acc:  72.77; ppl:  2.52; xent: 0.93; lr: 0.00010; 10213/4350 tok/s;   6694 sec
[2021-04-25 03:00:19,226 INFO] Step 33650/50000; acc:  71.79; ppl:  2.61; xent: 0.96; lr: 0.00010; 9752/4067 tok/s;   6705 sec
[2021-04-25 03:00:29,086 INFO] Step 33700/50000; acc:  72.72; ppl:  2.52; xent: 0.92; lr: 0.00010; 9903/4388 tok/s;   6714 sec
[2021-04-25 03:00:39,003 INFO] Step 33750/50000; acc:  72.47; ppl:  2.55; xent: 0.94; lr: 0.00010; 10397/4306 tok/s;   6724 sec
[2021-04-25 03:00:48,701 INFO] Step 33800/50000; acc:  72.40; ppl:  2.56; xent: 0.94; lr: 0.00010; 10453/4397 tok/s;   6734 sec
[2021-04-25 03:00:58,578 INFO] Step 33850/50000; acc:  72.14; ppl:  2.58; xent: 0.95; lr: 0.00010; 10411/4384 tok/s;   6744 sec
[2021-04-25 03:01:08,511 INFO] Step 33900/50000; acc:  72.38; ppl:  2.54; xent: 0.93; lr: 0.00010; 10336/4172 tok/s;   6754 sec
[2021-04-25 03:01:18,699 INFO] Step 33950/50000; acc:  72.34; ppl:  2.55; xent: 0.93; lr: 0.00010; 9872/4246 tok/s;   6764 sec
[2021-04-25 03:01:29,018 INFO] Step 34000/50000; acc:  72.26; ppl:  2.56; xent: 0.94; lr: 0.00010; 10079/4194 tok/s;   6774 sec
[2021-04-25 03:01:38,487 INFO] Step 34050/50000; acc:  72.79; ppl:  2.53; xent: 0.93; lr: 0.00010; 10529/4425 tok/s;   6784 sec
[2021-04-25 03:01:48,744 INFO] Step 34100/50000; acc:  72.28; ppl:  2.56; xent: 0.94; lr: 0.00010; 10064/4151 tok/s;   6794 sec
[2021-04-25 03:01:58,591 INFO] Step 34150/50000; acc:  72.91; ppl:  2.50; xent: 0.92; lr: 0.00010; 10067/4316 tok/s;   6804 sec
[2021-04-25 03:02:08,163 INFO] Step 34200/50000; acc:  72.48; ppl:  2.54; xent: 0.93; lr: 0.00010; 10830/4483 tok/s;   6814 sec
[2021-04-25 03:02:17,919 INFO] Step 34250/50000; acc:  72.76; ppl:  2.52; xent: 0.93; lr: 0.00010; 10294/4379 tok/s;   6823 sec
[2021-04-25 03:02:22,149 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:02:27,609 INFO] Step 34300/50000; acc:  72.56; ppl:  2.55; xent: 0.94; lr: 0.00010; 10621/4426 tok/s;   6833 sec
[2021-04-25 03:02:38,041 INFO] Step 34350/50000; acc:  72.18; ppl:  2.55; xent: 0.94; lr: 0.00010; 9811/4089 tok/s;   6843 sec
[2021-04-25 03:02:48,345 INFO] Step 34400/50000; acc:  72.36; ppl:  2.53; xent: 0.93; lr: 0.00010; 9647/4179 tok/s;   6854 sec
[2021-04-25 03:02:58,305 INFO] Step 34450/50000; acc:  72.53; ppl:  2.56; xent: 0.94; lr: 0.00010; 10407/4339 tok/s;   6864 sec
[2021-04-25 03:03:07,990 INFO] Step 34500/50000; acc:  72.62; ppl:  2.52; xent: 0.92; lr: 0.00010; 10112/4418 tok/s;   6873 sec
[2021-04-25 03:03:17,521 INFO] Step 34550/50000; acc:  72.53; ppl:  2.55; xent: 0.94; lr: 0.00010; 10789/4484 tok/s;   6883 sec
[2021-04-25 03:03:27,367 INFO] Step 34600/50000; acc:  72.10; ppl:  2.55; xent: 0.94; lr: 0.00010; 10470/4268 tok/s;   6893 sec
[2021-04-25 03:03:37,769 INFO] Step 34650/50000; acc:  72.44; ppl:  2.55; xent: 0.94; lr: 0.00010; 9890/4089 tok/s;   6903 sec
[2021-04-25 03:03:47,504 INFO] Step 34700/50000; acc:  72.36; ppl:  2.55; xent: 0.94; lr: 0.00010; 10447/4411 tok/s;   6913 sec
[2021-04-25 03:03:57,217 INFO] Step 34750/50000; acc:  72.97; ppl:  2.50; xent: 0.91; lr: 0.00010; 10328/4390 tok/s;   6923 sec
[2021-04-25 03:04:07,066 INFO] Step 34800/50000; acc:  72.21; ppl:  2.57; xent: 0.94; lr: 0.00010; 10582/4320 tok/s;   6932 sec
[2021-04-25 03:04:17,224 INFO] Step 34850/50000; acc:  73.00; ppl:  2.50; xent: 0.92; lr: 0.00010; 9812/4233 tok/s;   6943 sec
[2021-04-25 03:04:27,136 INFO] Step 34900/50000; acc:  72.75; ppl:  2.51; xent: 0.92; lr: 0.00010; 10418/4218 tok/s;   6953 sec
[2021-04-25 03:04:36,268 INFO] Step 34950/50000; acc:  72.86; ppl:  2.51; xent: 0.92; lr: 0.00010; 10783/4681 tok/s;   6962 sec
[2021-04-25 03:04:46,283 INFO] Step 35000/50000; acc:  72.43; ppl:  2.55; xent: 0.94; lr: 0.00010; 10397/4320 tok/s;   6972 sec
[2021-04-25 03:04:46,286 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-25 03:04:54,272 INFO] Validation perplexity: 2.82888
[2021-04-25 03:04:54,272 INFO] Validation accuracy: 70.7992
[2021-04-25 03:04:54,274 INFO] Saving checkpoint ../models/group1_params/strict_ops/model_step_35000.pt
[2021-04-25 03:04:56,390 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:05:04,675 INFO] Step 35050/50000; acc:  72.72; ppl:  2.50; xent: 0.92; lr: 0.00010; 5494/2337 tok/s;   6990 sec
[2021-04-25 03:05:14,739 INFO] Step 35100/50000; acc:  72.30; ppl:  2.55; xent: 0.94; lr: 0.00010; 10179/4220 tok/s;   7000 sec
[2021-04-25 03:05:25,070 INFO] Step 35150/50000; acc:  72.58; ppl:  2.53; xent: 0.93; lr: 0.00010; 9787/4221 tok/s;   7010 sec
[2021-04-25 03:05:34,622 INFO] Step 35200/50000; acc:  72.72; ppl:  2.52; xent: 0.92; lr: 0.00010; 10461/4463 tok/s;   7020 sec
[2021-04-25 03:05:44,412 INFO] Step 35250/50000; acc:  72.29; ppl:  2.56; xent: 0.94; lr: 0.00010; 10596/4323 tok/s;   7030 sec
[2021-04-25 03:05:53,991 INFO] Step 35300/50000; acc:  73.16; ppl:  2.48; xent: 0.91; lr: 0.00010; 10250/4491 tok/s;   7039 sec
[2021-04-25 03:06:04,119 INFO] Step 35350/50000; acc:  72.46; ppl:  2.56; xent: 0.94; lr: 0.00010; 10385/4189 tok/s;   7049 sec
[2021-04-25 03:06:14,348 INFO] Step 35400/50000; acc:  72.55; ppl:  2.52; xent: 0.92; lr: 0.00010; 9934/4166 tok/s;   7060 sec
[2021-04-25 03:06:24,366 INFO] Step 35450/50000; acc:  72.47; ppl:  2.54; xent: 0.93; lr: 0.00010; 10215/4272 tok/s;   7070 sec
[2021-04-25 03:06:34,226 INFO] Step 35500/50000; acc:  72.60; ppl:  2.53; xent: 0.93; lr: 0.00010; 10343/4356 tok/s;   7080 sec
[2021-04-25 03:06:43,914 INFO] Step 35550/50000; acc:  72.78; ppl:  2.51; xent: 0.92; lr: 0.00010; 10290/4382 tok/s;   7089 sec
[2021-04-25 03:06:54,335 INFO] Step 35600/50000; acc:  72.79; ppl:  2.50; xent: 0.92; lr: 0.00010; 10104/4123 tok/s;   7100 sec
[2021-04-25 03:07:03,403 INFO] Step 35650/50000; acc:  73.30; ppl:  2.47; xent: 0.90; lr: 0.00010; 10918/4568 tok/s;   7109 sec
[2021-04-25 03:07:13,153 INFO] Step 35700/50000; acc:  72.57; ppl:  2.54; xent: 0.93; lr: 0.00010; 10528/4420 tok/s;   7119 sec
[2021-04-25 03:07:15,443 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:07:23,106 INFO] Step 35750/50000; acc:  73.03; ppl:  2.49; xent: 0.91; lr: 0.00010; 9979/4365 tok/s;   7128 sec
[2021-04-25 03:07:32,989 INFO] Step 35800/50000; acc:  72.77; ppl:  2.52; xent: 0.92; lr: 0.00010; 10541/4300 tok/s;   7138 sec
[2021-04-25 03:07:42,903 INFO] Step 35850/50000; acc:  72.51; ppl:  2.53; xent: 0.93; lr: 0.00010; 10081/4337 tok/s;   7148 sec
[2021-04-25 03:07:53,122 INFO] Step 35900/50000; acc:  72.59; ppl:  2.52; xent: 0.93; lr: 0.00010; 10042/4253 tok/s;   7158 sec
[2021-04-25 03:08:02,394 INFO] Step 35950/50000; acc:  72.81; ppl:  2.52; xent: 0.92; lr: 0.00010; 10950/4601 tok/s;   7168 sec
[2021-04-25 03:08:12,091 INFO] Step 36000/50000; acc:  73.02; ppl:  2.50; xent: 0.92; lr: 0.00010; 10225/4380 tok/s;   7177 sec
[2021-04-25 03:08:22,002 INFO] Step 36050/50000; acc:  72.54; ppl:  2.54; xent: 0.93; lr: 0.00010; 10620/4315 tok/s;   7187 sec
[2021-04-25 03:08:32,251 INFO] Step 36100/50000; acc:  73.01; ppl:  2.48; xent: 0.91; lr: 0.00010; 9718/4135 tok/s;   7198 sec
[2021-04-25 03:08:42,268 INFO] Step 36150/50000; acc:  72.58; ppl:  2.54; xent: 0.93; lr: 0.00010; 10299/4225 tok/s;   7208 sec
[2021-04-25 03:08:52,061 INFO] Step 36200/50000; acc:  73.01; ppl:  2.49; xent: 0.91; lr: 0.00010; 10402/4388 tok/s;   7217 sec
[2021-04-25 03:09:01,776 INFO] Step 36250/50000; acc:  72.63; ppl:  2.51; xent: 0.92; lr: 0.00010; 10520/4414 tok/s;   7227 sec
[2021-04-25 03:09:12,108 INFO] Step 36300/50000; acc:  72.71; ppl:  2.52; xent: 0.92; lr: 0.00010; 9863/4151 tok/s;   7237 sec
[2021-04-25 03:09:21,862 INFO] Step 36350/50000; acc:  73.35; ppl:  2.45; xent: 0.90; lr: 0.00010; 10337/4328 tok/s;   7247 sec
[2021-04-25 03:09:31,223 INFO] Step 36400/50000; acc:  72.65; ppl:  2.52; xent: 0.93; lr: 0.00010; 11047/4557 tok/s;   7257 sec
[2021-04-25 03:09:40,701 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:09:41,179 INFO] Step 36450/50000; acc:  73.05; ppl:  2.48; xent: 0.91; lr: 0.00010; 9997/4262 tok/s;   7267 sec
[2021-04-25 03:09:51,115 INFO] Step 36500/50000; acc:  72.90; ppl:  2.49; xent: 0.91; lr: 0.00010; 10366/4366 tok/s;   7276 sec
[2021-04-25 03:10:00,803 INFO] Step 36550/50000; acc:  72.90; ppl:  2.50; xent: 0.92; lr: 0.00010; 10256/4300 tok/s;   7286 sec
[2021-04-25 03:10:11,222 INFO] Step 36600/50000; acc:  72.65; ppl:  2.52; xent: 0.92; lr: 0.00010; 9875/4220 tok/s;   7297 sec
[2021-04-25 03:10:21,093 INFO] Step 36650/50000; acc:  72.70; ppl:  2.51; xent: 0.92; lr: 0.00010; 10209/4328 tok/s;   7306 sec
[2021-04-25 03:10:30,820 INFO] Step 36700/50000; acc:  72.64; ppl:  2.52; xent: 0.92; lr: 0.00010; 10550/4445 tok/s;   7316 sec
[2021-04-25 03:10:40,593 INFO] Step 36750/50000; acc:  72.85; ppl:  2.51; xent: 0.92; lr: 0.00010; 10378/4400 tok/s;   7326 sec
[2021-04-25 03:10:50,320 INFO] Step 36800/50000; acc:  73.22; ppl:  2.47; xent: 0.90; lr: 0.00010; 10406/4240 tok/s;   7336 sec
[2021-04-25 03:11:01,340 INFO] Step 36850/50000; acc:  72.62; ppl:  2.53; xent: 0.93; lr: 0.00010; 9537/3968 tok/s;   7347 sec
[2021-04-25 03:11:10,224 INFO] Step 36900/50000; acc:  73.31; ppl:  2.45; xent: 0.90; lr: 0.00010; 10964/4721 tok/s;   7356 sec
[2021-04-25 03:11:20,515 INFO] Step 36950/50000; acc:  72.55; ppl:  2.53; xent: 0.93; lr: 0.00010; 10107/4206 tok/s;   7366 sec
[2021-04-25 03:11:30,384 INFO] Step 37000/50000; acc:  72.80; ppl:  2.50; xent: 0.92; lr: 0.00010; 10284/4292 tok/s;   7376 sec
[2021-04-25 03:11:40,776 INFO] Step 37050/50000; acc:  73.26; ppl:  2.46; xent: 0.90; lr: 0.00010; 9889/4102 tok/s;   7386 sec
[2021-04-25 03:11:50,047 INFO] Step 37100/50000; acc:  73.01; ppl:  2.48; xent: 0.91; lr: 0.00010; 10944/4577 tok/s;   7395 sec
[2021-04-25 03:11:59,363 INFO] Step 37150/50000; acc:  73.41; ppl:  2.45; xent: 0.90; lr: 0.00010; 10681/4526 tok/s;   7405 sec
[2021-04-25 03:12:06,322 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:12:09,585 INFO] Step 37200/50000; acc:  72.74; ppl:  2.52; xent: 0.92; lr: 0.00010; 10257/4257 tok/s;   7415 sec
[2021-04-25 03:12:19,518 INFO] Step 37250/50000; acc:  73.11; ppl:  2.46; xent: 0.90; lr: 0.00010; 10039/4277 tok/s;   7425 sec
[2021-04-25 03:12:30,071 INFO] Step 37300/50000; acc:  72.43; ppl:  2.52; xent: 0.93; lr: 0.00010; 9724/4141 tok/s;   7435 sec
[2021-04-25 03:12:39,786 INFO] Step 37350/50000; acc:  73.58; ppl:  2.45; xent: 0.89; lr: 0.00010; 10127/4411 tok/s;   7445 sec
[2021-04-25 03:12:49,584 INFO] Step 37400/50000; acc:  72.72; ppl:  2.50; xent: 0.92; lr: 0.00010; 10593/4369 tok/s;   7455 sec
[2021-04-25 03:12:59,394 INFO] Step 37450/50000; acc:  73.06; ppl:  2.49; xent: 0.91; lr: 0.00010; 10225/4363 tok/s;   7465 sec
[2021-04-25 03:13:09,214 INFO] Step 37500/50000; acc:  72.86; ppl:  2.51; xent: 0.92; lr: 0.00010; 10553/4382 tok/s;   7475 sec
[2021-04-25 03:13:19,517 INFO] Step 37550/50000; acc:  73.15; ppl:  2.48; xent: 0.91; lr: 0.00010; 9984/4029 tok/s;   7485 sec
[2021-04-25 03:13:29,355 INFO] Step 37600/50000; acc:  73.21; ppl:  2.48; xent: 0.91; lr: 0.00010; 10192/4379 tok/s;   7495 sec
[2021-04-25 03:13:39,679 INFO] Step 37650/50000; acc:  72.94; ppl:  2.50; xent: 0.92; lr: 0.00010; 10064/4188 tok/s;   7505 sec
[2021-04-25 03:13:48,935 INFO] Step 37700/50000; acc:  73.31; ppl:  2.45; xent: 0.90; lr: 0.00010; 10622/4527 tok/s;   7514 sec
[2021-04-25 03:13:59,196 INFO] Step 37750/50000; acc:  73.12; ppl:  2.49; xent: 0.91; lr: 0.00010; 10095/4152 tok/s;   7525 sec
[2021-04-25 03:14:09,127 INFO] Step 37800/50000; acc:  73.18; ppl:  2.46; xent: 0.90; lr: 0.00010; 10263/4276 tok/s;   7534 sec
[2021-04-25 03:14:18,588 INFO] Step 37850/50000; acc:  73.07; ppl:  2.47; xent: 0.90; lr: 0.00010; 10768/4540 tok/s;   7544 sec
[2021-04-25 03:14:28,331 INFO] Step 37900/50000; acc:  73.22; ppl:  2.46; xent: 0.90; lr: 0.00010; 10385/4408 tok/s;   7554 sec
[2021-04-25 03:14:32,181 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:14:37,829 INFO] Step 37950/50000; acc:  73.12; ppl:  2.47; xent: 0.90; lr: 0.00010; 10580/4475 tok/s;   7563 sec
[2021-04-25 03:14:48,432 INFO] Step 38000/50000; acc:  72.61; ppl:  2.52; xent: 0.93; lr: 0.00010; 9871/4057 tok/s;   7574 sec
[2021-04-25 03:14:58,765 INFO] Step 38050/50000; acc:  73.15; ppl:  2.46; xent: 0.90; lr: 0.00010; 9581/4178 tok/s;   7584 sec
[2021-04-25 03:15:08,520 INFO] Step 38100/50000; acc:  73.13; ppl:  2.48; xent: 0.91; lr: 0.00010; 10507/4412 tok/s;   7594 sec
[2021-04-25 03:15:18,094 INFO] Step 38150/50000; acc:  73.38; ppl:  2.47; xent: 0.90; lr: 0.00010; 10305/4439 tok/s;   7603 sec
[2021-04-25 03:15:27,889 INFO] Step 38200/50000; acc:  72.98; ppl:  2.50; xent: 0.92; lr: 0.00010; 10560/4390 tok/s;   7613 sec
[2021-04-25 03:15:37,282 INFO] Step 38250/50000; acc:  73.12; ppl:  2.45; xent: 0.90; lr: 0.00010; 10891/4495 tok/s;   7623 sec
[2021-04-25 03:15:47,882 INFO] Step 38300/50000; acc:  73.05; ppl:  2.49; xent: 0.91; lr: 0.00010; 9760/4009 tok/s;   7633 sec
[2021-04-25 03:15:57,722 INFO] Step 38350/50000; acc:  72.88; ppl:  2.50; xent: 0.91; lr: 0.00010; 10339/4341 tok/s;   7643 sec
[2021-04-25 03:16:07,280 INFO] Step 38400/50000; acc:  73.22; ppl:  2.45; xent: 0.90; lr: 0.00010; 10492/4449 tok/s;   7653 sec
[2021-04-25 03:16:17,338 INFO] Step 38450/50000; acc:  72.94; ppl:  2.50; xent: 0.91; lr: 0.00010; 10337/4271 tok/s;   7663 sec
[2021-04-25 03:16:27,301 INFO] Step 38500/50000; acc:  73.61; ppl:  2.43; xent: 0.89; lr: 0.00010; 9883/4253 tok/s;   7673 sec
[2021-04-25 03:16:37,144 INFO] Step 38550/50000; acc:  73.29; ppl:  2.45; xent: 0.90; lr: 0.00010; 10508/4293 tok/s;   7683 sec
[2021-04-25 03:16:46,644 INFO] Step 38600/50000; acc:  73.24; ppl:  2.48; xent: 0.91; lr: 0.00010; 10666/4511 tok/s;   7692 sec
[2021-04-25 03:16:56,634 INFO] Step 38650/50000; acc:  73.15; ppl:  2.47; xent: 0.91; lr: 0.00010; 10248/4292 tok/s;   7702 sec
[2021-04-25 03:16:57,777 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:17:06,573 INFO] Step 38700/50000; acc:  73.38; ppl:  2.45; xent: 0.89; lr: 0.00010; 10260/4357 tok/s;   7712 sec
[2021-04-25 03:17:16,314 INFO] Step 38750/50000; acc:  73.43; ppl:  2.45; xent: 0.90; lr: 0.00010; 10233/4322 tok/s;   7722 sec
[2021-04-25 03:17:26,750 INFO] Step 38800/50000; acc:  72.92; ppl:  2.49; xent: 0.91; lr: 0.00010; 9949/4187 tok/s;   7732 sec
[2021-04-25 03:17:36,323 INFO] Step 38850/50000; acc:  73.39; ppl:  2.46; xent: 0.90; lr: 0.00010; 10374/4478 tok/s;   7742 sec
[2021-04-25 03:17:46,132 INFO] Step 38900/50000; acc:  73.09; ppl:  2.47; xent: 0.90; lr: 0.00010; 10455/4309 tok/s;   7752 sec
[2021-04-25 03:17:55,627 INFO] Step 38950/50000; acc:  73.75; ppl:  2.43; xent: 0.89; lr: 0.00010; 10427/4554 tok/s;   7761 sec
[2021-04-25 03:18:05,526 INFO] Step 39000/50000; acc:  72.95; ppl:  2.49; xent: 0.91; lr: 0.00010; 10684/4265 tok/s;   7771 sec
[2021-04-25 03:18:15,878 INFO] Step 39050/50000; acc:  73.17; ppl:  2.47; xent: 0.90; lr: 0.00010; 9732/4170 tok/s;   7781 sec
[2021-04-25 03:18:25,672 INFO] Step 39100/50000; acc:  73.10; ppl:  2.47; xent: 0.90; lr: 0.00010; 10498/4311 tok/s;   7791 sec
[2021-04-25 03:18:35,481 INFO] Step 39150/50000; acc:  73.14; ppl:  2.46; xent: 0.90; lr: 0.00010; 10403/4368 tok/s;   7801 sec
[2021-04-25 03:18:45,227 INFO] Step 39200/50000; acc:  73.36; ppl:  2.44; xent: 0.89; lr: 0.00010; 10218/4387 tok/s;   7811 sec
[2021-04-25 03:18:55,660 INFO] Step 39250/50000; acc:  73.47; ppl:  2.43; xent: 0.89; lr: 0.00010; 10081/4081 tok/s;   7821 sec
[2021-04-25 03:19:04,729 INFO] Step 39300/50000; acc:  74.04; ppl:  2.40; xent: 0.87; lr: 0.00010; 10784/4567 tok/s;   7830 sec
[2021-04-25 03:19:14,802 INFO] Step 39350/50000; acc:  72.91; ppl:  2.48; xent: 0.91; lr: 0.00010; 10215/4327 tok/s;   7840 sec
[2021-04-25 03:19:16,603 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:19:24,645 INFO] Step 39400/50000; acc:  73.36; ppl:  2.44; xent: 0.89; lr: 0.00010; 10366/4400 tok/s;   7850 sec
[2021-04-25 03:19:34,403 INFO] Step 39450/50000; acc:  73.39; ppl:  2.45; xent: 0.90; lr: 0.00010; 10502/4350 tok/s;   7860 sec
[2021-04-25 03:19:44,438 INFO] Step 39500/50000; acc:  73.10; ppl:  2.47; xent: 0.90; lr: 0.00010; 10030/4264 tok/s;   7870 sec
[2021-04-25 03:19:54,264 INFO] Step 39550/50000; acc:  73.34; ppl:  2.44; xent: 0.89; lr: 0.00010; 10187/4430 tok/s;   7880 sec
[2021-04-25 03:20:03,852 INFO] Step 39600/50000; acc:  72.95; ppl:  2.48; xent: 0.91; lr: 0.00010; 10869/4437 tok/s;   7889 sec
[2021-04-25 03:20:13,298 INFO] Step 39650/50000; acc:  73.40; ppl:  2.44; xent: 0.89; lr: 0.00010; 10449/4524 tok/s;   7899 sec
[2021-04-25 03:20:23,171 INFO] Step 39700/50000; acc:  73.31; ppl:  2.45; xent: 0.90; lr: 0.00010; 10541/4316 tok/s;   7909 sec
[2021-04-25 03:20:33,439 INFO] Step 39750/50000; acc:  73.65; ppl:  2.42; xent: 0.88; lr: 0.00010; 9742/4132 tok/s;   7919 sec
[2021-04-25 03:20:43,665 INFO] Step 39800/50000; acc:  73.10; ppl:  2.48; xent: 0.91; lr: 0.00010; 10167/4181 tok/s;   7929 sec
[2021-04-25 03:20:53,375 INFO] Step 39850/50000; acc:  73.77; ppl:  2.43; xent: 0.89; lr: 0.00010; 10379/4379 tok/s;   7939 sec
[2021-04-25 03:21:03,320 INFO] Step 39900/50000; acc:  73.17; ppl:  2.46; xent: 0.90; lr: 0.00010; 10354/4325 tok/s;   7949 sec
[2021-04-25 03:21:13,532 INFO] Step 39950/50000; acc:  73.53; ppl:  2.44; xent: 0.89; lr: 0.00010; 9993/4200 tok/s;   7959 sec
[2021-04-25 03:21:23,209 INFO] Step 40000/50000; acc:  73.88; ppl:  2.40; xent: 0.87; lr: 0.00010; 10411/4333 tok/s;   7969 sec
[2021-04-25 03:21:23,213 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-25 03:21:31,180 INFO] Validation perplexity: 2.82104
[2021-04-25 03:21:31,180 INFO] Validation accuracy: 70.9775
[2021-04-25 03:21:31,182 INFO] Saving checkpoint ../models/group1_params/strict_ops/model_step_40000.pt
[2021-04-25 03:21:41,247 INFO] Step 40050/50000; acc:  73.16; ppl:  2.47; xent: 0.90; lr: 0.00010; 5718/2388 tok/s;   7987 sec
[2021-04-25 03:21:50,242 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:21:51,009 INFO] Step 40100/50000; acc:  73.92; ppl:  2.40; xent: 0.88; lr: 0.00010; 10072/4313 tok/s;   7996 sec
[2021-04-25 03:22:00,942 INFO] Step 40150/50000; acc:  73.18; ppl:  2.43; xent: 0.89; lr: 0.00010; 10394/4358 tok/s;   8006 sec
[2021-04-25 03:22:10,997 INFO] Step 40200/50000; acc:  73.11; ppl:  2.49; xent: 0.91; lr: 0.00010; 10147/4207 tok/s;   8016 sec
[2021-04-25 03:22:21,179 INFO] Step 40250/50000; acc:  73.42; ppl:  2.43; xent: 0.89; lr: 0.00010; 9946/4304 tok/s;   8027 sec
[2021-04-25 03:22:30,997 INFO] Step 40300/50000; acc:  73.36; ppl:  2.44; xent: 0.89; lr: 0.00010; 10341/4355 tok/s;   8036 sec
[2021-04-25 03:22:40,449 INFO] Step 40350/50000; acc:  73.50; ppl:  2.43; xent: 0.89; lr: 0.00010; 10569/4498 tok/s;   8046 sec
[2021-04-25 03:22:50,314 INFO] Step 40400/50000; acc:  72.99; ppl:  2.47; xent: 0.90; lr: 0.00010; 10583/4396 tok/s;   8056 sec
[2021-04-25 03:23:00,063 INFO] Step 40450/50000; acc:  73.92; ppl:  2.41; xent: 0.88; lr: 0.00010; 10319/4228 tok/s;   8065 sec
[2021-04-25 03:23:10,878 INFO] Step 40500/50000; acc:  73.20; ppl:  2.46; xent: 0.90; lr: 0.00010; 9601/4048 tok/s;   8076 sec
[2021-04-25 03:23:19,986 INFO] Step 40550/50000; acc:  73.81; ppl:  2.40; xent: 0.88; lr: 0.00010; 10777/4616 tok/s;   8085 sec
[2021-04-25 03:23:30,232 INFO] Step 40600/50000; acc:  73.10; ppl:  2.47; xent: 0.90; lr: 0.00010; 10205/4232 tok/s;   8096 sec
[2021-04-25 03:23:39,962 INFO] Step 40650/50000; acc:  73.71; ppl:  2.42; xent: 0.88; lr: 0.00010; 10351/4310 tok/s;   8105 sec
[2021-04-25 03:23:50,431 INFO] Step 40700/50000; acc:  73.50; ppl:  2.41; xent: 0.88; lr: 0.00010; 9864/4106 tok/s;   8116 sec
[2021-04-25 03:23:59,800 INFO] Step 40750/50000; acc:  73.57; ppl:  2.42; xent: 0.89; lr: 0.00010; 10839/4523 tok/s;   8125 sec
[2021-04-25 03:24:09,321 INFO] Step 40800/50000; acc:  73.78; ppl:  2.40; xent: 0.88; lr: 0.00010; 10427/4455 tok/s;   8135 sec
[2021-04-25 03:24:15,762 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:24:19,537 INFO] Step 40850/50000; acc:  73.53; ppl:  2.44; xent: 0.89; lr: 0.00010; 10249/4229 tok/s;   8145 sec
[2021-04-25 03:24:29,247 INFO] Step 40900/50000; acc:  73.98; ppl:  2.40; xent: 0.87; lr: 0.00010; 10160/4381 tok/s;   8155 sec
[2021-04-25 03:24:39,726 INFO] Step 40950/50000; acc:  72.89; ppl:  2.46; xent: 0.90; lr: 0.00010; 9803/4142 tok/s;   8165 sec
[2021-04-25 03:24:49,607 INFO] Step 41000/50000; acc:  73.86; ppl:  2.41; xent: 0.88; lr: 0.00010; 10236/4310 tok/s;   8175 sec
[2021-04-25 03:24:59,479 INFO] Step 41050/50000; acc:  73.59; ppl:  2.42; xent: 0.88; lr: 0.00010; 10348/4382 tok/s;   8185 sec
[2021-04-25 03:25:09,284 INFO] Step 41100/50000; acc:  73.58; ppl:  2.44; xent: 0.89; lr: 0.00010; 10314/4350 tok/s;   8195 sec
[2021-04-25 03:25:18,839 INFO] Step 41150/50000; acc:  73.76; ppl:  2.42; xent: 0.88; lr: 0.00010; 10552/4474 tok/s;   8204 sec
[2021-04-25 03:25:29,324 INFO] Step 41200/50000; acc:  73.31; ppl:  2.45; xent: 0.90; lr: 0.00010; 10085/4019 tok/s;   8215 sec
[2021-04-25 03:25:39,137 INFO] Step 41250/50000; acc:  73.66; ppl:  2.41; xent: 0.88; lr: 0.00010; 10147/4365 tok/s;   8225 sec
[2021-04-25 03:25:49,514 INFO] Step 41300/50000; acc:  73.63; ppl:  2.42; xent: 0.88; lr: 0.00010; 9911/4164 tok/s;   8235 sec
[2021-04-25 03:25:58,823 INFO] Step 41350/50000; acc:  73.86; ppl:  2.40; xent: 0.87; lr: 0.00010; 10636/4517 tok/s;   8244 sec
[2021-04-25 03:26:09,201 INFO] Step 41400/50000; acc:  73.55; ppl:  2.44; xent: 0.89; lr: 0.00010; 10048/4125 tok/s;   8255 sec
[2021-04-25 03:26:19,211 INFO] Step 41450/50000; acc:  73.82; ppl:  2.39; xent: 0.87; lr: 0.00010; 10083/4231 tok/s;   8265 sec
[2021-04-25 03:26:28,685 INFO] Step 41500/50000; acc:  73.66; ppl:  2.41; xent: 0.88; lr: 0.00010; 10801/4513 tok/s;   8274 sec
[2021-04-25 03:26:38,398 INFO] Step 41550/50000; acc:  74.12; ppl:  2.39; xent: 0.87; lr: 0.00010; 10446/4412 tok/s;   8284 sec
[2021-04-25 03:26:41,923 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:26:48,052 INFO] Step 41600/50000; acc:  73.92; ppl:  2.41; xent: 0.88; lr: 0.00010; 10369/4438 tok/s;   8293 sec
[2021-04-25 03:26:58,347 INFO] Step 41650/50000; acc:  73.22; ppl:  2.44; xent: 0.89; lr: 0.00010; 10166/4156 tok/s;   8304 sec
[2021-04-25 03:27:08,607 INFO] Step 41700/50000; acc:  73.94; ppl:  2.39; xent: 0.87; lr: 0.00010; 9523/4211 tok/s;   8314 sec
[2021-04-25 03:27:18,293 INFO] Step 41750/50000; acc:  73.96; ppl:  2.42; xent: 0.88; lr: 0.00010; 10618/4438 tok/s;   8324 sec
[2021-04-25 03:27:28,156 INFO] Step 41800/50000; acc:  73.70; ppl:  2.42; xent: 0.88; lr: 0.00010; 10292/4296 tok/s;   8334 sec
[2021-04-25 03:27:37,904 INFO] Step 41850/50000; acc:  73.67; ppl:  2.43; xent: 0.89; lr: 0.00010; 10429/4462 tok/s;   8343 sec
[2021-04-25 03:27:47,486 INFO] Step 41900/50000; acc:  73.84; ppl:  2.40; xent: 0.88; lr: 0.00010; 10772/4364 tok/s;   8353 sec
[2021-04-25 03:27:57,667 INFO] Step 41950/50000; acc:  73.85; ppl:  2.40; xent: 0.88; lr: 0.00010; 9858/4173 tok/s;   8363 sec
[2021-04-25 03:28:07,628 INFO] Step 42000/50000; acc:  73.25; ppl:  2.46; xent: 0.90; lr: 0.00010; 10516/4299 tok/s;   8373 sec
[2021-04-25 03:28:17,182 INFO] Step 42050/50000; acc:  74.11; ppl:  2.39; xent: 0.87; lr: 0.00010; 10434/4456 tok/s;   8383 sec
[2021-04-25 03:28:26,981 INFO] Step 42100/50000; acc:  73.79; ppl:  2.42; xent: 0.88; lr: 0.00010; 10481/4364 tok/s;   8392 sec
[2021-04-25 03:28:37,261 INFO] Step 42150/50000; acc:  74.26; ppl:  2.36; xent: 0.86; lr: 0.00010; 9665/4157 tok/s;   8403 sec
[2021-04-25 03:28:47,004 INFO] Step 42200/50000; acc:  73.87; ppl:  2.40; xent: 0.88; lr: 0.00010; 10674/4321 tok/s;   8412 sec
[2021-04-25 03:28:56,610 INFO] Step 42250/50000; acc:  73.93; ppl:  2.39; xent: 0.87; lr: 0.00010; 10451/4480 tok/s;   8422 sec
[2021-04-25 03:29:00,939 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:29:06,533 INFO] Step 42300/50000; acc:  73.90; ppl:  2.42; xent: 0.88; lr: 0.00010; 10388/4342 tok/s;   8432 sec
[2021-04-25 03:29:16,465 INFO] Step 42350/50000; acc:  73.70; ppl:  2.39; xent: 0.87; lr: 0.00010; 10267/4277 tok/s;   8442 sec
[2021-04-25 03:29:26,169 INFO] Step 42400/50000; acc:  74.02; ppl:  2.40; xent: 0.88; lr: 0.00010; 10246/4391 tok/s;   8452 sec
[2021-04-25 03:29:36,491 INFO] Step 42450/50000; acc:  73.68; ppl:  2.42; xent: 0.88; lr: 0.00010; 10051/4228 tok/s;   8462 sec
[2021-04-25 03:29:45,950 INFO] Step 42500/50000; acc:  74.03; ppl:  2.38; xent: 0.87; lr: 0.00010; 10366/4513 tok/s;   8471 sec
[2021-04-25 03:29:55,555 INFO] Step 42550/50000; acc:  73.87; ppl:  2.41; xent: 0.88; lr: 0.00010; 10704/4414 tok/s;   8481 sec
[2021-04-25 03:30:05,357 INFO] Step 42600/50000; acc:  73.88; ppl:  2.40; xent: 0.88; lr: 0.00010; 10384/4421 tok/s;   8491 sec
[2021-04-25 03:30:15,598 INFO] Step 42650/50000; acc:  73.62; ppl:  2.42; xent: 0.88; lr: 0.00010; 10180/4167 tok/s;   8501 sec
[2021-04-25 03:30:25,616 INFO] Step 42700/50000; acc:  73.78; ppl:  2.42; xent: 0.88; lr: 0.00010; 10126/4221 tok/s;   8511 sec
[2021-04-25 03:30:35,382 INFO] Step 42750/50000; acc:  74.09; ppl:  2.37; xent: 0.86; lr: 0.00010; 10247/4340 tok/s;   8521 sec
[2021-04-25 03:30:45,360 INFO] Step 42800/50000; acc:  73.34; ppl:  2.42; xent: 0.89; lr: 0.00010; 10498/4305 tok/s;   8531 sec
[2021-04-25 03:30:55,227 INFO] Step 42850/50000; acc:  73.97; ppl:  2.38; xent: 0.87; lr: 0.00010; 10074/4340 tok/s;   8541 sec
[2021-04-25 03:31:05,719 INFO] Step 42900/50000; acc:  74.14; ppl:  2.36; xent: 0.86; lr: 0.00010; 9883/4066 tok/s;   8551 sec
[2021-04-25 03:31:14,773 INFO] Step 42950/50000; acc:  74.28; ppl:  2.34; xent: 0.85; lr: 0.00010; 10872/4574 tok/s;   8560 sec
[2021-04-25 03:31:24,769 INFO] Step 43000/50000; acc:  73.72; ppl:  2.43; xent: 0.89; lr: 0.00010; 10363/4345 tok/s;   8570 sec
[2021-04-25 03:31:26,305 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:31:34,807 INFO] Step 43050/50000; acc:  74.14; ppl:  2.37; xent: 0.86; lr: 0.00010; 10061/4315 tok/s;   8580 sec
[2021-04-25 03:31:44,657 INFO] Step 43100/50000; acc:  73.84; ppl:  2.40; xent: 0.87; lr: 0.00010; 10468/4317 tok/s;   8590 sec
[2021-04-25 03:31:54,845 INFO] Step 43150/50000; acc:  73.93; ppl:  2.40; xent: 0.87; lr: 0.00010; 9897/4218 tok/s;   8600 sec
[2021-04-25 03:32:04,795 INFO] Step 43200/50000; acc:  74.00; ppl:  2.39; xent: 0.87; lr: 0.00010; 10041/4348 tok/s;   8610 sec
[2021-04-25 03:32:14,314 INFO] Step 43250/50000; acc:  73.72; ppl:  2.42; xent: 0.88; lr: 0.00010; 10934/4467 tok/s;   8620 sec
[2021-04-25 03:32:23,706 INFO] Step 43300/50000; acc:  74.22; ppl:  2.37; xent: 0.86; lr: 0.00010; 10368/4541 tok/s;   8629 sec
[2021-04-25 03:32:33,695 INFO] Step 43350/50000; acc:  73.57; ppl:  2.40; xent: 0.88; lr: 0.00010; 10475/4271 tok/s;   8639 sec
[2021-04-25 03:32:44,349 INFO] Step 43400/50000; acc:  74.19; ppl:  2.38; xent: 0.87; lr: 0.00010; 9625/3998 tok/s;   8650 sec
[2021-04-25 03:32:54,114 INFO] Step 43450/50000; acc:  73.96; ppl:  2.40; xent: 0.87; lr: 0.00010; 10480/4375 tok/s;   8659 sec
[2021-04-25 03:33:04,091 INFO] Step 43500/50000; acc:  73.92; ppl:  2.39; xent: 0.87; lr: 0.00010; 10189/4279 tok/s;   8669 sec
[2021-04-25 03:33:13,598 INFO] Step 43550/50000; acc:  74.23; ppl:  2.36; xent: 0.86; lr: 0.00010; 10520/4452 tok/s;   8679 sec
[2021-04-25 03:33:23,985 INFO] Step 43600/50000; acc:  73.83; ppl:  2.40; xent: 0.88; lr: 0.00010; 10108/4179 tok/s;   8689 sec
[2021-04-25 03:33:33,314 INFO] Step 43650/50000; acc:  74.44; ppl:  2.34; xent: 0.85; lr: 0.00010; 10723/4483 tok/s;   8699 sec
[2021-04-25 03:33:42,810 INFO] Step 43700/50000; acc:  74.13; ppl:  2.38; xent: 0.87; lr: 0.00010; 10745/4530 tok/s;   8708 sec
[2021-04-25 03:33:51,436 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:33:52,500 INFO] Step 43750/50000; acc:  74.49; ppl:  2.36; xent: 0.86; lr: 0.00010; 10220/4352 tok/s;   8718 sec
[2021-04-25 03:34:02,441 INFO] Step 43800/50000; acc:  74.09; ppl:  2.38; xent: 0.87; lr: 0.00010; 10450/4343 tok/s;   8728 sec
[2021-04-25 03:34:12,486 INFO] Step 43850/50000; acc:  73.87; ppl:  2.41; xent: 0.88; lr: 0.00010; 10057/4224 tok/s;   8738 sec
[2021-04-25 03:34:22,735 INFO] Step 43900/50000; acc:  73.81; ppl:  2.38; xent: 0.87; lr: 0.00010; 9936/4282 tok/s;   8748 sec
[2021-04-25 03:34:32,630 INFO] Step 43950/50000; acc:  73.95; ppl:  2.38; xent: 0.87; lr: 0.00010; 10278/4337 tok/s;   8758 sec
[2021-04-25 03:34:42,161 INFO] Step 44000/50000; acc:  74.22; ppl:  2.38; xent: 0.87; lr: 0.00010; 10467/4434 tok/s;   8768 sec
[2021-04-25 03:34:52,197 INFO] Step 44050/50000; acc:  73.69; ppl:  2.41; xent: 0.88; lr: 0.00010; 10393/4349 tok/s;   8778 sec
[2021-04-25 03:35:01,708 INFO] Step 44100/50000; acc:  74.63; ppl:  2.33; xent: 0.85; lr: 0.00010; 10442/4282 tok/s;   8787 sec
[2021-04-25 03:35:12,633 INFO] Step 44150/50000; acc:  74.01; ppl:  2.38; xent: 0.87; lr: 0.00010; 9518/4031 tok/s;   8798 sec
[2021-04-25 03:35:21,967 INFO] Step 44200/50000; acc:  74.03; ppl:  2.38; xent: 0.87; lr: 0.00010; 10840/4526 tok/s;   8807 sec
[2021-04-25 03:35:32,083 INFO] Step 44250/50000; acc:  73.83; ppl:  2.39; xent: 0.87; lr: 0.00010; 10162/4268 tok/s;   8817 sec
[2021-04-25 03:35:42,345 INFO] Step 44300/50000; acc:  74.18; ppl:  2.38; xent: 0.87; lr: 0.00010; 9890/4157 tok/s;   8828 sec
[2021-04-25 03:35:52,203 INFO] Step 44350/50000; acc:  74.59; ppl:  2.32; xent: 0.84; lr: 0.00010; 10199/4297 tok/s;   8838 sec
[2021-04-25 03:36:01,649 INFO] Step 44400/50000; acc:  73.87; ppl:  2.40; xent: 0.88; lr: 0.00010; 11037/4472 tok/s;   8847 sec
[2021-04-25 03:36:11,337 INFO] Step 44450/50000; acc:  74.53; ppl:  2.34; xent: 0.85; lr: 0.00010; 10203/4420 tok/s;   8857 sec
[2021-04-25 03:36:17,198 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:36:21,328 INFO] Step 44500/50000; acc:  74.06; ppl:  2.36; xent: 0.86; lr: 0.00010; 10344/4318 tok/s;   8867 sec
[2021-04-25 03:36:31,224 INFO] Step 44550/50000; acc:  74.38; ppl:  2.34; xent: 0.85; lr: 0.00010; 10060/4268 tok/s;   8877 sec
[2021-04-25 03:36:41,741 INFO] Step 44600/50000; acc:  73.68; ppl:  2.42; xent: 0.88; lr: 0.00010; 9812/4134 tok/s;   8887 sec
[2021-04-25 03:36:51,646 INFO] Step 44650/50000; acc:  74.21; ppl:  2.36; xent: 0.86; lr: 0.00010; 10126/4308 tok/s;   8897 sec
[2021-04-25 03:37:01,449 INFO] Step 44700/50000; acc:  74.22; ppl:  2.37; xent: 0.86; lr: 0.00010; 10481/4411 tok/s;   8907 sec
[2021-04-25 03:37:11,145 INFO] Step 44750/50000; acc:  74.38; ppl:  2.36; xent: 0.86; lr: 0.00010; 10419/4398 tok/s;   8917 sec
[2021-04-25 03:37:20,657 INFO] Step 44800/50000; acc:  74.34; ppl:  2.34; xent: 0.85; lr: 0.00010; 10616/4462 tok/s;   8926 sec
[2021-04-25 03:37:31,180 INFO] Step 44850/50000; acc:  73.85; ppl:  2.39; xent: 0.87; lr: 0.00010; 10022/4039 tok/s;   8937 sec
[2021-04-25 03:37:40,959 INFO] Step 44900/50000; acc:  74.61; ppl:  2.35; xent: 0.86; lr: 0.00010; 10048/4369 tok/s;   8946 sec
[2021-04-25 03:37:51,272 INFO] Step 44950/50000; acc:  74.48; ppl:  2.35; xent: 0.86; lr: 0.00010; 10007/4171 tok/s;   8957 sec
[2021-04-25 03:38:00,851 INFO] Step 45000/50000; acc:  74.10; ppl:  2.36; xent: 0.86; lr: 0.00010; 10628/4409 tok/s;   8966 sec
[2021-04-25 03:38:00,855 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-25 03:38:08,824 INFO] Validation perplexity: 2.82649
[2021-04-25 03:38:08,824 INFO] Validation accuracy: 71.0275
[2021-04-25 03:38:08,827 INFO] Saving checkpoint ../models/group1_params/strict_ops/model_step_45000.pt
[2021-04-25 03:38:19,424 INFO] Step 45050/50000; acc:  74.21; ppl:  2.37; xent: 0.86; lr: 0.00010; 5508/2308 tok/s;   8985 sec
[2021-04-25 03:38:29,462 INFO] Step 45100/50000; acc:  74.58; ppl:  2.32; xent: 0.84; lr: 0.00010; 10156/4212 tok/s;   8995 sec
[2021-04-25 03:38:38,662 INFO] Step 45150/50000; acc:  74.59; ppl:  2.33; xent: 0.85; lr: 0.00010; 10828/4629 tok/s;   9004 sec
[2021-04-25 03:38:48,826 INFO] Step 45200/50000; acc:  74.21; ppl:  2.36; xent: 0.86; lr: 0.00010; 10265/4253 tok/s;   9014 sec
[2021-04-25 03:38:51,720 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:38:58,402 INFO] Step 45250/50000; acc:  74.42; ppl:  2.34; xent: 0.85; lr: 0.00010; 10402/4473 tok/s;   9024 sec
[2021-04-25 03:39:08,417 INFO] Step 45300/50000; acc:  73.93; ppl:  2.38; xent: 0.87; lr: 0.00010; 10307/4237 tok/s;   9034 sec
[2021-04-25 03:39:18,641 INFO] Step 45350/50000; acc:  74.49; ppl:  2.35; xent: 0.85; lr: 0.00010; 9625/4238 tok/s;   9044 sec
[2021-04-25 03:39:28,370 INFO] Step 45400/50000; acc:  74.42; ppl:  2.36; xent: 0.86; lr: 0.00010; 10629/4411 tok/s;   9054 sec
[2021-04-25 03:39:38,255 INFO] Step 45450/50000; acc:  74.30; ppl:  2.36; xent: 0.86; lr: 0.00010; 10181/4309 tok/s;   9064 sec
[2021-04-25 03:39:47,986 INFO] Step 45500/50000; acc:  74.25; ppl:  2.36; xent: 0.86; lr: 0.00010; 10515/4430 tok/s;   9073 sec
[2021-04-25 03:39:57,986 INFO] Step 45550/50000; acc:  74.49; ppl:  2.34; xent: 0.85; lr: 0.00010; 10347/4215 tok/s;   9083 sec
[2021-04-25 03:40:07,975 INFO] Step 45600/50000; acc:  74.47; ppl:  2.33; xent: 0.85; lr: 0.00010; 10027/4262 tok/s;   9093 sec
[2021-04-25 03:40:17,999 INFO] Step 45650/50000; acc:  73.93; ppl:  2.40; xent: 0.87; lr: 0.00010; 10427/4237 tok/s;   9103 sec
[2021-04-25 03:40:27,503 INFO] Step 45700/50000; acc:  74.93; ppl:  2.30; xent: 0.83; lr: 0.00010; 10361/4507 tok/s;   9113 sec
[2021-04-25 03:40:37,412 INFO] Step 45750/50000; acc:  74.12; ppl:  2.37; xent: 0.86; lr: 0.00010; 10389/4302 tok/s;   9123 sec
[2021-04-25 03:40:47,668 INFO] Step 45800/50000; acc:  74.74; ppl:  2.32; xent: 0.84; lr: 0.00010; 9959/4185 tok/s;   9133 sec
[2021-04-25 03:40:57,280 INFO] Step 45850/50000; acc:  74.48; ppl:  2.34; xent: 0.85; lr: 0.00010; 10638/4352 tok/s;   9143 sec
[2021-04-25 03:41:06,948 INFO] Step 45900/50000; acc:  74.55; ppl:  2.33; xent: 0.85; lr: 0.00010; 10459/4479 tok/s;   9152 sec
[2021-04-25 03:41:10,814 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:41:16,818 INFO] Step 45950/50000; acc:  74.49; ppl:  2.33; xent: 0.85; lr: 0.00010; 10183/4336 tok/s;   9162 sec
[2021-04-25 03:41:27,130 INFO] Step 46000/50000; acc:  74.09; ppl:  2.35; xent: 0.86; lr: 0.00010; 10146/4155 tok/s;   9173 sec
[2021-04-25 03:41:36,984 INFO] Step 46050/50000; acc:  74.48; ppl:  2.33; xent: 0.85; lr: 0.00010; 10047/4351 tok/s;   9182 sec
[2021-04-25 03:41:47,059 INFO] Step 46100/50000; acc:  74.44; ppl:  2.35; xent: 0.85; lr: 0.00010; 10169/4285 tok/s;   9192 sec
[2021-04-25 03:41:56,711 INFO] Step 46150/50000; acc:  74.38; ppl:  2.34; xent: 0.85; lr: 0.00010; 10250/4440 tok/s;   9202 sec
[2021-04-25 03:42:06,648 INFO] Step 46200/50000; acc:  74.33; ppl:  2.36; xent: 0.86; lr: 0.00010; 10395/4285 tok/s;   9212 sec
[2021-04-25 03:42:16,168 INFO] Step 46250/50000; acc:  74.66; ppl:  2.33; xent: 0.84; lr: 0.00010; 10613/4524 tok/s;   9222 sec
[2021-04-25 03:42:26,293 INFO] Step 46300/50000; acc:  74.26; ppl:  2.36; xent: 0.86; lr: 0.00010; 10337/4185 tok/s;   9232 sec
[2021-04-25 03:42:36,476 INFO] Step 46350/50000; acc:  74.26; ppl:  2.36; xent: 0.86; lr: 0.00010; 9981/4184 tok/s;   9242 sec
[2021-04-25 03:42:46,095 INFO] Step 46400/50000; acc:  74.95; ppl:  2.30; xent: 0.83; lr: 0.00010; 10403/4391 tok/s;   9251 sec
[2021-04-25 03:42:56,062 INFO] Step 46450/50000; acc:  73.89; ppl:  2.37; xent: 0.86; lr: 0.00010; 10471/4311 tok/s;   9261 sec
[2021-04-25 03:43:05,818 INFO] Step 46500/50000; acc:  74.80; ppl:  2.32; xent: 0.84; lr: 0.00010; 10064/4363 tok/s;   9271 sec
[2021-04-25 03:43:16,278 INFO] Step 46550/50000; acc:  74.56; ppl:  2.31; xent: 0.84; lr: 0.00010; 9941/4115 tok/s;   9282 sec
[2021-04-25 03:43:25,651 INFO] Step 46600/50000; acc:  74.81; ppl:  2.32; xent: 0.84; lr: 0.00010; 10788/4439 tok/s;   9291 sec
[2021-04-25 03:43:35,491 INFO] Step 46650/50000; acc:  74.34; ppl:  2.35; xent: 0.85; lr: 0.00010; 10373/4384 tok/s;   9301 sec
[2021-04-25 03:43:36,673 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:43:45,462 INFO] Step 46700/50000; acc:  74.67; ppl:  2.32; xent: 0.84; lr: 0.00010; 10201/4322 tok/s;   9311 sec
[2021-04-25 03:43:55,181 INFO] Step 46750/50000; acc:  74.72; ppl:  2.33; xent: 0.84; lr: 0.00010; 10341/4367 tok/s;   9321 sec
[2021-04-25 03:44:05,726 INFO] Step 46800/50000; acc:  74.34; ppl:  2.34; xent: 0.85; lr: 0.00010; 9801/4108 tok/s;   9331 sec
[2021-04-25 03:44:15,618 INFO] Step 46850/50000; acc:  74.76; ppl:  2.32; xent: 0.84; lr: 0.00010; 10056/4351 tok/s;   9341 sec
[2021-04-25 03:44:25,317 INFO] Step 46900/50000; acc:  74.48; ppl:  2.34; xent: 0.85; lr: 0.00010; 10606/4409 tok/s;   9351 sec
[2021-04-25 03:44:34,753 INFO] Step 46950/50000; acc:  74.78; ppl:  2.32; xent: 0.84; lr: 0.00010; 10398/4506 tok/s;   9360 sec
[2021-04-25 03:44:44,839 INFO] Step 47000/50000; acc:  74.35; ppl:  2.35; xent: 0.85; lr: 0.00010; 10444/4211 tok/s;   9370 sec
[2021-04-25 03:44:55,331 INFO] Step 47050/50000; acc:  74.88; ppl:  2.31; xent: 0.84; lr: 0.00010; 9676/4104 tok/s;   9381 sec
[2021-04-25 03:45:04,935 INFO] Step 47100/50000; acc:  74.51; ppl:  2.35; xent: 0.85; lr: 0.00010; 10705/4397 tok/s;   9390 sec
[2021-04-25 03:45:14,901 INFO] Step 47150/50000; acc:  74.62; ppl:  2.33; xent: 0.85; lr: 0.00010; 10233/4314 tok/s;   9400 sec
[2021-04-25 03:45:24,537 INFO] Step 47200/50000; acc:  74.97; ppl:  2.31; xent: 0.84; lr: 0.00010; 10360/4404 tok/s;   9410 sec
[2021-04-25 03:45:34,811 INFO] Step 47250/50000; acc:  74.50; ppl:  2.34; xent: 0.85; lr: 0.00010; 10203/4197 tok/s;   9420 sec
[2021-04-25 03:45:44,021 INFO] Step 47300/50000; acc:  75.16; ppl:  2.28; xent: 0.82; lr: 0.00010; 10688/4545 tok/s;   9429 sec
[2021-04-25 03:45:53,683 INFO] Step 47350/50000; acc:  74.42; ppl:  2.33; xent: 0.84; lr: 0.00010; 10600/4457 tok/s;   9439 sec
[2021-04-25 03:46:01,714 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:46:03,211 INFO] Step 47400/50000; acc:  74.67; ppl:  2.32; xent: 0.84; lr: 0.00010; 10702/4437 tok/s;   9449 sec
[2021-04-25 03:46:13,316 INFO] Step 47450/50000; acc:  74.79; ppl:  2.30; xent: 0.83; lr: 0.00010; 10116/4290 tok/s;   9459 sec
[2021-04-25 03:46:23,610 INFO] Step 47500/50000; acc:  74.29; ppl:  2.35; xent: 0.85; lr: 0.00010; 9873/4149 tok/s;   9469 sec
[2021-04-25 03:46:33,533 INFO] Step 47550/50000; acc:  74.97; ppl:  2.29; xent: 0.83; lr: 0.00010; 10009/4357 tok/s;   9479 sec
[2021-04-25 03:46:43,549 INFO] Step 47600/50000; acc:  74.57; ppl:  2.34; xent: 0.85; lr: 0.00010; 10432/4308 tok/s;   9489 sec
[2021-04-25 03:46:53,080 INFO] Step 47650/50000; acc:  74.87; ppl:  2.32; xent: 0.84; lr: 0.00010; 10402/4440 tok/s;   9498 sec
[2021-04-25 03:47:02,933 INFO] Step 47700/50000; acc:  74.52; ppl:  2.34; xent: 0.85; lr: 0.00010; 10473/4443 tok/s;   9508 sec
[2021-04-25 03:47:12,504 INFO] Step 47750/50000; acc:  75.33; ppl:  2.28; xent: 0.82; lr: 0.00010; 10472/4249 tok/s;   9518 sec
[2021-04-25 03:47:23,171 INFO] Step 47800/50000; acc:  74.27; ppl:  2.35; xent: 0.85; lr: 0.00010; 9775/4112 tok/s;   9529 sec
[2021-04-25 03:47:32,518 INFO] Step 47850/50000; acc:  74.64; ppl:  2.32; xent: 0.84; lr: 0.00010; 10747/4542 tok/s;   9538 sec
[2021-04-25 03:47:42,557 INFO] Step 47900/50000; acc:  74.56; ppl:  2.33; xent: 0.85; lr: 0.00010; 10284/4272 tok/s;   9548 sec
[2021-04-25 03:47:52,962 INFO] Step 47950/50000; acc:  74.83; ppl:  2.32; xent: 0.84; lr: 0.00010; 9769/4094 tok/s;   9558 sec
[2021-04-25 03:48:02,720 INFO] Step 48000/50000; acc:  75.38; ppl:  2.27; xent: 0.82; lr: 0.00010; 10279/4328 tok/s;   9568 sec
[2021-04-25 03:48:12,278 INFO] Step 48050/50000; acc:  74.46; ppl:  2.34; xent: 0.85; lr: 0.00010; 10899/4449 tok/s;   9578 sec
[2021-04-25 03:48:21,935 INFO] Step 48100/50000; acc:  75.45; ppl:  2.27; xent: 0.82; lr: 0.00010; 10106/4400 tok/s;   9587 sec
[2021-04-25 03:48:27,486 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:48:31,884 INFO] Step 48150/50000; acc:  74.45; ppl:  2.33; xent: 0.84; lr: 0.00010; 10410/4367 tok/s;   9597 sec
[2021-04-25 03:48:42,137 INFO] Step 48200/50000; acc:  74.58; ppl:  2.32; xent: 0.84; lr: 0.00010; 9998/4144 tok/s;   9608 sec
[2021-04-25 03:48:52,408 INFO] Step 48250/50000; acc:  74.75; ppl:  2.32; xent: 0.84; lr: 0.00010; 9871/4192 tok/s;   9618 sec
[2021-04-25 03:49:02,489 INFO] Step 48300/50000; acc:  74.81; ppl:  2.31; xent: 0.84; lr: 0.00010; 10016/4285 tok/s;   9628 sec
[2021-04-25 03:49:12,159 INFO] Step 48350/50000; acc:  74.90; ppl:  2.30; xent: 0.83; lr: 0.00010; 10348/4423 tok/s;   9638 sec
[2021-04-25 03:49:21,951 INFO] Step 48400/50000; acc:  74.64; ppl:  2.33; xent: 0.85; lr: 0.00010; 10598/4384 tok/s;   9647 sec
[2021-04-25 03:49:31,284 INFO] Step 48450/50000; acc:  74.97; ppl:  2.29; xent: 0.83; lr: 0.00010; 10775/4521 tok/s;   9657 sec
[2021-04-25 03:49:41,749 INFO] Step 48500/50000; acc:  74.73; ppl:  2.32; xent: 0.84; lr: 0.00010; 9956/4049 tok/s;   9667 sec
[2021-04-25 03:49:51,636 INFO] Step 48550/50000; acc:  74.78; ppl:  2.31; xent: 0.84; lr: 0.00010; 10017/4307 tok/s;   9677 sec
[2021-04-25 03:50:01,684 INFO] Step 48600/50000; acc:  74.96; ppl:  2.31; xent: 0.84; lr: 0.00010; 10331/4327 tok/s;   9687 sec
[2021-04-25 03:50:11,170 INFO] Step 48650/50000; acc:  74.83; ppl:  2.30; xent: 0.83; lr: 0.00010; 10627/4471 tok/s;   9697 sec
[2021-04-25 03:50:21,201 INFO] Step 48700/50000; acc:  74.95; ppl:  2.30; xent: 0.83; lr: 0.00010; 10272/4214 tok/s;   9707 sec
[2021-04-25 03:50:31,269 INFO] Step 48750/50000; acc:  75.32; ppl:  2.27; xent: 0.82; lr: 0.00010; 10124/4221 tok/s;   9717 sec
[2021-04-25 03:50:40,466 INFO] Step 48800/50000; acc:  75.12; ppl:  2.28; xent: 0.82; lr: 0.00010; 10794/4622 tok/s;   9726 sec
[2021-04-25 03:50:50,695 INFO] Step 48850/50000; acc:  74.78; ppl:  2.31; xent: 0.84; lr: 0.00010; 10214/4224 tok/s;   9736 sec
[2021-04-25 03:50:53,180 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:50:59,910 INFO] Step 48900/50000; acc:  75.08; ppl:  2.27; xent: 0.82; lr: 0.00010; 10673/4662 tok/s;   9745 sec
[2021-04-25 03:51:10,037 INFO] Step 48950/50000; acc:  74.87; ppl:  2.31; xent: 0.84; lr: 0.00010; 10201/4163 tok/s;   9755 sec
[2021-04-25 03:51:20,485 INFO] Step 49000/50000; acc:  74.64; ppl:  2.33; xent: 0.85; lr: 0.00010; 9687/4196 tok/s;   9766 sec
[2021-04-25 03:51:29,967 INFO] Step 49050/50000; acc:  75.05; ppl:  2.29; xent: 0.83; lr: 0.00010; 10745/4512 tok/s;   9775 sec
[2021-04-25 03:51:39,819 INFO] Step 49100/50000; acc:  74.86; ppl:  2.31; xent: 0.84; lr: 0.00010; 10273/4324 tok/s;   9785 sec
[2021-04-25 03:51:49,328 INFO] Step 49150/50000; acc:  75.23; ppl:  2.28; xent: 0.82; lr: 0.00010; 10495/4486 tok/s;   9795 sec
[2021-04-25 03:51:59,565 INFO] Step 49200/50000; acc:  74.76; ppl:  2.31; xent: 0.84; lr: 0.00010; 10371/4172 tok/s;   9805 sec
[2021-04-25 03:52:09,286 INFO] Step 49250/50000; acc:  75.43; ppl:  2.27; xent: 0.82; lr: 0.00010; 10249/4373 tok/s;   9815 sec
[2021-04-25 03:52:19,363 INFO] Step 49300/50000; acc:  74.77; ppl:  2.32; xent: 0.84; lr: 0.00010; 10254/4212 tok/s;   9825 sec
[2021-04-25 03:52:29,012 INFO] Step 49350/50000; acc:  75.25; ppl:  2.27; xent: 0.82; lr: 0.00010; 10279/4448 tok/s;   9834 sec
[2021-04-25 03:52:38,961 INFO] Step 49400/50000; acc:  74.80; ppl:  2.32; xent: 0.84; lr: 0.00010; 10399/4294 tok/s;   9844 sec
[2021-04-25 03:52:48,999 INFO] Step 49450/50000; acc:  75.67; ppl:  2.25; xent: 0.81; lr: 0.00010; 10099/4253 tok/s;   9854 sec
[2021-04-25 03:52:58,551 INFO] Step 49500/50000; acc:  75.04; ppl:  2.29; xent: 0.83; lr: 0.00010; 10769/4393 tok/s;   9864 sec
[2021-04-25 03:53:08,453 INFO] Step 49550/50000; acc:  75.01; ppl:  2.28; xent: 0.82; lr: 0.00010; 10204/4347 tok/s;   9874 sec
[2021-04-25 03:53:11,869 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/strict/train.txt, align=None)...
[2021-04-25 03:53:18,158 INFO] Step 49600/50000; acc:  74.91; ppl:  2.29; xent: 0.83; lr: 0.00010; 10364/4438 tok/s;   9884 sec
[2021-04-25 03:53:28,158 INFO] Step 49650/50000; acc:  74.56; ppl:  2.31; xent: 0.84; lr: 0.00010; 10436/4263 tok/s;   9894 sec
[2021-04-25 03:53:37,866 INFO] Step 49700/50000; acc:  75.33; ppl:  2.27; xent: 0.82; lr: 0.00010; 10037/4394 tok/s;   9903 sec
[2021-04-25 03:53:48,019 INFO] Step 49750/50000; acc:  74.90; ppl:  2.31; xent: 0.84; lr: 0.00010; 10139/4301 tok/s;   9913 sec
[2021-04-25 03:53:57,737 INFO] Step 49800/50000; acc:  75.01; ppl:  2.30; xent: 0.83; lr: 0.00010; 10471/4395 tok/s;   9923 sec
[2021-04-25 03:54:07,322 INFO] Step 49850/50000; acc:  75.07; ppl:  2.29; xent: 0.83; lr: 0.00010; 10595/4418 tok/s;   9933 sec
[2021-04-25 03:54:16,877 INFO] Step 49900/50000; acc:  75.01; ppl:  2.29; xent: 0.83; lr: 0.00010; 10678/4533 tok/s;   9942 sec
[2021-04-25 03:54:27,105 INFO] Step 49950/50000; acc:  75.15; ppl:  2.28; xent: 0.82; lr: 0.00010; 9961/4094 tok/s;   9952 sec
[2021-04-25 03:54:37,251 INFO] Step 50000/50000; acc:  74.63; ppl:  2.32; xent: 0.84; lr: 0.00005; 10280/4196 tok/s;   9963 sec
[2021-04-25 03:54:37,253 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/strict/valid.txt, align=None)...
[2021-04-25 03:54:45,231 INFO] Validation perplexity: 2.85549
[2021-04-25 03:54:45,231 INFO] Validation accuracy: 71.0647
[2021-04-25 03:54:45,233 INFO] Saving checkpoint ../models/group1_params/strict_ops/model_step_50000.pt

Loosely condensed EditOperations:

modelGroup1Loose = HephaestusModel(MODEL_GROUP1_LOOSE)
modelGroup1Loose.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_OPS_GENERAL_LOOSE_TRAIN,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_OPS_GENERAL_LOOSE_VALID,
    **GROUP1_PARAMS
)
[2021-04-25 03:54:47,750 INFO] Counter vocab from -1 samples.
[2021-04-25 03:54:47,750 INFO] n_sample=-1: Build vocab on full datasets.
[2021-04-25 03:54:47,754 INFO] corpus_1's transforms: TransformPipe()
[2021-04-25 03:54:47,755 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 03:54:48,356 INFO] Counters src:429
[2021-04-25 03:54:48,356 INFO] Counters tgt:444
[2021-04-25 03:54:48,356 WARNING] path ../models/group1_params/loose_ops/save_data.vocab.src exists, may overwrite...
[2021-04-25 03:54:48,358 WARNING] path ../models/group1_params/loose_ops/save_data.vocab.tgt exists, may overwrite...
[2021-04-25 03:54:49,011 INFO] Parsed 2 corpora from -data.
[2021-04-25 03:54:49,012 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-04-25 03:54:49,012 INFO] Loading vocab from text file...
[2021-04-25 03:54:49,012 INFO] Loading src vocabulary from ../models/group1_params/loose_ops/save_data.vocab.src
[2021-04-25 03:54:49,014 INFO] Loaded src vocab has 429 tokens.
[2021-04-25 03:54:49,014 INFO] Loading tgt vocabulary from ../models/group1_params/loose_ops/save_data.vocab.tgt
[2021-04-25 03:54:49,016 INFO] Loaded tgt vocab has 444 tokens.
[2021-04-25 03:54:49,016 INFO] Building fields with vocab in counters...
[2021-04-25 03:54:49,016 INFO]  * tgt vocab size: 448.
[2021-04-25 03:54:49,017 INFO]  * src vocab size: 431.
[2021-04-25 03:54:49,017 INFO]  * src vocab size = 431
[2021-04-25 03:54:49,017 INFO]  * tgt vocab size = 448
[2021-04-25 03:54:49,018 INFO] Building model...
[2021-04-25 03:54:50,169 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): GRU(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(448, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedGRU(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): GRUCell(768, 256)
        (1): GRUCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=448, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-04-25 03:54:50,170 INFO] encoder: 1206784
[2021-04-25 03:54:50,170 INFO] decoder: 1790144
[2021-04-25 03:54:50,170 INFO] * number of parameters: 2996928
[2021-04-25 03:54:50,171 INFO] Starting training on GPU: [0]
[2021-04-25 03:54:50,171 INFO] Start training loop and validate every 5000 steps...
[2021-04-25 03:54:50,171 INFO] corpus_1's transforms: TransformPipe()
[2021-04-25 03:54:50,171 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 03:54:59,294 INFO] Step 50/50000; acc:  17.26; ppl: 103.55; xent: 4.64; lr: 0.00010; 11256/4409 tok/s;      9 sec
[2021-04-25 03:55:07,994 INFO] Step 100/50000; acc:  27.43; ppl: 31.53; xent: 3.45; lr: 0.00010; 11504/4596 tok/s;     18 sec
[2021-04-25 03:55:17,134 INFO] Step 150/50000; acc:  37.98; ppl: 19.36; xent: 2.96; lr: 0.00010; 11270/4521 tok/s;     27 sec
[2021-04-25 03:55:25,394 INFO] Step 200/50000; acc:  44.93; ppl: 11.60; xent: 2.45; lr: 0.00010; 12274/4847 tok/s;     35 sec
[2021-04-25 03:55:34,103 INFO] Step 250/50000; acc:  45.40; ppl: 10.26; xent: 2.33; lr: 0.00010; 11652/4608 tok/s;     44 sec
[2021-04-25 03:55:42,697 INFO] Step 300/50000; acc:  45.77; ppl:  9.61; xent: 2.26; lr: 0.00010; 11719/4605 tok/s;     53 sec
[2021-04-25 03:55:51,736 INFO] Step 350/50000; acc:  46.32; ppl:  9.26; xent: 2.23; lr: 0.00010; 11465/4417 tok/s;     62 sec
[2021-04-25 03:56:00,781 INFO] Step 400/50000; acc:  46.59; ppl:  8.97; xent: 2.19; lr: 0.00010; 11351/4498 tok/s;     71 sec
[2021-04-25 03:56:09,382 INFO] Step 450/50000; acc:  46.49; ppl:  8.73; xent: 2.17; lr: 0.00010; 11565/4643 tok/s;     79 sec
[2021-04-25 03:56:18,366 INFO] Step 500/50000; acc:  46.94; ppl:  8.74; xent: 2.17; lr: 0.00010; 11580/4506 tok/s;     88 sec
[2021-04-25 03:56:27,116 INFO] Step 550/50000; acc:  47.18; ppl:  8.55; xent: 2.15; lr: 0.00010; 11556/4682 tok/s;     97 sec
[2021-04-25 03:56:35,766 INFO] Step 600/50000; acc:  48.28; ppl:  8.06; xent: 2.09; lr: 0.00010; 12028/4523 tok/s;    106 sec
[2021-04-25 03:56:43,838 INFO] Step 650/50000; acc:  48.38; ppl:  7.95; xent: 2.07; lr: 0.00010; 12101/4919 tok/s;    114 sec
[2021-04-25 03:56:52,568 INFO] Step 700/50000; acc:  49.06; ppl:  7.82; xent: 2.06; lr: 0.00010; 11751/4640 tok/s;    122 sec
[2021-04-25 03:56:53,144 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 03:57:01,281 INFO] Step 750/50000; acc:  49.67; ppl:  7.56; xent: 2.02; lr: 0.00010; 11864/4650 tok/s;    131 sec
[2021-04-25 03:57:10,084 INFO] Step 800/50000; acc:  50.11; ppl:  7.47; xent: 2.01; lr: 0.00010; 11319/4543 tok/s;    140 sec
[2021-04-25 03:57:19,423 INFO] Step 850/50000; acc:  50.47; ppl:  7.27; xent: 1.98; lr: 0.00010; 10874/4347 tok/s;    149 sec
[2021-04-25 03:57:28,145 INFO] Step 900/50000; acc:  50.86; ppl:  7.02; xent: 1.95; lr: 0.00010; 11534/4699 tok/s;    158 sec
[2021-04-25 03:57:36,767 INFO] Step 950/50000; acc:  51.99; ppl:  6.80; xent: 1.92; lr: 0.00010; 11969/4654 tok/s;    167 sec
[2021-04-25 03:57:45,505 INFO] Step 1000/50000; acc:  52.24; ppl:  6.50; xent: 1.87; lr: 0.00010; 11557/4629 tok/s;    175 sec
[2021-04-25 03:57:54,242 INFO] Step 1050/50000; acc:  52.83; ppl:  6.41; xent: 1.86; lr: 0.00010; 11796/4507 tok/s;    184 sec
[2021-04-25 03:58:03,435 INFO] Step 1100/50000; acc:  53.16; ppl:  6.23; xent: 1.83; lr: 0.00010; 11021/4369 tok/s;    193 sec
[2021-04-25 03:58:12,158 INFO] Step 1150/50000; acc:  53.83; ppl:  6.06; xent: 1.80; lr: 0.00010; 11664/4613 tok/s;    202 sec
[2021-04-25 03:58:21,052 INFO] Step 1200/50000; acc:  53.78; ppl:  5.99; xent: 1.79; lr: 0.00010; 11581/4530 tok/s;    211 sec
[2021-04-25 03:58:29,544 INFO] Step 1250/50000; acc:  53.97; ppl:  5.96; xent: 1.79; lr: 0.00010; 11712/4716 tok/s;    219 sec
[2021-04-25 03:58:38,665 INFO] Step 1300/50000; acc:  54.22; ppl:  5.80; xent: 1.76; lr: 0.00010; 11430/4434 tok/s;    228 sec
[2021-04-25 03:58:46,809 INFO] Step 1350/50000; acc:  54.63; ppl:  5.72; xent: 1.74; lr: 0.00010; 12464/4957 tok/s;    237 sec
[2021-04-25 03:58:55,415 INFO] Step 1400/50000; acc:  55.29; ppl:  5.55; xent: 1.71; lr: 0.00010; 11935/4560 tok/s;    245 sec
[2021-04-25 03:59:02,160 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 03:59:03,630 INFO] Step 1450/50000; acc:  55.71; ppl:  5.35; xent: 1.68; lr: 0.00010; 11971/4888 tok/s;    253 sec
[2021-04-25 03:59:12,720 INFO] Step 1500/50000; acc:  55.20; ppl:  5.51; xent: 1.71; lr: 0.00010; 11306/4425 tok/s;    263 sec
[2021-04-25 03:59:22,211 INFO] Step 1550/50000; acc:  55.31; ppl:  5.48; xent: 1.70; lr: 0.00010; 10892/4331 tok/s;    272 sec
[2021-04-25 03:59:30,898 INFO] Step 1600/50000; acc:  55.73; ppl:  5.33; xent: 1.67; lr: 0.00010; 11322/4674 tok/s;    281 sec
[2021-04-25 03:59:39,667 INFO] Step 1650/50000; acc:  55.77; ppl:  5.33; xent: 1.67; lr: 0.00010; 11670/4644 tok/s;    289 sec
[2021-04-25 03:59:48,222 INFO] Step 1700/50000; acc:  56.39; ppl:  5.20; xent: 1.65; lr: 0.00010; 11736/4669 tok/s;    298 sec
[2021-04-25 03:59:57,158 INFO] Step 1750/50000; acc:  56.60; ppl:  5.21; xent: 1.65; lr: 0.00010; 11602/4468 tok/s;    307 sec
[2021-04-25 04:00:05,837 INFO] Step 1800/50000; acc:  56.55; ppl:  5.11; xent: 1.63; lr: 0.00010; 11835/4526 tok/s;    316 sec
[2021-04-25 04:00:15,180 INFO] Step 1850/50000; acc:  56.83; ppl:  5.10; xent: 1.63; lr: 0.00010; 10971/4366 tok/s;    325 sec
[2021-04-25 04:00:23,920 INFO] Step 1900/50000; acc:  57.24; ppl:  5.01; xent: 1.61; lr: 0.00010; 11426/4632 tok/s;    334 sec
[2021-04-25 04:00:32,391 INFO] Step 1950/50000; acc:  57.07; ppl:  5.02; xent: 1.61; lr: 0.00010; 12072/4741 tok/s;    342 sec
[2021-04-25 04:00:41,410 INFO] Step 2000/50000; acc:  57.23; ppl:  5.09; xent: 1.63; lr: 0.00010; 11390/4451 tok/s;    351 sec
[2021-04-25 04:00:50,070 INFO] Step 2050/50000; acc:  57.68; ppl:  4.81; xent: 1.57; lr: 0.00010; 11564/4588 tok/s;    360 sec
[2021-04-25 04:00:58,416 INFO] Step 2100/50000; acc:  57.68; ppl:  4.94; xent: 1.60; lr: 0.00010; 12411/4805 tok/s;    368 sec
[2021-04-25 04:01:06,770 INFO] Step 2150/50000; acc:  58.41; ppl:  4.72; xent: 1.55; lr: 0.00010; 12023/4767 tok/s;    377 sec
[2021-04-25 04:01:11,544 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:01:15,950 INFO] Step 2200/50000; acc:  57.94; ppl:  4.80; xent: 1.57; lr: 0.00010; 11297/4427 tok/s;    386 sec
[2021-04-25 04:01:24,518 INFO] Step 2250/50000; acc:  58.43; ppl:  4.76; xent: 1.56; lr: 0.00010; 11531/4661 tok/s;    394 sec
[2021-04-25 04:01:34,009 INFO] Step 2300/50000; acc:  58.16; ppl:  4.79; xent: 1.57; lr: 0.00010; 10780/4332 tok/s;    404 sec
[2021-04-25 04:01:42,955 INFO] Step 2350/50000; acc:  58.07; ppl:  4.76; xent: 1.56; lr: 0.00010; 11463/4557 tok/s;    413 sec
[2021-04-25 04:01:51,412 INFO] Step 2400/50000; acc:  59.05; ppl:  4.66; xent: 1.54; lr: 0.00010; 11724/4747 tok/s;    421 sec
[2021-04-25 04:02:00,076 INFO] Step 2450/50000; acc:  58.84; ppl:  4.69; xent: 1.55; lr: 0.00010; 11754/4600 tok/s;    430 sec
[2021-04-25 04:02:08,457 INFO] Step 2500/50000; acc:  58.78; ppl:  4.63; xent: 1.53; lr: 0.00010; 12156/4769 tok/s;    438 sec
[2021-04-25 04:02:17,766 INFO] Step 2550/50000; acc:  58.82; ppl:  4.64; xent: 1.53; lr: 0.00010; 11212/4268 tok/s;    448 sec
[2021-04-25 04:02:26,728 INFO] Step 2600/50000; acc:  59.14; ppl:  4.58; xent: 1.52; lr: 0.00010; 11347/4539 tok/s;    457 sec
[2021-04-25 04:02:35,374 INFO] Step 2650/50000; acc:  59.16; ppl:  4.56; xent: 1.52; lr: 0.00010; 11795/4600 tok/s;    465 sec
[2021-04-25 04:02:44,255 INFO] Step 2700/50000; acc:  59.15; ppl:  4.58; xent: 1.52; lr: 0.00010; 11302/4583 tok/s;    474 sec
[2021-04-25 04:02:53,017 INFO] Step 2750/50000; acc:  59.45; ppl:  4.58; xent: 1.52; lr: 0.00010; 11604/4562 tok/s;    483 sec
[2021-04-25 04:03:01,681 INFO] Step 2800/50000; acc:  59.79; ppl:  4.45; xent: 1.49; lr: 0.00010; 11936/4620 tok/s;    492 sec
[2021-04-25 04:03:09,872 INFO] Step 2850/50000; acc:  59.72; ppl:  4.46; xent: 1.50; lr: 0.00010; 12077/4890 tok/s;    500 sec
[2021-04-25 04:03:18,536 INFO] Step 2900/50000; acc:  60.32; ppl:  4.42; xent: 1.49; lr: 0.00010; 11967/4655 tok/s;    508 sec
[2021-04-25 04:03:20,643 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:03:27,179 INFO] Step 2950/50000; acc:  60.11; ppl:  4.45; xent: 1.49; lr: 0.00010; 11724/4700 tok/s;    517 sec
[2021-04-25 04:03:36,222 INFO] Step 3000/50000; acc:  59.97; ppl:  4.47; xent: 1.50; lr: 0.00010; 11440/4401 tok/s;    526 sec
[2021-04-25 04:03:45,277 INFO] Step 3050/50000; acc:  60.26; ppl:  4.37; xent: 1.48; lr: 0.00010; 10802/4569 tok/s;    535 sec
[2021-04-25 04:03:53,927 INFO] Step 3100/50000; acc:  60.06; ppl:  4.41; xent: 1.48; lr: 0.00010; 11853/4610 tok/s;    544 sec
[2021-04-25 04:04:02,689 INFO] Step 3150/50000; acc:  60.21; ppl:  4.40; xent: 1.48; lr: 0.00010; 11725/4663 tok/s;    553 sec
[2021-04-25 04:04:11,058 INFO] Step 3200/50000; acc:  60.86; ppl:  4.26; xent: 1.45; lr: 0.00010; 11823/4738 tok/s;    561 sec
[2021-04-25 04:04:20,057 INFO] Step 3250/50000; acc:  60.23; ppl:  4.40; xent: 1.48; lr: 0.00010; 11581/4360 tok/s;    570 sec
[2021-04-25 04:04:28,774 INFO] Step 3300/50000; acc:  60.60; ppl:  4.30; xent: 1.46; lr: 0.00010; 11580/4702 tok/s;    579 sec
[2021-04-25 04:04:37,759 INFO] Step 3350/50000; acc:  60.61; ppl:  4.34; xent: 1.47; lr: 0.00010; 11529/4427 tok/s;    588 sec
[2021-04-25 04:04:46,332 INFO] Step 3400/50000; acc:  60.62; ppl:  4.27; xent: 1.45; lr: 0.00010; 11880/4752 tok/s;    596 sec
[2021-04-25 04:04:55,006 INFO] Step 3450/50000; acc:  60.59; ppl:  4.36; xent: 1.47; lr: 0.00010; 11706/4619 tok/s;    605 sec
[2021-04-25 04:05:03,809 INFO] Step 3500/50000; acc:  60.77; ppl:  4.24; xent: 1.45; lr: 0.00010; 11460/4545 tok/s;    614 sec
[2021-04-25 04:05:11,906 INFO] Step 3550/50000; acc:  61.15; ppl:  4.23; xent: 1.44; lr: 0.00010; 12558/4855 tok/s;    622 sec
[2021-04-25 04:05:20,653 INFO] Step 3600/50000; acc:  61.11; ppl:  4.21; xent: 1.44; lr: 0.00010; 11719/4619 tok/s;    630 sec
[2021-04-25 04:05:23,193 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:05:29,380 INFO] Step 3650/50000; acc:  61.36; ppl:  4.18; xent: 1.43; lr: 0.00010; 11446/4671 tok/s;    639 sec
[2021-04-25 04:05:38,350 INFO] Step 3700/50000; acc:  61.49; ppl:  4.24; xent: 1.45; lr: 0.00010; 11598/4488 tok/s;    648 sec
[2021-04-25 04:05:47,513 INFO] Step 3750/50000; acc:  61.33; ppl:  4.21; xent: 1.44; lr: 0.00010; 10962/4415 tok/s;    657 sec
[2021-04-25 04:05:56,659 INFO] Step 3800/50000; acc:  61.41; ppl:  4.20; xent: 1.44; lr: 0.00010; 11255/4492 tok/s;    666 sec
[2021-04-25 04:06:04,717 INFO] Step 3850/50000; acc:  61.67; ppl:  4.14; xent: 1.42; lr: 0.00010; 12187/4988 tok/s;    675 sec
[2021-04-25 04:06:13,312 INFO] Step 3900/50000; acc:  61.38; ppl:  4.18; xent: 1.43; lr: 0.00010; 11904/4624 tok/s;    683 sec
[2021-04-25 04:06:21,887 INFO] Step 3950/50000; acc:  61.51; ppl:  4.17; xent: 1.43; lr: 0.00010; 12071/4627 tok/s;    692 sec
[2021-04-25 04:06:30,782 INFO] Step 4000/50000; acc:  61.70; ppl:  4.11; xent: 1.41; lr: 0.00010; 11367/4505 tok/s;    701 sec
[2021-04-25 04:06:39,790 INFO] Step 4050/50000; acc:  61.86; ppl:  4.09; xent: 1.41; lr: 0.00010; 11363/4493 tok/s;    710 sec
[2021-04-25 04:06:48,496 INFO] Step 4100/50000; acc:  61.49; ppl:  4.12; xent: 1.42; lr: 0.00010; 11576/4611 tok/s;    718 sec
[2021-04-25 04:06:57,416 INFO] Step 4150/50000; acc:  61.33; ppl:  4.19; xent: 1.43; lr: 0.00010; 11588/4529 tok/s;    727 sec
[2021-04-25 04:07:06,221 INFO] Step 4200/50000; acc:  61.70; ppl:  4.13; xent: 1.42; lr: 0.00010; 11554/4644 tok/s;    736 sec
[2021-04-25 04:07:14,758 INFO] Step 4250/50000; acc:  61.94; ppl:  4.07; xent: 1.40; lr: 0.00010; 12012/4543 tok/s;    745 sec
[2021-04-25 04:07:22,905 INFO] Step 4300/50000; acc:  62.18; ppl:  3.99; xent: 1.38; lr: 0.00010; 12247/4921 tok/s;    753 sec
[2021-04-25 04:07:31,653 INFO] Step 4350/50000; acc:  62.30; ppl:  4.02; xent: 1.39; lr: 0.00010; 11602/4610 tok/s;    761 sec
[2021-04-25 04:07:31,935 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:07:40,402 INFO] Step 4400/50000; acc:  62.35; ppl:  4.01; xent: 1.39; lr: 0.00010; 11779/4618 tok/s;    770 sec
[2021-04-25 04:07:49,278 INFO] Step 4450/50000; acc:  62.35; ppl:  4.04; xent: 1.40; lr: 0.00010; 11265/4548 tok/s;    779 sec
[2021-04-25 04:07:58,943 INFO] Step 4500/50000; acc:  61.64; ppl:  4.09; xent: 1.41; lr: 0.00010; 10638/4196 tok/s;    789 sec
[2021-04-25 04:08:07,517 INFO] Step 4550/50000; acc:  62.27; ppl:  4.01; xent: 1.39; lr: 0.00010; 11762/4781 tok/s;    797 sec
[2021-04-25 04:08:16,283 INFO] Step 4600/50000; acc:  62.25; ppl:  4.06; xent: 1.40; lr: 0.00010; 11795/4572 tok/s;    806 sec
[2021-04-25 04:08:24,978 INFO] Step 4650/50000; acc:  62.68; ppl:  3.93; xent: 1.37; lr: 0.00010; 11214/4640 tok/s;    815 sec
[2021-04-25 04:08:33,714 INFO] Step 4700/50000; acc:  62.19; ppl:  4.06; xent: 1.40; lr: 0.00010; 11935/4481 tok/s;    824 sec
[2021-04-25 04:08:43,090 INFO] Step 4750/50000; acc:  62.22; ppl:  3.99; xent: 1.38; lr: 0.00010; 11086/4370 tok/s;    833 sec
[2021-04-25 04:08:51,500 INFO] Step 4800/50000; acc:  63.09; ppl:  3.90; xent: 1.36; lr: 0.00010; 11794/4723 tok/s;    841 sec
[2021-04-25 04:09:00,395 INFO] Step 4850/50000; acc:  61.87; ppl:  4.01; xent: 1.39; lr: 0.00010; 11537/4526 tok/s;    850 sec
[2021-04-25 04:09:09,172 INFO] Step 4900/50000; acc:  62.68; ppl:  3.98; xent: 1.38; lr: 0.00010; 11473/4602 tok/s;    859 sec
[2021-04-25 04:09:18,082 INFO] Step 4950/50000; acc:  62.68; ppl:  3.94; xent: 1.37; lr: 0.00010; 11657/4463 tok/s;    868 sec
[2021-04-25 04:09:26,303 INFO] Step 5000/50000; acc:  62.66; ppl:  3.96; xent: 1.38; lr: 0.00010; 12385/4925 tok/s;    876 sec
[2021-04-25 04:09:26,303 INFO] valid's transforms: TransformPipe()
[2021-04-25 04:09:26,306 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-25 04:09:33,505 INFO] Validation perplexity: 3.73719
[2021-04-25 04:09:33,505 INFO] Validation accuracy: 64.1518
[2021-04-25 04:09:33,507 INFO] Saving checkpoint ../models/group1_params/loose_ops/model_step_5000.pt
[2021-04-25 04:09:42,310 INFO] Step 5050/50000; acc:  63.02; ppl:  3.90; xent: 1.36; lr: 0.00010; 6319/2482 tok/s;    892 sec
[2021-04-25 04:09:48,730 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:09:50,588 INFO] Step 5100/50000; acc:  63.15; ppl:  3.86; xent: 1.35; lr: 0.00010; 12162/4847 tok/s;    900 sec
[2021-04-25 04:09:59,780 INFO] Step 5150/50000; acc:  63.05; ppl:  3.93; xent: 1.37; lr: 0.00010; 11048/4383 tok/s;    910 sec
[2021-04-25 04:10:09,103 INFO] Step 5200/50000; acc:  62.80; ppl:  3.96; xent: 1.38; lr: 0.00010; 11049/4345 tok/s;    919 sec
[2021-04-25 04:10:18,007 INFO] Step 5250/50000; acc:  63.09; ppl:  3.87; xent: 1.35; lr: 0.00010; 11085/4593 tok/s;    928 sec
[2021-04-25 04:10:26,818 INFO] Step 5300/50000; acc:  62.72; ppl:  3.95; xent: 1.37; lr: 0.00010; 11790/4665 tok/s;    937 sec
[2021-04-25 04:10:35,402 INFO] Step 5350/50000; acc:  63.32; ppl:  3.86; xent: 1.35; lr: 0.00010; 11725/4613 tok/s;    945 sec
[2021-04-25 04:10:44,197 INFO] Step 5400/50000; acc:  63.00; ppl:  3.91; xent: 1.36; lr: 0.00010; 11796/4565 tok/s;    954 sec
[2021-04-25 04:10:52,796 INFO] Step 5450/50000; acc:  63.29; ppl:  3.80; xent: 1.34; lr: 0.00010; 11582/4516 tok/s;    963 sec
[2021-04-25 04:11:02,090 INFO] Step 5500/50000; acc:  62.98; ppl:  3.87; xent: 1.35; lr: 0.00010; 11112/4418 tok/s;    972 sec
[2021-04-25 04:11:10,956 INFO] Step 5550/50000; acc:  62.76; ppl:  3.89; xent: 1.36; lr: 0.00010; 11592/4539 tok/s;    981 sec
[2021-04-25 04:11:19,354 INFO] Step 5600/50000; acc:  63.38; ppl:  3.81; xent: 1.34; lr: 0.00010; 11874/4804 tok/s;    989 sec
[2021-04-25 04:11:28,544 INFO] Step 5650/50000; acc:  62.95; ppl:  3.90; xent: 1.36; lr: 0.00010; 11115/4383 tok/s;    998 sec
[2021-04-25 04:11:36,998 INFO] Step 5700/50000; acc:  63.49; ppl:  3.78; xent: 1.33; lr: 0.00010; 11968/4717 tok/s;   1007 sec
[2021-04-25 04:11:45,299 INFO] Step 5750/50000; acc:  63.21; ppl:  3.85; xent: 1.35; lr: 0.00010; 12437/4789 tok/s;   1015 sec
[2021-04-25 04:11:53,765 INFO] Step 5800/50000; acc:  63.70; ppl:  3.76; xent: 1.32; lr: 0.00010; 11943/4766 tok/s;   1024 sec
[2021-04-25 04:11:58,127 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:12:02,834 INFO] Step 5850/50000; acc:  63.42; ppl:  3.82; xent: 1.34; lr: 0.00010; 11253/4450 tok/s;   1033 sec
[2021-04-25 04:12:11,631 INFO] Step 5900/50000; acc:  63.96; ppl:  3.78; xent: 1.33; lr: 0.00010; 11498/4522 tok/s;   1041 sec
[2021-04-25 04:12:21,133 INFO] Step 5950/50000; acc:  63.22; ppl:  3.79; xent: 1.33; lr: 0.00010; 10620/4346 tok/s;   1051 sec
[2021-04-25 04:12:30,003 INFO] Step 6000/50000; acc:  63.37; ppl:  3.82; xent: 1.34; lr: 0.00010; 11544/4566 tok/s;   1060 sec
[2021-04-25 04:12:38,587 INFO] Step 6050/50000; acc:  63.64; ppl:  3.78; xent: 1.33; lr: 0.00010; 11574/4722 tok/s;   1068 sec
[2021-04-25 04:12:47,246 INFO] Step 6100/50000; acc:  63.68; ppl:  3.80; xent: 1.34; lr: 0.00010; 11926/4573 tok/s;   1077 sec
[2021-04-25 04:12:55,844 INFO] Step 6150/50000; acc:  63.89; ppl:  3.75; xent: 1.32; lr: 0.00010; 11897/4666 tok/s;   1086 sec
[2021-04-25 04:13:05,032 INFO] Step 6200/50000; acc:  63.45; ppl:  3.80; xent: 1.34; lr: 0.00010; 11363/4307 tok/s;   1095 sec
[2021-04-25 04:13:13,864 INFO] Step 6250/50000; acc:  64.10; ppl:  3.69; xent: 1.31; lr: 0.00010; 11143/4618 tok/s;   1104 sec
[2021-04-25 04:13:22,373 INFO] Step 6300/50000; acc:  63.71; ppl:  3.76; xent: 1.32; lr: 0.00010; 12097/4708 tok/s;   1112 sec
[2021-04-25 04:13:31,351 INFO] Step 6350/50000; acc:  63.18; ppl:  3.84; xent: 1.35; lr: 0.00010; 11476/4489 tok/s;   1121 sec
[2021-04-25 04:13:39,925 INFO] Step 6400/50000; acc:  64.08; ppl:  3.71; xent: 1.31; lr: 0.00010; 11591/4665 tok/s;   1130 sec
[2021-04-25 04:13:48,548 INFO] Step 6450/50000; acc:  63.90; ppl:  3.69; xent: 1.31; lr: 0.00010; 11920/4662 tok/s;   1138 sec
[2021-04-25 04:13:56,938 INFO] Step 6500/50000; acc:  64.12; ppl:  3.71; xent: 1.31; lr: 0.00010; 11922/4737 tok/s;   1147 sec
[2021-04-25 04:14:05,679 INFO] Step 6550/50000; acc:  64.06; ppl:  3.71; xent: 1.31; lr: 0.00010; 11854/4632 tok/s;   1156 sec
[2021-04-25 04:14:07,367 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:14:14,191 INFO] Step 6600/50000; acc:  64.24; ppl:  3.70; xent: 1.31; lr: 0.00010; 11951/4732 tok/s;   1164 sec
[2021-04-25 04:14:23,338 INFO] Step 6650/50000; acc:  64.05; ppl:  3.74; xent: 1.32; lr: 0.00010; 11125/4369 tok/s;   1173 sec
[2021-04-25 04:14:32,361 INFO] Step 6700/50000; acc:  64.29; ppl:  3.66; xent: 1.30; lr: 0.00010; 11074/4588 tok/s;   1182 sec
[2021-04-25 04:14:41,047 INFO] Step 6750/50000; acc:  64.04; ppl:  3.72; xent: 1.31; lr: 0.00010; 11690/4627 tok/s;   1191 sec
[2021-04-25 04:14:49,886 INFO] Step 6800/50000; acc:  64.02; ppl:  3.73; xent: 1.32; lr: 0.00010; 11583/4574 tok/s;   1200 sec
[2021-04-25 04:14:58,264 INFO] Step 6850/50000; acc:  64.70; ppl:  3.61; xent: 1.28; lr: 0.00010; 11859/4732 tok/s;   1208 sec
[2021-04-25 04:15:07,443 INFO] Step 6900/50000; acc:  63.67; ppl:  3.74; xent: 1.32; lr: 0.00010; 11502/4287 tok/s;   1217 sec
[2021-04-25 04:15:16,034 INFO] Step 6950/50000; acc:  64.41; ppl:  3.62; xent: 1.29; lr: 0.00010; 11785/4803 tok/s;   1226 sec
[2021-04-25 04:15:25,139 INFO] Step 7000/50000; acc:  64.03; ppl:  3.73; xent: 1.32; lr: 0.00010; 11376/4342 tok/s;   1235 sec
[2021-04-25 04:15:33,640 INFO] Step 7050/50000; acc:  64.26; ppl:  3.63; xent: 1.29; lr: 0.00010; 11607/4760 tok/s;   1243 sec
[2021-04-25 04:15:42,400 INFO] Step 7100/50000; acc:  63.98; ppl:  3.71; xent: 1.31; lr: 0.00010; 11686/4618 tok/s;   1252 sec
[2021-04-25 04:15:51,303 INFO] Step 7150/50000; acc:  64.38; ppl:  3.65; xent: 1.30; lr: 0.00010; 11653/4474 tok/s;   1261 sec
[2021-04-25 04:15:59,189 INFO] Step 7200/50000; acc:  64.79; ppl:  3.58; xent: 1.28; lr: 0.00010; 12571/5019 tok/s;   1269 sec
[2021-04-25 04:16:07,912 INFO] Step 7250/50000; acc:  64.94; ppl:  3.60; xent: 1.28; lr: 0.00010; 11674/4595 tok/s;   1278 sec
[2021-04-25 04:16:10,355 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:16:16,778 INFO] Step 7300/50000; acc:  64.31; ppl:  3.65; xent: 1.29; lr: 0.00010; 11433/4621 tok/s;   1287 sec
[2021-04-25 04:16:25,795 INFO] Step 7350/50000; acc:  64.65; ppl:  3.64; xent: 1.29; lr: 0.00010; 11484/4458 tok/s;   1296 sec
[2021-04-25 04:16:34,805 INFO] Step 7400/50000; acc:  64.45; ppl:  3.64; xent: 1.29; lr: 0.00010; 11191/4469 tok/s;   1305 sec
[2021-04-25 04:16:43,854 INFO] Step 7450/50000; acc:  64.65; ppl:  3.62; xent: 1.29; lr: 0.00010; 11215/4524 tok/s;   1314 sec
[2021-04-25 04:16:52,036 INFO] Step 7500/50000; acc:  64.40; ppl:  3.63; xent: 1.29; lr: 0.00010; 12270/4951 tok/s;   1322 sec
[2021-04-25 04:17:00,661 INFO] Step 7550/50000; acc:  64.60; ppl:  3.59; xent: 1.28; lr: 0.00010; 11713/4599 tok/s;   1330 sec
[2021-04-25 04:17:09,257 INFO] Step 7600/50000; acc:  64.52; ppl:  3.64; xent: 1.29; lr: 0.00010; 12032/4589 tok/s;   1339 sec
[2021-04-25 04:17:18,534 INFO] Step 7650/50000; acc:  64.74; ppl:  3.58; xent: 1.28; lr: 0.00010; 10924/4365 tok/s;   1348 sec
[2021-04-25 04:17:27,401 INFO] Step 7700/50000; acc:  64.71; ppl:  3.60; xent: 1.28; lr: 0.00010; 11692/4561 tok/s;   1357 sec
[2021-04-25 04:17:36,197 INFO] Step 7750/50000; acc:  64.64; ppl:  3.60; xent: 1.28; lr: 0.00010; 11499/4590 tok/s;   1366 sec
[2021-04-25 04:17:45,063 INFO] Step 7800/50000; acc:  64.29; ppl:  3.63; xent: 1.29; lr: 0.00010; 11658/4506 tok/s;   1375 sec
[2021-04-25 04:17:53,748 INFO] Step 7850/50000; acc:  64.82; ppl:  3.57; xent: 1.27; lr: 0.00010; 11343/4696 tok/s;   1384 sec
[2021-04-25 04:18:02,316 INFO] Step 7900/50000; acc:  64.70; ppl:  3.58; xent: 1.27; lr: 0.00010; 12097/4575 tok/s;   1392 sec
[2021-04-25 04:18:10,721 INFO] Step 7950/50000; acc:  64.75; ppl:  3.56; xent: 1.27; lr: 0.00010; 12166/4771 tok/s;   1401 sec
[2021-04-25 04:18:19,302 INFO] Step 8000/50000; acc:  65.42; ppl:  3.51; xent: 1.25; lr: 0.00010; 11576/4692 tok/s;   1409 sec
[2021-04-25 04:18:19,311 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:18:28,142 INFO] Step 8050/50000; acc:  65.12; ppl:  3.54; xent: 1.27; lr: 0.00010; 11580/4537 tok/s;   1418 sec
[2021-04-25 04:18:36,959 INFO] Step 8100/50000; acc:  65.07; ppl:  3.54; xent: 1.27; lr: 0.00010; 11470/4610 tok/s;   1427 sec
[2021-04-25 04:18:46,632 INFO] Step 8150/50000; acc:  64.42; ppl:  3.61; xent: 1.28; lr: 0.00010; 10588/4223 tok/s;   1436 sec
[2021-04-25 04:18:55,235 INFO] Step 8200/50000; acc:  64.94; ppl:  3.55; xent: 1.27; lr: 0.00010; 11799/4747 tok/s;   1445 sec
[2021-04-25 04:19:04,058 INFO] Step 8250/50000; acc:  65.16; ppl:  3.54; xent: 1.27; lr: 0.00010; 11546/4512 tok/s;   1454 sec
[2021-04-25 04:19:12,866 INFO] Step 8300/50000; acc:  65.00; ppl:  3.53; xent: 1.26; lr: 0.00010; 11326/4590 tok/s;   1463 sec
[2021-04-25 04:19:21,588 INFO] Step 8350/50000; acc:  65.03; ppl:  3.54; xent: 1.27; lr: 0.00010; 11803/4478 tok/s;   1471 sec
[2021-04-25 04:19:30,973 INFO] Step 8400/50000; acc:  64.61; ppl:  3.57; xent: 1.27; lr: 0.00010; 11047/4367 tok/s;   1481 sec
[2021-04-25 04:19:39,287 INFO] Step 8450/50000; acc:  65.59; ppl:  3.46; xent: 1.24; lr: 0.00010; 11931/4779 tok/s;   1489 sec
[2021-04-25 04:19:48,302 INFO] Step 8500/50000; acc:  64.55; ppl:  3.59; xent: 1.28; lr: 0.00010; 11570/4501 tok/s;   1498 sec
[2021-04-25 04:19:57,120 INFO] Step 8550/50000; acc:  65.24; ppl:  3.54; xent: 1.26; lr: 0.00010; 11461/4593 tok/s;   1507 sec
[2021-04-25 04:20:06,014 INFO] Step 8600/50000; acc:  65.09; ppl:  3.52; xent: 1.26; lr: 0.00010; 11685/4442 tok/s;   1516 sec
[2021-04-25 04:20:14,082 INFO] Step 8650/50000; acc:  65.75; ppl:  3.45; xent: 1.24; lr: 0.00010; 12168/4988 tok/s;   1524 sec
[2021-04-25 04:20:22,525 INFO] Step 8700/50000; acc:  65.46; ppl:  3.48; xent: 1.25; lr: 0.00010; 12102/4727 tok/s;   1532 sec
[2021-04-25 04:20:28,717 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:20:31,272 INFO] Step 8750/50000; acc:  65.45; ppl:  3.49; xent: 1.25; lr: 0.00010; 11836/4645 tok/s;   1541 sec
[2021-04-25 04:20:40,064 INFO] Step 8800/50000; acc:  65.96; ppl:  3.46; xent: 1.24; lr: 0.00010; 11290/4537 tok/s;   1550 sec
[2021-04-25 04:20:49,292 INFO] Step 8850/50000; acc:  64.91; ppl:  3.53; xent: 1.26; lr: 0.00010; 11092/4347 tok/s;   1559 sec
[2021-04-25 04:20:58,029 INFO] Step 8900/50000; acc:  65.57; ppl:  3.47; xent: 1.24; lr: 0.00010; 11441/4708 tok/s;   1568 sec
[2021-04-25 04:21:06,772 INFO] Step 8950/50000; acc:  65.16; ppl:  3.54; xent: 1.26; lr: 0.00010; 11825/4686 tok/s;   1577 sec
[2021-04-25 04:21:15,479 INFO] Step 9000/50000; acc:  65.54; ppl:  3.47; xent: 1.25; lr: 0.00010; 11617/4591 tok/s;   1585 sec
[2021-04-25 04:21:24,121 INFO] Step 9050/50000; acc:  65.22; ppl:  3.50; xent: 1.25; lr: 0.00010; 11852/4572 tok/s;   1594 sec
[2021-04-25 04:21:32,984 INFO] Step 9100/50000; acc:  65.28; ppl:  3.44; xent: 1.24; lr: 0.00010; 11482/4462 tok/s;   1603 sec
[2021-04-25 04:21:42,176 INFO] Step 9150/50000; acc:  65.44; ppl:  3.47; xent: 1.24; lr: 0.00010; 11096/4450 tok/s;   1612 sec
[2021-04-25 04:21:50,989 INFO] Step 9200/50000; acc:  65.37; ppl:  3.49; xent: 1.25; lr: 0.00010; 11634/4547 tok/s;   1621 sec
[2021-04-25 04:21:59,495 INFO] Step 9250/50000; acc:  65.83; ppl:  3.41; xent: 1.23; lr: 0.00010; 11734/4743 tok/s;   1629 sec
[2021-04-25 04:22:08,421 INFO] Step 9300/50000; acc:  65.24; ppl:  3.53; xent: 1.26; lr: 0.00010; 11618/4512 tok/s;   1638 sec
[2021-04-25 04:22:17,046 INFO] Step 9350/50000; acc:  65.89; ppl:  3.40; xent: 1.23; lr: 0.00010; 11777/4619 tok/s;   1647 sec
[2021-04-25 04:22:25,478 INFO] Step 9400/50000; acc:  65.48; ppl:  3.50; xent: 1.25; lr: 0.00010; 12254/4755 tok/s;   1655 sec
[2021-04-25 04:22:33,664 INFO] Step 9450/50000; acc:  66.46; ppl:  3.32; xent: 1.20; lr: 0.00010; 11931/4873 tok/s;   1663 sec
[2021-04-25 04:22:37,807 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:22:42,692 INFO] Step 9500/50000; acc:  65.40; ppl:  3.49; xent: 1.25; lr: 0.00010; 11417/4484 tok/s;   1673 sec
[2021-04-25 04:22:51,772 INFO] Step 9550/50000; acc:  65.72; ppl:  3.47; xent: 1.24; lr: 0.00010; 11427/4382 tok/s;   1682 sec
[2021-04-25 04:23:01,132 INFO] Step 9600/50000; acc:  66.02; ppl:  3.39; xent: 1.22; lr: 0.00010; 10548/4412 tok/s;   1691 sec
[2021-04-25 04:23:09,890 INFO] Step 9650/50000; acc:  65.76; ppl:  3.44; xent: 1.24; lr: 0.00010; 11623/4626 tok/s;   1700 sec
[2021-04-25 04:23:18,654 INFO] Step 9700/50000; acc:  65.65; ppl:  3.46; xent: 1.24; lr: 0.00010; 11457/4653 tok/s;   1708 sec
[2021-04-25 04:23:27,336 INFO] Step 9750/50000; acc:  65.65; ppl:  3.43; xent: 1.23; lr: 0.00010; 11860/4568 tok/s;   1717 sec
[2021-04-25 04:23:35,836 INFO] Step 9800/50000; acc:  65.63; ppl:  3.43; xent: 1.23; lr: 0.00010; 12106/4636 tok/s;   1726 sec
[2021-04-25 04:23:45,056 INFO] Step 9850/50000; acc:  65.86; ppl:  3.43; xent: 1.23; lr: 0.00010; 11139/4348 tok/s;   1735 sec
[2021-04-25 04:23:54,032 INFO] Step 9900/50000; acc:  65.89; ppl:  3.39; xent: 1.22; lr: 0.00010; 11193/4530 tok/s;   1744 sec
[2021-04-25 04:24:02,451 INFO] Step 9950/50000; acc:  65.87; ppl:  3.38; xent: 1.22; lr: 0.00010; 12097/4760 tok/s;   1752 sec
[2021-04-25 04:24:11,408 INFO] Step 10000/50000; acc:  65.55; ppl:  3.49; xent: 1.25; lr: 0.00010; 11476/4478 tok/s;   1761 sec
[2021-04-25 04:24:11,412 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-25 04:24:18,598 INFO] Validation perplexity: 3.30943
[2021-04-25 04:24:18,598 INFO] Validation accuracy: 66.7092
[2021-04-25 04:24:18,600 INFO] Saving checkpoint ../models/group1_params/loose_ops/model_step_10000.pt
[2021-04-25 04:24:27,746 INFO] Step 10050/50000; acc:  66.15; ppl:  3.36; xent: 1.21; lr: 0.00010; 6096/2476 tok/s;   1778 sec
[2021-04-25 04:24:36,327 INFO] Step 10100/50000; acc:  65.97; ppl:  3.38; xent: 1.22; lr: 0.00010; 12146/4621 tok/s;   1786 sec
[2021-04-25 04:24:44,757 INFO] Step 10150/50000; acc:  66.06; ppl:  3.38; xent: 1.22; lr: 0.00010; 11915/4743 tok/s;   1795 sec
[2021-04-25 04:24:53,489 INFO] Step 10200/50000; acc:  66.09; ppl:  3.39; xent: 1.22; lr: 0.00010; 11874/4643 tok/s;   1803 sec
[2021-04-25 04:24:54,718 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:25:01,855 INFO] Step 10250/50000; acc:  66.48; ppl:  3.34; xent: 1.21; lr: 0.00010; 11777/4807 tok/s;   1812 sec
[2021-04-25 04:25:11,060 INFO] Step 10300/50000; acc:  65.95; ppl:  3.43; xent: 1.23; lr: 0.00010; 11139/4362 tok/s;   1821 sec
[2021-04-25 04:25:20,153 INFO] Step 10350/50000; acc:  66.12; ppl:  3.38; xent: 1.22; lr: 0.00010; 11295/4545 tok/s;   1830 sec
[2021-04-25 04:25:28,535 INFO] Step 10400/50000; acc:  66.21; ppl:  3.36; xent: 1.21; lr: 0.00010; 11812/4794 tok/s;   1838 sec
[2021-04-25 04:25:37,333 INFO] Step 10450/50000; acc:  65.83; ppl:  3.41; xent: 1.23; lr: 0.00010; 11578/4603 tok/s;   1847 sec
[2021-04-25 04:25:45,749 INFO] Step 10500/50000; acc:  66.42; ppl:  3.34; xent: 1.21; lr: 0.00010; 11975/4707 tok/s;   1856 sec
[2021-04-25 04:25:54,886 INFO] Step 10550/50000; acc:  65.76; ppl:  3.42; xent: 1.23; lr: 0.00010; 11514/4308 tok/s;   1865 sec
[2021-04-25 04:26:03,507 INFO] Step 10600/50000; acc:  66.44; ppl:  3.32; xent: 1.20; lr: 0.00010; 11773/4769 tok/s;   1873 sec
[2021-04-25 04:26:12,641 INFO] Step 10650/50000; acc:  65.86; ppl:  3.40; xent: 1.22; lr: 0.00010; 11188/4364 tok/s;   1882 sec
[2021-04-25 04:26:21,187 INFO] Step 10700/50000; acc:  66.19; ppl:  3.36; xent: 1.21; lr: 0.00010; 11785/4716 tok/s;   1891 sec
[2021-04-25 04:26:29,882 INFO] Step 10750/50000; acc:  65.99; ppl:  3.40; xent: 1.22; lr: 0.00010; 11628/4673 tok/s;   1900 sec
[2021-04-25 04:26:38,578 INFO] Step 10800/50000; acc:  66.29; ppl:  3.35; xent: 1.21; lr: 0.00010; 11922/4539 tok/s;   1908 sec
[2021-04-25 04:26:46,627 INFO] Step 10850/50000; acc:  66.89; ppl:  3.28; xent: 1.19; lr: 0.00010; 12338/4947 tok/s;   1916 sec
[2021-04-25 04:26:55,212 INFO] Step 10900/50000; acc:  66.40; ppl:  3.35; xent: 1.21; lr: 0.00010; 12044/4672 tok/s;   1925 sec
[2021-04-25 04:26:57,307 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:27:04,121 INFO] Step 10950/50000; acc:  66.31; ppl:  3.32; xent: 1.20; lr: 0.00010; 11402/4571 tok/s;   1934 sec
[2021-04-25 04:27:13,304 INFO] Step 11000/50000; acc:  66.56; ppl:  3.35; xent: 1.21; lr: 0.00010; 11288/4412 tok/s;   1943 sec
[2021-04-25 04:27:22,075 INFO] Step 11050/50000; acc:  66.51; ppl:  3.31; xent: 1.20; lr: 0.00010; 11110/4549 tok/s;   1952 sec
[2021-04-25 04:27:31,298 INFO] Step 11100/50000; acc:  66.24; ppl:  3.36; xent: 1.21; lr: 0.00010; 11113/4471 tok/s;   1961 sec
[2021-04-25 04:27:39,761 INFO] Step 11150/50000; acc:  66.08; ppl:  3.38; xent: 1.22; lr: 0.00010; 12182/4792 tok/s;   1970 sec
[2021-04-25 04:27:48,073 INFO] Step 11200/50000; acc:  66.80; ppl:  3.27; xent: 1.18; lr: 0.00010; 11857/4766 tok/s;   1978 sec
[2021-04-25 04:27:56,660 INFO] Step 11250/50000; acc:  66.38; ppl:  3.34; xent: 1.21; lr: 0.00010; 12004/4569 tok/s;   1986 sec
[2021-04-25 04:28:05,959 INFO] Step 11300/50000; acc:  66.32; ppl:  3.33; xent: 1.20; lr: 0.00010; 11017/4339 tok/s;   1996 sec
[2021-04-25 04:28:14,911 INFO] Step 11350/50000; acc:  66.35; ppl:  3.33; xent: 1.20; lr: 0.00010; 11524/4580 tok/s;   2005 sec
[2021-04-25 04:28:23,727 INFO] Step 11400/50000; acc:  66.49; ppl:  3.32; xent: 1.20; lr: 0.00010; 11545/4567 tok/s;   2014 sec
[2021-04-25 04:28:32,507 INFO] Step 11450/50000; acc:  66.50; ppl:  3.33; xent: 1.20; lr: 0.00010; 11594/4533 tok/s;   2022 sec
[2021-04-25 04:28:41,288 INFO] Step 11500/50000; acc:  66.24; ppl:  3.31; xent: 1.20; lr: 0.00010; 11468/4612 tok/s;   2031 sec
[2021-04-25 04:28:49,734 INFO] Step 11550/50000; acc:  66.66; ppl:  3.28; xent: 1.19; lr: 0.00010; 12118/4690 tok/s;   2040 sec
[2021-04-25 04:28:58,063 INFO] Step 11600/50000; acc:  66.75; ppl:  3.28; xent: 1.19; lr: 0.00010; 12261/4769 tok/s;   2048 sec
[2021-04-25 04:29:06,326 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:29:06,632 INFO] Step 11650/50000; acc:  67.18; ppl:  3.23; xent: 1.17; lr: 0.00010; 11622/4709 tok/s;   2056 sec
[2021-04-25 04:29:15,603 INFO] Step 11700/50000; acc:  66.67; ppl:  3.32; xent: 1.20; lr: 0.00010; 11551/4496 tok/s;   2065 sec
[2021-04-25 04:29:24,755 INFO] Step 11750/50000; acc:  66.57; ppl:  3.31; xent: 1.20; lr: 0.00010; 11092/4484 tok/s;   2075 sec
[2021-04-25 04:29:34,201 INFO] Step 11800/50000; acc:  66.59; ppl:  3.34; xent: 1.21; lr: 0.00010; 10850/4258 tok/s;   2084 sec
[2021-04-25 04:29:42,663 INFO] Step 11850/50000; acc:  67.07; ppl:  3.26; xent: 1.18; lr: 0.00010; 11600/4818 tok/s;   2092 sec
[2021-04-25 04:29:51,592 INFO] Step 11900/50000; acc:  66.67; ppl:  3.31; xent: 1.20; lr: 0.00010; 11519/4480 tok/s;   2101 sec
[2021-04-25 04:30:00,442 INFO] Step 11950/50000; acc:  66.66; ppl:  3.31; xent: 1.20; lr: 0.00010; 11599/4542 tok/s;   2110 sec
[2021-04-25 04:30:08,957 INFO] Step 12000/50000; acc:  66.72; ppl:  3.26; xent: 1.18; lr: 0.00010; 11820/4621 tok/s;   2119 sec
[2021-04-25 04:30:18,362 INFO] Step 12050/50000; acc:  66.31; ppl:  3.31; xent: 1.20; lr: 0.00010; 10957/4326 tok/s;   2128 sec
[2021-04-25 04:30:26,793 INFO] Step 12100/50000; acc:  67.04; ppl:  3.25; xent: 1.18; lr: 0.00010; 11881/4739 tok/s;   2137 sec
[2021-04-25 04:30:35,828 INFO] Step 12150/50000; acc:  66.24; ppl:  3.34; xent: 1.20; lr: 0.00010; 11519/4504 tok/s;   2146 sec
[2021-04-25 04:30:44,715 INFO] Step 12200/50000; acc:  66.94; ppl:  3.28; xent: 1.19; lr: 0.00010; 11421/4551 tok/s;   2155 sec
[2021-04-25 04:30:53,509 INFO] Step 12250/50000; acc:  67.11; ppl:  3.23; xent: 1.17; lr: 0.00010; 11656/4466 tok/s;   2163 sec
[2021-04-25 04:31:01,709 INFO] Step 12300/50000; acc:  67.20; ppl:  3.25; xent: 1.18; lr: 0.00010; 12217/4971 tok/s;   2172 sec
[2021-04-25 04:31:10,023 INFO] Step 12350/50000; acc:  67.03; ppl:  3.22; xent: 1.17; lr: 0.00010; 12141/4724 tok/s;   2180 sec
[2021-04-25 04:31:15,998 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:31:18,984 INFO] Step 12400/50000; acc:  66.80; ppl:  3.26; xent: 1.18; lr: 0.00010; 11522/4562 tok/s;   2189 sec
[2021-04-25 04:31:27,660 INFO] Step 12450/50000; acc:  67.53; ppl:  3.23; xent: 1.17; lr: 0.00010; 11493/4615 tok/s;   2197 sec
[2021-04-25 04:31:37,146 INFO] Step 12500/50000; acc:  66.18; ppl:  3.32; xent: 1.20; lr: 0.00010; 10927/4272 tok/s;   2207 sec
[2021-04-25 04:31:45,889 INFO] Step 12550/50000; acc:  67.37; ppl:  3.24; xent: 1.17; lr: 0.00010; 11485/4684 tok/s;   2216 sec
[2021-04-25 04:31:54,805 INFO] Step 12600/50000; acc:  66.65; ppl:  3.29; xent: 1.19; lr: 0.00010; 11588/4565 tok/s;   2225 sec
[2021-04-25 04:32:03,250 INFO] Step 12650/50000; acc:  67.35; ppl:  3.21; xent: 1.17; lr: 0.00010; 11592/4680 tok/s;   2233 sec
[2021-04-25 04:32:12,045 INFO] Step 12700/50000; acc:  66.61; ppl:  3.28; xent: 1.19; lr: 0.00010; 11773/4553 tok/s;   2242 sec
[2021-04-25 04:32:21,090 INFO] Step 12750/50000; acc:  66.62; ppl:  3.26; xent: 1.18; lr: 0.00010; 11524/4357 tok/s;   2251 sec
[2021-04-25 04:32:30,089 INFO] Step 12800/50000; acc:  66.92; ppl:  3.23; xent: 1.17; lr: 0.00010; 11081/4547 tok/s;   2260 sec
[2021-04-25 04:32:38,913 INFO] Step 12850/50000; acc:  66.83; ppl:  3.26; xent: 1.18; lr: 0.00010; 11558/4503 tok/s;   2269 sec
[2021-04-25 04:32:47,479 INFO] Step 12900/50000; acc:  67.11; ppl:  3.20; xent: 1.16; lr: 0.00010; 11775/4761 tok/s;   2277 sec
[2021-04-25 04:32:56,524 INFO] Step 12950/50000; acc:  66.75; ppl:  3.30; xent: 1.19; lr: 0.00010; 11452/4447 tok/s;   2286 sec
[2021-04-25 04:33:05,116 INFO] Step 13000/50000; acc:  67.39; ppl:  3.19; xent: 1.16; lr: 0.00010; 11860/4615 tok/s;   2295 sec
[2021-04-25 04:33:13,349 INFO] Step 13050/50000; acc:  67.32; ppl:  3.22; xent: 1.17; lr: 0.00010; 12356/4865 tok/s;   2303 sec
[2021-04-25 04:33:21,644 INFO] Step 13100/50000; acc:  67.36; ppl:  3.17; xent: 1.15; lr: 0.00010; 12049/4851 tok/s;   2311 sec
[2021-04-25 04:33:25,416 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:33:30,771 INFO] Step 13150/50000; acc:  66.88; ppl:  3.24; xent: 1.18; lr: 0.00010; 11147/4446 tok/s;   2321 sec
[2021-04-25 04:33:39,745 INFO] Step 13200/50000; acc:  67.05; ppl:  3.25; xent: 1.18; lr: 0.00010; 11538/4376 tok/s;   2330 sec
[2021-04-25 04:33:49,009 INFO] Step 13250/50000; acc:  67.34; ppl:  3.20; xent: 1.16; lr: 0.00010; 10689/4494 tok/s;   2339 sec
[2021-04-25 04:33:57,973 INFO] Step 13300/50000; acc:  67.12; ppl:  3.25; xent: 1.18; lr: 0.00010; 11519/4527 tok/s;   2348 sec
[2021-04-25 04:34:06,665 INFO] Step 13350/50000; acc:  67.16; ppl:  3.23; xent: 1.17; lr: 0.00010; 11592/4697 tok/s;   2356 sec
[2021-04-25 04:34:15,225 INFO] Step 13400/50000; acc:  66.92; ppl:  3.23; xent: 1.17; lr: 0.00010; 12038/4585 tok/s;   2365 sec
[2021-04-25 04:34:23,605 INFO] Step 13450/50000; acc:  67.47; ppl:  3.16; xent: 1.15; lr: 0.00010; 11889/4699 tok/s;   2373 sec
[2021-04-25 04:34:32,717 INFO] Step 13500/50000; acc:  67.09; ppl:  3.22; xent: 1.17; lr: 0.00010; 11351/4446 tok/s;   2383 sec
[2021-04-25 04:34:41,873 INFO] Step 13550/50000; acc:  67.10; ppl:  3.24; xent: 1.18; lr: 0.00010; 11284/4429 tok/s;   2392 sec
[2021-04-25 04:34:50,155 INFO] Step 13600/50000; acc:  67.68; ppl:  3.15; xent: 1.15; lr: 0.00010; 12016/4836 tok/s;   2400 sec
[2021-04-25 04:34:58,987 INFO] Step 13650/50000; acc:  67.11; ppl:  3.25; xent: 1.18; lr: 0.00010; 11566/4532 tok/s;   2409 sec
[2021-04-25 04:35:07,776 INFO] Step 13700/50000; acc:  67.48; ppl:  3.18; xent: 1.16; lr: 0.00010; 11481/4647 tok/s;   2418 sec
[2021-04-25 04:35:16,051 INFO] Step 13750/50000; acc:  67.34; ppl:  3.18; xent: 1.16; lr: 0.00010; 12528/4752 tok/s;   2426 sec
[2021-04-25 04:35:24,497 INFO] Step 13800/50000; acc:  67.63; ppl:  3.18; xent: 1.16; lr: 0.00010; 11975/4712 tok/s;   2434 sec
[2021-04-25 04:35:33,117 INFO] Step 13850/50000; acc:  67.48; ppl:  3.18; xent: 1.16; lr: 0.00010; 11840/4731 tok/s;   2443 sec
[2021-04-25 04:35:34,080 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:35:41,807 INFO] Step 13900/50000; acc:  67.49; ppl:  3.18; xent: 1.16; lr: 0.00010; 11583/4658 tok/s;   2452 sec
[2021-04-25 04:35:50,813 INFO] Step 13950/50000; acc:  66.99; ppl:  3.22; xent: 1.17; lr: 0.00010; 11242/4423 tok/s;   2461 sec
[2021-04-25 04:36:00,136 INFO] Step 14000/50000; acc:  67.44; ppl:  3.18; xent: 1.16; lr: 0.00010; 10981/4408 tok/s;   2470 sec
[2021-04-25 04:36:08,447 INFO] Step 14050/50000; acc:  67.53; ppl:  3.17; xent: 1.15; lr: 0.00010; 11973/4855 tok/s;   2478 sec
[2021-04-25 04:36:17,231 INFO] Step 14100/50000; acc:  67.00; ppl:  3.22; xent: 1.17; lr: 0.00010; 11753/4615 tok/s;   2487 sec
[2021-04-25 04:36:25,660 INFO] Step 14150/50000; acc:  67.35; ppl:  3.17; xent: 1.15; lr: 0.00010; 12010/4693 tok/s;   2495 sec
[2021-04-25 04:36:35,090 INFO] Step 14200/50000; acc:  67.04; ppl:  3.22; xent: 1.17; lr: 0.00010; 11159/4219 tok/s;   2505 sec
[2021-04-25 04:36:43,616 INFO] Step 14250/50000; acc:  67.87; ppl:  3.11; xent: 1.13; lr: 0.00010; 11525/4745 tok/s;   2513 sec
[2021-04-25 04:36:52,729 INFO] Step 14300/50000; acc:  67.19; ppl:  3.21; xent: 1.17; lr: 0.00010; 11296/4379 tok/s;   2523 sec
[2021-04-25 04:37:01,519 INFO] Step 14350/50000; acc:  67.26; ppl:  3.21; xent: 1.16; lr: 0.00010; 11773/4644 tok/s;   2531 sec
[2021-04-25 04:37:09,990 INFO] Step 14400/50000; acc:  67.78; ppl:  3.15; xent: 1.15; lr: 0.00010; 11672/4727 tok/s;   2540 sec
[2021-04-25 04:37:18,736 INFO] Step 14450/50000; acc:  67.63; ppl:  3.14; xent: 1.14; lr: 0.00010; 11792/4559 tok/s;   2549 sec
[2021-04-25 04:37:26,907 INFO] Step 14500/50000; acc:  68.11; ppl:  3.10; xent: 1.13; lr: 0.00010; 12263/4871 tok/s;   2557 sec
[2021-04-25 04:37:35,417 INFO] Step 14550/50000; acc:  67.53; ppl:  3.18; xent: 1.16; lr: 0.00010; 12118/4727 tok/s;   2565 sec
[2021-04-25 04:37:37,074 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:37:44,295 INFO] Step 14600/50000; acc:  67.52; ppl:  3.15; xent: 1.15; lr: 0.00010; 11489/4574 tok/s;   2574 sec
[2021-04-25 04:37:53,349 INFO] Step 14650/50000; acc:  67.73; ppl:  3.16; xent: 1.15; lr: 0.00010; 11286/4457 tok/s;   2583 sec
[2021-04-25 04:38:02,278 INFO] Step 14700/50000; acc:  67.60; ppl:  3.14; xent: 1.15; lr: 0.00010; 11166/4481 tok/s;   2592 sec
[2021-04-25 04:38:11,358 INFO] Step 14750/50000; acc:  67.65; ppl:  3.16; xent: 1.15; lr: 0.00010; 11161/4545 tok/s;   2601 sec
[2021-04-25 04:38:19,848 INFO] Step 14800/50000; acc:  67.16; ppl:  3.19; xent: 1.16; lr: 0.00010; 12105/4759 tok/s;   2610 sec
[2021-04-25 04:38:28,146 INFO] Step 14850/50000; acc:  67.99; ppl:  3.09; xent: 1.13; lr: 0.00010; 11907/4770 tok/s;   2618 sec
[2021-04-25 04:38:36,988 INFO] Step 14900/50000; acc:  67.57; ppl:  3.18; xent: 1.16; lr: 0.00010; 11836/4437 tok/s;   2627 sec
[2021-04-25 04:38:46,310 INFO] Step 14950/50000; acc:  67.39; ppl:  3.16; xent: 1.15; lr: 0.00010; 11004/4373 tok/s;   2636 sec
[2021-04-25 04:38:55,222 INFO] Step 15000/50000; acc:  67.80; ppl:  3.14; xent: 1.14; lr: 0.00010; 11582/4548 tok/s;   2645 sec
[2021-04-25 04:38:55,226 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-25 04:39:02,408 INFO] Validation perplexity: 3.13111
[2021-04-25 04:39:02,408 INFO] Validation accuracy: 68.0666
[2021-04-25 04:39:02,410 INFO] Saving checkpoint ../models/group1_params/loose_ops/model_step_15000.pt
[2021-04-25 04:39:11,731 INFO] Step 15050/50000; acc:  68.11; ppl:  3.11; xent: 1.13; lr: 0.00010; 5969/2433 tok/s;   2662 sec
[2021-04-25 04:39:20,290 INFO] Step 15100/50000; acc:  67.61; ppl:  3.18; xent: 1.16; lr: 0.00010; 12008/4695 tok/s;   2670 sec
[2021-04-25 04:39:29,418 INFO] Step 15150/50000; acc:  67.63; ppl:  3.14; xent: 1.15; lr: 0.00010; 11332/4416 tok/s;   2679 sec
[2021-04-25 04:39:37,557 INFO] Step 15200/50000; acc:  68.30; ppl:  3.08; xent: 1.12; lr: 0.00010; 12291/4874 tok/s;   2687 sec
[2021-04-25 04:39:45,833 INFO] Step 15250/50000; acc:  68.05; ppl:  3.10; xent: 1.13; lr: 0.00010; 12249/4801 tok/s;   2696 sec
[2021-04-25 04:39:53,912 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:39:54,499 INFO] Step 15300/50000; acc:  68.06; ppl:  3.09; xent: 1.13; lr: 0.00010; 11629/4675 tok/s;   2704 sec
[2021-04-25 04:40:03,455 INFO] Step 15350/50000; acc:  67.86; ppl:  3.14; xent: 1.14; lr: 0.00010; 11537/4494 tok/s;   2713 sec
[2021-04-25 04:40:12,676 INFO] Step 15400/50000; acc:  67.56; ppl:  3.15; xent: 1.15; lr: 0.00010; 11072/4423 tok/s;   2723 sec
[2021-04-25 04:40:21,747 INFO] Step 15450/50000; acc:  67.65; ppl:  3.13; xent: 1.14; lr: 0.00010; 11123/4455 tok/s;   2732 sec
[2021-04-25 04:40:30,254 INFO] Step 15500/50000; acc:  67.83; ppl:  3.13; xent: 1.14; lr: 0.00010; 11795/4809 tok/s;   2740 sec
[2021-04-25 04:40:39,061 INFO] Step 15550/50000; acc:  68.06; ppl:  3.11; xent: 1.13; lr: 0.00010; 11527/4500 tok/s;   2749 sec
[2021-04-25 04:40:48,057 INFO] Step 15600/50000; acc:  67.68; ppl:  3.14; xent: 1.14; lr: 0.00010; 11401/4464 tok/s;   2758 sec
[2021-04-25 04:40:56,430 INFO] Step 15650/50000; acc:  67.84; ppl:  3.08; xent: 1.13; lr: 0.00010; 12051/4709 tok/s;   2766 sec
[2021-04-25 04:41:06,016 INFO] Step 15700/50000; acc:  67.19; ppl:  3.17; xent: 1.15; lr: 0.00010; 10907/4251 tok/s;   2776 sec
[2021-04-25 04:41:14,499 INFO] Step 15750/50000; acc:  68.34; ppl:  3.08; xent: 1.12; lr: 0.00010; 11832/4775 tok/s;   2784 sec
[2021-04-25 04:41:23,349 INFO] Step 15800/50000; acc:  67.49; ppl:  3.15; xent: 1.15; lr: 0.00010; 11767/4554 tok/s;   2793 sec
[2021-04-25 04:41:32,057 INFO] Step 15850/50000; acc:  68.34; ppl:  3.07; xent: 1.12; lr: 0.00010; 11262/4612 tok/s;   2802 sec
[2021-04-25 04:41:40,886 INFO] Step 15900/50000; acc:  68.24; ppl:  3.08; xent: 1.12; lr: 0.00010; 11709/4490 tok/s;   2811 sec
[2021-04-25 04:41:49,228 INFO] Step 15950/50000; acc:  67.73; ppl:  3.13; xent: 1.14; lr: 0.00010; 12356/4863 tok/s;   2819 sec
[2021-04-25 04:41:57,330 INFO] Step 16000/50000; acc:  68.52; ppl:  3.03; xent: 1.11; lr: 0.00010; 12170/4849 tok/s;   2827 sec
[2021-04-25 04:42:03,159 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:42:06,367 INFO] Step 16050/50000; acc:  68.13; ppl:  3.09; xent: 1.13; lr: 0.00010; 11366/4499 tok/s;   2836 sec
[2021-04-25 04:42:15,163 INFO] Step 16100/50000; acc:  68.31; ppl:  3.10; xent: 1.13; lr: 0.00010; 11483/4591 tok/s;   2845 sec
[2021-04-25 04:42:24,652 INFO] Step 16150/50000; acc:  67.50; ppl:  3.13; xent: 1.14; lr: 0.00010; 10867/4284 tok/s;   2854 sec
[2021-04-25 04:42:33,208 INFO] Step 16200/50000; acc:  68.25; ppl:  3.08; xent: 1.12; lr: 0.00010; 11803/4714 tok/s;   2863 sec
[2021-04-25 04:42:42,090 INFO] Step 16250/50000; acc:  67.99; ppl:  3.10; xent: 1.13; lr: 0.00010; 11456/4592 tok/s;   2872 sec
[2021-04-25 04:42:50,897 INFO] Step 16300/50000; acc:  68.11; ppl:  3.10; xent: 1.13; lr: 0.00010; 11353/4544 tok/s;   2881 sec
[2021-04-25 04:42:59,394 INFO] Step 16350/50000; acc:  68.04; ppl:  3.10; xent: 1.13; lr: 0.00010; 12049/4692 tok/s;   2889 sec
[2021-04-25 04:43:08,547 INFO] Step 16400/50000; acc:  68.01; ppl:  3.10; xent: 1.13; lr: 0.00010; 11378/4249 tok/s;   2898 sec
[2021-04-25 04:43:17,556 INFO] Step 16450/50000; acc:  68.17; ppl:  3.06; xent: 1.12; lr: 0.00010; 11080/4606 tok/s;   2907 sec
[2021-04-25 04:43:26,482 INFO] Step 16500/50000; acc:  67.94; ppl:  3.12; xent: 1.14; lr: 0.00010; 11598/4461 tok/s;   2916 sec
[2021-04-25 04:43:35,168 INFO] Step 16550/50000; acc:  68.34; ppl:  3.06; xent: 1.12; lr: 0.00010; 11648/4727 tok/s;   2925 sec
[2021-04-25 04:43:44,092 INFO] Step 16600/50000; acc:  68.04; ppl:  3.13; xent: 1.14; lr: 0.00010; 11612/4440 tok/s;   2934 sec
[2021-04-25 04:43:52,459 INFO] Step 16650/50000; acc:  68.97; ppl:  2.98; xent: 1.09; lr: 0.00010; 11780/4761 tok/s;   2942 sec
[2021-04-25 04:44:00,726 INFO] Step 16700/50000; acc:  68.14; ppl:  3.07; xent: 1.12; lr: 0.00010; 12416/4831 tok/s;   2951 sec
[2021-04-25 04:44:09,117 INFO] Step 16750/50000; acc:  68.29; ppl:  3.06; xent: 1.12; lr: 0.00010; 12234/4811 tok/s;   2959 sec
[2021-04-25 04:44:12,443 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:44:18,008 INFO] Step 16800/50000; acc:  68.44; ppl:  3.06; xent: 1.12; lr: 0.00010; 11208/4561 tok/s;   2968 sec
[2021-04-25 04:44:27,096 INFO] Step 16850/50000; acc:  68.13; ppl:  3.09; xent: 1.13; lr: 0.00010; 11285/4383 tok/s;   2977 sec
[2021-04-25 04:44:36,299 INFO] Step 16900/50000; acc:  68.54; ppl:  3.05; xent: 1.11; lr: 0.00010; 10890/4483 tok/s;   2986 sec
[2021-04-25 04:44:45,227 INFO] Step 16950/50000; acc:  68.14; ppl:  3.09; xent: 1.13; lr: 0.00010; 11532/4523 tok/s;   2995 sec
[2021-04-25 04:44:53,849 INFO] Step 17000/50000; acc:  67.81; ppl:  3.09; xent: 1.13; lr: 0.00010; 11747/4712 tok/s;   3004 sec
[2021-04-25 04:45:02,428 INFO] Step 17050/50000; acc:  68.47; ppl:  3.05; xent: 1.12; lr: 0.00010; 11838/4609 tok/s;   3012 sec
[2021-04-25 04:45:10,938 INFO] Step 17100/50000; acc:  68.53; ppl:  3.04; xent: 1.11; lr: 0.00010; 11977/4630 tok/s;   3021 sec
[2021-04-25 04:45:19,938 INFO] Step 17150/50000; acc:  68.36; ppl:  3.05; xent: 1.11; lr: 0.00010; 11342/4480 tok/s;   3030 sec
[2021-04-25 04:45:29,070 INFO] Step 17200/50000; acc:  68.12; ppl:  3.10; xent: 1.13; lr: 0.00010; 11284/4435 tok/s;   3039 sec
[2021-04-25 04:45:37,489 INFO] Step 17250/50000; acc:  68.56; ppl:  3.00; xent: 1.10; lr: 0.00010; 11862/4776 tok/s;   3047 sec
[2021-04-25 04:45:46,394 INFO] Step 17300/50000; acc:  67.84; ppl:  3.11; xent: 1.14; lr: 0.00010; 11626/4501 tok/s;   3056 sec
[2021-04-25 04:45:55,169 INFO] Step 17350/50000; acc:  68.88; ppl:  3.02; xent: 1.11; lr: 0.00010; 11550/4632 tok/s;   3065 sec
[2021-04-25 04:46:03,486 INFO] Step 17400/50000; acc:  68.50; ppl:  3.04; xent: 1.11; lr: 0.00010; 12452/4750 tok/s;   3073 sec
[2021-04-25 04:46:11,851 INFO] Step 17450/50000; acc:  69.03; ppl:  2.99; xent: 1.10; lr: 0.00010; 11704/4750 tok/s;   3082 sec
[2021-04-25 04:46:20,638 INFO] Step 17500/50000; acc:  68.31; ppl:  3.06; xent: 1.12; lr: 0.00010; 11721/4643 tok/s;   3090 sec
[2021-04-25 04:46:21,298 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:46:29,607 INFO] Step 17550/50000; acc:  68.37; ppl:  3.06; xent: 1.12; lr: 0.00010; 11544/4482 tok/s;   3099 sec
[2021-04-25 04:46:38,376 INFO] Step 17600/50000; acc:  68.76; ppl:  3.03; xent: 1.11; lr: 0.00010; 11271/4572 tok/s;   3108 sec
[2021-04-25 04:46:47,544 INFO] Step 17650/50000; acc:  68.25; ppl:  3.03; xent: 1.11; lr: 0.00010; 11108/4481 tok/s;   3117 sec
[2021-04-25 04:46:55,877 INFO] Step 17700/50000; acc:  68.51; ppl:  3.04; xent: 1.11; lr: 0.00010; 12056/4830 tok/s;   3126 sec
[2021-04-25 04:47:04,773 INFO] Step 17750/50000; acc:  68.50; ppl:  3.06; xent: 1.12; lr: 0.00010; 11575/4557 tok/s;   3135 sec
[2021-04-25 04:47:13,149 INFO] Step 17800/50000; acc:  68.49; ppl:  3.02; xent: 1.11; lr: 0.00010; 12153/4691 tok/s;   3143 sec
[2021-04-25 04:47:22,362 INFO] Step 17850/50000; acc:  68.35; ppl:  3.06; xent: 1.12; lr: 0.00010; 11260/4350 tok/s;   3152 sec
[2021-04-25 04:47:31,098 INFO] Step 17900/50000; acc:  68.71; ppl:  2.98; xent: 1.09; lr: 0.00010; 11490/4653 tok/s;   3161 sec
[2021-04-25 04:47:39,750 INFO] Step 17950/50000; acc:  68.34; ppl:  3.05; xent: 1.12; lr: 0.00010; 11737/4599 tok/s;   3170 sec
[2021-04-25 04:47:48,724 INFO] Step 18000/50000; acc:  68.01; ppl:  3.08; xent: 1.12; lr: 0.00010; 11506/4512 tok/s;   3179 sec
[2021-04-25 04:47:57,212 INFO] Step 18050/50000; acc:  68.85; ppl:  3.00; xent: 1.10; lr: 0.00010; 11694/4789 tok/s;   3187 sec
[2021-04-25 04:48:05,931 INFO] Step 18100/50000; acc:  68.48; ppl:  3.02; xent: 1.10; lr: 0.00010; 12003/4534 tok/s;   3196 sec
[2021-04-25 04:48:14,223 INFO] Step 18150/50000; acc:  68.83; ppl:  3.00; xent: 1.10; lr: 0.00010; 12135/4814 tok/s;   3204 sec
[2021-04-25 04:48:22,994 INFO] Step 18200/50000; acc:  68.48; ppl:  3.02; xent: 1.11; lr: 0.00010; 11752/4605 tok/s;   3213 sec
[2021-04-25 04:48:24,087 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:48:31,444 INFO] Step 18250/50000; acc:  69.16; ppl:  2.97; xent: 1.09; lr: 0.00010; 11674/4765 tok/s;   3221 sec
[2021-04-25 04:48:40,494 INFO] Step 18300/50000; acc:  68.36; ppl:  3.04; xent: 1.11; lr: 0.00010; 11399/4455 tok/s;   3230 sec
[2021-04-25 04:48:49,696 INFO] Step 18350/50000; acc:  68.30; ppl:  3.03; xent: 1.11; lr: 0.00010; 11115/4371 tok/s;   3240 sec
[2021-04-25 04:48:58,381 INFO] Step 18400/50000; acc:  68.68; ppl:  3.00; xent: 1.10; lr: 0.00010; 11418/4746 tok/s;   3248 sec
[2021-04-25 04:49:06,890 INFO] Step 18450/50000; acc:  68.31; ppl:  3.05; xent: 1.11; lr: 0.00010; 11988/4733 tok/s;   3257 sec
[2021-04-25 04:49:15,344 INFO] Step 18500/50000; acc:  68.91; ppl:  2.98; xent: 1.09; lr: 0.00010; 11849/4729 tok/s;   3265 sec
[2021-04-25 04:49:24,198 INFO] Step 18550/50000; acc:  68.51; ppl:  3.04; xent: 1.11; lr: 0.00010; 11780/4417 tok/s;   3274 sec
[2021-04-25 04:49:33,593 INFO] Step 18600/50000; acc:  68.59; ppl:  3.01; xent: 1.10; lr: 0.00010; 10940/4327 tok/s;   3283 sec
[2021-04-25 04:49:42,416 INFO] Step 18650/50000; acc:  68.81; ppl:  3.00; xent: 1.10; lr: 0.00010; 11547/4569 tok/s;   3292 sec
[2021-04-25 04:49:51,254 INFO] Step 18700/50000; acc:  68.68; ppl:  3.00; xent: 1.10; lr: 0.00010; 11373/4582 tok/s;   3301 sec
[2021-04-25 04:49:59,907 INFO] Step 18750/50000; acc:  68.91; ppl:  3.01; xent: 1.10; lr: 0.00010; 11755/4622 tok/s;   3310 sec
[2021-04-25 04:50:09,025 INFO] Step 18800/50000; acc:  68.68; ppl:  3.01; xent: 1.10; lr: 0.00010; 11321/4398 tok/s;   3319 sec
[2021-04-25 04:50:17,210 INFO] Step 18850/50000; acc:  69.08; ppl:  2.95; xent: 1.08; lr: 0.00010; 12251/4866 tok/s;   3327 sec
[2021-04-25 04:50:25,620 INFO] Step 18900/50000; acc:  68.73; ppl:  2.99; xent: 1.09; lr: 0.00010; 12233/4757 tok/s;   3335 sec
[2021-04-25 04:50:33,226 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:50:34,136 INFO] Step 18950/50000; acc:  69.14; ppl:  2.95; xent: 1.08; lr: 0.00010; 11871/4744 tok/s;   3344 sec
[2021-04-25 04:50:43,230 INFO] Step 19000/50000; acc:  68.79; ppl:  3.00; xent: 1.10; lr: 0.00010; 11351/4413 tok/s;   3353 sec
[2021-04-25 04:50:52,175 INFO] Step 19050/50000; acc:  68.96; ppl:  2.97; xent: 1.09; lr: 0.00010; 11038/4535 tok/s;   3362 sec
[2021-04-25 04:51:01,365 INFO] Step 19100/50000; acc:  68.51; ppl:  3.01; xent: 1.10; lr: 0.00010; 11104/4443 tok/s;   3371 sec
[2021-04-25 04:51:10,161 INFO] Step 19150/50000; acc:  68.65; ppl:  3.03; xent: 1.11; lr: 0.00010; 11709/4640 tok/s;   3380 sec
[2021-04-25 04:51:18,712 INFO] Step 19200/50000; acc:  69.00; ppl:  2.97; xent: 1.09; lr: 0.00010; 11589/4644 tok/s;   3389 sec
[2021-04-25 04:51:27,605 INFO] Step 19250/50000; acc:  68.82; ppl:  3.00; xent: 1.10; lr: 0.00010; 11496/4473 tok/s;   3397 sec
[2021-04-25 04:51:36,051 INFO] Step 19300/50000; acc:  69.04; ppl:  2.95; xent: 1.08; lr: 0.00010; 12068/4714 tok/s;   3406 sec
[2021-04-25 04:51:45,646 INFO] Step 19350/50000; acc:  68.22; ppl:  3.04; xent: 1.11; lr: 0.00010; 10862/4244 tok/s;   3415 sec
[2021-04-25 04:51:54,391 INFO] Step 19400/50000; acc:  69.13; ppl:  2.96; xent: 1.09; lr: 0.00010; 11533/4632 tok/s;   3424 sec
[2021-04-25 04:52:02,977 INFO] Step 19450/50000; acc:  68.84; ppl:  2.99; xent: 1.09; lr: 0.00010; 11942/4671 tok/s;   3433 sec
[2021-04-25 04:52:11,709 INFO] Step 19500/50000; acc:  69.19; ppl:  2.96; xent: 1.09; lr: 0.00010; 11486/4589 tok/s;   3442 sec
[2021-04-25 04:52:20,432 INFO] Step 19550/50000; acc:  69.08; ppl:  2.95; xent: 1.08; lr: 0.00010; 11695/4568 tok/s;   3450 sec
[2021-04-25 04:52:28,752 INFO] Step 19600/50000; acc:  68.66; ppl:  3.01; xent: 1.10; lr: 0.00010; 12375/4814 tok/s;   3459 sec
[2021-04-25 04:52:37,138 INFO] Step 19650/50000; acc:  69.49; ppl:  2.90; xent: 1.06; lr: 0.00010; 11785/4761 tok/s;   3467 sec
[2021-04-25 04:52:42,668 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:52:46,454 INFO] Step 19700/50000; acc:  68.68; ppl:  3.00; xent: 1.10; lr: 0.00010; 11176/4369 tok/s;   3476 sec
[2021-04-25 04:52:55,174 INFO] Step 19750/50000; acc:  69.07; ppl:  2.98; xent: 1.09; lr: 0.00010; 11651/4632 tok/s;   3485 sec
[2021-04-25 04:53:04,658 INFO] Step 19800/50000; acc:  68.31; ppl:  3.02; xent: 1.10; lr: 0.00010; 10853/4262 tok/s;   3494 sec
[2021-04-25 04:53:13,127 INFO] Step 19850/50000; acc:  69.59; ppl:  2.91; xent: 1.07; lr: 0.00010; 11526/4773 tok/s;   3503 sec
[2021-04-25 04:53:22,080 INFO] Step 19900/50000; acc:  68.91; ppl:  2.99; xent: 1.10; lr: 0.00010; 11490/4530 tok/s;   3512 sec
[2021-04-25 04:53:30,911 INFO] Step 19950/50000; acc:  68.65; ppl:  3.00; xent: 1.10; lr: 0.00010; 11621/4530 tok/s;   3521 sec
[2021-04-25 04:53:39,318 INFO] Step 20000/50000; acc:  69.23; ppl:  2.93; xent: 1.07; lr: 0.00010; 11897/4764 tok/s;   3529 sec
[2021-04-25 04:53:39,320 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-25 04:53:46,523 INFO] Validation perplexity: 3.0204
[2021-04-25 04:53:46,523 INFO] Validation accuracy: 68.9775
[2021-04-25 04:53:46,525 INFO] Saving checkpoint ../models/group1_params/loose_ops/model_step_20000.pt
[2021-04-25 04:53:56,225 INFO] Step 20050/50000; acc:  68.75; ppl:  2.99; xent: 1.09; lr: 0.00010; 6128/2319 tok/s;   3546 sec
[2021-04-25 04:54:05,243 INFO] Step 20100/50000; acc:  69.14; ppl:  2.94; xent: 1.08; lr: 0.00010; 11187/4536 tok/s;   3555 sec
[2021-04-25 04:54:14,024 INFO] Step 20150/50000; acc:  68.95; ppl:  2.98; xent: 1.09; lr: 0.00010; 11767/4550 tok/s;   3564 sec
[2021-04-25 04:54:22,793 INFO] Step 20200/50000; acc:  69.33; ppl:  2.95; xent: 1.08; lr: 0.00010; 11592/4649 tok/s;   3573 sec
[2021-04-25 04:54:31,745 INFO] Step 20250/50000; acc:  69.17; ppl:  2.97; xent: 1.09; lr: 0.00010; 11409/4450 tok/s;   3582 sec
[2021-04-25 04:54:40,359 INFO] Step 20300/50000; acc:  69.46; ppl:  2.89; xent: 1.06; lr: 0.00010; 11688/4653 tok/s;   3590 sec
[2021-04-25 04:54:48,530 INFO] Step 20350/50000; acc:  69.24; ppl:  2.94; xent: 1.08; lr: 0.00010; 12379/4865 tok/s;   3598 sec
[2021-04-25 04:54:57,047 INFO] Step 20400/50000; acc:  69.28; ppl:  2.93; xent: 1.08; lr: 0.00010; 12047/4746 tok/s;   3607 sec
[2021-04-25 04:54:59,981 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:55:05,912 INFO] Step 20450/50000; acc:  69.07; ppl:  2.95; xent: 1.08; lr: 0.00010; 11260/4592 tok/s;   3616 sec
[2021-04-25 04:55:14,863 INFO] Step 20500/50000; acc:  68.89; ppl:  3.00; xent: 1.10; lr: 0.00010; 11621/4442 tok/s;   3625 sec
[2021-04-25 04:55:24,105 INFO] Step 20550/50000; acc:  69.20; ppl:  2.94; xent: 1.08; lr: 0.00010; 10888/4466 tok/s;   3634 sec
[2021-04-25 04:55:33,002 INFO] Step 20600/50000; acc:  68.95; ppl:  2.98; xent: 1.09; lr: 0.00010; 11582/4530 tok/s;   3643 sec
[2021-04-25 04:55:41,530 INFO] Step 20650/50000; acc:  69.41; ppl:  2.90; xent: 1.07; lr: 0.00010; 11482/4759 tok/s;   3651 sec
[2021-04-25 04:55:50,128 INFO] Step 20700/50000; acc:  69.16; ppl:  2.94; xent: 1.08; lr: 0.00010; 11929/4612 tok/s;   3660 sec
[2021-04-25 04:55:58,936 INFO] Step 20750/50000; acc:  69.13; ppl:  2.95; xent: 1.08; lr: 0.00010; 11888/4482 tok/s;   3669 sec
[2021-04-25 04:56:07,542 INFO] Step 20800/50000; acc:  69.47; ppl:  2.91; xent: 1.07; lr: 0.00010; 11560/4702 tok/s;   3677 sec
[2021-04-25 04:56:16,384 INFO] Step 20850/50000; acc:  69.02; ppl:  2.96; xent: 1.09; lr: 0.00010; 11614/4516 tok/s;   3686 sec
[2021-04-25 04:56:24,927 INFO] Step 20900/50000; acc:  69.15; ppl:  2.92; xent: 1.07; lr: 0.00010; 11816/4742 tok/s;   3695 sec
[2021-04-25 04:56:33,809 INFO] Step 20950/50000; acc:  69.17; ppl:  2.97; xent: 1.09; lr: 0.00010; 11609/4560 tok/s;   3704 sec
[2021-04-25 04:56:42,677 INFO] Step 21000/50000; acc:  69.65; ppl:  2.91; xent: 1.07; lr: 0.00010; 11503/4542 tok/s;   3713 sec
[2021-04-25 04:56:50,792 INFO] Step 21050/50000; acc:  69.40; ppl:  2.91; xent: 1.07; lr: 0.00010; 12568/4840 tok/s;   3721 sec
[2021-04-25 04:56:59,377 INFO] Step 21100/50000; acc:  69.67; ppl:  2.89; xent: 1.06; lr: 0.00010; 11648/4696 tok/s;   3729 sec
[2021-04-25 04:57:02,679 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:57:08,144 INFO] Step 21150/50000; acc:  69.25; ppl:  2.93; xent: 1.08; lr: 0.00010; 11617/4626 tok/s;   3738 sec
[2021-04-25 04:57:17,329 INFO] Step 21200/50000; acc:  69.05; ppl:  2.96; xent: 1.08; lr: 0.00010; 11239/4390 tok/s;   3747 sec
[2021-04-25 04:57:25,966 INFO] Step 21250/50000; acc:  69.56; ppl:  2.90; xent: 1.06; lr: 0.00010; 11461/4653 tok/s;   3756 sec
[2021-04-25 04:57:35,108 INFO] Step 21300/50000; acc:  69.18; ppl:  2.94; xent: 1.08; lr: 0.00010; 11302/4497 tok/s;   3765 sec
[2021-04-25 04:57:43,400 INFO] Step 21350/50000; acc:  69.16; ppl:  2.91; xent: 1.07; lr: 0.00010; 12172/4845 tok/s;   3773 sec
[2021-04-25 04:57:52,088 INFO] Step 21400/50000; acc:  68.94; ppl:  2.96; xent: 1.09; lr: 0.00010; 11860/4603 tok/s;   3782 sec
[2021-04-25 04:58:00,518 INFO] Step 21450/50000; acc:  69.92; ppl:  2.86; xent: 1.05; lr: 0.00010; 11672/4691 tok/s;   3790 sec
[2021-04-25 04:58:09,740 INFO] Step 21500/50000; acc:  68.90; ppl:  2.96; xent: 1.09; lr: 0.00010; 11376/4333 tok/s;   3800 sec
[2021-04-25 04:58:18,768 INFO] Step 21550/50000; acc:  69.44; ppl:  2.90; xent: 1.07; lr: 0.00010; 11401/4518 tok/s;   3809 sec
[2021-04-25 04:58:27,453 INFO] Step 21600/50000; acc:  69.75; ppl:  2.90; xent: 1.06; lr: 0.00010; 11426/4594 tok/s;   3817 sec
[2021-04-25 04:58:36,370 INFO] Step 21650/50000; acc:  69.18; ppl:  2.94; xent: 1.08; lr: 0.00010; 11510/4545 tok/s;   3826 sec
[2021-04-25 04:58:45,007 INFO] Step 21700/50000; acc:  69.57; ppl:  2.90; xent: 1.07; lr: 0.00010; 11657/4702 tok/s;   3835 sec
[2021-04-25 04:58:53,749 INFO] Step 21750/50000; acc:  69.49; ppl:  2.90; xent: 1.06; lr: 0.00010; 11901/4502 tok/s;   3844 sec
[2021-04-25 04:59:01,957 INFO] Step 21800/50000; acc:  69.78; ppl:  2.88; xent: 1.06; lr: 0.00010; 12314/4846 tok/s;   3852 sec
[2021-04-25 04:59:10,659 INFO] Step 21850/50000; acc:  69.39; ppl:  2.90; xent: 1.06; lr: 0.00010; 11671/4644 tok/s;   3860 sec
[2021-04-25 04:59:11,506 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 04:59:19,194 INFO] Step 21900/50000; acc:  69.48; ppl:  2.88; xent: 1.06; lr: 0.00010; 11793/4742 tok/s;   3869 sec
[2021-04-25 04:59:28,247 INFO] Step 21950/50000; acc:  69.57; ppl:  2.91; xent: 1.07; lr: 0.00010; 11265/4463 tok/s;   3878 sec
[2021-04-25 04:59:37,567 INFO] Step 22000/50000; acc:  69.12; ppl:  2.93; xent: 1.07; lr: 0.00010; 10959/4325 tok/s;   3887 sec
[2021-04-25 04:59:46,267 INFO] Step 22050/50000; acc:  69.57; ppl:  2.89; xent: 1.06; lr: 0.00010; 11423/4701 tok/s;   3896 sec
[2021-04-25 04:59:54,878 INFO] Step 22100/50000; acc:  69.11; ppl:  2.96; xent: 1.08; lr: 0.00010; 12029/4677 tok/s;   3905 sec
[2021-04-25 05:00:03,526 INFO] Step 22150/50000; acc:  69.75; ppl:  2.86; xent: 1.05; lr: 0.00010; 11609/4678 tok/s;   3913 sec
[2021-04-25 05:00:12,495 INFO] Step 22200/50000; acc:  69.55; ppl:  2.93; xent: 1.07; lr: 0.00010; 11664/4351 tok/s;   3922 sec
[2021-04-25 05:00:21,551 INFO] Step 22250/50000; acc:  69.71; ppl:  2.85; xent: 1.05; lr: 0.00010; 10960/4453 tok/s;   3931 sec
[2021-04-25 05:00:30,413 INFO] Step 22300/50000; acc:  69.40; ppl:  2.93; xent: 1.08; lr: 0.00010; 11620/4541 tok/s;   3940 sec
[2021-04-25 05:00:39,288 INFO] Step 22350/50000; acc:  69.36; ppl:  2.91; xent: 1.07; lr: 0.00010; 11629/4566 tok/s;   3949 sec
[2021-04-25 05:00:47,747 INFO] Step 22400/50000; acc:  70.05; ppl:  2.85; xent: 1.05; lr: 0.00010; 11722/4715 tok/s;   3958 sec
[2021-04-25 05:00:56,850 INFO] Step 22450/50000; acc:  69.60; ppl:  2.89; xent: 1.06; lr: 0.00010; 11291/4456 tok/s;   3967 sec
[2021-04-25 05:01:04,970 INFO] Step 22500/50000; acc:  69.84; ppl:  2.86; xent: 1.05; lr: 0.00010; 12466/4933 tok/s;   3975 sec
[2021-04-25 05:01:13,547 INFO] Step 22550/50000; acc:  69.72; ppl:  2.88; xent: 1.06; lr: 0.00010; 11975/4599 tok/s;   3983 sec
[2021-04-25 05:01:20,687 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:01:21,921 INFO] Step 22600/50000; acc:  70.08; ppl:  2.84; xent: 1.04; lr: 0.00010; 12136/4816 tok/s;   3992 sec
[2021-04-25 05:01:31,001 INFO] Step 22650/50000; acc:  69.74; ppl:  2.89; xent: 1.06; lr: 0.00010; 11214/4427 tok/s;   4001 sec
[2021-04-25 05:01:40,009 INFO] Step 22700/50000; acc:  69.57; ppl:  2.90; xent: 1.06; lr: 0.00010; 11170/4526 tok/s;   4010 sec
[2021-04-25 05:01:49,096 INFO] Step 22750/50000; acc:  69.68; ppl:  2.89; xent: 1.06; lr: 0.00010; 11090/4484 tok/s;   4019 sec
[2021-04-25 05:01:57,959 INFO] Step 22800/50000; acc:  69.32; ppl:  2.93; xent: 1.07; lr: 0.00010; 11612/4586 tok/s;   4028 sec
[2021-04-25 05:02:06,452 INFO] Step 22850/50000; acc:  69.93; ppl:  2.86; xent: 1.05; lr: 0.00010; 11684/4714 tok/s;   4036 sec
[2021-04-25 05:02:15,328 INFO] Step 22900/50000; acc:  69.38; ppl:  2.91; xent: 1.07; lr: 0.00010; 11703/4496 tok/s;   4045 sec
[2021-04-25 05:02:24,092 INFO] Step 22950/50000; acc:  69.47; ppl:  2.87; xent: 1.05; lr: 0.00010; 11670/4520 tok/s;   4054 sec
[2021-04-25 05:02:33,468 INFO] Step 23000/50000; acc:  69.63; ppl:  2.90; xent: 1.06; lr: 0.00010; 11097/4346 tok/s;   4063 sec
[2021-04-25 05:02:42,006 INFO] Step 23050/50000; acc:  70.25; ppl:  2.83; xent: 1.04; lr: 0.00010; 11449/4705 tok/s;   4072 sec
[2021-04-25 05:02:50,698 INFO] Step 23100/50000; acc:  69.70; ppl:  2.89; xent: 1.06; lr: 0.00010; 11906/4649 tok/s;   4081 sec
[2021-04-25 05:02:59,690 INFO] Step 23150/50000; acc:  69.66; ppl:  2.89; xent: 1.06; lr: 0.00010; 11455/4469 tok/s;   4090 sec
[2021-04-25 05:03:08,293 INFO] Step 23200/50000; acc:  70.25; ppl:  2.82; xent: 1.04; lr: 0.00010; 11598/4596 tok/s;   4098 sec
[2021-04-25 05:03:16,671 INFO] Step 23250/50000; acc:  69.55; ppl:  2.89; xent: 1.06; lr: 0.00010; 12196/4804 tok/s;   4107 sec
[2021-04-25 05:03:25,023 INFO] Step 23300/50000; acc:  70.01; ppl:  2.83; xent: 1.04; lr: 0.00010; 11983/4792 tok/s;   4115 sec
[2021-04-25 05:03:30,075 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:03:34,114 INFO] Step 23350/50000; acc:  69.84; ppl:  2.87; xent: 1.05; lr: 0.00010; 11406/4454 tok/s;   4124 sec
[2021-04-25 05:03:42,870 INFO] Step 23400/50000; acc:  69.93; ppl:  2.88; xent: 1.06; lr: 0.00010; 11671/4560 tok/s;   4133 sec
[2021-04-25 05:03:52,326 INFO] Step 23450/50000; acc:  69.34; ppl:  2.88; xent: 1.06; lr: 0.00010; 10712/4355 tok/s;   4142 sec
[2021-04-25 05:04:00,907 INFO] Step 23500/50000; acc:  70.26; ppl:  2.83; xent: 1.04; lr: 0.00010; 11635/4698 tok/s;   4151 sec
[2021-04-25 05:04:09,649 INFO] Step 23550/50000; acc:  69.93; ppl:  2.87; xent: 1.05; lr: 0.00010; 11620/4624 tok/s;   4159 sec
[2021-04-25 05:04:18,438 INFO] Step 23600/50000; acc:  69.67; ppl:  2.89; xent: 1.06; lr: 0.00010; 11646/4545 tok/s;   4168 sec
[2021-04-25 05:04:26,732 INFO] Step 23650/50000; acc:  69.90; ppl:  2.83; xent: 1.04; lr: 0.00010; 12123/4819 tok/s;   4177 sec
[2021-04-25 05:04:36,026 INFO] Step 23700/50000; acc:  69.42; ppl:  2.90; xent: 1.07; lr: 0.00010; 11282/4272 tok/s;   4186 sec
[2021-04-25 05:04:44,902 INFO] Step 23750/50000; acc:  69.91; ppl:  2.85; xent: 1.05; lr: 0.00010; 11403/4584 tok/s;   4195 sec
[2021-04-25 05:04:53,601 INFO] Step 23800/50000; acc:  69.72; ppl:  2.87; xent: 1.05; lr: 0.00010; 11895/4569 tok/s;   4203 sec
[2021-04-25 05:05:02,270 INFO] Step 23850/50000; acc:  70.54; ppl:  2.81; xent: 1.03; lr: 0.00010; 11332/4683 tok/s;   4212 sec
[2021-04-25 05:05:11,262 INFO] Step 23900/50000; acc:  69.83; ppl:  2.87; xent: 1.05; lr: 0.00010; 11448/4446 tok/s;   4221 sec
[2021-04-25 05:05:19,892 INFO] Step 23950/50000; acc:  70.03; ppl:  2.82; xent: 1.04; lr: 0.00010; 12000/4664 tok/s;   4230 sec
[2021-04-25 05:05:27,936 INFO] Step 24000/50000; acc:  70.26; ppl:  2.81; xent: 1.03; lr: 0.00010; 12267/4950 tok/s;   4238 sec
[2021-04-25 05:05:36,507 INFO] Step 24050/50000; acc:  70.09; ppl:  2.83; xent: 1.04; lr: 0.00010; 11930/4683 tok/s;   4246 sec
[2021-04-25 05:05:39,148 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:05:45,299 INFO] Step 24100/50000; acc:  70.08; ppl:  2.84; xent: 1.05; lr: 0.00010; 11486/4634 tok/s;   4255 sec
[2021-04-25 05:05:54,351 INFO] Step 24150/50000; acc:  69.41; ppl:  2.90; xent: 1.07; lr: 0.00010; 11428/4418 tok/s;   4264 sec
[2021-04-25 05:06:03,592 INFO] Step 24200/50000; acc:  70.15; ppl:  2.83; xent: 1.04; lr: 0.00010; 10950/4484 tok/s;   4273 sec
[2021-04-25 05:06:12,318 INFO] Step 24250/50000; acc:  70.04; ppl:  2.85; xent: 1.05; lr: 0.00010; 11632/4558 tok/s;   4282 sec
[2021-04-25 05:06:20,968 INFO] Step 24300/50000; acc:  70.02; ppl:  2.84; xent: 1.04; lr: 0.00010; 11571/4732 tok/s;   4291 sec
[2021-04-25 05:06:29,439 INFO] Step 24350/50000; acc:  70.41; ppl:  2.83; xent: 1.04; lr: 0.00010; 11959/4667 tok/s;   4299 sec
[2021-04-25 05:06:38,419 INFO] Step 24400/50000; acc:  69.68; ppl:  2.86; xent: 1.05; lr: 0.00010; 11654/4367 tok/s;   4308 sec
[2021-04-25 05:06:47,117 INFO] Step 24450/50000; acc:  70.40; ppl:  2.80; xent: 1.03; lr: 0.00010; 11468/4706 tok/s;   4317 sec
[2021-04-25 05:06:56,343 INFO] Step 24500/50000; acc:  69.59; ppl:  2.88; xent: 1.06; lr: 0.00010; 11276/4324 tok/s;   4326 sec
[2021-04-25 05:07:04,945 INFO] Step 24550/50000; acc:  70.09; ppl:  2.83; xent: 1.04; lr: 0.00010; 11782/4756 tok/s;   4335 sec
[2021-04-25 05:07:13,647 INFO] Step 24600/50000; acc:  69.86; ppl:  2.86; xent: 1.05; lr: 0.00010; 11848/4580 tok/s;   4343 sec
[2021-04-25 05:07:22,435 INFO] Step 24650/50000; acc:  70.76; ppl:  2.76; xent: 1.02; lr: 0.00010; 11225/4567 tok/s;   4352 sec
[2021-04-25 05:07:30,567 INFO] Step 24700/50000; acc:  70.19; ppl:  2.82; xent: 1.04; lr: 0.00010; 12667/4824 tok/s;   4360 sec
[2021-04-25 05:07:39,346 INFO] Step 24750/50000; acc:  70.05; ppl:  2.83; xent: 1.04; lr: 0.00010; 11690/4621 tok/s;   4369 sec
[2021-04-25 05:07:42,140 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:07:48,059 INFO] Step 24800/50000; acc:  70.20; ppl:  2.81; xent: 1.03; lr: 0.00010; 11444/4665 tok/s;   4378 sec
[2021-04-25 05:07:56,948 INFO] Step 24850/50000; acc:  70.07; ppl:  2.84; xent: 1.05; lr: 0.00010; 11528/4517 tok/s;   4387 sec
[2021-04-25 05:08:06,054 INFO] Step 24900/50000; acc:  69.98; ppl:  2.83; xent: 1.04; lr: 0.00010; 11006/4429 tok/s;   4396 sec
[2021-04-25 05:08:15,228 INFO] Step 24950/50000; acc:  70.19; ppl:  2.84; xent: 1.04; lr: 0.00010; 11217/4505 tok/s;   4405 sec
[2021-04-25 05:08:23,427 INFO] Step 25000/50000; acc:  70.13; ppl:  2.84; xent: 1.04; lr: 0.00010; 12383/4884 tok/s;   4413 sec
[2021-04-25 05:08:23,428 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-25 05:08:30,626 INFO] Validation perplexity: 2.97118
[2021-04-25 05:08:30,626 INFO] Validation accuracy: 69.534
[2021-04-25 05:08:30,628 INFO] Saving checkpoint ../models/group1_params/loose_ops/model_step_25000.pt
[2021-04-25 05:08:39,914 INFO] Step 25050/50000; acc:  70.06; ppl:  2.83; xent: 1.04; lr: 0.00010; 6146/2416 tok/s;   4430 sec
[2021-04-25 05:08:48,334 INFO] Step 25100/50000; acc:  70.39; ppl:  2.80; xent: 1.03; lr: 0.00010; 11961/4708 tok/s;   4438 sec
[2021-04-25 05:08:57,576 INFO] Step 25150/50000; acc:  70.00; ppl:  2.83; xent: 1.04; lr: 0.00010; 11199/4319 tok/s;   4447 sec
[2021-04-25 05:09:06,517 INFO] Step 25200/50000; acc:  70.33; ppl:  2.81; xent: 1.03; lr: 0.00010; 11509/4539 tok/s;   4456 sec
[2021-04-25 05:09:15,161 INFO] Step 25250/50000; acc:  70.18; ppl:  2.80; xent: 1.03; lr: 0.00010; 11522/4647 tok/s;   4465 sec
[2021-04-25 05:09:24,074 INFO] Step 25300/50000; acc:  69.94; ppl:  2.85; xent: 1.05; lr: 0.00010; 11648/4512 tok/s;   4474 sec
[2021-04-25 05:09:32,781 INFO] Step 25350/50000; acc:  70.09; ppl:  2.82; xent: 1.04; lr: 0.00010; 11621/4709 tok/s;   4483 sec
[2021-04-25 05:09:41,493 INFO] Step 25400/50000; acc:  70.35; ppl:  2.81; xent: 1.03; lr: 0.00010; 11945/4476 tok/s;   4491 sec
[2021-04-25 05:09:49,578 INFO] Step 25450/50000; acc:  71.02; ppl:  2.74; xent: 1.01; lr: 0.00010; 12076/4912 tok/s;   4499 sec
[2021-04-25 05:09:58,402 INFO] Step 25500/50000; acc:  70.10; ppl:  2.83; xent: 1.04; lr: 0.00010; 11651/4609 tok/s;   4508 sec
[2021-04-25 05:09:58,980 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:10:07,067 INFO] Step 25550/50000; acc:  70.23; ppl:  2.81; xent: 1.03; lr: 0.00010; 11908/4671 tok/s;   4517 sec
[2021-04-25 05:10:15,931 INFO] Step 25600/50000; acc:  70.34; ppl:  2.80; xent: 1.03; lr: 0.00010; 11250/4521 tok/s;   4526 sec
[2021-04-25 05:10:25,399 INFO] Step 25650/50000; acc:  70.01; ppl:  2.82; xent: 1.04; lr: 0.00010; 10711/4285 tok/s;   4535 sec
[2021-04-25 05:10:34,007 INFO] Step 25700/50000; acc:  70.41; ppl:  2.80; xent: 1.03; lr: 0.00010; 11676/4750 tok/s;   4544 sec
[2021-04-25 05:10:42,801 INFO] Step 25750/50000; acc:  70.07; ppl:  2.83; xent: 1.04; lr: 0.00010; 11748/4570 tok/s;   4553 sec
[2021-04-25 05:10:51,635 INFO] Step 25800/50000; acc:  70.47; ppl:  2.79; xent: 1.03; lr: 0.00010; 11421/4585 tok/s;   4561 sec
[2021-04-25 05:11:00,404 INFO] Step 25850/50000; acc:  70.22; ppl:  2.81; xent: 1.03; lr: 0.00010; 11764/4484 tok/s;   4570 sec
[2021-04-25 05:11:09,468 INFO] Step 25900/50000; acc:  70.18; ppl:  2.78; xent: 1.02; lr: 0.00010; 11174/4473 tok/s;   4579 sec
[2021-04-25 05:11:17,993 INFO] Step 25950/50000; acc:  70.27; ppl:  2.80; xent: 1.03; lr: 0.00010; 11920/4659 tok/s;   4588 sec
[2021-04-25 05:11:26,924 INFO] Step 26000/50000; acc:  70.12; ppl:  2.83; xent: 1.04; lr: 0.00010; 11554/4522 tok/s;   4597 sec
[2021-04-25 05:11:35,594 INFO] Step 26050/50000; acc:  70.82; ppl:  2.76; xent: 1.02; lr: 0.00010; 11476/4633 tok/s;   4605 sec
[2021-04-25 05:11:44,661 INFO] Step 26100/50000; acc:  70.31; ppl:  2.81; xent: 1.03; lr: 0.00010; 11493/4438 tok/s;   4614 sec
[2021-04-25 05:11:52,770 INFO] Step 26150/50000; acc:  70.66; ppl:  2.78; xent: 1.02; lr: 0.00010; 12493/4995 tok/s;   4623 sec
[2021-04-25 05:12:01,234 INFO] Step 26200/50000; acc:  70.20; ppl:  2.79; xent: 1.03; lr: 0.00010; 12139/4645 tok/s;   4631 sec
[2021-04-25 05:12:07,892 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:12:09,365 INFO] Step 26250/50000; acc:  71.20; ppl:  2.70; xent: 0.99; lr: 0.00010; 12111/4930 tok/s;   4639 sec
[2021-04-25 05:12:18,539 INFO] Step 26300/50000; acc:  70.27; ppl:  2.81; xent: 1.03; lr: 0.00010; 11209/4406 tok/s;   4648 sec
[2021-04-25 05:12:28,014 INFO] Step 26350/50000; acc:  69.81; ppl:  2.83; xent: 1.04; lr: 0.00010; 10894/4321 tok/s;   4658 sec
[2021-04-25 05:12:36,787 INFO] Step 26400/50000; acc:  70.81; ppl:  2.75; xent: 1.01; lr: 0.00010; 11222/4621 tok/s;   4667 sec
[2021-04-25 05:12:45,506 INFO] Step 26450/50000; acc:  70.07; ppl:  2.83; xent: 1.04; lr: 0.00010; 11743/4701 tok/s;   4675 sec
[2021-04-25 05:12:54,014 INFO] Step 26500/50000; acc:  70.51; ppl:  2.77; xent: 1.02; lr: 0.00010; 11795/4653 tok/s;   4684 sec
[2021-04-25 05:13:02,898 INFO] Step 26550/50000; acc:  70.18; ppl:  2.83; xent: 1.04; lr: 0.00010; 11668/4509 tok/s;   4693 sec
[2021-04-25 05:13:11,797 INFO] Step 26600/50000; acc:  70.36; ppl:  2.77; xent: 1.02; lr: 0.00010; 11570/4404 tok/s;   4702 sec
[2021-04-25 05:13:21,085 INFO] Step 26650/50000; acc:  70.53; ppl:  2.78; xent: 1.02; lr: 0.00010; 11011/4408 tok/s;   4711 sec
[2021-04-25 05:13:29,878 INFO] Step 26700/50000; acc:  70.59; ppl:  2.78; xent: 1.02; lr: 0.00010; 11376/4588 tok/s;   4720 sec
[2021-04-25 05:13:38,303 INFO] Step 26750/50000; acc:  70.61; ppl:  2.77; xent: 1.02; lr: 0.00010; 12119/4768 tok/s;   4728 sec
[2021-04-25 05:13:47,474 INFO] Step 26800/50000; acc:  70.36; ppl:  2.80; xent: 1.03; lr: 0.00010; 11208/4388 tok/s;   4737 sec
[2021-04-25 05:13:56,028 INFO] Step 26850/50000; acc:  70.99; ppl:  2.72; xent: 1.00; lr: 0.00010; 11688/4656 tok/s;   4746 sec
[2021-04-25 05:14:04,312 INFO] Step 26900/50000; acc:  70.36; ppl:  2.80; xent: 1.03; lr: 0.00010; 12514/4825 tok/s;   4754 sec
[2021-04-25 05:14:12,718 INFO] Step 26950/50000; acc:  70.66; ppl:  2.74; xent: 1.01; lr: 0.00010; 11955/4774 tok/s;   4763 sec
[2021-04-25 05:14:17,433 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:14:21,897 INFO] Step 27000/50000; acc:  70.46; ppl:  2.80; xent: 1.03; lr: 0.00010; 11285/4415 tok/s;   4772 sec
[2021-04-25 05:14:30,534 INFO] Step 27050/50000; acc:  70.74; ppl:  2.75; xent: 1.01; lr: 0.00010; 11472/4607 tok/s;   4780 sec
[2021-04-25 05:14:40,024 INFO] Step 27100/50000; acc:  70.27; ppl:  2.80; xent: 1.03; lr: 0.00010; 10763/4331 tok/s;   4790 sec
[2021-04-25 05:14:49,073 INFO] Step 27150/50000; acc:  70.44; ppl:  2.79; xent: 1.03; lr: 0.00010; 11328/4520 tok/s;   4799 sec
[2021-04-25 05:14:57,524 INFO] Step 27200/50000; acc:  70.92; ppl:  2.75; xent: 1.01; lr: 0.00010; 11729/4741 tok/s;   4807 sec
[2021-04-25 05:15:06,114 INFO] Step 27250/50000; acc:  70.33; ppl:  2.79; xent: 1.03; lr: 0.00010; 11848/4631 tok/s;   4816 sec
[2021-04-25 05:15:14,544 INFO] Step 27300/50000; acc:  70.72; ppl:  2.74; xent: 1.01; lr: 0.00010; 12094/4728 tok/s;   4824 sec
[2021-04-25 05:15:23,880 INFO] Step 27350/50000; acc:  70.13; ppl:  2.79; xent: 1.03; lr: 0.00010; 11187/4268 tok/s;   4834 sec
[2021-04-25 05:15:32,884 INFO] Step 27400/50000; acc:  70.60; ppl:  2.76; xent: 1.01; lr: 0.00010; 11303/4563 tok/s;   4843 sec
[2021-04-25 05:15:41,283 INFO] Step 27450/50000; acc:  70.50; ppl:  2.76; xent: 1.01; lr: 0.00010; 12127/4678 tok/s;   4851 sec
[2021-04-25 05:15:50,039 INFO] Step 27500/50000; acc:  70.81; ppl:  2.76; xent: 1.02; lr: 0.00010; 11459/4658 tok/s;   4860 sec
[2021-04-25 05:15:58,852 INFO] Step 27550/50000; acc:  70.46; ppl:  2.77; xent: 1.02; lr: 0.00010; 11542/4517 tok/s;   4869 sec
[2021-04-25 05:16:07,469 INFO] Step 27600/50000; acc:  70.58; ppl:  2.75; xent: 1.01; lr: 0.00010; 11998/4671 tok/s;   4877 sec
[2021-04-25 05:16:15,757 INFO] Step 27650/50000; acc:  70.98; ppl:  2.73; xent: 1.00; lr: 0.00010; 11928/4808 tok/s;   4886 sec
[2021-04-25 05:16:24,484 INFO] Step 27700/50000; acc:  70.61; ppl:  2.77; xent: 1.02; lr: 0.00010; 11906/4620 tok/s;   4894 sec
[2021-04-25 05:16:26,560 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:16:33,022 INFO] Step 27750/50000; acc:  70.77; ppl:  2.75; xent: 1.01; lr: 0.00010; 11860/4765 tok/s;   4903 sec
[2021-04-25 05:16:42,189 INFO] Step 27800/50000; acc:  70.41; ppl:  2.81; xent: 1.03; lr: 0.00010; 11276/4361 tok/s;   4912 sec
[2021-04-25 05:16:51,204 INFO] Step 27850/50000; acc:  71.06; ppl:  2.73; xent: 1.00; lr: 0.00010; 10845/4572 tok/s;   4921 sec
[2021-04-25 05:16:59,853 INFO] Step 27900/50000; acc:  70.53; ppl:  2.77; xent: 1.02; lr: 0.00010; 11875/4630 tok/s;   4930 sec
[2021-04-25 05:17:08,650 INFO] Step 27950/50000; acc:  70.66; ppl:  2.76; xent: 1.01; lr: 0.00010; 11667/4635 tok/s;   4938 sec
[2021-04-25 05:17:16,984 INFO] Step 28000/50000; acc:  71.07; ppl:  2.71; xent: 1.00; lr: 0.00010; 11879/4744 tok/s;   4947 sec
[2021-04-25 05:17:26,020 INFO] Step 28050/50000; acc:  70.49; ppl:  2.78; xent: 1.02; lr: 0.00010; 11524/4345 tok/s;   4956 sec
[2021-04-25 05:17:34,592 INFO] Step 28100/50000; acc:  70.88; ppl:  2.72; xent: 1.00; lr: 0.00010; 11780/4797 tok/s;   4964 sec
[2021-04-25 05:17:43,832 INFO] Step 28150/50000; acc:  70.53; ppl:  2.79; xent: 1.03; lr: 0.00010; 11214/4301 tok/s;   4974 sec
[2021-04-25 05:17:52,441 INFO] Step 28200/50000; acc:  70.54; ppl:  2.75; xent: 1.01; lr: 0.00010; 11837/4707 tok/s;   4982 sec
[2021-04-25 05:18:01,193 INFO] Step 28250/50000; acc:  70.80; ppl:  2.74; xent: 1.01; lr: 0.00010; 11589/4612 tok/s;   4991 sec
[2021-04-25 05:18:09,917 INFO] Step 28300/50000; acc:  71.18; ppl:  2.70; xent: 0.99; lr: 0.00010; 11578/4564 tok/s;   5000 sec
[2021-04-25 05:18:18,011 INFO] Step 28350/50000; acc:  70.93; ppl:  2.73; xent: 1.01; lr: 0.00010; 12558/4863 tok/s;   5008 sec
[2021-04-25 05:18:26,661 INFO] Step 28400/50000; acc:  70.96; ppl:  2.74; xent: 1.01; lr: 0.00010; 11825/4670 tok/s;   5016 sec
[2021-04-25 05:18:29,351 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:18:35,497 INFO] Step 28450/50000; acc:  70.81; ppl:  2.72; xent: 1.00; lr: 0.00010; 11335/4637 tok/s;   5025 sec
[2021-04-25 05:18:44,396 INFO] Step 28500/50000; acc:  70.54; ppl:  2.77; xent: 1.02; lr: 0.00010; 11670/4492 tok/s;   5034 sec
[2021-04-25 05:18:53,436 INFO] Step 28550/50000; acc:  70.82; ppl:  2.74; xent: 1.01; lr: 0.00010; 11105/4474 tok/s;   5043 sec
[2021-04-25 05:19:02,579 INFO] Step 28600/50000; acc:  70.66; ppl:  2.76; xent: 1.02; lr: 0.00010; 11269/4490 tok/s;   5052 sec
[2021-04-25 05:19:10,663 INFO] Step 28650/50000; acc:  71.13; ppl:  2.72; xent: 1.00; lr: 0.00010; 12160/4974 tok/s;   5060 sec
[2021-04-25 05:19:19,303 INFO] Step 28700/50000; acc:  70.73; ppl:  2.73; xent: 1.01; lr: 0.00010; 11835/4616 tok/s;   5069 sec
[2021-04-25 05:19:27,909 INFO] Step 28750/50000; acc:  70.78; ppl:  2.75; xent: 1.01; lr: 0.00010; 12044/4605 tok/s;   5078 sec
[2021-04-25 05:19:37,080 INFO] Step 28800/50000; acc:  70.82; ppl:  2.73; xent: 1.00; lr: 0.00010; 11015/4392 tok/s;   5087 sec
[2021-04-25 05:19:46,091 INFO] Step 28850/50000; acc:  71.17; ppl:  2.72; xent: 1.00; lr: 0.00010; 11347/4502 tok/s;   5096 sec
[2021-04-25 05:19:54,638 INFO] Step 28900/50000; acc:  70.85; ppl:  2.72; xent: 1.00; lr: 0.00010; 11792/4654 tok/s;   5104 sec
[2021-04-25 05:20:03,564 INFO] Step 28950/50000; acc:  70.77; ppl:  2.74; xent: 1.01; lr: 0.00010; 11583/4535 tok/s;   5113 sec
[2021-04-25 05:20:12,348 INFO] Step 29000/50000; acc:  70.83; ppl:  2.72; xent: 1.00; lr: 0.00010; 11592/4655 tok/s;   5122 sec
[2021-04-25 05:20:20,887 INFO] Step 29050/50000; acc:  71.01; ppl:  2.71; xent: 1.00; lr: 0.00010; 12015/4536 tok/s;   5131 sec
[2021-04-25 05:20:29,118 INFO] Step 29100/50000; acc:  71.43; ppl:  2.68; xent: 0.99; lr: 0.00010; 12093/4885 tok/s;   5139 sec
[2021-04-25 05:20:37,911 INFO] Step 29150/50000; acc:  71.09; ppl:  2.72; xent: 1.00; lr: 0.00010; 11565/4581 tok/s;   5148 sec
[2021-04-25 05:20:38,191 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:20:46,705 INFO] Step 29200/50000; acc:  70.98; ppl:  2.73; xent: 1.00; lr: 0.00010; 11709/4577 tok/s;   5157 sec
[2021-04-25 05:20:55,655 INFO] Step 29250/50000; acc:  71.16; ppl:  2.71; xent: 1.00; lr: 0.00010; 11173/4523 tok/s;   5165 sec
[2021-04-25 05:21:05,359 INFO] Step 29300/50000; acc:  70.59; ppl:  2.75; xent: 1.01; lr: 0.00010; 10594/4198 tok/s;   5175 sec
[2021-04-25 05:21:13,861 INFO] Step 29350/50000; acc:  71.00; ppl:  2.73; xent: 1.00; lr: 0.00010; 11876/4801 tok/s;   5184 sec
[2021-04-25 05:21:22,713 INFO] Step 29400/50000; acc:  70.70; ppl:  2.74; xent: 1.01; lr: 0.00010; 11678/4528 tok/s;   5193 sec
[2021-04-25 05:21:31,372 INFO] Step 29450/50000; acc:  71.36; ppl:  2.68; xent: 0.98; lr: 0.00010; 11259/4673 tok/s;   5201 sec
[2021-04-25 05:21:40,126 INFO] Step 29500/50000; acc:  70.85; ppl:  2.74; xent: 1.01; lr: 0.00010; 11915/4442 tok/s;   5210 sec
[2021-04-25 05:21:49,571 INFO] Step 29550/50000; acc:  70.88; ppl:  2.73; xent: 1.00; lr: 0.00010; 10996/4370 tok/s;   5219 sec
[2021-04-25 05:21:57,674 INFO] Step 29600/50000; acc:  71.47; ppl:  2.68; xent: 0.99; lr: 0.00010; 12210/4871 tok/s;   5228 sec
[2021-04-25 05:22:06,576 INFO] Step 29650/50000; acc:  70.92; ppl:  2.75; xent: 1.01; lr: 0.00010; 11553/4546 tok/s;   5236 sec
[2021-04-25 05:22:15,403 INFO] Step 29700/50000; acc:  71.27; ppl:  2.69; xent: 0.99; lr: 0.00010; 11410/4572 tok/s;   5245 sec
[2021-04-25 05:22:24,261 INFO] Step 29750/50000; acc:  71.13; ppl:  2.71; xent: 1.00; lr: 0.00010; 11727/4479 tok/s;   5254 sec
[2021-04-25 05:22:32,516 INFO] Step 29800/50000; acc:  71.09; ppl:  2.71; xent: 1.00; lr: 0.00010; 12305/4904 tok/s;   5262 sec
[2021-04-25 05:22:40,837 INFO] Step 29850/50000; acc:  71.25; ppl:  2.68; xent: 0.99; lr: 0.00010; 12164/4773 tok/s;   5271 sec
[2021-04-25 05:22:47,386 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:22:49,249 INFO] Step 29900/50000; acc:  71.59; ppl:  2.66; xent: 0.98; lr: 0.00010; 11982/4802 tok/s;   5279 sec
[2021-04-25 05:22:58,316 INFO] Step 29950/50000; acc:  71.02; ppl:  2.70; xent: 0.99; lr: 0.00010; 11208/4430 tok/s;   5288 sec
[2021-04-25 05:23:07,626 INFO] Step 30000/50000; acc:  70.49; ppl:  2.75; xent: 1.01; lr: 0.00010; 11060/4334 tok/s;   5297 sec
[2021-04-25 05:23:07,627 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-25 05:23:14,844 INFO] Validation perplexity: 2.9406
[2021-04-25 05:23:14,845 INFO] Validation accuracy: 69.7281
[2021-04-25 05:23:14,847 INFO] Saving checkpoint ../models/group1_params/loose_ops/model_step_30000.pt
[2021-04-25 05:23:24,100 INFO] Step 30050/50000; acc:  71.37; ppl:  2.68; xent: 0.98; lr: 0.00010; 5996/2487 tok/s;   5314 sec
[2021-04-25 05:23:32,929 INFO] Step 30100/50000; acc:  70.60; ppl:  2.74; xent: 1.01; lr: 0.00010; 11744/4638 tok/s;   5323 sec
[2021-04-25 05:23:41,577 INFO] Step 30150/50000; acc:  71.36; ppl:  2.69; xent: 0.99; lr: 0.00010; 11643/4601 tok/s;   5331 sec
[2021-04-25 05:23:50,340 INFO] Step 30200/50000; acc:  70.83; ppl:  2.74; xent: 1.01; lr: 0.00010; 11853/4544 tok/s;   5340 sec
[2021-04-25 05:23:59,102 INFO] Step 30250/50000; acc:  71.46; ppl:  2.65; xent: 0.97; lr: 0.00010; 11373/4474 tok/s;   5349 sec
[2021-04-25 05:24:08,334 INFO] Step 30300/50000; acc:  71.04; ppl:  2.71; xent: 1.00; lr: 0.00010; 11181/4444 tok/s;   5358 sec
[2021-04-25 05:24:17,239 INFO] Step 30350/50000; acc:  70.98; ppl:  2.73; xent: 1.00; lr: 0.00010; 11537/4534 tok/s;   5367 sec
[2021-04-25 05:24:25,543 INFO] Step 30400/50000; acc:  71.58; ppl:  2.65; xent: 0.98; lr: 0.00010; 11987/4832 tok/s;   5375 sec
[2021-04-25 05:24:34,586 INFO] Step 30450/50000; acc:  71.27; ppl:  2.70; xent: 0.99; lr: 0.00010; 11311/4462 tok/s;   5384 sec
[2021-04-25 05:24:43,146 INFO] Step 30500/50000; acc:  71.33; ppl:  2.67; xent: 0.98; lr: 0.00010; 11824/4654 tok/s;   5393 sec
[2021-04-25 05:24:51,477 INFO] Step 30550/50000; acc:  70.84; ppl:  2.72; xent: 1.00; lr: 0.00010; 12402/4786 tok/s;   5401 sec
[2021-04-25 05:24:59,884 INFO] Step 30600/50000; acc:  71.49; ppl:  2.65; xent: 0.98; lr: 0.00010; 12017/4773 tok/s;   5410 sec
[2021-04-25 05:25:04,270 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:25:08,933 INFO] Step 30650/50000; acc:  71.15; ppl:  2.71; xent: 1.00; lr: 0.00010; 11276/4457 tok/s;   5419 sec
[2021-04-25 05:25:17,607 INFO] Step 30700/50000; acc:  71.31; ppl:  2.68; xent: 0.99; lr: 0.00010; 11655/4581 tok/s;   5427 sec
[2021-04-25 05:25:27,177 INFO] Step 30750/50000; acc:  71.00; ppl:  2.70; xent: 0.99; lr: 0.00010; 10562/4340 tok/s;   5437 sec
[2021-04-25 05:25:36,034 INFO] Step 30800/50000; acc:  71.19; ppl:  2.70; xent: 0.99; lr: 0.00010; 11548/4552 tok/s;   5446 sec
[2021-04-25 05:25:44,645 INFO] Step 30850/50000; acc:  71.43; ppl:  2.68; xent: 0.99; lr: 0.00010; 11543/4717 tok/s;   5454 sec
[2021-04-25 05:25:53,257 INFO] Step 30900/50000; acc:  70.94; ppl:  2.72; xent: 1.00; lr: 0.00010; 11988/4602 tok/s;   5463 sec
[2021-04-25 05:26:01,742 INFO] Step 30950/50000; acc:  71.22; ppl:  2.67; xent: 0.98; lr: 0.00010; 12058/4711 tok/s;   5472 sec
[2021-04-25 05:26:10,921 INFO] Step 31000/50000; acc:  70.81; ppl:  2.72; xent: 1.00; lr: 0.00010; 11365/4349 tok/s;   5481 sec
[2021-04-25 05:26:19,770 INFO] Step 31050/50000; acc:  71.83; ppl:  2.64; xent: 0.97; lr: 0.00010; 11113/4579 tok/s;   5490 sec
[2021-04-25 05:26:28,235 INFO] Step 31100/50000; acc:  71.27; ppl:  2.69; xent: 0.99; lr: 0.00010; 12168/4729 tok/s;   5498 sec
[2021-04-25 05:26:37,233 INFO] Step 31150/50000; acc:  71.16; ppl:  2.71; xent: 1.00; lr: 0.00010; 11455/4486 tok/s;   5507 sec
[2021-04-25 05:26:45,859 INFO] Step 31200/50000; acc:  71.82; ppl:  2.64; xent: 0.97; lr: 0.00010; 11509/4638 tok/s;   5516 sec
[2021-04-25 05:26:54,411 INFO] Step 31250/50000; acc:  71.48; ppl:  2.66; xent: 0.98; lr: 0.00010; 12019/4706 tok/s;   5524 sec
[2021-04-25 05:27:02,734 INFO] Step 31300/50000; acc:  71.54; ppl:  2.66; xent: 0.98; lr: 0.00010; 12032/4760 tok/s;   5533 sec
[2021-04-25 05:27:11,387 INFO] Step 31350/50000; acc:  71.59; ppl:  2.68; xent: 0.98; lr: 0.00010; 11972/4665 tok/s;   5541 sec
[2021-04-25 05:27:13,118 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:27:19,989 INFO] Step 31400/50000; acc:  71.36; ppl:  2.68; xent: 0.99; lr: 0.00010; 11842/4708 tok/s;   5550 sec
[2021-04-25 05:27:29,170 INFO] Step 31450/50000; acc:  71.16; ppl:  2.71; xent: 1.00; lr: 0.00010; 11075/4345 tok/s;   5559 sec
[2021-04-25 05:27:38,227 INFO] Step 31500/50000; acc:  71.69; ppl:  2.64; xent: 0.97; lr: 0.00010; 11031/4577 tok/s;   5568 sec
[2021-04-25 05:27:46,797 INFO] Step 31550/50000; acc:  71.12; ppl:  2.68; xent: 0.99; lr: 0.00010; 11837/4701 tok/s;   5577 sec
[2021-04-25 05:27:55,540 INFO] Step 31600/50000; acc:  71.40; ppl:  2.68; xent: 0.99; lr: 0.00010; 11713/4636 tok/s;   5585 sec
[2021-04-25 05:28:03,928 INFO] Step 31650/50000; acc:  71.65; ppl:  2.65; xent: 0.97; lr: 0.00010; 11859/4713 tok/s;   5594 sec
[2021-04-25 05:28:13,119 INFO] Step 31700/50000; acc:  71.02; ppl:  2.71; xent: 1.00; lr: 0.00010; 11485/4276 tok/s;   5603 sec
[2021-04-25 05:28:21,621 INFO] Step 31750/50000; acc:  71.59; ppl:  2.64; xent: 0.97; lr: 0.00010; 11887/4857 tok/s;   5611 sec
[2021-04-25 05:28:30,909 INFO] Step 31800/50000; acc:  71.27; ppl:  2.71; xent: 1.00; lr: 0.00010; 11166/4288 tok/s;   5621 sec
[2021-04-25 05:28:39,321 INFO] Step 31850/50000; acc:  71.64; ppl:  2.64; xent: 0.97; lr: 0.00010; 11721/4768 tok/s;   5629 sec
[2021-04-25 05:28:48,086 INFO] Step 31900/50000; acc:  71.34; ppl:  2.68; xent: 0.98; lr: 0.00010; 11685/4626 tok/s;   5638 sec
[2021-04-25 05:28:56,873 INFO] Step 31950/50000; acc:  71.70; ppl:  2.65; xent: 0.97; lr: 0.00010; 11815/4537 tok/s;   5647 sec
[2021-04-25 05:29:04,769 INFO] Step 32000/50000; acc:  72.03; ppl:  2.61; xent: 0.96; lr: 0.00010; 12553/4986 tok/s;   5655 sec
[2021-04-25 05:29:13,228 INFO] Step 32050/50000; acc:  71.73; ppl:  2.65; xent: 0.97; lr: 0.00010; 12041/4743 tok/s;   5663 sec
[2021-04-25 05:29:15,746 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:29:22,163 INFO] Step 32100/50000; acc:  71.65; ppl:  2.66; xent: 0.98; lr: 0.00010; 11332/4592 tok/s;   5672 sec
[2021-04-25 05:29:31,237 INFO] Step 32150/50000; acc:  71.49; ppl:  2.68; xent: 0.99; lr: 0.00010; 11416/4452 tok/s;   5681 sec
[2021-04-25 05:29:40,030 INFO] Step 32200/50000; acc:  71.26; ppl:  2.67; xent: 0.98; lr: 0.00010; 11463/4547 tok/s;   5690 sec
[2021-04-25 05:29:49,247 INFO] Step 32250/50000; acc:  71.66; ppl:  2.67; xent: 0.98; lr: 0.00010; 11023/4462 tok/s;   5699 sec
[2021-04-25 05:29:57,418 INFO] Step 32300/50000; acc:  71.47; ppl:  2.66; xent: 0.98; lr: 0.00010; 12275/4963 tok/s;   5707 sec
[2021-04-25 05:30:06,039 INFO] Step 32350/50000; acc:  71.66; ppl:  2.65; xent: 0.97; lr: 0.00010; 11716/4593 tok/s;   5716 sec
[2021-04-25 05:30:14,572 INFO] Step 32400/50000; acc:  71.57; ppl:  2.67; xent: 0.98; lr: 0.00010; 12136/4619 tok/s;   5724 sec
[2021-04-25 05:30:23,800 INFO] Step 32450/50000; acc:  71.54; ppl:  2.64; xent: 0.97; lr: 0.00010; 10978/4379 tok/s;   5734 sec
[2021-04-25 05:30:32,816 INFO] Step 32500/50000; acc:  71.62; ppl:  2.67; xent: 0.98; lr: 0.00010; 11483/4498 tok/s;   5743 sec
[2021-04-25 05:30:41,461 INFO] Step 32550/50000; acc:  71.28; ppl:  2.68; xent: 0.98; lr: 0.00010; 11708/4680 tok/s;   5751 sec
[2021-04-25 05:30:50,443 INFO] Step 32600/50000; acc:  71.63; ppl:  2.66; xent: 0.98; lr: 0.00010; 11516/4431 tok/s;   5760 sec
[2021-04-25 05:30:59,004 INFO] Step 32650/50000; acc:  71.79; ppl:  2.60; xent: 0.96; lr: 0.00010; 11502/4755 tok/s;   5769 sec
[2021-04-25 05:31:07,580 INFO] Step 32700/50000; acc:  71.76; ppl:  2.64; xent: 0.97; lr: 0.00010; 12072/4562 tok/s;   5777 sec
[2021-04-25 05:31:15,990 INFO] Step 32750/50000; acc:  71.89; ppl:  2.64; xent: 0.97; lr: 0.00010; 12174/4798 tok/s;   5786 sec
[2021-04-25 05:31:24,589 INFO] Step 32800/50000; acc:  72.00; ppl:  2.61; xent: 0.96; lr: 0.00010; 11554/4663 tok/s;   5794 sec
[2021-04-25 05:31:24,598 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:31:33,641 INFO] Step 32850/50000; acc:  71.78; ppl:  2.63; xent: 0.97; lr: 0.00010; 11295/4453 tok/s;   5803 sec
[2021-04-25 05:31:42,526 INFO] Step 32900/50000; acc:  71.59; ppl:  2.65; xent: 0.97; lr: 0.00010; 11388/4570 tok/s;   5812 sec
[2021-04-25 05:31:51,817 INFO] Step 32950/50000; acc:  71.28; ppl:  2.68; xent: 0.99; lr: 0.00010; 11029/4384 tok/s;   5822 sec
[2021-04-25 05:32:00,509 INFO] Step 33000/50000; acc:  71.78; ppl:  2.64; xent: 0.97; lr: 0.00010; 11665/4703 tok/s;   5830 sec
[2021-04-25 05:32:09,354 INFO] Step 33050/50000; acc:  71.51; ppl:  2.64; xent: 0.97; lr: 0.00010; 11519/4507 tok/s;   5839 sec
[2021-04-25 05:32:18,027 INFO] Step 33100/50000; acc:  71.84; ppl:  2.63; xent: 0.97; lr: 0.00010; 11525/4642 tok/s;   5848 sec
[2021-04-25 05:32:26,694 INFO] Step 33150/50000; acc:  71.70; ppl:  2.63; xent: 0.97; lr: 0.00010; 11881/4519 tok/s;   5857 sec
[2021-04-25 05:32:36,047 INFO] Step 33200/50000; acc:  71.28; ppl:  2.67; xent: 0.98; lr: 0.00010; 11074/4355 tok/s;   5866 sec
[2021-04-25 05:32:44,477 INFO] Step 33250/50000; acc:  72.38; ppl:  2.60; xent: 0.95; lr: 0.00010; 11752/4747 tok/s;   5874 sec
[2021-04-25 05:32:53,521 INFO] Step 33300/50000; acc:  71.25; ppl:  2.68; xent: 0.99; lr: 0.00010; 11542/4458 tok/s;   5883 sec
[2021-04-25 05:33:02,370 INFO] Step 33350/50000; acc:  71.86; ppl:  2.62; xent: 0.96; lr: 0.00010; 11421/4613 tok/s;   5892 sec
[2021-04-25 05:33:11,231 INFO] Step 33400/50000; acc:  71.68; ppl:  2.63; xent: 0.97; lr: 0.00010; 11734/4449 tok/s;   5901 sec
[2021-04-25 05:33:19,294 INFO] Step 33450/50000; acc:  72.38; ppl:  2.58; xent: 0.95; lr: 0.00010; 12168/4997 tok/s;   5909 sec
[2021-04-25 05:33:27,622 INFO] Step 33500/50000; acc:  71.74; ppl:  2.63; xent: 0.97; lr: 0.00010; 12265/4762 tok/s;   5917 sec
[2021-04-25 05:33:33,916 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:33:36,480 INFO] Step 33550/50000; acc:  71.76; ppl:  2.63; xent: 0.97; lr: 0.00010; 11683/4613 tok/s;   5926 sec
[2021-04-25 05:33:45,239 INFO] Step 33600/50000; acc:  72.16; ppl:  2.61; xent: 0.96; lr: 0.00010; 11351/4548 tok/s;   5935 sec
[2021-04-25 05:33:54,531 INFO] Step 33650/50000; acc:  71.35; ppl:  2.66; xent: 0.98; lr: 0.00010; 11001/4336 tok/s;   5944 sec
[2021-04-25 05:34:03,339 INFO] Step 33700/50000; acc:  72.06; ppl:  2.61; xent: 0.96; lr: 0.00010; 11364/4670 tok/s;   5953 sec
[2021-04-25 05:34:12,232 INFO] Step 33750/50000; acc:  71.57; ppl:  2.66; xent: 0.98; lr: 0.00010; 11610/4578 tok/s;   5962 sec
[2021-04-25 05:34:20,922 INFO] Step 33800/50000; acc:  71.80; ppl:  2.64; xent: 0.97; lr: 0.00010; 11649/4599 tok/s;   5971 sec
[2021-04-25 05:34:29,616 INFO] Step 33850/50000; acc:  71.74; ppl:  2.64; xent: 0.97; lr: 0.00010; 11788/4567 tok/s;   5979 sec
[2021-04-25 05:34:38,390 INFO] Step 33900/50000; acc:  71.77; ppl:  2.60; xent: 0.96; lr: 0.00010; 11583/4453 tok/s;   5988 sec
[2021-04-25 05:34:47,615 INFO] Step 33950/50000; acc:  71.90; ppl:  2.62; xent: 0.96; lr: 0.00010; 11059/4454 tok/s;   5997 sec
[2021-04-25 05:34:56,488 INFO] Step 34000/50000; acc:  71.85; ppl:  2.64; xent: 0.97; lr: 0.00010; 11556/4533 tok/s;   6006 sec
[2021-04-25 05:35:05,036 INFO] Step 34050/50000; acc:  72.48; ppl:  2.57; xent: 0.94; lr: 0.00010; 11667/4717 tok/s;   6015 sec
[2021-04-25 05:35:14,205 INFO] Step 34100/50000; acc:  71.68; ppl:  2.65; xent: 0.97; lr: 0.00010; 11331/4428 tok/s;   6024 sec
[2021-04-25 05:35:22,696 INFO] Step 34150/50000; acc:  72.12; ppl:  2.59; xent: 0.95; lr: 0.00010; 11946/4673 tok/s;   6033 sec
[2021-04-25 05:35:31,011 INFO] Step 34200/50000; acc:  71.80; ppl:  2.62; xent: 0.96; lr: 0.00010; 12427/4805 tok/s;   6041 sec
[2021-04-25 05:35:39,147 INFO] Step 34250/50000; acc:  72.61; ppl:  2.55; xent: 0.94; lr: 0.00010; 12004/4908 tok/s;   6049 sec
[2021-04-25 05:35:43,340 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:35:48,271 INFO] Step 34300/50000; acc:  71.58; ppl:  2.64; xent: 0.97; lr: 0.00010; 11302/4454 tok/s;   6058 sec
[2021-04-25 05:35:57,230 INFO] Step 34350/50000; acc:  71.61; ppl:  2.64; xent: 0.97; lr: 0.00010; 11576/4425 tok/s;   6067 sec
[2021-04-25 05:36:06,571 INFO] Step 34400/50000; acc:  72.12; ppl:  2.60; xent: 0.96; lr: 0.00010; 10572/4418 tok/s;   6076 sec
[2021-04-25 05:36:15,409 INFO] Step 34450/50000; acc:  71.96; ppl:  2.62; xent: 0.96; lr: 0.00010; 11518/4576 tok/s;   6085 sec
[2021-04-25 05:36:24,097 INFO] Step 34500/50000; acc:  71.67; ppl:  2.62; xent: 0.97; lr: 0.00010; 11557/4709 tok/s;   6094 sec
[2021-04-25 05:36:32,703 INFO] Step 34550/50000; acc:  71.92; ppl:  2.63; xent: 0.97; lr: 0.00010; 11970/4601 tok/s;   6103 sec
[2021-04-25 05:36:41,281 INFO] Step 34600/50000; acc:  72.03; ppl:  2.61; xent: 0.96; lr: 0.00010; 11995/4595 tok/s;   6111 sec
[2021-04-25 05:36:50,380 INFO] Step 34650/50000; acc:  71.75; ppl:  2.62; xent: 0.96; lr: 0.00010; 11274/4433 tok/s;   6120 sec
[2021-04-25 05:36:59,341 INFO] Step 34700/50000; acc:  72.00; ppl:  2.61; xent: 0.96; lr: 0.00010; 11222/4523 tok/s;   6129 sec
[2021-04-25 05:37:07,779 INFO] Step 34750/50000; acc:  72.28; ppl:  2.60; xent: 0.96; lr: 0.00010; 12076/4752 tok/s;   6138 sec
[2021-04-25 05:37:16,751 INFO] Step 34800/50000; acc:  71.78; ppl:  2.64; xent: 0.97; lr: 0.00010; 11447/4460 tok/s;   6147 sec
[2021-04-25 05:37:25,460 INFO] Step 34850/50000; acc:  72.36; ppl:  2.56; xent: 0.94; lr: 0.00010; 11441/4648 tok/s;   6155 sec
[2021-04-25 05:37:33,976 INFO] Step 34900/50000; acc:  72.06; ppl:  2.62; xent: 0.96; lr: 0.00010; 12227/4656 tok/s;   6164 sec
[2021-04-25 05:37:42,313 INFO] Step 34950/50000; acc:  71.93; ppl:  2.59; xent: 0.95; lr: 0.00010; 12072/4782 tok/s;   6172 sec
[2021-04-25 05:37:51,085 INFO] Step 35000/50000; acc:  72.03; ppl:  2.61; xent: 0.96; lr: 0.00010; 11808/4654 tok/s;   6181 sec
[2021-04-25 05:37:51,089 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-25 05:37:58,294 INFO] Validation perplexity: 2.92731
[2021-04-25 05:37:58,294 INFO] Validation accuracy: 70.0322
[2021-04-25 05:37:58,296 INFO] Saving checkpoint ../models/group1_params/loose_ops/model_step_35000.pt
[2021-04-25 05:37:59,983 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:38:07,281 INFO] Step 35050/50000; acc:  72.45; ppl:  2.56; xent: 0.94; lr: 0.00010; 6082/2469 tok/s;   6197 sec
[2021-04-25 05:38:16,459 INFO] Step 35100/50000; acc:  71.80; ppl:  2.65; xent: 0.97; lr: 0.00010; 11174/4372 tok/s;   6206 sec
[2021-04-25 05:38:25,780 INFO] Step 35150/50000; acc:  72.01; ppl:  2.61; xent: 0.96; lr: 0.00010; 11001/4417 tok/s;   6216 sec
[2021-04-25 05:38:34,006 INFO] Step 35200/50000; acc:  72.32; ppl:  2.58; xent: 0.95; lr: 0.00010; 12065/4897 tok/s;   6224 sec
[2021-04-25 05:38:42,735 INFO] Step 35250/50000; acc:  71.99; ppl:  2.60; xent: 0.96; lr: 0.00010; 11655/4653 tok/s;   6233 sec
[2021-04-25 05:38:51,233 INFO] Step 35300/50000; acc:  72.16; ppl:  2.59; xent: 0.95; lr: 0.00010; 11869/4637 tok/s;   6241 sec
[2021-04-25 05:39:00,516 INFO] Step 35350/50000; acc:  71.80; ppl:  2.61; xent: 0.96; lr: 0.00010; 11331/4287 tok/s;   6250 sec
[2021-04-25 05:39:09,192 INFO] Step 35400/50000; acc:  72.28; ppl:  2.57; xent: 0.94; lr: 0.00010; 11714/4698 tok/s;   6259 sec
[2021-04-25 05:39:18,281 INFO] Step 35450/50000; acc:  71.92; ppl:  2.62; xent: 0.96; lr: 0.00010; 11223/4367 tok/s;   6268 sec
[2021-04-25 05:39:26,981 INFO] Step 35500/50000; acc:  72.32; ppl:  2.58; xent: 0.95; lr: 0.00010; 11586/4688 tok/s;   6277 sec
[2021-04-25 05:39:35,599 INFO] Step 35550/50000; acc:  72.20; ppl:  2.59; xent: 0.95; lr: 0.00010; 11739/4679 tok/s;   6285 sec
[2021-04-25 05:39:44,263 INFO] Step 35600/50000; acc:  72.20; ppl:  2.59; xent: 0.95; lr: 0.00010; 11971/4557 tok/s;   6294 sec
[2021-04-25 05:39:52,354 INFO] Step 35650/50000; acc:  72.59; ppl:  2.53; xent: 0.93; lr: 0.00010; 12247/4936 tok/s;   6302 sec
[2021-04-25 05:40:00,908 INFO] Step 35700/50000; acc:  72.03; ppl:  2.60; xent: 0.95; lr: 0.00010; 12087/4684 tok/s;   6311 sec
[2021-04-25 05:40:02,923 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:40:09,649 INFO] Step 35750/50000; acc:  72.18; ppl:  2.58; xent: 0.95; lr: 0.00010; 11620/4664 tok/s;   6319 sec
[2021-04-25 05:40:18,959 INFO] Step 35800/50000; acc:  72.02; ppl:  2.62; xent: 0.96; lr: 0.00010; 11132/4354 tok/s;   6329 sec
[2021-04-25 05:40:27,726 INFO] Step 35850/50000; acc:  72.50; ppl:  2.55; xent: 0.94; lr: 0.00010; 11130/4536 tok/s;   6338 sec
[2021-04-25 05:40:36,808 INFO] Step 35900/50000; acc:  72.00; ppl:  2.61; xent: 0.96; lr: 0.00010; 11299/4541 tok/s;   6347 sec
[2021-04-25 05:40:45,266 INFO] Step 35950/50000; acc:  72.06; ppl:  2.59; xent: 0.95; lr: 0.00010; 12172/4795 tok/s;   6355 sec
[2021-04-25 05:40:53,587 INFO] Step 36000/50000; acc:  72.41; ppl:  2.56; xent: 0.94; lr: 0.00010; 11833/4760 tok/s;   6363 sec
[2021-04-25 05:41:02,225 INFO] Step 36050/50000; acc:  72.25; ppl:  2.58; xent: 0.95; lr: 0.00010; 11953/4542 tok/s;   6372 sec
[2021-04-25 05:41:11,552 INFO] Step 36100/50000; acc:  72.05; ppl:  2.58; xent: 0.95; lr: 0.00010; 10965/4336 tok/s;   6381 sec
[2021-04-25 05:41:20,539 INFO] Step 36150/50000; acc:  72.27; ppl:  2.60; xent: 0.95; lr: 0.00010; 11487/4546 tok/s;   6390 sec
[2021-04-25 05:41:29,479 INFO] Step 36200/50000; acc:  72.17; ppl:  2.59; xent: 0.95; lr: 0.00010; 11383/4504 tok/s;   6399 sec
[2021-04-25 05:41:38,041 INFO] Step 36250/50000; acc:  72.27; ppl:  2.56; xent: 0.94; lr: 0.00010; 11901/4668 tok/s;   6408 sec
[2021-04-25 05:41:46,890 INFO] Step 36300/50000; acc:  72.38; ppl:  2.56; xent: 0.94; lr: 0.00010; 11373/4565 tok/s;   6417 sec
[2021-04-25 05:41:55,228 INFO] Step 36350/50000; acc:  72.35; ppl:  2.55; xent: 0.94; lr: 0.00010; 12283/4744 tok/s;   6425 sec
[2021-04-25 05:42:03,552 INFO] Step 36400/50000; acc:  72.33; ppl:  2.58; xent: 0.95; lr: 0.00010; 12257/4783 tok/s;   6433 sec
[2021-04-25 05:42:11,889 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:42:12,209 INFO] Step 36450/50000; acc:  72.70; ppl:  2.53; xent: 0.93; lr: 0.00010; 11491/4679 tok/s;   6442 sec
[2021-04-25 05:42:21,154 INFO] Step 36500/50000; acc:  72.28; ppl:  2.59; xent: 0.95; lr: 0.00010; 11600/4479 tok/s;   6451 sec
[2021-04-25 05:42:30,286 INFO] Step 36550/50000; acc:  72.38; ppl:  2.59; xent: 0.95; lr: 0.00010; 11124/4496 tok/s;   6460 sec
[2021-04-25 05:42:39,483 INFO] Step 36600/50000; acc:  72.07; ppl:  2.59; xent: 0.95; lr: 0.00010; 11144/4391 tok/s;   6469 sec
[2021-04-25 05:42:47,964 INFO] Step 36650/50000; acc:  72.63; ppl:  2.54; xent: 0.93; lr: 0.00010; 11570/4810 tok/s;   6478 sec
[2021-04-25 05:42:56,837 INFO] Step 36700/50000; acc:  72.18; ppl:  2.60; xent: 0.95; lr: 0.00010; 11583/4481 tok/s;   6487 sec
[2021-04-25 05:43:05,689 INFO] Step 36750/50000; acc:  72.29; ppl:  2.58; xent: 0.95; lr: 0.00010; 11608/4550 tok/s;   6496 sec
[2021-04-25 05:43:14,211 INFO] Step 36800/50000; acc:  72.41; ppl:  2.54; xent: 0.93; lr: 0.00010; 11802/4618 tok/s;   6504 sec
[2021-04-25 05:43:23,657 INFO] Step 36850/50000; acc:  71.81; ppl:  2.60; xent: 0.95; lr: 0.00010; 10922/4313 tok/s;   6513 sec
[2021-04-25 05:43:32,022 INFO] Step 36900/50000; acc:  72.68; ppl:  2.53; xent: 0.93; lr: 0.00010; 11950/4780 tok/s;   6522 sec
[2021-04-25 05:43:40,992 INFO] Step 36950/50000; acc:  72.17; ppl:  2.60; xent: 0.95; lr: 0.00010; 11608/4532 tok/s;   6531 sec
[2021-04-25 05:43:49,952 INFO] Step 37000/50000; acc:  72.58; ppl:  2.55; xent: 0.94; lr: 0.00010; 11322/4520 tok/s;   6540 sec
[2021-04-25 05:43:58,720 INFO] Step 37050/50000; acc:  72.59; ppl:  2.54; xent: 0.93; lr: 0.00010; 11673/4482 tok/s;   6549 sec
[2021-04-25 05:44:06,931 INFO] Step 37100/50000; acc:  72.35; ppl:  2.56; xent: 0.94; lr: 0.00010; 12228/4949 tok/s;   6557 sec
[2021-04-25 05:44:15,178 INFO] Step 37150/50000; acc:  72.47; ppl:  2.55; xent: 0.93; lr: 0.00010; 12245/4768 tok/s;   6565 sec
[2021-04-25 05:44:21,277 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:44:24,245 INFO] Step 37200/50000; acc:  72.48; ppl:  2.56; xent: 0.94; lr: 0.00010; 11395/4496 tok/s;   6574 sec
[2021-04-25 05:44:32,976 INFO] Step 37250/50000; acc:  72.59; ppl:  2.55; xent: 0.94; lr: 0.00010; 11430/4599 tok/s;   6583 sec
[2021-04-25 05:44:42,458 INFO] Step 37300/50000; acc:  71.73; ppl:  2.61; xent: 0.96; lr: 0.00010; 10913/4282 tok/s;   6592 sec
[2021-04-25 05:44:51,031 INFO] Step 37350/50000; acc:  72.69; ppl:  2.54; xent: 0.93; lr: 0.00010; 11711/4752 tok/s;   6601 sec
[2021-04-25 05:44:59,953 INFO] Step 37400/50000; acc:  72.30; ppl:  2.57; xent: 0.95; lr: 0.00010; 11586/4568 tok/s;   6610 sec
[2021-04-25 05:45:08,446 INFO] Step 37450/50000; acc:  72.62; ppl:  2.53; xent: 0.93; lr: 0.00010; 11514/4658 tok/s;   6618 sec
[2021-04-25 05:45:17,198 INFO] Step 37500/50000; acc:  72.19; ppl:  2.58; xent: 0.95; lr: 0.00010; 11836/4590 tok/s;   6627 sec
[2021-04-25 05:45:26,187 INFO] Step 37550/50000; acc:  72.24; ppl:  2.56; xent: 0.94; lr: 0.00010; 11619/4334 tok/s;   6636 sec
[2021-04-25 05:45:35,208 INFO] Step 37600/50000; acc:  72.87; ppl:  2.53; xent: 0.93; lr: 0.00010; 11037/4569 tok/s;   6645 sec
[2021-04-25 05:45:44,022 INFO] Step 37650/50000; acc:  72.53; ppl:  2.56; xent: 0.94; lr: 0.00010; 11569/4519 tok/s;   6654 sec
[2021-04-25 05:45:52,540 INFO] Step 37700/50000; acc:  72.85; ppl:  2.51; xent: 0.92; lr: 0.00010; 11843/4777 tok/s;   6662 sec
[2021-04-25 05:46:01,613 INFO] Step 37750/50000; acc:  72.43; ppl:  2.57; xent: 0.94; lr: 0.00010; 11420/4420 tok/s;   6671 sec
[2021-04-25 05:46:10,213 INFO] Step 37800/50000; acc:  72.62; ppl:  2.53; xent: 0.93; lr: 0.00010; 11846/4657 tok/s;   6680 sec
[2021-04-25 05:46:18,386 INFO] Step 37850/50000; acc:  72.58; ppl:  2.54; xent: 0.93; lr: 0.00010; 12441/4869 tok/s;   6688 sec
[2021-04-25 05:46:26,722 INFO] Step 37900/50000; acc:  72.77; ppl:  2.53; xent: 0.93; lr: 0.00010; 11982/4821 tok/s;   6697 sec
[2021-04-25 05:46:30,461 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:46:35,627 INFO] Step 37950/50000; acc:  72.27; ppl:  2.56; xent: 0.94; lr: 0.00010; 11462/4553 tok/s;   6705 sec
[2021-04-25 05:46:44,779 INFO] Step 38000/50000; acc:  72.34; ppl:  2.57; xent: 0.95; lr: 0.00010; 11268/4344 tok/s;   6715 sec
[2021-04-25 05:46:53,954 INFO] Step 38050/50000; acc:  72.71; ppl:  2.53; xent: 0.93; lr: 0.00010; 10800/4491 tok/s;   6724 sec
[2021-04-25 05:47:02,948 INFO] Step 38100/50000; acc:  72.68; ppl:  2.55; xent: 0.94; lr: 0.00010; 11478/4525 tok/s;   6733 sec
[2021-04-25 05:47:11,426 INFO] Step 38150/50000; acc:  72.50; ppl:  2.54; xent: 0.93; lr: 0.00010; 11894/4790 tok/s;   6741 sec
[2021-04-25 05:47:20,060 INFO] Step 38200/50000; acc:  72.52; ppl:  2.56; xent: 0.94; lr: 0.00010; 11936/4576 tok/s;   6750 sec
[2021-04-25 05:47:28,416 INFO] Step 38250/50000; acc:  73.20; ppl:  2.50; xent: 0.91; lr: 0.00010; 11930/4691 tok/s;   6758 sec
[2021-04-25 05:47:37,409 INFO] Step 38300/50000; acc:  72.58; ppl:  2.55; xent: 0.93; lr: 0.00010; 11508/4481 tok/s;   6767 sec
[2021-04-25 05:47:46,636 INFO] Step 38350/50000; acc:  72.44; ppl:  2.57; xent: 0.94; lr: 0.00010; 11187/4425 tok/s;   6776 sec
[2021-04-25 05:47:54,946 INFO] Step 38400/50000; acc:  72.92; ppl:  2.51; xent: 0.92; lr: 0.00010; 11984/4803 tok/s;   6785 sec
[2021-04-25 05:48:03,921 INFO] Step 38450/50000; acc:  72.69; ppl:  2.54; xent: 0.93; lr: 0.00010; 11374/4489 tok/s;   6794 sec
[2021-04-25 05:48:12,615 INFO] Step 38500/50000; acc:  73.03; ppl:  2.51; xent: 0.92; lr: 0.00010; 11610/4669 tok/s;   6802 sec
[2021-04-25 05:48:20,989 INFO] Step 38550/50000; acc:  72.77; ppl:  2.52; xent: 0.92; lr: 0.00010; 12376/4710 tok/s;   6811 sec
[2021-04-25 05:48:29,536 INFO] Step 38600/50000; acc:  72.61; ppl:  2.53; xent: 0.93; lr: 0.00010; 11837/4670 tok/s;   6819 sec
[2021-04-25 05:48:38,248 INFO] Step 38650/50000; acc:  72.80; ppl:  2.53; xent: 0.93; lr: 0.00010; 11710/4669 tok/s;   6828 sec
[2021-04-25 05:48:39,211 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:48:46,953 INFO] Step 38700/50000; acc:  72.73; ppl:  2.54; xent: 0.93; lr: 0.00010; 11579/4638 tok/s;   6837 sec
[2021-04-25 05:48:55,947 INFO] Step 38750/50000; acc:  72.55; ppl:  2.54; xent: 0.93; lr: 0.00010; 11258/4435 tok/s;   6846 sec
[2021-04-25 05:49:05,233 INFO] Step 38800/50000; acc:  72.68; ppl:  2.54; xent: 0.93; lr: 0.00010; 11024/4426 tok/s;   6855 sec
[2021-04-25 05:49:13,580 INFO] Step 38850/50000; acc:  72.83; ppl:  2.51; xent: 0.92; lr: 0.00010; 11899/4838 tok/s;   6863 sec
[2021-04-25 05:49:22,380 INFO] Step 38900/50000; acc:  72.54; ppl:  2.55; xent: 0.94; lr: 0.00010; 11744/4594 tok/s;   6872 sec
[2021-04-25 05:49:30,702 INFO] Step 38950/50000; acc:  72.68; ppl:  2.53; xent: 0.93; lr: 0.00010; 12167/4755 tok/s;   6881 sec
[2021-04-25 05:49:40,006 INFO] Step 39000/50000; acc:  72.57; ppl:  2.54; xent: 0.93; lr: 0.00010; 11303/4283 tok/s;   6890 sec
[2021-04-25 05:49:48,518 INFO] Step 39050/50000; acc:  73.31; ppl:  2.47; xent: 0.90; lr: 0.00010; 11553/4758 tok/s;   6898 sec
[2021-04-25 05:49:57,410 INFO] Step 39100/50000; acc:  72.36; ppl:  2.56; xent: 0.94; lr: 0.00010; 11567/4504 tok/s;   6907 sec
[2021-04-25 05:50:06,293 INFO] Step 39150/50000; acc:  72.22; ppl:  2.55; xent: 0.94; lr: 0.00010; 11646/4554 tok/s;   6916 sec
[2021-04-25 05:50:14,786 INFO] Step 39200/50000; acc:  73.27; ppl:  2.48; xent: 0.91; lr: 0.00010; 11644/4749 tok/s;   6925 sec
[2021-04-25 05:50:23,518 INFO] Step 39250/50000; acc:  72.83; ppl:  2.50; xent: 0.92; lr: 0.00010; 11818/4553 tok/s;   6933 sec
[2021-04-25 05:50:31,703 INFO] Step 39300/50000; acc:  73.07; ppl:  2.49; xent: 0.91; lr: 0.00010; 12259/4862 tok/s;   6942 sec
[2021-04-25 05:50:40,391 INFO] Step 39350/50000; acc:  72.46; ppl:  2.55; xent: 0.93; lr: 0.00010; 11861/4647 tok/s;   6950 sec
[2021-04-25 05:50:42,002 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:50:49,067 INFO] Step 39400/50000; acc:  72.85; ppl:  2.53; xent: 0.93; lr: 0.00010; 11750/4680 tok/s;   6959 sec
[2021-04-25 05:50:58,065 INFO] Step 39450/50000; acc:  72.90; ppl:  2.53; xent: 0.93; lr: 0.00010; 11350/4461 tok/s;   6968 sec
[2021-04-25 05:51:06,840 INFO] Step 39500/50000; acc:  72.92; ppl:  2.51; xent: 0.92; lr: 0.00010; 11351/4579 tok/s;   6977 sec
[2021-04-25 05:51:15,930 INFO] Step 39550/50000; acc:  72.75; ppl:  2.53; xent: 0.93; lr: 0.00010; 11171/4536 tok/s;   6986 sec
[2021-04-25 05:51:24,496 INFO] Step 39600/50000; acc:  72.46; ppl:  2.54; xent: 0.93; lr: 0.00010; 11987/4714 tok/s;   6994 sec
[2021-04-25 05:51:32,811 INFO] Step 39650/50000; acc:  72.95; ppl:  2.48; xent: 0.91; lr: 0.00010; 11886/4767 tok/s;   7003 sec
[2021-04-25 05:51:41,669 INFO] Step 39700/50000; acc:  72.84; ppl:  2.53; xent: 0.93; lr: 0.00010; 11817/4434 tok/s;   7011 sec
[2021-04-25 05:51:51,020 INFO] Step 39750/50000; acc:  72.62; ppl:  2.51; xent: 0.92; lr: 0.00010; 10946/4356 tok/s;   7021 sec
[2021-04-25 05:51:59,907 INFO] Step 39800/50000; acc:  72.96; ppl:  2.52; xent: 0.92; lr: 0.00010; 11638/4544 tok/s;   7030 sec
[2021-04-25 05:52:08,834 INFO] Step 39850/50000; acc:  73.14; ppl:  2.48; xent: 0.91; lr: 0.00010; 11017/4516 tok/s;   7039 sec
[2021-04-25 05:52:17,427 INFO] Step 39900/50000; acc:  72.91; ppl:  2.51; xent: 0.92; lr: 0.00010; 11983/4661 tok/s;   7047 sec
[2021-04-25 05:52:26,584 INFO] Step 39950/50000; acc:  72.94; ppl:  2.50; xent: 0.92; lr: 0.00010; 11298/4411 tok/s;   7056 sec
[2021-04-25 05:52:34,652 INFO] Step 40000/50000; acc:  73.12; ppl:  2.47; xent: 0.90; lr: 0.00010; 12404/4899 tok/s;   7064 sec
[2021-04-25 05:52:34,655 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-25 05:52:41,864 INFO] Validation perplexity: 2.93475
[2021-04-25 05:52:41,864 INFO] Validation accuracy: 70.1869
[2021-04-25 05:52:41,867 INFO] Saving checkpoint ../models/group1_params/loose_ops/model_step_40000.pt
[2021-04-25 05:52:50,707 INFO] Step 40050/50000; acc:  73.07; ppl:  2.51; xent: 0.92; lr: 0.00010; 6308/2471 tok/s;   7081 sec
[2021-04-25 05:52:58,873 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:52:59,452 INFO] Step 40100/50000; acc:  72.96; ppl:  2.49; xent: 0.91; lr: 0.00010; 11529/4640 tok/s;   7089 sec
[2021-04-25 05:53:08,458 INFO] Step 40150/50000; acc:  72.78; ppl:  2.52; xent: 0.92; lr: 0.00010; 11469/4474 tok/s;   7098 sec
[2021-04-25 05:53:17,654 INFO] Step 40200/50000; acc:  72.74; ppl:  2.53; xent: 0.93; lr: 0.00010; 11090/4437 tok/s;   7107 sec
[2021-04-25 05:53:26,786 INFO] Step 40250/50000; acc:  72.84; ppl:  2.50; xent: 0.92; lr: 0.00010; 11068/4437 tok/s;   7117 sec
[2021-04-25 05:53:35,304 INFO] Step 40300/50000; acc:  72.98; ppl:  2.50; xent: 0.92; lr: 0.00010; 11767/4787 tok/s;   7125 sec
[2021-04-25 05:53:44,033 INFO] Step 40350/50000; acc:  73.04; ppl:  2.50; xent: 0.92; lr: 0.00010; 11629/4551 tok/s;   7134 sec
[2021-04-25 05:53:52,972 INFO] Step 40400/50000; acc:  72.73; ppl:  2.53; xent: 0.93; lr: 0.00010; 11492/4487 tok/s;   7143 sec
[2021-04-25 05:54:01,337 INFO] Step 40450/50000; acc:  73.26; ppl:  2.46; xent: 0.90; lr: 0.00010; 12045/4720 tok/s;   7151 sec
[2021-04-25 05:54:10,936 INFO] Step 40500/50000; acc:  72.41; ppl:  2.55; xent: 0.94; lr: 0.00010; 10891/4249 tok/s;   7161 sec
[2021-04-25 05:54:19,469 INFO] Step 40550/50000; acc:  73.47; ppl:  2.47; xent: 0.90; lr: 0.00010; 11766/4740 tok/s;   7169 sec
[2021-04-25 05:54:28,349 INFO] Step 40600/50000; acc:  72.70; ppl:  2.52; xent: 0.92; lr: 0.00010; 11720/4536 tok/s;   7178 sec
[2021-04-25 05:54:37,045 INFO] Step 40650/50000; acc:  73.82; ppl:  2.45; xent: 0.89; lr: 0.00010; 11288/4622 tok/s;   7187 sec
[2021-04-25 05:54:45,875 INFO] Step 40700/50000; acc:  72.89; ppl:  2.49; xent: 0.91; lr: 0.00010; 11696/4505 tok/s;   7196 sec
[2021-04-25 05:54:54,201 INFO] Step 40750/50000; acc:  72.83; ppl:  2.52; xent: 0.92; lr: 0.00010; 12386/4830 tok/s;   7204 sec
[2021-04-25 05:55:02,559 INFO] Step 40800/50000; acc:  73.48; ppl:  2.45; xent: 0.90; lr: 0.00010; 11788/4743 tok/s;   7212 sec
[2021-04-25 05:55:08,333 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:55:11,588 INFO] Step 40850/50000; acc:  73.14; ppl:  2.48; xent: 0.91; lr: 0.00010; 11378/4505 tok/s;   7221 sec
[2021-04-25 05:55:20,363 INFO] Step 40900/50000; acc:  73.14; ppl:  2.49; xent: 0.91; lr: 0.00010; 11540/4598 tok/s;   7230 sec
[2021-04-25 05:55:29,825 INFO] Step 40950/50000; acc:  72.45; ppl:  2.52; xent: 0.93; lr: 0.00010; 10878/4288 tok/s;   7240 sec
[2021-04-25 05:55:38,447 INFO] Step 41000/50000; acc:  73.36; ppl:  2.47; xent: 0.90; lr: 0.00010; 11702/4691 tok/s;   7248 sec
[2021-04-25 05:55:47,327 INFO] Step 41050/50000; acc:  73.06; ppl:  2.50; xent: 0.92; lr: 0.00010; 11476/4563 tok/s;   7257 sec
[2021-04-25 05:55:55,993 INFO] Step 41100/50000; acc:  73.05; ppl:  2.48; xent: 0.91; lr: 0.00010; 11534/4621 tok/s;   7266 sec
[2021-04-25 05:56:04,496 INFO] Step 41150/50000; acc:  72.91; ppl:  2.49; xent: 0.91; lr: 0.00010; 12029/4673 tok/s;   7274 sec
[2021-04-25 05:56:13,640 INFO] Step 41200/50000; acc:  72.62; ppl:  2.51; xent: 0.92; lr: 0.00010; 11408/4289 tok/s;   7283 sec
[2021-04-25 05:56:22,715 INFO] Step 41250/50000; acc:  73.20; ppl:  2.48; xent: 0.91; lr: 0.00010; 10983/4558 tok/s;   7293 sec
[2021-04-25 05:56:31,497 INFO] Step 41300/50000; acc:  72.89; ppl:  2.50; xent: 0.92; lr: 0.00010; 11800/4513 tok/s;   7301 sec
[2021-04-25 05:56:40,206 INFO] Step 41350/50000; acc:  73.25; ppl:  2.46; xent: 0.90; lr: 0.00010; 11608/4711 tok/s;   7310 sec
[2021-04-25 05:56:49,188 INFO] Step 41400/50000; acc:  73.15; ppl:  2.50; xent: 0.92; lr: 0.00010; 11547/4426 tok/s;   7319 sec
[2021-04-25 05:56:57,657 INFO] Step 41450/50000; acc:  74.04; ppl:  2.40; xent: 0.88; lr: 0.00010; 11631/4719 tok/s;   7327 sec
[2021-04-25 05:57:06,003 INFO] Step 41500/50000; acc:  73.00; ppl:  2.48; xent: 0.91; lr: 0.00010; 12281/4783 tok/s;   7336 sec
[2021-04-25 05:57:14,413 INFO] Step 41550/50000; acc:  73.23; ppl:  2.47; xent: 0.90; lr: 0.00010; 12221/4801 tok/s;   7344 sec
[2021-04-25 05:57:17,738 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:57:23,271 INFO] Step 41600/50000; acc:  73.30; ppl:  2.47; xent: 0.90; lr: 0.00010; 11226/4580 tok/s;   7353 sec
[2021-04-25 05:57:32,146 INFO] Step 41650/50000; acc:  73.13; ppl:  2.50; xent: 0.92; lr: 0.00010; 11575/4457 tok/s;   7362 sec
[2021-04-25 05:57:41,434 INFO] Step 41700/50000; acc:  73.24; ppl:  2.47; xent: 0.90; lr: 0.00010; 10792/4476 tok/s;   7371 sec
[2021-04-25 05:57:50,299 INFO] Step 41750/50000; acc:  73.10; ppl:  2.49; xent: 0.91; lr: 0.00010; 11618/4561 tok/s;   7380 sec
[2021-04-25 05:57:58,957 INFO] Step 41800/50000; acc:  73.00; ppl:  2.49; xent: 0.91; lr: 0.00010; 11705/4694 tok/s;   7389 sec
[2021-04-25 05:58:07,491 INFO] Step 41850/50000; acc:  73.27; ppl:  2.47; xent: 0.90; lr: 0.00010; 11892/4612 tok/s;   7397 sec
[2021-04-25 05:58:16,101 INFO] Step 41900/50000; acc:  73.44; ppl:  2.45; xent: 0.90; lr: 0.00010; 11844/4617 tok/s;   7406 sec
[2021-04-25 05:58:24,893 INFO] Step 41950/50000; acc:  73.48; ppl:  2.45; xent: 0.90; lr: 0.00010; 11591/4555 tok/s;   7415 sec
[2021-04-25 05:58:33,852 INFO] Step 42000/50000; acc:  73.03; ppl:  2.50; xent: 0.92; lr: 0.00010; 11527/4486 tok/s;   7424 sec
[2021-04-25 05:58:42,325 INFO] Step 42050/50000; acc:  73.70; ppl:  2.43; xent: 0.89; lr: 0.00010; 11770/4780 tok/s;   7432 sec
[2021-04-25 05:58:51,181 INFO] Step 42100/50000; acc:  73.16; ppl:  2.50; xent: 0.92; lr: 0.00010; 11686/4541 tok/s;   7441 sec
[2021-04-25 05:59:00,018 INFO] Step 42150/50000; acc:  73.65; ppl:  2.43; xent: 0.89; lr: 0.00010; 11480/4592 tok/s;   7450 sec
[2021-04-25 05:59:08,312 INFO] Step 42200/50000; acc:  73.21; ppl:  2.48; xent: 0.91; lr: 0.00010; 12494/4744 tok/s;   7458 sec
[2021-04-25 05:59:16,763 INFO] Step 42250/50000; acc:  73.79; ppl:  2.41; xent: 0.88; lr: 0.00010; 11568/4751 tok/s;   7467 sec
[2021-04-25 05:59:20,428 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 05:59:25,598 INFO] Step 42300/50000; acc:  73.11; ppl:  2.48; xent: 0.91; lr: 0.00010; 11673/4582 tok/s;   7475 sec
[2021-04-25 05:59:34,679 INFO] Step 42350/50000; acc:  73.05; ppl:  2.49; xent: 0.91; lr: 0.00010; 11387/4466 tok/s;   7485 sec
[2021-04-25 05:59:43,178 INFO] Step 42400/50000; acc:  73.34; ppl:  2.45; xent: 0.90; lr: 0.00010; 11627/4696 tok/s;   7493 sec
[2021-04-25 05:59:52,376 INFO] Step 42450/50000; acc:  73.23; ppl:  2.47; xent: 0.91; lr: 0.00010; 11072/4484 tok/s;   7502 sec
[2021-04-25 06:00:00,675 INFO] Step 42500/50000; acc:  73.16; ppl:  2.47; xent: 0.90; lr: 0.00010; 12117/4823 tok/s;   7511 sec
[2021-04-25 06:00:09,503 INFO] Step 42550/50000; acc:  73.23; ppl:  2.48; xent: 0.91; lr: 0.00010; 11666/4571 tok/s;   7519 sec
[2021-04-25 06:00:17,992 INFO] Step 42600/50000; acc:  73.35; ppl:  2.46; xent: 0.90; lr: 0.00010; 11982/4631 tok/s;   7528 sec
[2021-04-25 06:00:27,168 INFO] Step 42650/50000; acc:  73.35; ppl:  2.46; xent: 0.90; lr: 0.00010; 11313/4360 tok/s;   7537 sec
[2021-04-25 06:00:35,970 INFO] Step 42700/50000; acc:  73.65; ppl:  2.43; xent: 0.89; lr: 0.00010; 11403/4647 tok/s;   7546 sec
[2021-04-25 06:00:44,742 INFO] Step 42750/50000; acc:  73.36; ppl:  2.47; xent: 0.91; lr: 0.00010; 11586/4529 tok/s;   7555 sec
[2021-04-25 06:00:53,717 INFO] Step 42800/50000; acc:  73.31; ppl:  2.47; xent: 0.91; lr: 0.00010; 11495/4513 tok/s;   7564 sec
[2021-04-25 06:01:02,265 INFO] Step 42850/50000; acc:  73.67; ppl:  2.42; xent: 0.88; lr: 0.00010; 11633/4732 tok/s;   7572 sec
[2021-04-25 06:01:11,114 INFO] Step 42900/50000; acc:  73.31; ppl:  2.46; xent: 0.90; lr: 0.00010; 11801/4470 tok/s;   7581 sec
[2021-04-25 06:01:19,330 INFO] Step 42950/50000; acc:  73.77; ppl:  2.43; xent: 0.89; lr: 0.00010; 12249/4869 tok/s;   7589 sec
[2021-04-25 06:01:28,131 INFO] Step 43000/50000; acc:  72.84; ppl:  2.48; xent: 0.91; lr: 0.00010; 11713/4585 tok/s;   7598 sec
[2021-04-25 06:01:29,242 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 06:01:36,651 INFO] Step 43050/50000; acc:  74.10; ppl:  2.40; xent: 0.88; lr: 0.00010; 11564/4744 tok/s;   7606 sec
[2021-04-25 06:01:45,717 INFO] Step 43100/50000; acc:  73.36; ppl:  2.47; xent: 0.90; lr: 0.00010; 11378/4450 tok/s;   7616 sec
[2021-04-25 06:01:55,104 INFO] Step 43150/50000; acc:  73.24; ppl:  2.46; xent: 0.90; lr: 0.00010; 10906/4285 tok/s;   7625 sec
[2021-04-25 06:02:03,716 INFO] Step 43200/50000; acc:  73.70; ppl:  2.45; xent: 0.89; lr: 0.00010; 11509/4758 tok/s;   7634 sec
[2021-04-25 06:02:12,292 INFO] Step 43250/50000; acc:  73.15; ppl:  2.47; xent: 0.90; lr: 0.00010; 11902/4717 tok/s;   7642 sec
[2021-04-25 06:02:20,712 INFO] Step 43300/50000; acc:  73.93; ppl:  2.43; xent: 0.89; lr: 0.00010; 11889/4744 tok/s;   7651 sec
[2021-04-25 06:02:29,769 INFO] Step 43350/50000; acc:  73.09; ppl:  2.47; xent: 0.91; lr: 0.00010; 11548/4325 tok/s;   7660 sec
[2021-04-25 06:02:38,979 INFO] Step 43400/50000; acc:  73.49; ppl:  2.44; xent: 0.89; lr: 0.00010; 11135/4391 tok/s;   7669 sec
[2021-04-25 06:02:47,748 INFO] Step 43450/50000; acc:  73.53; ppl:  2.45; xent: 0.90; lr: 0.00010; 11633/4598 tok/s;   7678 sec
[2021-04-25 06:02:56,554 INFO] Step 43500/50000; acc:  73.28; ppl:  2.45; xent: 0.89; lr: 0.00010; 11411/4605 tok/s;   7686 sec
[2021-04-25 06:03:05,278 INFO] Step 43550/50000; acc:  73.78; ppl:  2.43; xent: 0.89; lr: 0.00010; 11639/4589 tok/s;   7695 sec
[2021-04-25 06:03:14,306 INFO] Step 43600/50000; acc:  73.54; ppl:  2.44; xent: 0.89; lr: 0.00010; 11450/4438 tok/s;   7704 sec
[2021-04-25 06:03:22,422 INFO] Step 43650/50000; acc:  73.81; ppl:  2.40; xent: 0.87; lr: 0.00010; 12338/4919 tok/s;   7712 sec
[2021-04-25 06:03:31,029 INFO] Step 43700/50000; acc:  73.26; ppl:  2.46; xent: 0.90; lr: 0.00010; 11964/4632 tok/s;   7721 sec
[2021-04-25 06:03:38,533 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 06:03:39,463 INFO] Step 43750/50000; acc:  73.83; ppl:  2.41; xent: 0.88; lr: 0.00010; 11990/4817 tok/s;   7729 sec
[2021-04-25 06:03:48,589 INFO] Step 43800/50000; acc:  73.49; ppl:  2.46; xent: 0.90; lr: 0.00010; 11324/4374 tok/s;   7738 sec
[2021-04-25 06:03:57,578 INFO] Step 43850/50000; acc:  73.79; ppl:  2.42; xent: 0.88; lr: 0.00010; 10961/4535 tok/s;   7747 sec
[2021-04-25 06:04:06,592 INFO] Step 43900/50000; acc:  73.65; ppl:  2.45; xent: 0.90; lr: 0.00010; 11320/4517 tok/s;   7756 sec
[2021-04-25 06:04:15,361 INFO] Step 43950/50000; acc:  73.32; ppl:  2.47; xent: 0.90; lr: 0.00010; 11753/4655 tok/s;   7765 sec
[2021-04-25 06:04:23,903 INFO] Step 44000/50000; acc:  74.02; ppl:  2.42; xent: 0.88; lr: 0.00010; 11593/4628 tok/s;   7774 sec
[2021-04-25 06:04:32,775 INFO] Step 44050/50000; acc:  73.50; ppl:  2.45; xent: 0.90; lr: 0.00010; 11529/4525 tok/s;   7783 sec
[2021-04-25 06:04:41,381 INFO] Step 44100/50000; acc:  73.79; ppl:  2.41; xent: 0.88; lr: 0.00010; 11850/4594 tok/s;   7791 sec
[2021-04-25 06:04:50,834 INFO] Step 44150/50000; acc:  73.49; ppl:  2.45; xent: 0.90; lr: 0.00010; 11009/4329 tok/s;   7801 sec
[2021-04-25 06:04:59,617 INFO] Step 44200/50000; acc:  73.68; ppl:  2.44; xent: 0.89; lr: 0.00010; 11506/4580 tok/s;   7809 sec
[2021-04-25 06:05:08,309 INFO] Step 44250/50000; acc:  73.67; ppl:  2.42; xent: 0.89; lr: 0.00010; 11791/4635 tok/s;   7818 sec
[2021-04-25 06:05:16,988 INFO] Step 44300/50000; acc:  73.97; ppl:  2.41; xent: 0.88; lr: 0.00010; 11561/4604 tok/s;   7827 sec
[2021-04-25 06:05:25,837 INFO] Step 44350/50000; acc:  73.55; ppl:  2.41; xent: 0.88; lr: 0.00010; 11538/4516 tok/s;   7836 sec
[2021-04-25 06:05:34,223 INFO] Step 44400/50000; acc:  73.37; ppl:  2.46; xent: 0.90; lr: 0.00010; 12259/4781 tok/s;   7844 sec
[2021-04-25 06:05:42,584 INFO] Step 44450/50000; acc:  74.17; ppl:  2.38; xent: 0.87; lr: 0.00010; 11823/4781 tok/s;   7852 sec
[2021-04-25 06:05:47,936 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 06:05:51,742 INFO] Step 44500/50000; acc:  73.61; ppl:  2.44; xent: 0.89; lr: 0.00010; 11364/4432 tok/s;   7862 sec
[2021-04-25 06:06:00,542 INFO] Step 44550/50000; acc:  73.72; ppl:  2.44; xent: 0.89; lr: 0.00010; 11567/4563 tok/s;   7870 sec
[2021-04-25 06:06:09,929 INFO] Step 44600/50000; acc:  73.41; ppl:  2.45; xent: 0.89; lr: 0.00010; 10947/4358 tok/s;   7880 sec
[2021-04-25 06:06:18,496 INFO] Step 44650/50000; acc:  74.10; ppl:  2.39; xent: 0.87; lr: 0.00010; 11403/4709 tok/s;   7888 sec
[2021-04-25 06:06:27,364 INFO] Step 44700/50000; acc:  73.51; ppl:  2.45; xent: 0.89; lr: 0.00010; 11596/4551 tok/s;   7897 sec
[2021-04-25 06:06:36,115 INFO] Step 44750/50000; acc:  73.44; ppl:  2.45; xent: 0.90; lr: 0.00010; 11710/4583 tok/s;   7906 sec
[2021-04-25 06:06:44,395 INFO] Step 44800/50000; acc:  74.04; ppl:  2.39; xent: 0.87; lr: 0.00010; 12109/4829 tok/s;   7914 sec
[2021-04-25 06:06:53,610 INFO] Step 44850/50000; acc:  73.39; ppl:  2.44; xent: 0.89; lr: 0.00010; 11235/4266 tok/s;   7923 sec
[2021-04-25 06:07:02,589 INFO] Step 44900/50000; acc:  73.69; ppl:  2.43; xent: 0.89; lr: 0.00010; 11231/4557 tok/s;   7932 sec
[2021-04-25 06:07:11,301 INFO] Step 44950/50000; acc:  73.56; ppl:  2.43; xent: 0.89; lr: 0.00010; 11863/4592 tok/s;   7941 sec
[2021-04-25 06:07:20,028 INFO] Step 45000/50000; acc:  73.80; ppl:  2.41; xent: 0.88; lr: 0.00010; 11640/4650 tok/s;   7950 sec
[2021-04-25 06:07:20,031 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-25 06:07:27,212 INFO] Validation perplexity: 2.94042
[2021-04-25 06:07:27,212 INFO] Validation accuracy: 70.2561
[2021-04-25 06:07:27,214 INFO] Saving checkpoint ../models/group1_params/loose_ops/model_step_45000.pt
[2021-04-25 06:07:36,586 INFO] Step 45050/50000; acc:  73.96; ppl:  2.41; xent: 0.88; lr: 0.00010; 6159/2404 tok/s;   7966 sec
[2021-04-25 06:07:45,195 INFO] Step 45100/50000; acc:  74.22; ppl:  2.37; xent: 0.86; lr: 0.00010; 11717/4686 tok/s;   7975 sec
[2021-04-25 06:07:53,366 INFO] Step 45150/50000; acc:  74.10; ppl:  2.39; xent: 0.87; lr: 0.00010; 12386/4848 tok/s;   7983 sec
[2021-04-25 06:08:01,894 INFO] Step 45200/50000; acc:  73.76; ppl:  2.42; xent: 0.88; lr: 0.00010; 12038/4732 tok/s;   7992 sec
[2021-04-25 06:08:04,816 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 06:08:10,689 INFO] Step 45250/50000; acc:  73.92; ppl:  2.41; xent: 0.88; lr: 0.00010; 11345/4619 tok/s;   8001 sec
[2021-04-25 06:08:19,897 INFO] Step 45300/50000; acc:  73.30; ppl:  2.46; xent: 0.90; lr: 0.00010; 11285/4357 tok/s;   8010 sec
[2021-04-25 06:08:28,812 INFO] Step 45350/50000; acc:  73.73; ppl:  2.41; xent: 0.88; lr: 0.00010; 11282/4624 tok/s;   8019 sec
[2021-04-25 06:08:37,714 INFO] Step 45400/50000; acc:  73.68; ppl:  2.43; xent: 0.89; lr: 0.00010; 11571/4505 tok/s;   8028 sec
[2021-04-25 06:08:46,220 INFO] Step 45450/50000; acc:  74.06; ppl:  2.39; xent: 0.87; lr: 0.00010; 11525/4764 tok/s;   8036 sec
[2021-04-25 06:08:54,787 INFO] Step 45500/50000; acc:  73.94; ppl:  2.41; xent: 0.88; lr: 0.00010; 11964/4624 tok/s;   8045 sec
[2021-04-25 06:09:03,715 INFO] Step 45550/50000; acc:  73.67; ppl:  2.41; xent: 0.88; lr: 0.00010; 11746/4435 tok/s;   8054 sec
[2021-04-25 06:09:12,439 INFO] Step 45600/50000; acc:  74.11; ppl:  2.39; xent: 0.87; lr: 0.00010; 11403/4637 tok/s;   8062 sec
[2021-04-25 06:09:21,397 INFO] Step 45650/50000; acc:  73.69; ppl:  2.44; xent: 0.89; lr: 0.00010; 11458/4467 tok/s;   8071 sec
[2021-04-25 06:09:29,930 INFO] Step 45700/50000; acc:  73.88; ppl:  2.39; xent: 0.87; lr: 0.00010; 11836/4751 tok/s;   8080 sec
[2021-04-25 06:09:38,820 INFO] Step 45750/50000; acc:  73.89; ppl:  2.42; xent: 0.89; lr: 0.00010; 11589/4536 tok/s;   8089 sec
[2021-04-25 06:09:47,745 INFO] Step 45800/50000; acc:  74.12; ppl:  2.38; xent: 0.87; lr: 0.00010; 11426/4507 tok/s;   8098 sec
[2021-04-25 06:09:55,843 INFO] Step 45850/50000; acc:  74.02; ppl:  2.39; xent: 0.87; lr: 0.00010; 12600/4852 tok/s;   8106 sec
[2021-04-25 06:10:04,397 INFO] Step 45900/50000; acc:  74.43; ppl:  2.37; xent: 0.86; lr: 0.00010; 11675/4712 tok/s;   8114 sec
[2021-04-25 06:10:07,692 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 06:10:13,316 INFO] Step 45950/50000; acc:  73.79; ppl:  2.41; xent: 0.88; lr: 0.00010; 11442/4559 tok/s;   8123 sec
[2021-04-25 06:10:22,307 INFO] Step 46000/50000; acc:  73.69; ppl:  2.43; xent: 0.89; lr: 0.00010; 11463/4469 tok/s;   8132 sec
[2021-04-25 06:10:31,323 INFO] Step 46050/50000; acc:  74.12; ppl:  2.39; xent: 0.87; lr: 0.00010; 11000/4473 tok/s;   8141 sec
[2021-04-25 06:10:40,538 INFO] Step 46100/50000; acc:  73.79; ppl:  2.42; xent: 0.88; lr: 0.00010; 11196/4480 tok/s;   8150 sec
[2021-04-25 06:10:48,827 INFO] Step 46150/50000; acc:  73.92; ppl:  2.40; xent: 0.87; lr: 0.00010; 12193/4847 tok/s;   8159 sec
[2021-04-25 06:10:57,759 INFO] Step 46200/50000; acc:  73.75; ppl:  2.42; xent: 0.88; lr: 0.00010; 11522/4468 tok/s;   8168 sec
[2021-04-25 06:11:06,034 INFO] Step 46250/50000; acc:  74.50; ppl:  2.36; xent: 0.86; lr: 0.00010; 11898/4771 tok/s;   8176 sec
[2021-04-25 06:11:15,214 INFO] Step 46300/50000; acc:  73.41; ppl:  2.42; xent: 0.88; lr: 0.00010; 11404/4350 tok/s;   8185 sec
[2021-04-25 06:11:24,191 INFO] Step 46350/50000; acc:  73.93; ppl:  2.40; xent: 0.87; lr: 0.00010; 11492/4545 tok/s;   8194 sec
[2021-04-25 06:11:32,762 INFO] Step 46400/50000; acc:  74.25; ppl:  2.37; xent: 0.86; lr: 0.00010; 11591/4662 tok/s;   8203 sec
[2021-04-25 06:11:41,657 INFO] Step 46450/50000; acc:  73.97; ppl:  2.41; xent: 0.88; lr: 0.00010; 11519/4528 tok/s;   8211 sec
[2021-04-25 06:11:50,338 INFO] Step 46500/50000; acc:  74.14; ppl:  2.38; xent: 0.87; lr: 0.00010; 11604/4689 tok/s;   8220 sec
[2021-04-25 06:11:59,152 INFO] Step 46550/50000; acc:  73.81; ppl:  2.39; xent: 0.87; lr: 0.00010; 11803/4451 tok/s;   8229 sec
[2021-04-25 06:12:07,369 INFO] Step 46600/50000; acc:  74.21; ppl:  2.38; xent: 0.87; lr: 0.00010; 12291/4844 tok/s;   8237 sec
[2021-04-25 06:12:16,150 INFO] Step 46650/50000; acc:  73.85; ppl:  2.39; xent: 0.87; lr: 0.00010; 11591/4612 tok/s;   8246 sec
[2021-04-25 06:12:17,029 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 06:12:24,687 INFO] Step 46700/50000; acc:  74.07; ppl:  2.37; xent: 0.86; lr: 0.00010; 11778/4749 tok/s;   8255 sec
[2021-04-25 06:12:33,743 INFO] Step 46750/50000; acc:  74.04; ppl:  2.41; xent: 0.88; lr: 0.00010; 11267/4447 tok/s;   8264 sec
[2021-04-25 06:12:43,286 INFO] Step 46800/50000; acc:  73.79; ppl:  2.40; xent: 0.88; lr: 0.00010; 10690/4242 tok/s;   8273 sec
[2021-04-25 06:12:51,853 INFO] Step 46850/50000; acc:  74.01; ppl:  2.39; xent: 0.87; lr: 0.00010; 11600/4752 tok/s;   8282 sec
[2021-04-25 06:13:00,660 INFO] Step 46900/50000; acc:  73.79; ppl:  2.42; xent: 0.88; lr: 0.00010; 11769/4578 tok/s;   8290 sec
[2021-04-25 06:13:09,364 INFO] Step 46950/50000; acc:  74.33; ppl:  2.38; xent: 0.87; lr: 0.00010; 11530/4653 tok/s;   8299 sec
[2021-04-25 06:13:18,201 INFO] Step 47000/50000; acc:  74.02; ppl:  2.41; xent: 0.88; lr: 0.00010; 11844/4417 tok/s;   8308 sec
[2021-04-25 06:13:27,108 INFO] Step 47050/50000; acc:  74.56; ppl:  2.34; xent: 0.85; lr: 0.00010; 11131/4533 tok/s;   8317 sec
[2021-04-25 06:13:35,837 INFO] Step 47100/50000; acc:  73.89; ppl:  2.41; xent: 0.88; lr: 0.00010; 11786/4605 tok/s;   8326 sec
[2021-04-25 06:13:44,760 INFO] Step 47150/50000; acc:  73.84; ppl:  2.41; xent: 0.88; lr: 0.00010; 11589/4538 tok/s;   8335 sec
[2021-04-25 06:13:53,351 INFO] Step 47200/50000; acc:  74.82; ppl:  2.33; xent: 0.85; lr: 0.00010; 11537/4658 tok/s;   8343 sec
[2021-04-25 06:14:02,386 INFO] Step 47250/50000; acc:  74.20; ppl:  2.38; xent: 0.87; lr: 0.00010; 11376/4463 tok/s;   8352 sec
[2021-04-25 06:14:10,452 INFO] Step 47300/50000; acc:  74.40; ppl:  2.35; xent: 0.85; lr: 0.00010; 12525/4979 tok/s;   8360 sec
[2021-04-25 06:14:18,988 INFO] Step 47350/50000; acc:  74.00; ppl:  2.39; xent: 0.87; lr: 0.00010; 12037/4627 tok/s;   8369 sec
[2021-04-25 06:14:26,044 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 06:14:27,279 INFO] Step 47400/50000; acc:  74.46; ppl:  2.35; xent: 0.85; lr: 0.00010; 12274/4865 tok/s;   8377 sec
[2021-04-25 06:14:36,489 INFO] Step 47450/50000; acc:  74.03; ppl:  2.39; xent: 0.87; lr: 0.00010; 11062/4385 tok/s;   8386 sec
[2021-04-25 06:14:45,471 INFO] Step 47500/50000; acc:  74.06; ppl:  2.38; xent: 0.87; lr: 0.00010; 11189/4506 tok/s;   8395 sec
[2021-04-25 06:14:54,669 INFO] Step 47550/50000; acc:  74.23; ppl:  2.38; xent: 0.87; lr: 0.00010; 10970/4434 tok/s;   8404 sec
[2021-04-25 06:15:03,421 INFO] Step 47600/50000; acc:  73.68; ppl:  2.42; xent: 0.89; lr: 0.00010; 11761/4663 tok/s;   8413 sec
[2021-04-25 06:15:11,889 INFO] Step 47650/50000; acc:  74.53; ppl:  2.34; xent: 0.85; lr: 0.00010; 11710/4703 tok/s;   8422 sec
[2021-04-25 06:15:20,777 INFO] Step 47700/50000; acc:  73.84; ppl:  2.41; xent: 0.88; lr: 0.00010; 11689/4486 tok/s;   8431 sec
[2021-04-25 06:15:29,592 INFO] Step 47750/50000; acc:  74.17; ppl:  2.36; xent: 0.86; lr: 0.00010; 11630/4501 tok/s;   8439 sec
[2021-04-25 06:15:38,780 INFO] Step 47800/50000; acc:  73.99; ppl:  2.39; xent: 0.87; lr: 0.00010; 11294/4450 tok/s;   8449 sec
[2021-04-25 06:15:47,365 INFO] Step 47850/50000; acc:  74.53; ppl:  2.34; xent: 0.85; lr: 0.00010; 11404/4671 tok/s;   8457 sec
[2021-04-25 06:15:55,983 INFO] Step 47900/50000; acc:  74.08; ppl:  2.38; xent: 0.87; lr: 0.00010; 11989/4679 tok/s;   8466 sec
[2021-04-25 06:16:05,168 INFO] Step 47950/50000; acc:  74.33; ppl:  2.36; xent: 0.86; lr: 0.00010; 11221/4400 tok/s;   8475 sec
[2021-04-25 06:16:13,612 INFO] Step 48000/50000; acc:  74.84; ppl:  2.32; xent: 0.84; lr: 0.00010; 11801/4671 tok/s;   8483 sec
[2021-04-25 06:16:21,951 INFO] Step 48050/50000; acc:  74.02; ppl:  2.39; xent: 0.87; lr: 0.00010; 12263/4815 tok/s;   8492 sec
[2021-04-25 06:16:30,357 INFO] Step 48100/50000; acc:  74.54; ppl:  2.34; xent: 0.85; lr: 0.00010; 11912/4775 tok/s;   8500 sec
[2021-04-25 06:16:35,460 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 06:16:39,585 INFO] Step 48150/50000; acc:  74.22; ppl:  2.38; xent: 0.87; lr: 0.00010; 11228/4412 tok/s;   8509 sec
[2021-04-25 06:16:48,314 INFO] Step 48200/50000; acc:  74.09; ppl:  2.38; xent: 0.87; lr: 0.00010; 11734/4538 tok/s;   8518 sec
[2021-04-25 06:16:57,774 INFO] Step 48250/50000; acc:  74.05; ppl:  2.37; xent: 0.86; lr: 0.00010; 10693/4358 tok/s;   8528 sec
[2021-04-25 06:17:06,474 INFO] Step 48300/50000; acc:  74.39; ppl:  2.35; xent: 0.85; lr: 0.00010; 11466/4645 tok/s;   8536 sec
[2021-04-25 06:17:15,248 INFO] Step 48350/50000; acc:  74.28; ppl:  2.37; xent: 0.86; lr: 0.00010; 11583/4601 tok/s;   8545 sec
[2021-04-25 06:17:23,941 INFO] Step 48400/50000; acc:  74.15; ppl:  2.39; xent: 0.87; lr: 0.00010; 11762/4587 tok/s;   8554 sec
[2021-04-25 06:17:32,161 INFO] Step 48450/50000; acc:  74.52; ppl:  2.33; xent: 0.85; lr: 0.00010; 12242/4829 tok/s;   8562 sec
[2021-04-25 06:17:41,508 INFO] Step 48500/50000; acc:  73.73; ppl:  2.40; xent: 0.87; lr: 0.00010; 11224/4291 tok/s;   8571 sec
[2021-04-25 06:17:50,422 INFO] Step 48550/50000; acc:  74.24; ppl:  2.37; xent: 0.86; lr: 0.00010; 11362/4584 tok/s;   8580 sec
[2021-04-25 06:17:58,951 INFO] Step 48600/50000; acc:  74.60; ppl:  2.36; xent: 0.86; lr: 0.00010; 12122/4616 tok/s;   8589 sec
[2021-04-25 06:18:07,488 INFO] Step 48650/50000; acc:  74.74; ppl:  2.33; xent: 0.84; lr: 0.00010; 11505/4762 tok/s;   8597 sec
[2021-04-25 06:18:16,465 INFO] Step 48700/50000; acc:  74.56; ppl:  2.35; xent: 0.86; lr: 0.00010; 11468/4449 tok/s;   8606 sec
[2021-04-25 06:18:25,112 INFO] Step 48750/50000; acc:  74.50; ppl:  2.33; xent: 0.85; lr: 0.00010; 11971/4671 tok/s;   8615 sec
[2021-04-25 06:18:33,235 INFO] Step 48800/50000; acc:  74.72; ppl:  2.32; xent: 0.84; lr: 0.00010; 12135/4871 tok/s;   8623 sec
[2021-04-25 06:18:41,856 INFO] Step 48850/50000; acc:  74.33; ppl:  2.35; xent: 0.86; lr: 0.00010; 11886/4671 tok/s;   8632 sec
[2021-04-25 06:18:44,452 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 06:18:50,493 INFO] Step 48900/50000; acc:  74.28; ppl:  2.35; xent: 0.86; lr: 0.00010; 11692/4718 tok/s;   8640 sec
[2021-04-25 06:18:59,583 INFO] Step 48950/50000; acc:  74.14; ppl:  2.38; xent: 0.87; lr: 0.00010; 11372/4410 tok/s;   8649 sec
[2021-04-25 06:19:08,809 INFO] Step 49000/50000; acc:  74.39; ppl:  2.36; xent: 0.86; lr: 0.00010; 10957/4491 tok/s;   8659 sec
[2021-04-25 06:19:17,386 INFO] Step 49050/50000; acc:  74.41; ppl:  2.36; xent: 0.86; lr: 0.00010; 11853/4638 tok/s;   8667 sec
[2021-04-25 06:19:25,993 INFO] Step 49100/50000; acc:  74.50; ppl:  2.35; xent: 0.85; lr: 0.00010; 11617/4754 tok/s;   8676 sec
[2021-04-25 06:19:34,454 INFO] Step 49150/50000; acc:  74.56; ppl:  2.35; xent: 0.85; lr: 0.00010; 11982/4660 tok/s;   8684 sec
[2021-04-25 06:19:43,467 INFO] Step 49200/50000; acc:  74.07; ppl:  2.37; xent: 0.86; lr: 0.00010; 11606/4356 tok/s;   8693 sec
[2021-04-25 06:19:52,018 INFO] Step 49250/50000; acc:  74.87; ppl:  2.31; xent: 0.84; lr: 0.00010; 11668/4804 tok/s;   8702 sec
[2021-04-25 06:20:01,245 INFO] Step 49300/50000; acc:  74.15; ppl:  2.39; xent: 0.87; lr: 0.00010; 11280/4315 tok/s;   8711 sec
[2021-04-25 06:20:09,840 INFO] Step 49350/50000; acc:  74.43; ppl:  2.35; xent: 0.85; lr: 0.00010; 11792/4738 tok/s;   8720 sec
[2021-04-25 06:20:18,603 INFO] Step 49400/50000; acc:  74.54; ppl:  2.34; xent: 0.85; lr: 0.00010; 11751/4586 tok/s;   8728 sec
[2021-04-25 06:20:27,249 INFO] Step 49450/50000; acc:  75.24; ppl:  2.29; xent: 0.83; lr: 0.00010; 11422/4603 tok/s;   8737 sec
[2021-04-25 06:20:35,302 INFO] Step 49500/50000; acc:  74.70; ppl:  2.33; xent: 0.85; lr: 0.00010; 12795/4891 tok/s;   8745 sec
[2021-04-25 06:20:44,022 INFO] Step 49550/50000; acc:  74.28; ppl:  2.36; xent: 0.86; lr: 0.00010; 11747/4657 tok/s;   8754 sec
[2021-04-25 06:20:46,961 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/loose/train.txt, align=None)...
[2021-04-25 06:20:52,715 INFO] Step 49600/50000; acc:  74.55; ppl:  2.33; xent: 0.85; lr: 0.00010; 11495/4676 tok/s;   8763 sec
[2021-04-25 06:21:01,713 INFO] Step 49650/50000; acc:  74.48; ppl:  2.35; xent: 0.86; lr: 0.00010; 11379/4461 tok/s;   8772 sec
[2021-04-25 06:21:10,316 INFO] Step 49700/50000; acc:  74.84; ppl:  2.31; xent: 0.84; lr: 0.00010; 11626/4666 tok/s;   8780 sec
[2021-04-25 06:21:19,604 INFO] Step 49750/50000; acc:  74.21; ppl:  2.37; xent: 0.86; lr: 0.00010; 11100/4454 tok/s;   8789 sec
[2021-04-25 06:21:27,850 INFO] Step 49800/50000; acc:  74.37; ppl:  2.35; xent: 0.85; lr: 0.00010; 12313/4860 tok/s;   8798 sec
[2021-04-25 06:21:36,541 INFO] Step 49850/50000; acc:  74.77; ppl:  2.34; xent: 0.85; lr: 0.00010; 11661/4593 tok/s;   8806 sec
[2021-04-25 06:21:44,977 INFO] Step 49900/50000; acc:  74.68; ppl:  2.32; xent: 0.84; lr: 0.00010; 11949/4708 tok/s;   8815 sec
[2021-04-25 06:21:54,242 INFO] Step 49950/50000; acc:  74.58; ppl:  2.34; xent: 0.85; lr: 0.00010; 11165/4316 tok/s;   8824 sec
[2021-04-25 06:22:03,280 INFO] Step 50000/50000; acc:  74.47; ppl:  2.35; xent: 0.85; lr: 0.00005; 11381/4490 tok/s;   8833 sec
[2021-04-25 06:22:03,284 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/loose/valid.txt, align=None)...
[2021-04-25 06:22:10,481 INFO] Validation perplexity: 2.9772
[2021-04-25 06:22:10,481 INFO] Validation accuracy: 70.0919
[2021-04-25 06:22:10,483 INFO] Saving checkpoint ../models/group1_params/loose_ops/model_step_50000.pt

Parameter group 2

These models are trained with CompoundOperation machine strings in typed form instead of the default general form. All other parameters are the same as the default parameter group.

Note that this parameter groups's control model is identical to the one in the default parameters group, so a new control model won't be trained.

Model paths

Variable name Value
MODEL_GROUP2_CONTROL "../models/default_params/control"
MODEL_GROUP2_BASIC "../models/group2_params/basic_ops"
MODEL_GROUP2_STRICT "../models/group2_params/strict_ops"
MODEL_GROUP2_LOOSE "../models/group2_params/loose_ops"

Models

Basic condensed EditOperations:

modelGroup2Basic = HephaestusModel(MODEL_GROUP2_BASIC)
modelGroup2Basic.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_OPS_TYPED_BASIC_TRAIN,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_OPS_TYPED_BASIC_VALID
)
[2021-05-15 17:38:10,177 INFO] Counter vocab from -1 samples.
[2021-05-15 17:38:10,177 INFO] n_sample=-1: Build vocab on full datasets.
[2021-05-15 17:38:10,185 INFO] corpus_1's transforms: TransformPipe()
[2021-05-15 17:38:10,185 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 17:38:11,288 INFO] Counters src:429
[2021-05-15 17:38:11,288 INFO] Counters tgt:442
[2021-05-15 17:38:11,288 WARNING] path ../models/group2_params/basic_ops/save_data.vocab.src exists, may overwrite...
[2021-05-15 17:38:11,290 WARNING] path ../models/group2_params/basic_ops/save_data.vocab.tgt exists, may overwrite...
[2021-05-15 17:38:12,011 INFO] Parsed 2 corpora from -data.
[2021-05-15 17:38:12,011 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-05-15 17:38:12,011 INFO] Loading vocab from text file...
[2021-05-15 17:38:12,012 INFO] Loading src vocabulary from ../models/group2_params/basic_ops/save_data.vocab.src
[2021-05-15 17:38:12,013 INFO] Loaded src vocab has 429 tokens.
[2021-05-15 17:38:12,013 INFO] Loading tgt vocabulary from ../models/group2_params/basic_ops/save_data.vocab.tgt
[2021-05-15 17:38:12,014 INFO] Loaded tgt vocab has 442 tokens.
[2021-05-15 17:38:12,014 INFO] Building fields with vocab in counters...
[2021-05-15 17:38:12,015 INFO]  * tgt vocab size: 446.
[2021-05-15 17:38:12,016 INFO]  * src vocab size: 431.
[2021-05-15 17:38:12,016 INFO]  * src vocab size = 431
[2021-05-15 17:38:12,016 INFO]  * tgt vocab size = 446
[2021-05-15 17:38:12,017 INFO] Building model...
[2021-05-15 17:38:14,603 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): LSTM(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(446, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedLSTM(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): LSTMCell(768, 256)
        (1): LSTMCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=446, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-05-15 17:38:14,604 INFO] encoder: 1535488
[2021-05-15 17:38:14,604 INFO] decoder: 2182846
[2021-05-15 17:38:14,604 INFO] * number of parameters: 3718334
[2021-05-15 17:38:14,605 INFO] Starting training on GPU: [0]
[2021-05-15 17:38:14,605 INFO] Start training loop and validate every 5000 steps...
[2021-05-15 17:38:14,605 INFO] corpus_1's transforms: TransformPipe()
[2021-05-15 17:38:14,605 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 17:38:34,082 INFO] Step 50/50000; acc:  15.29; ppl: 147.40; xent: 4.99; lr: 0.00010; 5357/6589 tok/s;     19 sec
[2021-05-15 17:38:52,581 INFO] Step 100/50000; acc:  16.87; ppl: 37.74; xent: 3.63; lr: 0.00010; 5473/6874 tok/s;     38 sec
[2021-05-15 17:39:11,381 INFO] Step 150/50000; acc:  19.06; ppl: 33.34; xent: 3.51; lr: 0.00010; 5336/6676 tok/s;     57 sec
[2021-05-15 17:39:30,724 INFO] Step 200/50000; acc:  33.59; ppl: 23.04; xent: 3.14; lr: 0.00010; 5382/6725 tok/s;     76 sec
[2021-05-15 17:39:49,306 INFO] Step 250/50000; acc:  49.19; ppl: 10.81; xent: 2.38; lr: 0.00010; 5214/6657 tok/s;     95 sec
[2021-05-15 17:40:08,516 INFO] Step 300/50000; acc:  52.67; ppl:  7.25; xent: 1.98; lr: 0.00010; 5422/6517 tok/s;    114 sec
[2021-05-15 17:40:27,026 INFO] Step 350/50000; acc:  54.02; ppl:  6.15; xent: 1.82; lr: 0.00010; 5550/6817 tok/s;    132 sec
[2021-05-15 17:40:46,145 INFO] Step 400/50000; acc:  55.04; ppl:  5.66; xent: 1.73; lr: 0.00010; 5321/6671 tok/s;    152 sec
[2021-05-15 17:41:05,514 INFO] Step 450/50000; acc:  56.46; ppl:  5.20; xent: 1.65; lr: 0.00010; 5177/6586 tok/s;    171 sec
[2021-05-15 17:41:24,882 INFO] Step 500/50000; acc:  57.93; ppl:  4.88; xent: 1.58; lr: 0.00010; 5376/6729 tok/s;    190 sec
[2021-05-15 17:41:43,966 INFO] Step 550/50000; acc:  59.60; ppl:  4.54; xent: 1.51; lr: 0.00010; 5369/6639 tok/s;    209 sec
[2021-05-15 17:42:02,342 INFO] Step 600/50000; acc:  62.41; ppl:  4.04; xent: 1.40; lr: 0.00010; 5432/6754 tok/s;    228 sec
[2021-05-15 17:42:21,254 INFO] Step 650/50000; acc:  64.62; ppl:  3.67; xent: 1.30; lr: 0.00010; 5465/6981 tok/s;    247 sec
[2021-05-15 17:42:39,590 INFO] Step 700/50000; acc:  65.93; ppl:  3.42; xent: 1.23; lr: 0.00010; 5437/6877 tok/s;    265 sec
[2021-05-15 17:42:41,030 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 17:42:58,628 INFO] Step 750/50000; acc:  67.27; ppl:  3.27; xent: 1.18; lr: 0.00010; 5411/6828 tok/s;    284 sec
[2021-05-15 17:43:16,837 INFO] Step 800/50000; acc:  69.66; ppl:  2.96; xent: 1.08; lr: 0.00010; 5457/6849 tok/s;    302 sec
[2021-05-15 17:43:35,566 INFO] Step 850/50000; acc:  70.19; ppl:  2.91; xent: 1.07; lr: 0.00010; 5504/6806 tok/s;    321 sec
[2021-05-15 17:43:54,983 INFO] Step 900/50000; acc:  71.70; ppl:  2.74; xent: 1.01; lr: 0.00010; 5238/6636 tok/s;    340 sec
[2021-05-15 17:44:14,028 INFO] Step 950/50000; acc:  73.48; ppl:  2.61; xent: 0.96; lr: 0.00010; 5289/6531 tok/s;    359 sec
[2021-05-15 17:44:33,355 INFO] Step 1000/50000; acc:  74.92; ppl:  2.48; xent: 0.91; lr: 0.00010; 5357/6694 tok/s;    379 sec
[2021-05-15 17:44:51,902 INFO] Step 1050/50000; acc:  75.48; ppl:  2.46; xent: 0.90; lr: 0.00010; 5317/6498 tok/s;    397 sec
[2021-05-15 17:45:10,823 INFO] Step 1100/50000; acc:  76.93; ppl:  2.32; xent: 0.84; lr: 0.00010; 5540/6774 tok/s;    416 sec
[2021-05-15 17:45:30,155 INFO] Step 1150/50000; acc:  78.24; ppl:  2.23; xent: 0.80; lr: 0.00010; 5214/6635 tok/s;    436 sec
[2021-05-15 17:45:49,338 INFO] Step 1200/50000; acc:  78.54; ppl:  2.21; xent: 0.79; lr: 0.00010; 5321/6654 tok/s;    455 sec
[2021-05-15 17:46:07,767 INFO] Step 1250/50000; acc:  79.23; ppl:  2.17; xent: 0.77; lr: 0.00010; 5433/6922 tok/s;    473 sec
[2021-05-15 17:46:27,159 INFO] Step 1300/50000; acc:  79.53; ppl:  2.15; xent: 0.77; lr: 0.00010; 5389/6476 tok/s;    493 sec
[2021-05-15 17:46:45,595 INFO] Step 1350/50000; acc:  81.69; ppl:  2.01; xent: 0.70; lr: 0.00010; 5584/7236 tok/s;    511 sec
[2021-05-15 17:47:04,187 INFO] Step 1400/50000; acc:  81.09; ppl:  2.04; xent: 0.72; lr: 0.00010; 5289/6662 tok/s;    530 sec
[2021-05-15 17:47:19,045 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 17:47:23,145 INFO] Step 1450/50000; acc:  82.14; ppl:  1.97; xent: 0.68; lr: 0.00010; 5479/6923 tok/s;    549 sec
[2021-05-15 17:47:41,789 INFO] Step 1500/50000; acc:  81.96; ppl:  2.00; xent: 0.69; lr: 0.00010; 5366/6649 tok/s;    567 sec
[2021-05-15 17:48:00,696 INFO] Step 1550/50000; acc:  82.62; ppl:  1.95; xent: 0.67; lr: 0.00010; 5452/6844 tok/s;    586 sec
[2021-05-15 17:48:18,966 INFO] Step 1600/50000; acc:  81.71; ppl:  2.02; xent: 0.70; lr: 0.00010; 5364/6643 tok/s;    604 sec
[2021-05-15 17:48:38,901 INFO] Step 1650/50000; acc:  83.66; ppl:  1.89; xent: 0.64; lr: 0.00010; 5222/6611 tok/s;    624 sec
[2021-05-15 17:48:58,022 INFO] Step 1700/50000; acc:  83.39; ppl:  1.91; xent: 0.65; lr: 0.00010; 5301/6551 tok/s;    643 sec
[2021-05-15 17:49:16,886 INFO] Step 1750/50000; acc:  83.71; ppl:  1.89; xent: 0.64; lr: 0.00010; 5358/6730 tok/s;    662 sec
[2021-05-15 17:49:35,911 INFO] Step 1800/50000; acc:  84.09; ppl:  1.86; xent: 0.62; lr: 0.00010; 5534/6699 tok/s;    681 sec
[2021-05-15 17:49:54,725 INFO] Step 1850/50000; acc:  83.69; ppl:  1.88; xent: 0.63; lr: 0.00010; 5214/6533 tok/s;    700 sec
[2021-05-15 17:50:14,230 INFO] Step 1900/50000; acc:  84.84; ppl:  1.82; xent: 0.60; lr: 0.00010; 5303/6715 tok/s;    720 sec
[2021-05-15 17:50:33,906 INFO] Step 1950/50000; acc:  84.32; ppl:  1.85; xent: 0.61; lr: 0.00010; 5144/6504 tok/s;    739 sec
[2021-05-15 17:50:52,845 INFO] Step 2000/50000; acc:  84.55; ppl:  1.84; xent: 0.61; lr: 0.00010; 5379/6761 tok/s;    758 sec
[2021-05-15 17:51:11,767 INFO] Step 2050/50000; acc:  84.93; ppl:  1.81; xent: 0.59; lr: 0.00010; 5327/6603 tok/s;    777 sec
[2021-05-15 17:51:30,081 INFO] Step 2100/50000; acc:  85.46; ppl:  1.78; xent: 0.58; lr: 0.00010; 5668/7165 tok/s;    795 sec
[2021-05-15 17:51:49,114 INFO] Step 2150/50000; acc:  85.05; ppl:  1.80; xent: 0.59; lr: 0.00010; 5345/6526 tok/s;    815 sec
[2021-05-15 17:51:58,420 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 17:52:07,563 INFO] Step 2200/50000; acc:  85.26; ppl:  1.79; xent: 0.58; lr: 0.00010; 5392/6969 tok/s;    833 sec
[2021-05-15 17:52:26,606 INFO] Step 2250/50000; acc:  85.61; ppl:  1.77; xent: 0.57; lr: 0.00010; 5485/6817 tok/s;    852 sec
[2021-05-15 17:52:44,539 INFO] Step 2300/50000; acc:  84.66; ppl:  1.82; xent: 0.60; lr: 0.00010; 5548/6868 tok/s;    870 sec
[2021-05-15 17:53:03,799 INFO] Step 2350/50000; acc:  85.63; ppl:  1.77; xent: 0.57; lr: 0.00010; 5305/6741 tok/s;    889 sec
[2021-05-15 17:53:23,146 INFO] Step 2400/50000; acc:  85.12; ppl:  1.80; xent: 0.59; lr: 0.00010; 5110/6413 tok/s;    909 sec
[2021-05-15 17:53:42,004 INFO] Step 2450/50000; acc:  85.99; ppl:  1.74; xent: 0.55; lr: 0.00010; 5489/6847 tok/s;    927 sec
[2021-05-15 17:54:01,548 INFO] Step 2500/50000; acc:  85.61; ppl:  1.76; xent: 0.57; lr: 0.00010; 5257/6471 tok/s;    947 sec
[2021-05-15 17:54:19,618 INFO] Step 2550/50000; acc:  85.72; ppl:  1.74; xent: 0.56; lr: 0.00010; 5643/6804 tok/s;    965 sec
[2021-05-15 17:54:39,454 INFO] Step 2600/50000; acc:  86.19; ppl:  1.73; xent: 0.55; lr: 0.00010; 5257/6631 tok/s;    985 sec
[2021-05-15 17:54:58,282 INFO] Step 2650/50000; acc:  85.61; ppl:  1.75; xent: 0.56; lr: 0.00010; 5179/6583 tok/s;   1004 sec
[2021-05-15 17:55:18,108 INFO] Step 2700/50000; acc:  86.34; ppl:  1.72; xent: 0.54; lr: 0.00010; 5244/6648 tok/s;   1024 sec
[2021-05-15 17:55:36,671 INFO] Step 2750/50000; acc:  85.65; ppl:  1.76; xent: 0.56; lr: 0.00010; 5426/6751 tok/s;   1042 sec
[2021-05-15 17:55:55,257 INFO] Step 2800/50000; acc:  86.44; ppl:  1.70; xent: 0.53; lr: 0.00010; 5514/6782 tok/s;   1061 sec
[2021-05-15 17:56:14,022 INFO] Step 2850/50000; acc:  86.49; ppl:  1.71; xent: 0.53; lr: 0.00010; 5307/6884 tok/s;   1079 sec
[2021-05-15 17:56:32,469 INFO] Step 2900/50000; acc:  86.58; ppl:  1.70; xent: 0.53; lr: 0.00010; 5631/7011 tok/s;   1098 sec
[2021-05-15 17:56:36,636 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 17:56:51,641 INFO] Step 2950/50000; acc:  86.45; ppl:  1.70; xent: 0.53; lr: 0.00010; 5355/6657 tok/s;   1117 sec
[2021-05-15 17:57:09,622 INFO] Step 3000/50000; acc:  86.19; ppl:  1.72; xent: 0.54; lr: 0.00010; 5515/6980 tok/s;   1135 sec
[2021-05-15 17:57:28,641 INFO] Step 3050/50000; acc:  86.17; ppl:  1.72; xent: 0.54; lr: 0.00010; 5437/6753 tok/s;   1154 sec
[2021-05-15 17:57:48,024 INFO] Step 3100/50000; acc:  86.48; ppl:  1.69; xent: 0.53; lr: 0.00010; 5146/6558 tok/s;   1173 sec
[2021-05-15 17:58:06,872 INFO] Step 3150/50000; acc:  86.54; ppl:  1.70; xent: 0.53; lr: 0.00010; 5441/6665 tok/s;   1192 sec
[2021-05-15 17:58:25,365 INFO] Step 3200/50000; acc:  86.48; ppl:  1.69; xent: 0.53; lr: 0.00010; 5327/6745 tok/s;   1211 sec
[2021-05-15 17:58:44,411 INFO] Step 3250/50000; acc:  86.85; ppl:  1.68; xent: 0.52; lr: 0.00010; 5560/6675 tok/s;   1230 sec
[2021-05-15 17:59:03,273 INFO] Step 3300/50000; acc:  86.85; ppl:  1.66; xent: 0.51; lr: 0.00010; 5399/6770 tok/s;   1249 sec
[2021-05-15 17:59:22,422 INFO] Step 3350/50000; acc:  86.75; ppl:  1.68; xent: 0.52; lr: 0.00010; 5281/6615 tok/s;   1268 sec
[2021-05-15 17:59:41,748 INFO] Step 3400/50000; acc:  87.29; ppl:  1.65; xent: 0.50; lr: 0.00010; 5405/6834 tok/s;   1287 sec
[2021-05-15 18:00:00,405 INFO] Step 3450/50000; acc:  86.43; ppl:  1.70; xent: 0.53; lr: 0.00010; 5204/6555 tok/s;   1306 sec
[2021-05-15 18:00:19,152 INFO] Step 3500/50000; acc:  86.88; ppl:  1.67; xent: 0.51; lr: 0.00010; 5572/6744 tok/s;   1325 sec
[2021-05-15 18:00:37,599 INFO] Step 3550/50000; acc:  87.51; ppl:  1.63; xent: 0.49; lr: 0.00010; 5457/7055 tok/s;   1343 sec
[2021-05-15 18:00:56,185 INFO] Step 3600/50000; acc:  87.36; ppl:  1.64; xent: 0.49; lr: 0.00010; 5467/6844 tok/s;   1362 sec
[2021-05-15 18:01:02,491 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:01:15,093 INFO] Step 3650/50000; acc:  87.03; ppl:  1.65; xent: 0.50; lr: 0.00010; 5322/6812 tok/s;   1380 sec
[2021-05-15 18:01:34,379 INFO] Step 3700/50000; acc:  87.32; ppl:  1.64; xent: 0.49; lr: 0.00010; 5401/6573 tok/s;   1400 sec
[2021-05-15 18:01:53,046 INFO] Step 3750/50000; acc:  87.14; ppl:  1.65; xent: 0.50; lr: 0.00010; 5461/6868 tok/s;   1418 sec
[2021-05-15 18:02:12,064 INFO] Step 3800/50000; acc:  86.53; ppl:  1.68; xent: 0.52; lr: 0.00010; 5181/6542 tok/s;   1437 sec
[2021-05-15 18:02:31,135 INFO] Step 3850/50000; acc:  87.56; ppl:  1.62; xent: 0.48; lr: 0.00010; 5444/6811 tok/s;   1457 sec
[2021-05-15 18:02:50,145 INFO] Step 3900/50000; acc:  87.27; ppl:  1.64; xent: 0.49; lr: 0.00010; 5234/6615 tok/s;   1476 sec
[2021-05-15 18:03:09,283 INFO] Step 3950/50000; acc:  87.46; ppl:  1.63; xent: 0.49; lr: 0.00010; 5385/6563 tok/s;   1495 sec
[2021-05-15 18:03:27,703 INFO] Step 4000/50000; acc:  87.01; ppl:  1.65; xent: 0.50; lr: 0.00010; 5477/6722 tok/s;   1513 sec
[2021-05-15 18:03:46,619 INFO] Step 4050/50000; acc:  87.58; ppl:  1.61; xent: 0.48; lr: 0.00010; 5494/6867 tok/s;   1532 sec
[2021-05-15 18:04:06,391 INFO] Step 4100/50000; acc:  87.46; ppl:  1.62; xent: 0.48; lr: 0.00010; 5152/6552 tok/s;   1552 sec
[2021-05-15 18:04:25,535 INFO] Step 4150/50000; acc:  87.34; ppl:  1.63; xent: 0.49; lr: 0.00010; 5268/6645 tok/s;   1571 sec
[2021-05-15 18:04:44,958 INFO] Step 4200/50000; acc:  87.24; ppl:  1.64; xent: 0.49; lr: 0.00010; 5377/6559 tok/s;   1590 sec
[2021-05-15 18:05:03,226 INFO] Step 4250/50000; acc:  87.44; ppl:  1.62; xent: 0.48; lr: 0.00010; 5364/6725 tok/s;   1609 sec
[2021-05-15 18:05:22,222 INFO] Step 4300/50000; acc:  88.16; ppl:  1.58; xent: 0.46; lr: 0.00010; 5448/6950 tok/s;   1628 sec
[2021-05-15 18:05:40,665 INFO] Step 4350/50000; acc:  87.70; ppl:  1.61; xent: 0.48; lr: 0.00010; 5442/6881 tok/s;   1646 sec
[2021-05-15 18:05:41,336 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:05:59,497 INFO] Step 4400/50000; acc:  87.66; ppl:  1.61; xent: 0.47; lr: 0.00010; 5424/6841 tok/s;   1665 sec
[2021-05-15 18:06:18,017 INFO] Step 4450/50000; acc:  87.62; ppl:  1.61; xent: 0.48; lr: 0.00010; 5437/6793 tok/s;   1683 sec
[2021-05-15 18:06:37,094 INFO] Step 4500/50000; acc:  87.22; ppl:  1.63; xent: 0.49; lr: 0.00010; 5394/6640 tok/s;   1702 sec
[2021-05-15 18:06:56,554 INFO] Step 4550/50000; acc:  87.67; ppl:  1.60; xent: 0.47; lr: 0.00010; 5252/6651 tok/s;   1722 sec
[2021-05-15 18:07:15,505 INFO] Step 4600/50000; acc:  87.40; ppl:  1.62; xent: 0.48; lr: 0.00010; 5231/6505 tok/s;   1741 sec
[2021-05-15 18:07:35,356 INFO] Step 4650/50000; acc:  87.96; ppl:  1.59; xent: 0.46; lr: 0.00010; 5195/6528 tok/s;   1761 sec
[2021-05-15 18:07:53,829 INFO] Step 4700/50000; acc:  87.46; ppl:  1.62; xent: 0.48; lr: 0.00010; 5495/6666 tok/s;   1779 sec
[2021-05-15 18:08:12,733 INFO] Step 4750/50000; acc:  87.48; ppl:  1.61; xent: 0.48; lr: 0.00010; 5480/6736 tok/s;   1798 sec
[2021-05-15 18:08:32,174 INFO] Step 4800/50000; acc:  87.95; ppl:  1.59; xent: 0.46; lr: 0.00010; 5087/6475 tok/s;   1818 sec
[2021-05-15 18:08:51,572 INFO] Step 4850/50000; acc:  87.80; ppl:  1.59; xent: 0.46; lr: 0.00010; 5372/6707 tok/s;   1837 sec
[2021-05-15 18:09:10,743 INFO] Step 4900/50000; acc:  87.96; ppl:  1.59; xent: 0.46; lr: 0.00010; 5303/6772 tok/s;   1856 sec
[2021-05-15 18:09:29,582 INFO] Step 4950/50000; acc:  87.43; ppl:  1.62; xent: 0.48; lr: 0.00010; 5386/6474 tok/s;   1875 sec
[2021-05-15 18:09:48,249 INFO] Step 5000/50000; acc:  88.40; ppl:  1.56; xent: 0.45; lr: 0.00010; 5605/7154 tok/s;   1894 sec
[2021-05-15 18:09:48,250 INFO] valid's transforms: TransformPipe()
[2021-05-15 18:09:48,250 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/basic/valid.txt, align=None)...
[2021-05-15 18:10:15,823 INFO] Validation perplexity: 1.56649
[2021-05-15 18:10:15,824 INFO] Validation accuracy: 88.1999
[2021-05-15 18:10:15,828 INFO] Saving checkpoint ../models/group2_params/basic_ops/model_step_5000.pt
[2021-05-15 18:10:34,313 INFO] Step 5050/50000; acc:  87.86; ppl:  1.59; xent: 0.46; lr: 0.00010; 2095/2693 tok/s;   1940 sec
[2021-05-15 18:10:48,365 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:10:53,144 INFO] Step 5100/50000; acc:  88.30; ppl:  1.56; xent: 0.45; lr: 0.00010; 5531/6973 tok/s;   1959 sec
[2021-05-15 18:11:12,101 INFO] Step 5150/50000; acc:  87.71; ppl:  1.60; xent: 0.47; lr: 0.00010; 5306/6554 tok/s;   1977 sec
[2021-05-15 18:11:31,068 INFO] Step 5200/50000; acc:  87.93; ppl:  1.59; xent: 0.46; lr: 0.00010; 5382/6789 tok/s;   1996 sec
[2021-05-15 18:11:49,652 INFO] Step 5250/50000; acc:  87.34; ppl:  1.62; xent: 0.48; lr: 0.00010; 5354/6600 tok/s;   2015 sec
[2021-05-15 18:12:09,390 INFO] Step 5300/50000; acc:  88.16; ppl:  1.57; xent: 0.45; lr: 0.00010; 5274/6620 tok/s;   2035 sec
[2021-05-15 18:12:28,585 INFO] Step 5350/50000; acc:  88.01; ppl:  1.58; xent: 0.46; lr: 0.00010; 5310/6607 tok/s;   2054 sec
[2021-05-15 18:12:47,539 INFO] Step 5400/50000; acc:  87.76; ppl:  1.59; xent: 0.47; lr: 0.00010; 5248/6618 tok/s;   2073 sec
[2021-05-15 18:13:06,435 INFO] Step 5450/50000; acc:  88.11; ppl:  1.57; xent: 0.45; lr: 0.00010; 5557/6740 tok/s;   2092 sec
[2021-05-15 18:13:25,741 INFO] Step 5500/50000; acc:  87.58; ppl:  1.60; xent: 0.47; lr: 0.00010; 5215/6444 tok/s;   2111 sec
[2021-05-15 18:13:45,105 INFO] Step 5550/50000; acc:  88.32; ppl:  1.56; xent: 0.44; lr: 0.00010; 5289/6705 tok/s;   2130 sec
[2021-05-15 18:14:04,340 INFO] Step 5600/50000; acc:  87.92; ppl:  1.58; xent: 0.46; lr: 0.00010; 5168/6602 tok/s;   2150 sec
[2021-05-15 18:14:23,494 INFO] Step 5650/50000; acc:  88.06; ppl:  1.58; xent: 0.46; lr: 0.00010; 5422/6699 tok/s;   2169 sec
[2021-05-15 18:14:42,618 INFO] Step 5700/50000; acc:  88.25; ppl:  1.56; xent: 0.44; lr: 0.00010; 5337/6641 tok/s;   2188 sec
[2021-05-15 18:15:00,617 INFO] Step 5750/50000; acc:  88.41; ppl:  1.56; xent: 0.44; lr: 0.00010; 5603/7169 tok/s;   2206 sec
[2021-05-15 18:15:19,586 INFO] Step 5800/50000; acc:  88.26; ppl:  1.56; xent: 0.45; lr: 0.00010; 5469/6692 tok/s;   2225 sec
[2021-05-15 18:15:28,088 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:15:38,148 INFO] Step 5850/50000; acc:  88.09; ppl:  1.57; xent: 0.45; lr: 0.00010; 5253/6821 tok/s;   2244 sec
[2021-05-15 18:15:57,450 INFO] Step 5900/50000; acc:  88.45; ppl:  1.56; xent: 0.44; lr: 0.00010; 5430/6758 tok/s;   2263 sec
[2021-05-15 18:16:15,324 INFO] Step 5950/50000; acc:  87.45; ppl:  1.60; xent: 0.47; lr: 0.00010; 5584/6824 tok/s;   2281 sec
[2021-05-15 18:16:34,756 INFO] Step 6000/50000; acc:  88.24; ppl:  1.56; xent: 0.45; lr: 0.00010; 5221/6660 tok/s;   2300 sec
[2021-05-15 18:16:54,315 INFO] Step 6050/50000; acc:  87.84; ppl:  1.58; xent: 0.46; lr: 0.00010; 5121/6379 tok/s;   2320 sec
[2021-05-15 18:17:13,149 INFO] Step 6100/50000; acc:  88.50; ppl:  1.55; xent: 0.44; lr: 0.00010; 5488/6843 tok/s;   2339 sec
[2021-05-15 18:17:32,382 INFO] Step 6150/50000; acc:  88.17; ppl:  1.57; xent: 0.45; lr: 0.00010; 5384/6619 tok/s;   2358 sec
[2021-05-15 18:17:50,330 INFO] Step 6200/50000; acc:  88.12; ppl:  1.56; xent: 0.45; lr: 0.00010; 5584/6837 tok/s;   2376 sec
[2021-05-15 18:18:10,160 INFO] Step 6250/50000; acc:  88.44; ppl:  1.55; xent: 0.44; lr: 0.00010; 5237/6577 tok/s;   2396 sec
[2021-05-15 18:18:29,218 INFO] Step 6300/50000; acc:  88.12; ppl:  1.55; xent: 0.44; lr: 0.00010; 5265/6610 tok/s;   2415 sec
[2021-05-15 18:18:48,571 INFO] Step 6350/50000; acc:  88.38; ppl:  1.56; xent: 0.44; lr: 0.00010; 5303/6766 tok/s;   2434 sec
[2021-05-15 18:19:06,791 INFO] Step 6400/50000; acc:  87.82; ppl:  1.58; xent: 0.46; lr: 0.00010; 5439/6766 tok/s;   2452 sec
[2021-05-15 18:19:25,698 INFO] Step 6450/50000; acc:  88.73; ppl:  1.53; xent: 0.43; lr: 0.00010; 5527/6822 tok/s;   2471 sec
[2021-05-15 18:19:44,351 INFO] Step 6500/50000; acc:  88.71; ppl:  1.54; xent: 0.43; lr: 0.00010; 5411/6972 tok/s;   2490 sec
[2021-05-15 18:20:02,637 INFO] Step 6550/50000; acc:  88.28; ppl:  1.55; xent: 0.44; lr: 0.00010; 5531/6906 tok/s;   2508 sec
[2021-05-15 18:20:06,110 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:20:21,986 INFO] Step 6600/50000; acc:  88.61; ppl:  1.54; xent: 0.43; lr: 0.00010; 5394/6725 tok/s;   2527 sec
[2021-05-15 18:20:39,903 INFO] Step 6650/50000; acc:  88.03; ppl:  1.57; xent: 0.45; lr: 0.00010; 5425/6916 tok/s;   2545 sec
[2021-05-15 18:20:59,069 INFO] Step 6700/50000; acc:  88.08; ppl:  1.57; xent: 0.45; lr: 0.00010; 5403/6657 tok/s;   2564 sec
[2021-05-15 18:21:18,285 INFO] Step 6750/50000; acc:  88.40; ppl:  1.55; xent: 0.44; lr: 0.00010; 5230/6635 tok/s;   2584 sec
[2021-05-15 18:21:37,367 INFO] Step 6800/50000; acc:  88.30; ppl:  1.55; xent: 0.44; lr: 0.00010; 5323/6606 tok/s;   2603 sec
[2021-05-15 18:21:56,252 INFO] Step 6850/50000; acc:  88.38; ppl:  1.55; xent: 0.44; lr: 0.00010; 5294/6622 tok/s;   2622 sec
[2021-05-15 18:22:15,006 INFO] Step 6900/50000; acc:  88.37; ppl:  1.55; xent: 0.44; lr: 0.00010; 5640/6758 tok/s;   2640 sec
[2021-05-15 18:22:34,117 INFO] Step 6950/50000; acc:  88.61; ppl:  1.53; xent: 0.43; lr: 0.00010; 5363/6770 tok/s;   2660 sec
[2021-05-15 18:22:53,144 INFO] Step 7000/50000; acc:  88.24; ppl:  1.55; xent: 0.44; lr: 0.00010; 5222/6553 tok/s;   2679 sec
[2021-05-15 18:23:12,655 INFO] Step 7050/50000; acc:  88.73; ppl:  1.53; xent: 0.42; lr: 0.00010; 5338/6790 tok/s;   2698 sec
[2021-05-15 18:23:31,402 INFO] Step 7100/50000; acc:  87.96; ppl:  1.57; xent: 0.45; lr: 0.00010; 5320/6551 tok/s;   2717 sec
[2021-05-15 18:23:50,037 INFO] Step 7150/50000; acc:  88.40; ppl:  1.54; xent: 0.43; lr: 0.00010; 5552/6770 tok/s;   2735 sec
[2021-05-15 18:24:08,471 INFO] Step 7200/50000; acc:  88.91; ppl:  1.51; xent: 0.41; lr: 0.00010; 5358/6981 tok/s;   2754 sec
[2021-05-15 18:24:27,010 INFO] Step 7250/50000; acc:  88.78; ppl:  1.53; xent: 0.42; lr: 0.00010; 5584/6930 tok/s;   2772 sec
[2021-05-15 18:24:32,565 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:24:46,297 INFO] Step 7300/50000; acc:  88.72; ppl:  1.53; xent: 0.42; lr: 0.00010; 5306/6869 tok/s;   2792 sec
[2021-05-15 18:25:04,893 INFO] Step 7350/50000; acc:  88.50; ppl:  1.54; xent: 0.43; lr: 0.00010; 5433/6615 tok/s;   2810 sec
[2021-05-15 18:25:23,613 INFO] Step 7400/50000; acc:  88.39; ppl:  1.55; xent: 0.44; lr: 0.00010; 5531/6856 tok/s;   2829 sec
[2021-05-15 18:25:42,534 INFO] Step 7450/50000; acc:  87.99; ppl:  1.56; xent: 0.45; lr: 0.00010; 5122/6571 tok/s;   2848 sec
[2021-05-15 18:26:01,823 INFO] Step 7500/50000; acc:  88.72; ppl:  1.53; xent: 0.43; lr: 0.00010; 5394/6714 tok/s;   2867 sec
[2021-05-15 18:26:20,617 INFO] Step 7550/50000; acc:  88.56; ppl:  1.53; xent: 0.43; lr: 0.00010; 5317/6724 tok/s;   2886 sec
[2021-05-15 18:26:39,683 INFO] Step 7600/50000; acc:  88.49; ppl:  1.54; xent: 0.43; lr: 0.00010; 5378/6523 tok/s;   2905 sec
[2021-05-15 18:26:58,280 INFO] Step 7650/50000; acc:  88.33; ppl:  1.54; xent: 0.43; lr: 0.00010; 5490/6722 tok/s;   2924 sec
[2021-05-15 18:27:17,148 INFO] Step 7700/50000; acc:  88.87; ppl:  1.52; xent: 0.42; lr: 0.00010; 5498/6858 tok/s;   2943 sec
[2021-05-15 18:27:36,555 INFO] Step 7750/50000; acc:  88.74; ppl:  1.52; xent: 0.42; lr: 0.00010; 5285/6701 tok/s;   2962 sec
[2021-05-15 18:27:55,541 INFO] Step 7800/50000; acc:  88.40; ppl:  1.54; xent: 0.43; lr: 0.00010; 5218/6640 tok/s;   2981 sec
[2021-05-15 18:28:14,889 INFO] Step 7850/50000; acc:  88.48; ppl:  1.54; xent: 0.43; lr: 0.00010; 5383/6528 tok/s;   3000 sec
[2021-05-15 18:28:33,449 INFO] Step 7900/50000; acc:  88.90; ppl:  1.51; xent: 0.41; lr: 0.00010; 5433/6870 tok/s;   3019 sec
[2021-05-15 18:28:52,413 INFO] Step 7950/50000; acc:  89.03; ppl:  1.51; xent: 0.41; lr: 0.00010; 5380/6835 tok/s;   3038 sec
[2021-05-15 18:29:10,688 INFO] Step 8000/50000; acc:  88.66; ppl:  1.53; xent: 0.42; lr: 0.00010; 5412/6901 tok/s;   3056 sec
[2021-05-15 18:29:10,704 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:29:29,700 INFO] Step 8050/50000; acc:  88.84; ppl:  1.52; xent: 0.42; lr: 0.00010; 5470/6800 tok/s;   3075 sec
[2021-05-15 18:29:48,608 INFO] Step 8100/50000; acc:  88.81; ppl:  1.52; xent: 0.42; lr: 0.00010; 5406/6792 tok/s;   3094 sec
[2021-05-15 18:30:07,299 INFO] Step 8150/50000; acc:  88.00; ppl:  1.56; xent: 0.45; lr: 0.00010; 5341/6583 tok/s;   3113 sec
[2021-05-15 18:30:26,535 INFO] Step 8200/50000; acc:  88.85; ppl:  1.52; xent: 0.42; lr: 0.00010; 5413/6846 tok/s;   3132 sec
[2021-05-15 18:30:45,166 INFO] Step 8250/50000; acc:  88.40; ppl:  1.54; xent: 0.43; lr: 0.00010; 5224/6483 tok/s;   3151 sec
[2021-05-15 18:31:05,082 INFO] Step 8300/50000; acc:  88.92; ppl:  1.51; xent: 0.41; lr: 0.00010; 5190/6524 tok/s;   3170 sec
[2021-05-15 18:31:23,724 INFO] Step 8350/50000; acc:  88.44; ppl:  1.54; xent: 0.43; lr: 0.00010; 5470/6614 tok/s;   3189 sec
[2021-05-15 18:31:42,775 INFO] Step 8400/50000; acc:  88.52; ppl:  1.53; xent: 0.42; lr: 0.00010; 5394/6682 tok/s;   3208 sec
[2021-05-15 18:32:02,526 INFO] Step 8450/50000; acc:  88.89; ppl:  1.51; xent: 0.41; lr: 0.00010; 5060/6392 tok/s;   3228 sec
[2021-05-15 18:32:21,962 INFO] Step 8500/50000; acc:  88.78; ppl:  1.52; xent: 0.42; lr: 0.00010; 5370/6725 tok/s;   3247 sec
[2021-05-15 18:32:41,362 INFO] Step 8550/50000; acc:  88.90; ppl:  1.51; xent: 0.41; lr: 0.00010; 5276/6674 tok/s;   3267 sec
[2021-05-15 18:33:00,107 INFO] Step 8600/50000; acc:  88.50; ppl:  1.53; xent: 0.42; lr: 0.00010; 5319/6496 tok/s;   3286 sec
[2021-05-15 18:33:18,360 INFO] Step 8650/50000; acc:  89.29; ppl:  1.50; xent: 0.40; lr: 0.00010; 5695/7333 tok/s;   3304 sec
[2021-05-15 18:33:36,851 INFO] Step 8700/50000; acc:  88.73; ppl:  1.52; xent: 0.42; lr: 0.00010; 5373/6700 tok/s;   3322 sec
[2021-05-15 18:33:50,111 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:33:55,588 INFO] Step 8750/50000; acc:  89.13; ppl:  1.50; xent: 0.40; lr: 0.00010; 5506/6975 tok/s;   3341 sec
[2021-05-15 18:34:14,360 INFO] Step 8800/50000; acc:  88.68; ppl:  1.53; xent: 0.42; lr: 0.00010; 5269/6656 tok/s;   3360 sec
[2021-05-15 18:34:33,045 INFO] Step 8850/50000; acc:  88.74; ppl:  1.52; xent: 0.42; lr: 0.00010; 5562/6896 tok/s;   3378 sec
[2021-05-15 18:34:52,157 INFO] Step 8900/50000; acc:  88.49; ppl:  1.53; xent: 0.43; lr: 0.00010; 5290/6570 tok/s;   3398 sec
[2021-05-15 18:35:11,432 INFO] Step 8950/50000; acc:  88.69; ppl:  1.52; xent: 0.42; lr: 0.00010; 5233/6597 tok/s;   3417 sec
[2021-05-15 18:35:30,744 INFO] Step 9000/50000; acc:  88.94; ppl:  1.51; xent: 0.41; lr: 0.00010; 5374/6606 tok/s;   3436 sec
[2021-05-15 18:35:49,392 INFO] Step 9050/50000; acc:  88.70; ppl:  1.52; xent: 0.42; lr: 0.00010; 5251/6693 tok/s;   3455 sec
[2021-05-15 18:36:08,264 INFO] Step 9100/50000; acc:  88.80; ppl:  1.51; xent: 0.42; lr: 0.00010; 5573/6717 tok/s;   3474 sec
[2021-05-15 18:36:27,540 INFO] Step 9150/50000; acc:  88.52; ppl:  1.52; xent: 0.42; lr: 0.00010; 5246/6531 tok/s;   3493 sec
[2021-05-15 18:36:47,115 INFO] Step 9200/50000; acc:  89.22; ppl:  1.49; xent: 0.40; lr: 0.00010; 5189/6618 tok/s;   3513 sec
[2021-05-15 18:37:06,165 INFO] Step 9250/50000; acc:  88.93; ppl:  1.50; xent: 0.41; lr: 0.00010; 5279/6753 tok/s;   3532 sec
[2021-05-15 18:37:25,167 INFO] Step 9300/50000; acc:  88.60; ppl:  1.53; xent: 0.42; lr: 0.00010; 5465/6616 tok/s;   3551 sec
[2021-05-15 18:37:44,245 INFO] Step 9350/50000; acc:  89.17; ppl:  1.49; xent: 0.40; lr: 0.00010; 5394/6712 tok/s;   3570 sec
[2021-05-15 18:38:02,056 INFO] Step 9400/50000; acc:  88.97; ppl:  1.51; xent: 0.41; lr: 0.00010; 5561/7195 tok/s;   3587 sec
[2021-05-15 18:38:21,011 INFO] Step 9450/50000; acc:  89.06; ppl:  1.50; xent: 0.41; lr: 0.00010; 5451/6666 tok/s;   3606 sec
[2021-05-15 18:38:28,952 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:38:39,754 INFO] Step 9500/50000; acc:  88.96; ppl:  1.50; xent: 0.41; lr: 0.00010; 5349/6836 tok/s;   3625 sec
[2021-05-15 18:38:58,918 INFO] Step 9550/50000; acc:  89.27; ppl:  1.49; xent: 0.40; lr: 0.00010; 5400/6847 tok/s;   3644 sec
[2021-05-15 18:39:16,840 INFO] Step 9600/50000; acc:  88.03; ppl:  1.55; xent: 0.44; lr: 0.00010; 5489/6711 tok/s;   3662 sec
[2021-05-15 18:39:36,310 INFO] Step 9650/50000; acc:  89.06; ppl:  1.50; xent: 0.41; lr: 0.00010; 5308/6710 tok/s;   3682 sec
[2021-05-15 18:39:56,201 INFO] Step 9700/50000; acc:  88.82; ppl:  1.51; xent: 0.41; lr: 0.00010; 5110/6378 tok/s;   3702 sec
[2021-05-15 18:40:14,841 INFO] Step 9750/50000; acc:  88.88; ppl:  1.51; xent: 0.41; lr: 0.00010; 5382/6743 tok/s;   3720 sec
[2021-05-15 18:40:34,385 INFO] Step 9800/50000; acc:  88.99; ppl:  1.50; xent: 0.41; lr: 0.00010; 5397/6573 tok/s;   3740 sec
[2021-05-15 18:40:52,503 INFO] Step 9850/50000; acc:  88.65; ppl:  1.52; xent: 0.42; lr: 0.00010; 5424/6722 tok/s;   3758 sec
[2021-05-15 18:41:11,924 INFO] Step 9900/50000; acc:  89.29; ppl:  1.49; xent: 0.40; lr: 0.00010; 5353/6695 tok/s;   3777 sec
[2021-05-15 18:41:30,848 INFO] Step 9950/50000; acc:  89.02; ppl:  1.50; xent: 0.40; lr: 0.00010; 5331/6726 tok/s;   3796 sec
[2021-05-15 18:41:50,305 INFO] Step 10000/50000; acc:  89.04; ppl:  1.50; xent: 0.41; lr: 0.00010; 5237/6627 tok/s;   3816 sec
[2021-05-15 18:41:50,306 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/basic/valid.txt, align=None)...
[2021-05-15 18:42:17,902 INFO] Validation perplexity: 1.49059
[2021-05-15 18:42:17,902 INFO] Validation accuracy: 89.165
[2021-05-15 18:42:17,906 INFO] Saving checkpoint ../models/group2_params/basic_ops/model_step_10000.pt
[2021-05-15 18:42:36,391 INFO] Step 10050/50000; acc:  88.60; ppl:  1.52; xent: 0.42; lr: 0.00010; 2177/2703 tok/s;   3862 sec
[2021-05-15 18:42:54,977 INFO] Step 10100/50000; acc:  89.46; ppl:  1.48; xent: 0.39; lr: 0.00010; 5617/6985 tok/s;   3880 sec
[2021-05-15 18:43:13,718 INFO] Step 10150/50000; acc:  89.37; ppl:  1.48; xent: 0.39; lr: 0.00010; 5426/6973 tok/s;   3899 sec
[2021-05-15 18:43:31,846 INFO] Step 10200/50000; acc:  88.87; ppl:  1.51; xent: 0.41; lr: 0.00010; 5484/6870 tok/s;   3917 sec
[2021-05-15 18:43:34,707 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:43:51,057 INFO] Step 10250/50000; acc:  89.23; ppl:  1.49; xent: 0.40; lr: 0.00010; 5420/6707 tok/s;   3936 sec
[2021-05-15 18:44:09,398 INFO] Step 10300/50000; acc:  88.80; ppl:  1.51; xent: 0.41; lr: 0.00010; 5443/6831 tok/s;   3955 sec
[2021-05-15 18:44:28,217 INFO] Step 10350/50000; acc:  88.73; ppl:  1.52; xent: 0.42; lr: 0.00010; 5437/6836 tok/s;   3974 sec
[2021-05-15 18:44:47,044 INFO] Step 10400/50000; acc:  88.93; ppl:  1.50; xent: 0.41; lr: 0.00010; 5244/6654 tok/s;   3992 sec
[2021-05-15 18:45:06,437 INFO] Step 10450/50000; acc:  89.03; ppl:  1.50; xent: 0.41; lr: 0.00010; 5342/6609 tok/s;   4012 sec
[2021-05-15 18:45:25,814 INFO] Step 10500/50000; acc:  89.17; ppl:  1.49; xent: 0.40; lr: 0.00010; 5241/6541 tok/s;   4031 sec
[2021-05-15 18:45:44,295 INFO] Step 10550/50000; acc:  88.69; ppl:  1.51; xent: 0.42; lr: 0.00010; 5566/6659 tok/s;   4050 sec
[2021-05-15 18:46:03,500 INFO] Step 10600/50000; acc:  89.30; ppl:  1.48; xent: 0.39; lr: 0.00010; 5417/6785 tok/s;   4069 sec
[2021-05-15 18:46:22,634 INFO] Step 10650/50000; acc:  88.95; ppl:  1.50; xent: 0.41; lr: 0.00010; 5109/6517 tok/s;   4088 sec
[2021-05-15 18:46:42,157 INFO] Step 10700/50000; acc:  89.24; ppl:  1.49; xent: 0.40; lr: 0.00010; 5338/6728 tok/s;   4108 sec
[2021-05-15 18:47:00,730 INFO] Step 10750/50000; acc:  88.70; ppl:  1.52; xent: 0.42; lr: 0.00010; 5394/6666 tok/s;   4126 sec
[2021-05-15 18:47:19,421 INFO] Step 10800/50000; acc:  89.18; ppl:  1.49; xent: 0.40; lr: 0.00010; 5500/6715 tok/s;   4145 sec
[2021-05-15 18:47:38,219 INFO] Step 10850/50000; acc:  89.73; ppl:  1.46; xent: 0.38; lr: 0.00010; 5322/6964 tok/s;   4164 sec
[2021-05-15 18:47:56,978 INFO] Step 10900/50000; acc:  89.24; ppl:  1.49; xent: 0.40; lr: 0.00010; 5518/6868 tok/s;   4182 sec
[2021-05-15 18:48:01,477 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:48:15,938 INFO] Step 10950/50000; acc:  89.24; ppl:  1.48; xent: 0.40; lr: 0.00010; 5425/6913 tok/s;   4201 sec
[2021-05-15 18:48:34,492 INFO] Step 11000/50000; acc:  89.04; ppl:  1.50; xent: 0.40; lr: 0.00010; 5358/6620 tok/s;   4220 sec
[2021-05-15 18:48:53,187 INFO] Step 11050/50000; acc:  88.97; ppl:  1.50; xent: 0.41; lr: 0.00010; 5513/6780 tok/s;   4239 sec
[2021-05-15 18:49:12,415 INFO] Step 11100/50000; acc:  88.79; ppl:  1.50; xent: 0.41; lr: 0.00010; 5185/6613 tok/s;   4258 sec
[2021-05-15 18:49:31,638 INFO] Step 11150/50000; acc:  89.24; ppl:  1.49; xent: 0.40; lr: 0.00010; 5351/6674 tok/s;   4277 sec
[2021-05-15 18:49:50,306 INFO] Step 11200/50000; acc:  89.05; ppl:  1.49; xent: 0.40; lr: 0.00010; 5259/6705 tok/s;   4296 sec
[2021-05-15 18:50:09,315 INFO] Step 11250/50000; acc:  89.17; ppl:  1.49; xent: 0.40; lr: 0.00010; 5508/6655 tok/s;   4315 sec
[2021-05-15 18:50:27,921 INFO] Step 11300/50000; acc:  89.12; ppl:  1.49; xent: 0.40; lr: 0.00010; 5563/6851 tok/s;   4333 sec
[2021-05-15 18:50:46,832 INFO] Step 11350/50000; acc:  89.18; ppl:  1.49; xent: 0.40; lr: 0.00010; 5320/6681 tok/s;   4352 sec
[2021-05-15 18:51:06,342 INFO] Step 11400/50000; acc:  89.31; ppl:  1.48; xent: 0.39; lr: 0.00010; 5354/6653 tok/s;   4372 sec
[2021-05-15 18:51:24,873 INFO] Step 11450/50000; acc:  89.10; ppl:  1.49; xent: 0.40; lr: 0.00010; 5248/6779 tok/s;   4390 sec
[2021-05-15 18:51:44,371 INFO] Step 11500/50000; acc:  89.00; ppl:  1.50; xent: 0.40; lr: 0.00010; 5351/6443 tok/s;   4410 sec
[2021-05-15 18:52:02,692 INFO] Step 11550/50000; acc:  89.44; ppl:  1.47; xent: 0.39; lr: 0.00010; 5529/7067 tok/s;   4428 sec
[2021-05-15 18:52:21,542 INFO] Step 11600/50000; acc:  89.58; ppl:  1.47; xent: 0.38; lr: 0.00010; 5372/6797 tok/s;   4447 sec
[2021-05-15 18:52:39,135 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:52:40,029 INFO] Step 11650/50000; acc:  89.27; ppl:  1.48; xent: 0.39; lr: 0.00010; 5424/6903 tok/s;   4465 sec
[2021-05-15 18:52:59,037 INFO] Step 11700/50000; acc:  89.30; ppl:  1.48; xent: 0.39; lr: 0.00010; 5459/6778 tok/s;   4484 sec
[2021-05-15 18:53:18,120 INFO] Step 11750/50000; acc:  89.29; ppl:  1.48; xent: 0.39; lr: 0.00010; 5391/6752 tok/s;   4504 sec
[2021-05-15 18:53:36,677 INFO] Step 11800/50000; acc:  88.44; ppl:  1.52; xent: 0.42; lr: 0.00010; 5291/6550 tok/s;   4522 sec
[2021-05-15 18:53:56,378 INFO] Step 11850/50000; acc:  89.47; ppl:  1.47; xent: 0.39; lr: 0.00010; 5263/6709 tok/s;   4542 sec
[2021-05-15 18:54:15,103 INFO] Step 11900/50000; acc:  89.08; ppl:  1.49; xent: 0.40; lr: 0.00010; 5350/6613 tok/s;   4560 sec
[2021-05-15 18:54:34,690 INFO] Step 11950/50000; acc:  89.48; ppl:  1.47; xent: 0.39; lr: 0.00010; 5224/6537 tok/s;   4580 sec
[2021-05-15 18:54:53,221 INFO] Step 12000/50000; acc:  88.97; ppl:  1.50; xent: 0.40; lr: 0.00010; 5413/6557 tok/s;   4599 sec
[2021-05-15 18:55:12,183 INFO] Step 12050/50000; acc:  89.09; ppl:  1.49; xent: 0.40; lr: 0.00010; 5516/6802 tok/s;   4618 sec
[2021-05-15 18:55:31,884 INFO] Step 12100/50000; acc:  89.71; ppl:  1.46; xent: 0.38; lr: 0.00010; 5145/6534 tok/s;   4637 sec
[2021-05-15 18:55:51,318 INFO] Step 12150/50000; acc:  89.02; ppl:  1.49; xent: 0.40; lr: 0.00010; 5221/6539 tok/s;   4657 sec
[2021-05-15 18:56:10,835 INFO] Step 12200/50000; acc:  89.45; ppl:  1.47; xent: 0.39; lr: 0.00010; 5334/6716 tok/s;   4676 sec
[2021-05-15 18:56:29,388 INFO] Step 12250/50000; acc:  89.05; ppl:  1.48; xent: 0.39; lr: 0.00010; 5282/6495 tok/s;   4695 sec
[2021-05-15 18:56:47,910 INFO] Step 12300/50000; acc:  89.89; ppl:  1.45; xent: 0.37; lr: 0.00010; 5612/7267 tok/s;   4713 sec
[2021-05-15 18:57:06,313 INFO] Step 12350/50000; acc:  89.21; ppl:  1.48; xent: 0.39; lr: 0.00010; 5422/6729 tok/s;   4732 sec
[2021-05-15 18:57:18,864 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 18:57:25,116 INFO] Step 12400/50000; acc:  89.51; ppl:  1.47; xent: 0.38; lr: 0.00010; 5444/6882 tok/s;   4751 sec
[2021-05-15 18:57:44,311 INFO] Step 12450/50000; acc:  89.27; ppl:  1.48; xent: 0.39; lr: 0.00010; 5233/6534 tok/s;   4770 sec
[2021-05-15 18:58:02,603 INFO] Step 12500/50000; acc:  89.26; ppl:  1.48; xent: 0.39; lr: 0.00010; 5671/7065 tok/s;   4788 sec
[2021-05-15 18:58:21,794 INFO] Step 12550/50000; acc:  88.98; ppl:  1.49; xent: 0.40; lr: 0.00010; 5306/6668 tok/s;   4807 sec
[2021-05-15 18:58:40,868 INFO] Step 12600/50000; acc:  89.25; ppl:  1.48; xent: 0.39; lr: 0.00010; 5192/6549 tok/s;   4826 sec
[2021-05-15 18:58:59,871 INFO] Step 12650/50000; acc:  89.36; ppl:  1.48; xent: 0.39; lr: 0.00010; 5446/6624 tok/s;   4845 sec
[2021-05-15 18:59:19,022 INFO] Step 12700/50000; acc:  89.28; ppl:  1.48; xent: 0.39; lr: 0.00010; 5262/6634 tok/s;   4864 sec
[2021-05-15 18:59:37,841 INFO] Step 12750/50000; acc:  89.26; ppl:  1.48; xent: 0.39; lr: 0.00010; 5517/6726 tok/s;   4883 sec
[2021-05-15 18:59:56,835 INFO] Step 12800/50000; acc:  89.07; ppl:  1.49; xent: 0.40; lr: 0.00010; 5236/6552 tok/s;   4902 sec
[2021-05-15 19:00:16,587 INFO] Step 12850/50000; acc:  89.64; ppl:  1.46; xent: 0.38; lr: 0.00010; 5246/6548 tok/s;   4922 sec
[2021-05-15 19:00:36,031 INFO] Step 12900/50000; acc:  89.67; ppl:  1.46; xent: 0.38; lr: 0.00010; 5239/6784 tok/s;   4941 sec
[2021-05-15 19:00:54,691 INFO] Step 12950/50000; acc:  88.96; ppl:  1.49; xent: 0.40; lr: 0.00010; 5417/6597 tok/s;   4960 sec
[2021-05-15 19:01:13,574 INFO] Step 13000/50000; acc:  89.73; ppl:  1.45; xent: 0.37; lr: 0.00010; 5538/6860 tok/s;   4979 sec
[2021-05-15 19:01:31,083 INFO] Step 13050/50000; acc:  89.45; ppl:  1.47; xent: 0.38; lr: 0.00010; 5547/7234 tok/s;   4996 sec
[2021-05-15 19:01:49,942 INFO] Step 13100/50000; acc:  89.53; ppl:  1.47; xent: 0.38; lr: 0.00010; 5494/6748 tok/s;   5015 sec
[2021-05-15 19:01:57,227 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:02:08,870 INFO] Step 13150/50000; acc:  89.37; ppl:  1.47; xent: 0.39; lr: 0.00010; 5319/6743 tok/s;   5034 sec
[2021-05-15 19:02:27,694 INFO] Step 13200/50000; acc:  89.75; ppl:  1.46; xent: 0.38; lr: 0.00010; 5452/6955 tok/s;   5053 sec
[2021-05-15 19:02:45,693 INFO] Step 13250/50000; acc:  88.55; ppl:  1.51; xent: 0.41; lr: 0.00010; 5544/6708 tok/s;   5071 sec
[2021-05-15 19:03:05,319 INFO] Step 13300/50000; acc:  89.60; ppl:  1.46; xent: 0.38; lr: 0.00010; 5264/6674 tok/s;   5091 sec
[2021-05-15 19:03:24,928 INFO] Step 13350/50000; acc:  89.35; ppl:  1.47; xent: 0.39; lr: 0.00010; 5217/6522 tok/s;   5110 sec
[2021-05-15 19:03:43,427 INFO] Step 13400/50000; acc:  89.26; ppl:  1.48; xent: 0.39; lr: 0.00010; 5331/6647 tok/s;   5129 sec
[2021-05-15 19:04:02,830 INFO] Step 13450/50000; acc:  89.42; ppl:  1.47; xent: 0.39; lr: 0.00010; 5420/6605 tok/s;   5148 sec
[2021-05-15 19:04:21,415 INFO] Step 13500/50000; acc:  89.22; ppl:  1.48; xent: 0.39; lr: 0.00010; 5424/6727 tok/s;   5167 sec
[2021-05-15 19:04:40,740 INFO] Step 13550/50000; acc:  89.68; ppl:  1.46; xent: 0.38; lr: 0.00010; 5325/6694 tok/s;   5186 sec
[2021-05-15 19:04:59,600 INFO] Step 13600/50000; acc:  89.34; ppl:  1.47; xent: 0.38; lr: 0.00010; 5263/6691 tok/s;   5205 sec
[2021-05-15 19:05:18,728 INFO] Step 13650/50000; acc:  89.43; ppl:  1.47; xent: 0.39; lr: 0.00010; 5427/6697 tok/s;   5224 sec
[2021-05-15 19:05:37,960 INFO] Step 13700/50000; acc:  89.36; ppl:  1.47; xent: 0.39; lr: 0.00010; 5300/6589 tok/s;   5243 sec
[2021-05-15 19:05:56,092 INFO] Step 13750/50000; acc:  89.71; ppl:  1.45; xent: 0.37; lr: 0.00010; 5580/7015 tok/s;   5261 sec
[2021-05-15 19:06:14,991 INFO] Step 13800/50000; acc:  89.72; ppl:  1.46; xent: 0.38; lr: 0.00010; 5486/6955 tok/s;   5280 sec
[2021-05-15 19:06:33,276 INFO] Step 13850/50000; acc:  89.55; ppl:  1.46; xent: 0.38; lr: 0.00010; 5336/6843 tok/s;   5299 sec
[2021-05-15 19:06:35,487 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:06:52,587 INFO] Step 13900/50000; acc:  89.62; ppl:  1.46; xent: 0.38; lr: 0.00010; 5401/6675 tok/s;   5318 sec
[2021-05-15 19:07:10,818 INFO] Step 13950/50000; acc:  89.32; ppl:  1.48; xent: 0.39; lr: 0.00010; 5499/6868 tok/s;   5336 sec
[2021-05-15 19:07:29,977 INFO] Step 14000/50000; acc:  89.13; ppl:  1.48; xent: 0.39; lr: 0.00010; 5293/6688 tok/s;   5355 sec
[2021-05-15 19:07:49,119 INFO] Step 14050/50000; acc:  89.40; ppl:  1.47; xent: 0.38; lr: 0.00010; 5239/6573 tok/s;   5375 sec
[2021-05-15 19:08:08,440 INFO] Step 14100/50000; acc:  89.60; ppl:  1.46; xent: 0.38; lr: 0.00010; 5351/6673 tok/s;   5394 sec
[2021-05-15 19:08:27,249 INFO] Step 14150/50000; acc:  89.54; ppl:  1.46; xent: 0.38; lr: 0.00010; 5443/6678 tok/s;   5413 sec
[2021-05-15 19:08:45,810 INFO] Step 14200/50000; acc:  89.18; ppl:  1.47; xent: 0.39; lr: 0.00010; 5449/6668 tok/s;   5431 sec
[2021-05-15 19:09:05,031 INFO] Step 14250/50000; acc:  89.67; ppl:  1.45; xent: 0.37; lr: 0.00010; 5395/6733 tok/s;   5450 sec
[2021-05-15 19:09:24,524 INFO] Step 14300/50000; acc:  89.45; ppl:  1.46; xent: 0.38; lr: 0.00010; 5147/6487 tok/s;   5470 sec
[2021-05-15 19:09:43,796 INFO] Step 14350/50000; acc:  89.64; ppl:  1.46; xent: 0.38; lr: 0.00010; 5345/6802 tok/s;   5489 sec
[2021-05-15 19:10:02,165 INFO] Step 14400/50000; acc:  89.08; ppl:  1.48; xent: 0.39; lr: 0.00010; 5370/6677 tok/s;   5508 sec
[2021-05-15 19:10:21,101 INFO] Step 14450/50000; acc:  89.79; ppl:  1.45; xent: 0.37; lr: 0.00010; 5535/6713 tok/s;   5526 sec
[2021-05-15 19:10:40,209 INFO] Step 14500/50000; acc:  90.10; ppl:  1.44; xent: 0.36; lr: 0.00010; 5301/6887 tok/s;   5546 sec
[2021-05-15 19:10:58,459 INFO] Step 14550/50000; acc:  89.63; ppl:  1.46; xent: 0.38; lr: 0.00010; 5506/6933 tok/s;   5564 sec
[2021-05-15 19:11:02,255 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:11:17,662 INFO] Step 14600/50000; acc:  89.60; ppl:  1.46; xent: 0.38; lr: 0.00010; 5447/6830 tok/s;   5583 sec
[2021-05-15 19:11:36,059 INFO] Step 14650/50000; acc:  89.47; ppl:  1.46; xent: 0.38; lr: 0.00010; 5311/6645 tok/s;   5601 sec
[2021-05-15 19:11:55,087 INFO] Step 14700/50000; acc:  89.39; ppl:  1.47; xent: 0.38; lr: 0.00010; 5430/6706 tok/s;   5620 sec
[2021-05-15 19:12:14,029 INFO] Step 14750/50000; acc:  89.09; ppl:  1.48; xent: 0.39; lr: 0.00010; 5293/6656 tok/s;   5639 sec
[2021-05-15 19:12:33,407 INFO] Step 14800/50000; acc:  89.76; ppl:  1.45; xent: 0.37; lr: 0.00010; 5259/6634 tok/s;   5659 sec
[2021-05-15 19:12:52,260 INFO] Step 14850/50000; acc:  89.46; ppl:  1.46; xent: 0.38; lr: 0.00010; 5275/6660 tok/s;   5678 sec
[2021-05-15 19:13:11,457 INFO] Step 14900/50000; acc:  89.47; ppl:  1.46; xent: 0.38; lr: 0.00010; 5461/6503 tok/s;   5697 sec
[2021-05-15 19:13:30,327 INFO] Step 14950/50000; acc:  89.38; ppl:  1.47; xent: 0.38; lr: 0.00010; 5507/6826 tok/s;   5716 sec
[2021-05-15 19:13:49,319 INFO] Step 15000/50000; acc:  89.73; ppl:  1.45; xent: 0.37; lr: 0.00010; 5212/6659 tok/s;   5735 sec
[2021-05-15 19:13:49,320 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/basic/valid.txt, align=None)...
[2021-05-15 19:14:16,847 INFO] Validation perplexity: 1.46246
[2021-05-15 19:14:16,847 INFO] Validation accuracy: 89.554
[2021-05-15 19:14:16,851 INFO] Saving checkpoint ../models/group2_params/basic_ops/model_step_15000.pt
[2021-05-15 19:14:36,020 INFO] Step 15050/50000; acc:  89.73; ppl:  1.45; xent: 0.37; lr: 0.00010; 2227/2763 tok/s;   5781 sec
[2021-05-15 19:14:55,115 INFO] Step 15100/50000; acc:  89.50; ppl:  1.46; xent: 0.38; lr: 0.00010; 5241/6625 tok/s;   5801 sec
[2021-05-15 19:15:14,138 INFO] Step 15150/50000; acc:  89.47; ppl:  1.46; xent: 0.38; lr: 0.00010; 5422/6604 tok/s;   5820 sec
[2021-05-15 19:15:32,426 INFO] Step 15200/50000; acc:  89.88; ppl:  1.44; xent: 0.36; lr: 0.00010; 5450/7029 tok/s;   5838 sec
[2021-05-15 19:15:51,476 INFO] Step 15250/50000; acc:  89.93; ppl:  1.44; xent: 0.37; lr: 0.00010; 5413/6808 tok/s;   5857 sec
[2021-05-15 19:16:08,433 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:16:10,465 INFO] Step 15300/50000; acc:  89.91; ppl:  1.44; xent: 0.36; lr: 0.00010; 5354/6893 tok/s;   5876 sec
[2021-05-15 19:16:28,974 INFO] Step 15350/50000; acc:  89.47; ppl:  1.46; xent: 0.38; lr: 0.00010; 5450/6726 tok/s;   5894 sec
[2021-05-15 19:16:48,362 INFO] Step 15400/50000; acc:  89.67; ppl:  1.46; xent: 0.38; lr: 0.00010; 5401/6708 tok/s;   5914 sec
[2021-05-15 19:17:06,354 INFO] Step 15450/50000; acc:  88.90; ppl:  1.49; xent: 0.40; lr: 0.00010; 5355/6677 tok/s;   5932 sec
[2021-05-15 19:17:26,134 INFO] Step 15500/50000; acc:  89.92; ppl:  1.44; xent: 0.37; lr: 0.00010; 5255/6676 tok/s;   5952 sec
[2021-05-15 19:17:44,798 INFO] Step 15550/50000; acc:  89.39; ppl:  1.46; xent: 0.38; lr: 0.00010; 5382/6698 tok/s;   5970 sec
[2021-05-15 19:18:04,249 INFO] Step 15600/50000; acc:  89.79; ppl:  1.45; xent: 0.37; lr: 0.00010; 5232/6558 tok/s;   5990 sec
[2021-05-15 19:18:23,138 INFO] Step 15650/50000; acc:  89.39; ppl:  1.46; xent: 0.38; lr: 0.00010; 5374/6511 tok/s;   6009 sec
[2021-05-15 19:18:41,938 INFO] Step 15700/50000; acc:  89.48; ppl:  1.46; xent: 0.38; lr: 0.00010; 5568/6774 tok/s;   6027 sec
[2021-05-15 19:19:01,731 INFO] Step 15750/50000; acc:  90.08; ppl:  1.43; xent: 0.36; lr: 0.00010; 5146/6581 tok/s;   6047 sec
[2021-05-15 19:19:21,042 INFO] Step 15800/50000; acc:  89.44; ppl:  1.46; xent: 0.38; lr: 0.00010; 5169/6468 tok/s;   6066 sec
[2021-05-15 19:19:40,282 INFO] Step 15850/50000; acc:  89.79; ppl:  1.45; xent: 0.37; lr: 0.00010; 5382/6779 tok/s;   6086 sec
[2021-05-15 19:19:59,274 INFO] Step 15900/50000; acc:  89.62; ppl:  1.45; xent: 0.37; lr: 0.00010; 5301/6541 tok/s;   6105 sec
[2021-05-15 19:20:17,775 INFO] Step 15950/50000; acc:  90.15; ppl:  1.43; xent: 0.36; lr: 0.00010; 5559/7158 tok/s;   6123 sec
[2021-05-15 19:20:36,092 INFO] Step 16000/50000; acc:  89.42; ppl:  1.46; xent: 0.38; lr: 0.00010; 5360/6682 tok/s;   6141 sec
[2021-05-15 19:20:48,055 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:20:54,849 INFO] Step 16050/50000; acc:  90.00; ppl:  1.43; xent: 0.36; lr: 0.00010; 5564/7014 tok/s;   6160 sec
[2021-05-15 19:21:14,273 INFO] Step 16100/50000; acc:  89.82; ppl:  1.45; xent: 0.37; lr: 0.00010; 5254/6570 tok/s;   6180 sec
[2021-05-15 19:21:32,451 INFO] Step 16150/50000; acc:  89.31; ppl:  1.46; xent: 0.38; lr: 0.00010; 5532/6960 tok/s;   6198 sec
[2021-05-15 19:21:51,819 INFO] Step 16200/50000; acc:  89.57; ppl:  1.45; xent: 0.37; lr: 0.00010; 5349/6677 tok/s;   6217 sec
[2021-05-15 19:22:10,785 INFO] Step 16250/50000; acc:  89.45; ppl:  1.46; xent: 0.38; lr: 0.00010; 5127/6460 tok/s;   6236 sec
[2021-05-15 19:22:29,659 INFO] Step 16300/50000; acc:  89.79; ppl:  1.45; xent: 0.37; lr: 0.00010; 5488/6756 tok/s;   6255 sec
[2021-05-15 19:22:48,869 INFO] Step 16350/50000; acc:  89.57; ppl:  1.45; xent: 0.37; lr: 0.00010; 5279/6551 tok/s;   6274 sec
[2021-05-15 19:23:07,552 INFO] Step 16400/50000; acc:  89.71; ppl:  1.44; xent: 0.37; lr: 0.00010; 5519/6773 tok/s;   6293 sec
[2021-05-15 19:23:26,918 INFO] Step 16450/50000; acc:  89.37; ppl:  1.46; xent: 0.38; lr: 0.00010; 5197/6460 tok/s;   6312 sec
[2021-05-15 19:23:46,624 INFO] Step 16500/50000; acc:  89.95; ppl:  1.44; xent: 0.36; lr: 0.00010; 5259/6580 tok/s;   6332 sec
[2021-05-15 19:24:05,678 INFO] Step 16550/50000; acc:  90.19; ppl:  1.42; xent: 0.35; lr: 0.00010; 5378/7009 tok/s;   6351 sec
[2021-05-15 19:24:24,249 INFO] Step 16600/50000; acc:  89.20; ppl:  1.47; xent: 0.39; lr: 0.00010; 5352/6513 tok/s;   6370 sec
[2021-05-15 19:24:42,838 INFO] Step 16650/50000; acc:  90.09; ppl:  1.43; xent: 0.36; lr: 0.00010; 5604/6969 tok/s;   6388 sec
[2021-05-15 19:25:00,775 INFO] Step 16700/50000; acc:  89.82; ppl:  1.44; xent: 0.37; lr: 0.00010; 5569/7119 tok/s;   6406 sec
[2021-05-15 19:25:19,681 INFO] Step 16750/50000; acc:  89.84; ppl:  1.44; xent: 0.37; lr: 0.00010; 5415/6712 tok/s;   6425 sec
[2021-05-15 19:25:26,052 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:25:38,522 INFO] Step 16800/50000; acc:  89.67; ppl:  1.45; xent: 0.37; lr: 0.00010; 5269/6766 tok/s;   6444 sec
[2021-05-15 19:25:57,343 INFO] Step 16850/50000; acc:  90.09; ppl:  1.43; xent: 0.36; lr: 0.00010; 5538/6939 tok/s;   6463 sec
[2021-05-15 19:26:15,692 INFO] Step 16900/50000; acc:  89.35; ppl:  1.46; xent: 0.38; lr: 0.00010; 5521/6758 tok/s;   6481 sec
[2021-05-15 19:26:34,865 INFO] Step 16950/50000; acc:  89.63; ppl:  1.45; xent: 0.37; lr: 0.00010; 5230/6659 tok/s;   6500 sec
[2021-05-15 19:26:54,370 INFO] Step 17000/50000; acc:  89.65; ppl:  1.45; xent: 0.37; lr: 0.00010; 5336/6562 tok/s;   6520 sec
[2021-05-15 19:27:12,758 INFO] Step 17050/50000; acc:  89.57; ppl:  1.45; xent: 0.37; lr: 0.00010; 5270/6679 tok/s;   6538 sec
[2021-05-15 19:27:32,188 INFO] Step 17100/50000; acc:  89.71; ppl:  1.45; xent: 0.37; lr: 0.00010; 5426/6542 tok/s;   6558 sec
[2021-05-15 19:27:50,718 INFO] Step 17150/50000; acc:  89.69; ppl:  1.44; xent: 0.37; lr: 0.00010; 5460/6823 tok/s;   6576 sec
[2021-05-15 19:28:10,130 INFO] Step 17200/50000; acc:  89.97; ppl:  1.43; xent: 0.36; lr: 0.00010; 5259/6624 tok/s;   6596 sec
[2021-05-15 19:28:29,284 INFO] Step 17250/50000; acc:  89.86; ppl:  1.43; xent: 0.36; lr: 0.00010; 5254/6679 tok/s;   6615 sec
[2021-05-15 19:28:48,677 INFO] Step 17300/50000; acc:  89.67; ppl:  1.45; xent: 0.37; lr: 0.00010; 5348/6530 tok/s;   6634 sec
[2021-05-15 19:29:07,349 INFO] Step 17350/50000; acc:  89.76; ppl:  1.44; xent: 0.37; lr: 0.00010; 5499/6778 tok/s;   6653 sec
[2021-05-15 19:29:25,632 INFO] Step 17400/50000; acc:  90.03; ppl:  1.43; xent: 0.35; lr: 0.00010; 5431/6968 tok/s;   6671 sec
[2021-05-15 19:29:44,249 INFO] Step 17450/50000; acc:  90.11; ppl:  1.43; xent: 0.36; lr: 0.00010; 5557/7011 tok/s;   6690 sec
[2021-05-15 19:30:02,886 INFO] Step 17500/50000; acc:  89.92; ppl:  1.43; xent: 0.36; lr: 0.00010; 5377/6866 tok/s;   6708 sec
[2021-05-15 19:30:04,195 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:30:22,308 INFO] Step 17550/50000; acc:  89.84; ppl:  1.44; xent: 0.37; lr: 0.00010; 5317/6541 tok/s;   6728 sec
[2021-05-15 19:30:40,401 INFO] Step 17600/50000; acc:  89.67; ppl:  1.45; xent: 0.37; lr: 0.00010; 5443/6910 tok/s;   6746 sec
[2021-05-15 19:30:59,521 INFO] Step 17650/50000; acc:  89.46; ppl:  1.46; xent: 0.38; lr: 0.00010; 5410/6724 tok/s;   6765 sec
[2021-05-15 19:31:19,003 INFO] Step 17700/50000; acc:  89.85; ppl:  1.44; xent: 0.36; lr: 0.00010; 5215/6572 tok/s;   6784 sec
[2021-05-15 19:31:37,689 INFO] Step 17750/50000; acc:  89.78; ppl:  1.44; xent: 0.36; lr: 0.00010; 5373/6735 tok/s;   6803 sec
[2021-05-15 19:31:56,904 INFO] Step 17800/50000; acc:  89.88; ppl:  1.44; xent: 0.36; lr: 0.00010; 5428/6589 tok/s;   6822 sec
[2021-05-15 19:32:15,336 INFO] Step 17850/50000; acc:  89.52; ppl:  1.45; xent: 0.37; lr: 0.00010; 5392/6671 tok/s;   6841 sec
[2021-05-15 19:32:34,501 INFO] Step 17900/50000; acc:  90.11; ppl:  1.43; xent: 0.36; lr: 0.00010; 5416/6766 tok/s;   6860 sec
[2021-05-15 19:32:53,820 INFO] Step 17950/50000; acc:  89.79; ppl:  1.44; xent: 0.36; lr: 0.00010; 5210/6573 tok/s;   6879 sec
[2021-05-15 19:33:13,062 INFO] Step 18000/50000; acc:  89.96; ppl:  1.43; xent: 0.36; lr: 0.00010; 5317/6746 tok/s;   6898 sec
[2021-05-15 19:33:32,128 INFO] Step 18050/50000; acc:  89.52; ppl:  1.45; xent: 0.37; lr: 0.00010; 5245/6485 tok/s;   6918 sec
[2021-05-15 19:33:51,026 INFO] Step 18100/50000; acc:  90.10; ppl:  1.43; xent: 0.35; lr: 0.00010; 5548/6746 tok/s;   6936 sec
[2021-05-15 19:34:09,990 INFO] Step 18150/50000; acc:  90.39; ppl:  1.41; xent: 0.35; lr: 0.00010; 5381/6964 tok/s;   6955 sec
[2021-05-15 19:34:28,283 INFO] Step 18200/50000; acc:  89.85; ppl:  1.44; xent: 0.36; lr: 0.00010; 5395/6836 tok/s;   6974 sec
[2021-05-15 19:34:31,347 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:34:47,480 INFO] Step 18250/50000; acc:  89.98; ppl:  1.43; xent: 0.36; lr: 0.00010; 5425/6821 tok/s;   6993 sec
[2021-05-15 19:35:05,826 INFO] Step 18300/50000; acc:  89.92; ppl:  1.44; xent: 0.36; lr: 0.00010; 5479/6819 tok/s;   7011 sec
[2021-05-15 19:35:24,274 INFO] Step 18350/50000; acc:  89.61; ppl:  1.45; xent: 0.37; lr: 0.00010; 5525/6790 tok/s;   7030 sec
[2021-05-15 19:35:43,138 INFO] Step 18400/50000; acc:  89.59; ppl:  1.45; xent: 0.37; lr: 0.00010; 5240/6704 tok/s;   7049 sec
[2021-05-15 19:36:02,501 INFO] Step 18450/50000; acc:  90.10; ppl:  1.43; xent: 0.36; lr: 0.00010; 5356/6650 tok/s;   7068 sec
[2021-05-15 19:36:21,777 INFO] Step 18500/50000; acc:  89.94; ppl:  1.43; xent: 0.36; lr: 0.00010; 5242/6649 tok/s;   7087 sec
[2021-05-15 19:36:40,608 INFO] Step 18550/50000; acc:  89.64; ppl:  1.45; xent: 0.37; lr: 0.00010; 5411/6446 tok/s;   7106 sec
[2021-05-15 19:36:59,807 INFO] Step 18600/50000; acc:  89.74; ppl:  1.44; xent: 0.36; lr: 0.00010; 5490/6753 tok/s;   7125 sec
[2021-05-15 19:37:18,603 INFO] Step 18650/50000; acc:  90.05; ppl:  1.42; xent: 0.35; lr: 0.00010; 5183/6675 tok/s;   7144 sec
[2021-05-15 19:37:37,872 INFO] Step 18700/50000; acc:  90.05; ppl:  1.43; xent: 0.36; lr: 0.00010; 5398/6671 tok/s;   7163 sec
[2021-05-15 19:37:56,870 INFO] Step 18750/50000; acc:  89.84; ppl:  1.44; xent: 0.36; lr: 0.00010; 5303/6716 tok/s;   7182 sec
[2021-05-15 19:38:15,815 INFO] Step 18800/50000; acc:  89.77; ppl:  1.44; xent: 0.36; lr: 0.00010; 5403/6607 tok/s;   7201 sec
[2021-05-15 19:38:34,348 INFO] Step 18850/50000; acc:  90.27; ppl:  1.41; xent: 0.34; lr: 0.00010; 5450/6971 tok/s;   7220 sec
[2021-05-15 19:38:53,273 INFO] Step 18900/50000; acc:  90.14; ppl:  1.42; xent: 0.35; lr: 0.00010; 5445/6839 tok/s;   7239 sec
[2021-05-15 19:39:09,421 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:39:12,181 INFO] Step 18950/50000; acc:  90.27; ppl:  1.41; xent: 0.35; lr: 0.00010; 5413/6966 tok/s;   7258 sec
[2021-05-15 19:39:30,616 INFO] Step 19000/50000; acc:  89.77; ppl:  1.44; xent: 0.36; lr: 0.00010; 5373/6688 tok/s;   7276 sec
[2021-05-15 19:39:49,462 INFO] Step 19050/50000; acc:  90.00; ppl:  1.43; xent: 0.36; lr: 0.00010; 5531/6906 tok/s;   7295 sec
[2021-05-15 19:40:08,064 INFO] Step 19100/50000; acc:  89.26; ppl:  1.46; xent: 0.38; lr: 0.00010; 5336/6521 tok/s;   7313 sec
[2021-05-15 19:40:27,767 INFO] Step 19150/50000; acc:  90.07; ppl:  1.42; xent: 0.35; lr: 0.00010; 5213/6618 tok/s;   7333 sec
[2021-05-15 19:40:46,390 INFO] Step 19200/50000; acc:  89.82; ppl:  1.44; xent: 0.36; lr: 0.00010; 5302/6709 tok/s;   7352 sec
[2021-05-15 19:41:05,894 INFO] Step 19250/50000; acc:  90.08; ppl:  1.43; xent: 0.35; lr: 0.00010; 5329/6627 tok/s;   7371 sec
[2021-05-15 19:41:25,120 INFO] Step 19300/50000; acc:  89.91; ppl:  1.43; xent: 0.36; lr: 0.00010; 5344/6503 tok/s;   7391 sec
[2021-05-15 19:41:43,651 INFO] Step 19350/50000; acc:  89.62; ppl:  1.44; xent: 0.37; lr: 0.00010; 5494/6686 tok/s;   7409 sec
[2021-05-15 19:42:03,550 INFO] Step 19400/50000; acc:  90.44; ppl:  1.41; xent: 0.34; lr: 0.00010; 5206/6656 tok/s;   7429 sec
[2021-05-15 19:42:22,809 INFO] Step 19450/50000; acc:  89.84; ppl:  1.44; xent: 0.36; lr: 0.00010; 5088/6453 tok/s;   7448 sec
[2021-05-15 19:42:41,965 INFO] Step 19500/50000; acc:  90.01; ppl:  1.43; xent: 0.36; lr: 0.00010; 5417/6742 tok/s;   7467 sec
[2021-05-15 19:43:00,609 INFO] Step 19550/50000; acc:  89.99; ppl:  1.42; xent: 0.35; lr: 0.00010; 5421/6698 tok/s;   7486 sec
[2021-05-15 19:43:19,162 INFO] Step 19600/50000; acc:  90.50; ppl:  1.41; xent: 0.34; lr: 0.00010; 5503/7098 tok/s;   7505 sec
[2021-05-15 19:43:37,785 INFO] Step 19650/50000; acc:  89.78; ppl:  1.43; xent: 0.36; lr: 0.00010; 5342/6602 tok/s;   7523 sec
[2021-05-15 19:43:48,860 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:43:56,574 INFO] Step 19700/50000; acc:  90.24; ppl:  1.42; xent: 0.35; lr: 0.00010; 5550/7011 tok/s;   7542 sec
[2021-05-15 19:44:15,833 INFO] Step 19750/50000; acc:  90.11; ppl:  1.43; xent: 0.36; lr: 0.00010; 5348/6663 tok/s;   7561 sec
[2021-05-15 19:44:33,558 INFO] Step 19800/50000; acc:  89.62; ppl:  1.45; xent: 0.37; lr: 0.00010; 5562/7047 tok/s;   7579 sec
[2021-05-15 19:44:52,938 INFO] Step 19850/50000; acc:  90.00; ppl:  1.43; xent: 0.36; lr: 0.00010; 5323/6644 tok/s;   7598 sec
[2021-05-15 19:45:12,303 INFO] Step 19900/50000; acc:  89.73; ppl:  1.44; xent: 0.36; lr: 0.00010; 5172/6414 tok/s;   7618 sec
[2021-05-15 19:45:30,943 INFO] Step 19950/50000; acc:  90.10; ppl:  1.42; xent: 0.35; lr: 0.00010; 5491/6847 tok/s;   7636 sec
[2021-05-15 19:45:50,306 INFO] Step 20000/50000; acc:  89.84; ppl:  1.43; xent: 0.36; lr: 0.00010; 5147/6470 tok/s;   7656 sec
[2021-05-15 19:45:50,307 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/basic/valid.txt, align=None)...
[2021-05-15 19:46:18,055 INFO] Validation perplexity: 1.4458
[2021-05-15 19:46:18,055 INFO] Validation accuracy: 89.7809
[2021-05-15 19:46:18,059 INFO] Saving checkpoint ../models/group2_params/basic_ops/model_step_20000.pt
[2021-05-15 19:46:36,739 INFO] Step 20050/50000; acc:  89.95; ppl:  1.43; xent: 0.36; lr: 0.00010; 2266/2736 tok/s;   7702 sec
[2021-05-15 19:46:56,436 INFO] Step 20100/50000; acc:  90.14; ppl:  1.42; xent: 0.35; lr: 0.00010; 5178/6522 tok/s;   7722 sec
[2021-05-15 19:47:15,978 INFO] Step 20150/50000; acc:  89.99; ppl:  1.42; xent: 0.35; lr: 0.00010; 5157/6421 tok/s;   7741 sec
[2021-05-15 19:47:35,603 INFO] Step 20200/50000; acc:  90.32; ppl:  1.41; xent: 0.35; lr: 0.00010; 5313/6801 tok/s;   7761 sec
[2021-05-15 19:47:54,117 INFO] Step 20250/50000; acc:  89.67; ppl:  1.44; xent: 0.37; lr: 0.00010; 5273/6567 tok/s;   7780 sec
[2021-05-15 19:48:12,743 INFO] Step 20300/50000; acc:  90.44; ppl:  1.40; xent: 0.34; lr: 0.00010; 5600/6939 tok/s;   7798 sec
[2021-05-15 19:48:30,921 INFO] Step 20350/50000; acc:  90.12; ppl:  1.42; xent: 0.35; lr: 0.00010; 5503/7051 tok/s;   7816 sec
[2021-05-15 19:48:49,475 INFO] Step 20400/50000; acc:  90.13; ppl:  1.42; xent: 0.35; lr: 0.00010; 5489/6781 tok/s;   7835 sec
[2021-05-15 19:48:55,274 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:49:08,608 INFO] Step 20450/50000; acc:  90.00; ppl:  1.43; xent: 0.35; lr: 0.00010; 5251/6700 tok/s;   7854 sec
[2021-05-15 19:49:27,036 INFO] Step 20500/50000; acc:  90.25; ppl:  1.42; xent: 0.35; lr: 0.00010; 5654/7008 tok/s;   7872 sec
[2021-05-15 19:49:45,701 INFO] Step 20550/50000; acc:  89.66; ppl:  1.44; xent: 0.37; lr: 0.00010; 5466/6748 tok/s;   7891 sec
[2021-05-15 19:50:05,000 INFO] Step 20600/50000; acc:  89.88; ppl:  1.42; xent: 0.35; lr: 0.00010; 5113/6577 tok/s;   7910 sec
[2021-05-15 19:50:24,331 INFO] Step 20650/50000; acc:  89.93; ppl:  1.43; xent: 0.36; lr: 0.00010; 5360/6546 tok/s;   7930 sec
[2021-05-15 19:50:42,919 INFO] Step 20700/50000; acc:  89.90; ppl:  1.43; xent: 0.35; lr: 0.00010; 5365/6715 tok/s;   7948 sec
[2021-05-15 19:51:02,036 INFO] Step 20750/50000; acc:  90.13; ppl:  1.42; xent: 0.35; lr: 0.00010; 5457/6699 tok/s;   7967 sec
[2021-05-15 19:51:20,474 INFO] Step 20800/50000; acc:  89.99; ppl:  1.42; xent: 0.35; lr: 0.00010; 5381/6725 tok/s;   7986 sec
[2021-05-15 19:51:39,757 INFO] Step 20850/50000; acc:  90.16; ppl:  1.42; xent: 0.35; lr: 0.00010; 5406/6716 tok/s;   8005 sec
[2021-05-15 19:51:59,158 INFO] Step 20900/50000; acc:  90.40; ppl:  1.41; xent: 0.34; lr: 0.00010; 5259/6746 tok/s;   8025 sec
[2021-05-15 19:52:18,180 INFO] Step 20950/50000; acc:  89.92; ppl:  1.43; xent: 0.36; lr: 0.00010; 5289/6548 tok/s;   8044 sec
[2021-05-15 19:52:36,653 INFO] Step 21000/50000; acc:  90.07; ppl:  1.42; xent: 0.35; lr: 0.00010; 5668/6878 tok/s;   8062 sec
[2021-05-15 19:52:55,064 INFO] Step 21050/50000; acc:  90.37; ppl:  1.40; xent: 0.34; lr: 0.00010; 5291/6871 tok/s;   8080 sec
[2021-05-15 19:53:13,792 INFO] Step 21100/50000; acc:  90.43; ppl:  1.41; xent: 0.34; lr: 0.00010; 5531/6947 tok/s;   8099 sec
[2021-05-15 19:53:21,364 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:53:32,448 INFO] Step 21150/50000; acc:  90.12; ppl:  1.42; xent: 0.35; lr: 0.00010; 5404/6846 tok/s;   8118 sec
[2021-05-15 19:53:51,709 INFO] Step 21200/50000; acc:  90.25; ppl:  1.42; xent: 0.35; lr: 0.00010; 5314/6597 tok/s;   8137 sec
[2021-05-15 19:54:09,955 INFO] Step 21250/50000; acc:  90.02; ppl:  1.42; xent: 0.35; lr: 0.00010; 5466/6931 tok/s;   8155 sec
[2021-05-15 19:54:29,110 INFO] Step 21300/50000; acc:  89.58; ppl:  1.45; xent: 0.37; lr: 0.00010; 5398/6607 tok/s;   8175 sec
[2021-05-15 19:54:48,400 INFO] Step 21350/50000; acc:  90.25; ppl:  1.41; xent: 0.34; lr: 0.00010; 5305/6714 tok/s;   8194 sec
[2021-05-15 19:55:07,064 INFO] Step 21400/50000; acc:  90.08; ppl:  1.42; xent: 0.35; lr: 0.00010; 5287/6695 tok/s;   8212 sec
[2021-05-15 19:55:26,356 INFO] Step 21450/50000; acc:  90.19; ppl:  1.42; xent: 0.35; lr: 0.00010; 5385/6509 tok/s;   8232 sec
[2021-05-15 19:55:44,906 INFO] Step 21500/50000; acc:  89.95; ppl:  1.42; xent: 0.35; lr: 0.00010; 5514/6739 tok/s;   8250 sec
[2021-05-15 19:56:03,806 INFO] Step 21550/50000; acc:  90.27; ppl:  1.41; xent: 0.34; lr: 0.00010; 5425/6808 tok/s;   8269 sec
[2021-05-15 19:56:23,112 INFO] Step 21600/50000; acc:  90.15; ppl:  1.41; xent: 0.35; lr: 0.00010; 5127/6525 tok/s;   8289 sec
[2021-05-15 19:56:42,666 INFO] Step 21650/50000; acc:  90.25; ppl:  1.41; xent: 0.35; lr: 0.00010; 5327/6712 tok/s;   8308 sec
[2021-05-15 19:57:01,857 INFO] Step 21700/50000; acc:  89.97; ppl:  1.42; xent: 0.35; lr: 0.00010; 5302/6569 tok/s;   8327 sec
[2021-05-15 19:57:20,223 INFO] Step 21750/50000; acc:  90.23; ppl:  1.41; xent: 0.34; lr: 0.00010; 5532/6812 tok/s;   8346 sec
[2021-05-15 19:57:39,163 INFO] Step 21800/50000; acc:  90.57; ppl:  1.40; xent: 0.33; lr: 0.00010; 5481/6980 tok/s;   8365 sec
[2021-05-15 19:57:57,380 INFO] Step 21850/50000; acc:  90.20; ppl:  1.41; xent: 0.34; lr: 0.00010; 5321/6826 tok/s;   8383 sec
[2021-05-15 19:57:59,694 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 19:58:16,570 INFO] Step 21900/50000; acc:  90.22; ppl:  1.41; xent: 0.35; lr: 0.00010; 5430/6806 tok/s;   8402 sec
[2021-05-15 19:58:35,144 INFO] Step 21950/50000; acc:  90.22; ppl:  1.42; xent: 0.35; lr: 0.00010; 5439/6793 tok/s;   8421 sec
[2021-05-15 19:58:53,643 INFO] Step 22000/50000; acc:  89.89; ppl:  1.43; xent: 0.36; lr: 0.00010; 5468/6769 tok/s;   8439 sec
[2021-05-15 19:59:12,615 INFO] Step 22050/50000; acc:  89.86; ppl:  1.43; xent: 0.35; lr: 0.00010; 5280/6699 tok/s;   8458 sec
[2021-05-15 19:59:32,247 INFO] Step 22100/50000; acc:  90.42; ppl:  1.41; xent: 0.34; lr: 0.00010; 5286/6540 tok/s;   8478 sec
[2021-05-15 19:59:51,565 INFO] Step 22150/50000; acc:  90.13; ppl:  1.41; xent: 0.35; lr: 0.00010; 5261/6634 tok/s;   8497 sec
[2021-05-15 20:00:10,113 INFO] Step 22200/50000; acc:  89.84; ppl:  1.42; xent: 0.35; lr: 0.00010; 5416/6541 tok/s;   8516 sec
[2021-05-15 20:00:28,931 INFO] Step 22250/50000; acc:  90.09; ppl:  1.42; xent: 0.35; lr: 0.00010; 5566/6827 tok/s;   8534 sec
[2021-05-15 20:00:48,478 INFO] Step 22300/50000; acc:  90.44; ppl:  1.40; xent: 0.34; lr: 0.00010; 5131/6540 tok/s;   8554 sec
[2021-05-15 20:01:07,603 INFO] Step 22350/50000; acc:  90.25; ppl:  1.41; xent: 0.34; lr: 0.00010; 5375/6679 tok/s;   8573 sec
[2021-05-15 20:01:26,023 INFO] Step 22400/50000; acc:  90.12; ppl:  1.42; xent: 0.35; lr: 0.00010; 5369/6825 tok/s;   8591 sec
[2021-05-15 20:01:45,137 INFO] Step 22450/50000; acc:  90.12; ppl:  1.42; xent: 0.35; lr: 0.00010; 5468/6640 tok/s;   8611 sec
[2021-05-15 20:02:03,865 INFO] Step 22500/50000; acc:  90.77; ppl:  1.39; xent: 0.33; lr: 0.00010; 5463/7079 tok/s;   8629 sec
[2021-05-15 20:02:22,573 INFO] Step 22550/50000; acc:  90.26; ppl:  1.41; xent: 0.35; lr: 0.00010; 5350/6713 tok/s;   8648 sec
[2021-05-15 20:02:37,957 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:02:41,509 INFO] Step 22600/50000; acc:  90.48; ppl:  1.40; xent: 0.34; lr: 0.00010; 5502/6933 tok/s;   8667 sec
[2021-05-15 20:03:00,006 INFO] Step 22650/50000; acc:  90.18; ppl:  1.42; xent: 0.35; lr: 0.00010; 5263/6656 tok/s;   8685 sec
[2021-05-15 20:03:18,920 INFO] Step 22700/50000; acc:  90.34; ppl:  1.41; xent: 0.34; lr: 0.00010; 5511/6845 tok/s;   8704 sec
[2021-05-15 20:03:37,212 INFO] Step 22750/50000; acc:  89.50; ppl:  1.44; xent: 0.37; lr: 0.00010; 5448/6671 tok/s;   8723 sec
[2021-05-15 20:03:57,141 INFO] Step 22800/50000; acc:  90.36; ppl:  1.40; xent: 0.34; lr: 0.00010; 5123/6527 tok/s;   8743 sec
[2021-05-15 20:04:15,878 INFO] Step 22850/50000; acc:  90.13; ppl:  1.41; xent: 0.35; lr: 0.00010; 5335/6652 tok/s;   8761 sec
[2021-05-15 20:04:35,298 INFO] Step 22900/50000; acc:  90.37; ppl:  1.41; xent: 0.34; lr: 0.00010; 5352/6680 tok/s;   8781 sec
[2021-05-15 20:04:54,023 INFO] Step 22950/50000; acc:  90.21; ppl:  1.41; xent: 0.34; lr: 0.00010; 5529/6723 tok/s;   8799 sec
[2021-05-15 20:05:12,893 INFO] Step 23000/50000; acc:  90.01; ppl:  1.42; xent: 0.35; lr: 0.00010; 5293/6566 tok/s;   8818 sec
[2021-05-15 20:05:32,555 INFO] Step 23050/50000; acc:  90.63; ppl:  1.39; xent: 0.33; lr: 0.00010; 5257/6650 tok/s;   8838 sec
[2021-05-15 20:05:52,189 INFO] Step 23100/50000; acc:  90.18; ppl:  1.41; xent: 0.35; lr: 0.00010; 5130/6496 tok/s;   8858 sec
[2021-05-15 20:06:11,104 INFO] Step 23150/50000; acc:  90.24; ppl:  1.41; xent: 0.34; lr: 0.00010; 5425/6784 tok/s;   8876 sec
[2021-05-15 20:06:29,755 INFO] Step 23200/50000; acc:  90.28; ppl:  1.40; xent: 0.34; lr: 0.00010; 5334/6604 tok/s;   8895 sec
[2021-05-15 20:06:48,157 INFO] Step 23250/50000; acc:  90.68; ppl:  1.39; xent: 0.33; lr: 0.00010; 5647/7187 tok/s;   8914 sec
[2021-05-15 20:07:07,081 INFO] Step 23300/50000; acc:  90.27; ppl:  1.41; xent: 0.34; lr: 0.00010; 5336/6598 tok/s;   8932 sec
[2021-05-15 20:07:17,184 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:07:25,583 INFO] Step 23350/50000; acc:  90.40; ppl:  1.40; xent: 0.34; lr: 0.00010; 5470/6951 tok/s;   8951 sec
[2021-05-15 20:07:44,886 INFO] Step 23400/50000; acc:  90.46; ppl:  1.40; xent: 0.34; lr: 0.00010; 5434/6796 tok/s;   8970 sec
[2021-05-15 20:08:02,550 INFO] Step 23450/50000; acc:  89.89; ppl:  1.42; xent: 0.35; lr: 0.00010; 5477/6927 tok/s;   8988 sec
[2021-05-15 20:08:22,039 INFO] Step 23500/50000; acc:  90.23; ppl:  1.41; xent: 0.34; lr: 0.00010; 5303/6602 tok/s;   9007 sec
[2021-05-15 20:08:41,321 INFO] Step 23550/50000; acc:  90.09; ppl:  1.41; xent: 0.35; lr: 0.00010; 5216/6534 tok/s;   9027 sec
[2021-05-15 20:09:00,094 INFO] Step 23600/50000; acc:  90.42; ppl:  1.40; xent: 0.34; lr: 0.00010; 5407/6734 tok/s;   9045 sec
[2021-05-15 20:09:19,486 INFO] Step 23650/50000; acc:  90.12; ppl:  1.41; xent: 0.35; lr: 0.00010; 5219/6480 tok/s;   9065 sec
[2021-05-15 20:09:37,908 INFO] Step 23700/50000; acc:  90.24; ppl:  1.41; xent: 0.34; lr: 0.00010; 5700/6817 tok/s;   9083 sec
[2021-05-15 20:09:57,635 INFO] Step 23750/50000; acc:  90.46; ppl:  1.40; xent: 0.34; lr: 0.00010; 5200/6611 tok/s;   9103 sec
[2021-05-15 20:10:16,863 INFO] Step 23800/50000; acc:  90.29; ppl:  1.40; xent: 0.34; lr: 0.00010; 5159/6512 tok/s;   9122 sec
[2021-05-15 20:10:36,567 INFO] Step 23850/50000; acc:  90.60; ppl:  1.40; xent: 0.33; lr: 0.00010; 5268/6697 tok/s;   9142 sec
[2021-05-15 20:10:55,125 INFO] Step 23900/50000; acc:  89.91; ppl:  1.42; xent: 0.35; lr: 0.00010; 5400/6673 tok/s;   9161 sec
[2021-05-15 20:11:13,957 INFO] Step 23950/50000; acc:  90.58; ppl:  1.39; xent: 0.33; lr: 0.00010; 5485/6762 tok/s;   9179 sec
[2021-05-15 20:11:32,407 INFO] Step 24000/50000; acc:  90.57; ppl:  1.39; xent: 0.33; lr: 0.00010; 5327/6972 tok/s;   9198 sec
[2021-05-15 20:11:50,950 INFO] Step 24050/50000; acc:  90.46; ppl:  1.40; xent: 0.34; lr: 0.00010; 5606/6891 tok/s;   9216 sec
[2021-05-15 20:11:56,051 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:12:10,430 INFO] Step 24100/50000; acc:  90.47; ppl:  1.40; xent: 0.33; lr: 0.00010; 5233/6652 tok/s;   9236 sec
[2021-05-15 20:12:28,360 INFO] Step 24150/50000; acc:  90.18; ppl:  1.41; xent: 0.35; lr: 0.00010; 5630/6986 tok/s;   9254 sec
[2021-05-15 20:12:47,473 INFO] Step 24200/50000; acc:  89.98; ppl:  1.42; xent: 0.35; lr: 0.00010; 5432/6760 tok/s;   9273 sec
[2021-05-15 20:13:06,536 INFO] Step 24250/50000; acc:  90.24; ppl:  1.40; xent: 0.34; lr: 0.00010; 5085/6578 tok/s;   9292 sec
[2021-05-15 20:13:25,781 INFO] Step 24300/50000; acc:  90.32; ppl:  1.41; xent: 0.34; lr: 0.00010; 5395/6499 tok/s;   9311 sec
[2021-05-15 20:13:44,357 INFO] Step 24350/50000; acc:  90.24; ppl:  1.40; xent: 0.34; lr: 0.00010; 5390/6824 tok/s;   9330 sec
[2021-05-15 20:14:03,435 INFO] Step 24400/50000; acc:  90.27; ppl:  1.41; xent: 0.34; lr: 0.00010; 5440/6599 tok/s;   9349 sec
[2021-05-15 20:14:22,048 INFO] Step 24450/50000; acc:  90.48; ppl:  1.40; xent: 0.33; lr: 0.00010; 5397/6795 tok/s;   9367 sec
[2021-05-15 20:14:41,472 INFO] Step 24500/50000; acc:  90.45; ppl:  1.40; xent: 0.34; lr: 0.00010; 5362/6657 tok/s;   9387 sec
[2021-05-15 20:15:00,664 INFO] Step 24550/50000; acc:  90.54; ppl:  1.39; xent: 0.33; lr: 0.00010; 5351/6815 tok/s;   9406 sec
[2021-05-15 20:15:19,668 INFO] Step 24600/50000; acc:  90.16; ppl:  1.41; xent: 0.34; lr: 0.00010; 5203/6487 tok/s;   9425 sec
[2021-05-15 20:15:38,269 INFO] Step 24650/50000; acc:  90.29; ppl:  1.40; xent: 0.34; lr: 0.00010; 5605/6808 tok/s;   9444 sec
[2021-05-15 20:15:56,806 INFO] Step 24700/50000; acc:  90.64; ppl:  1.39; xent: 0.33; lr: 0.00010; 5408/6931 tok/s;   9462 sec
[2021-05-15 20:16:15,352 INFO] Step 24750/50000; acc:  90.61; ppl:  1.39; xent: 0.33; lr: 0.00010; 5516/6929 tok/s;   9481 sec
[2021-05-15 20:16:22,215 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:16:33,983 INFO] Step 24800/50000; acc:  90.39; ppl:  1.40; xent: 0.34; lr: 0.00010; 5334/6870 tok/s;   9499 sec
[2021-05-15 20:16:53,361 INFO] Step 24850/50000; acc:  90.54; ppl:  1.40; xent: 0.34; lr: 0.00010; 5374/6576 tok/s;   9519 sec
[2021-05-15 20:17:12,023 INFO] Step 24900/50000; acc:  90.34; ppl:  1.40; xent: 0.34; lr: 0.00010; 5433/6850 tok/s;   9537 sec
[2021-05-15 20:17:31,078 INFO] Step 24950/50000; acc:  89.84; ppl:  1.42; xent: 0.35; lr: 0.00010; 5258/6543 tok/s;   9556 sec
[2021-05-15 20:17:50,346 INFO] Step 25000/50000; acc:  90.51; ppl:  1.40; xent: 0.33; lr: 0.00010; 5412/6764 tok/s;   9576 sec
[2021-05-15 20:17:50,347 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/basic/valid.txt, align=None)...
[2021-05-15 20:18:17,959 INFO] Validation perplexity: 1.43709
[2021-05-15 20:18:17,959 INFO] Validation accuracy: 89.9387
[2021-05-15 20:18:17,963 INFO] Saving checkpoint ../models/group2_params/basic_ops/model_step_25000.pt
[2021-05-15 20:18:36,713 INFO] Step 25050/50000; acc:  90.29; ppl:  1.40; xent: 0.34; lr: 0.00010; 2086/2658 tok/s;   9622 sec
[2021-05-15 20:18:56,126 INFO] Step 25100/50000; acc:  90.58; ppl:  1.39; xent: 0.33; lr: 0.00010; 5364/6507 tok/s;   9642 sec
[2021-05-15 20:19:14,512 INFO] Step 25150/50000; acc:  90.18; ppl:  1.41; xent: 0.34; lr: 0.00010; 5581/6823 tok/s;   9660 sec
[2021-05-15 20:19:33,288 INFO] Step 25200/50000; acc:  90.55; ppl:  1.39; xent: 0.33; lr: 0.00010; 5431/6804 tok/s;   9679 sec
[2021-05-15 20:19:52,762 INFO] Step 25250/50000; acc:  90.48; ppl:  1.40; xent: 0.33; lr: 0.00010; 5155/6571 tok/s;   9698 sec
[2021-05-15 20:20:12,274 INFO] Step 25300/50000; acc:  90.57; ppl:  1.40; xent: 0.33; lr: 0.00010; 5326/6685 tok/s;   9718 sec
[2021-05-15 20:20:31,652 INFO] Step 25350/50000; acc:  90.35; ppl:  1.40; xent: 0.34; lr: 0.00010; 5291/6510 tok/s;   9737 sec
[2021-05-15 20:20:50,055 INFO] Step 25400/50000; acc:  90.47; ppl:  1.39; xent: 0.33; lr: 0.00010; 5426/6734 tok/s;   9755 sec
[2021-05-15 20:21:08,982 INFO] Step 25450/50000; acc:  90.92; ppl:  1.38; xent: 0.32; lr: 0.00010; 5459/6971 tok/s;   9774 sec
[2021-05-15 20:21:27,404 INFO] Step 25500/50000; acc:  90.58; ppl:  1.39; xent: 0.33; lr: 0.00010; 5424/6900 tok/s;   9793 sec
[2021-05-15 20:23:01,887 INFO] Step 25750/50000; acc:  90.33; ppl:  1.40; xent: 0.34; lr: 0.00010; 5300/6529 tok/s;   9887 sec
[2021-05-15 20:23:21,578 INFO] Step 25800/50000; acc:  90.50; ppl:  1.39; xent: 0.33; lr: 0.00010; 5256/6585 tok/s;   9907 sec
[2021-05-15 20:23:39,985 INFO] Step 25850/50000; acc:  90.26; ppl:  1.40; xent: 0.34; lr: 0.00010; 5362/6577 tok/s;   9925 sec
[2021-05-15 20:23:59,103 INFO] Step 25900/50000; acc:  90.25; ppl:  1.40; xent: 0.34; lr: 0.00010; 5479/6711 tok/s;   9944 sec
[2021-05-15 20:24:18,535 INFO] Step 25950/50000; acc:  90.69; ppl:  1.38; xent: 0.32; lr: 0.00010; 5180/6534 tok/s;   9964 sec
[2021-05-15 20:24:37,914 INFO] Step 26000/50000; acc:  90.67; ppl:  1.39; xent: 0.33; lr: 0.00010; 5277/6624 tok/s;   9983 sec
[2021-05-15 20:24:56,547 INFO] Step 26050/50000; acc:  90.46; ppl:  1.39; xent: 0.33; lr: 0.00010; 5377/6821 tok/s;  10002 sec
[2021-05-15 20:25:15,900 INFO] Step 26100/50000; acc:  90.42; ppl:  1.40; xent: 0.33; lr: 0.00010; 5397/6482 tok/s;  10021 sec
[2021-05-15 20:25:34,347 INFO] Step 26150/50000; acc:  90.99; ppl:  1.37; xent: 0.32; lr: 0.00010; 5572/7244 tok/s;  10040 sec
[2021-05-15 20:25:52,809 INFO] Step 26200/50000; acc:  90.37; ppl:  1.40; xent: 0.33; lr: 0.00010; 5328/6697 tok/s;  10058 sec
[2021-05-15 20:26:07,696 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:26:11,807 INFO] Step 26250/50000; acc:  90.80; ppl:  1.38; xent: 0.32; lr: 0.00010; 5471/6916 tok/s;  10077 sec
[2021-05-15 20:26:30,553 INFO] Step 26300/50000; acc:  90.42; ppl:  1.40; xent: 0.33; lr: 0.00010; 5341/6670 tok/s;  10096 sec
[2021-05-15 20:26:49,301 INFO] Step 26350/50000; acc:  90.33; ppl:  1.40; xent: 0.34; lr: 0.00010; 5489/6832 tok/s;  10115 sec
[2021-05-15 20:27:07,712 INFO] Step 26400/50000; acc:  89.93; ppl:  1.42; xent: 0.35; lr: 0.00010; 5328/6615 tok/s;  10133 sec
[2021-05-15 20:27:27,486 INFO] Step 26450/50000; acc:  90.75; ppl:  1.39; xent: 0.33; lr: 0.00010; 5269/6626 tok/s;  10153 sec
[2021-05-15 20:27:46,571 INFO] Step 26500/50000; acc:  90.42; ppl:  1.39; xent: 0.33; lr: 0.00010; 5306/6611 tok/s;  10172 sec
[2021-05-15 20:28:05,534 INFO] Step 26550/50000; acc:  90.45; ppl:  1.40; xent: 0.33; lr: 0.00010; 5330/6672 tok/s;  10191 sec
[2021-05-15 20:28:24,738 INFO] Step 26600/50000; acc:  90.54; ppl:  1.39; xent: 0.33; lr: 0.00010; 5495/6689 tok/s;  10210 sec
[2021-05-15 20:28:43,770 INFO] Step 26650/50000; acc:  90.17; ppl:  1.40; xent: 0.34; lr: 0.00010; 5144/6427 tok/s;  10229 sec
[2021-05-15 20:29:03,306 INFO] Step 26700/50000; acc:  90.81; ppl:  1.38; xent: 0.32; lr: 0.00010; 5303/6688 tok/s;  10249 sec
[2021-05-15 20:29:22,779 INFO] Step 26750/50000; acc:  90.61; ppl:  1.39; xent: 0.33; lr: 0.00010; 5190/6556 tok/s;  10268 sec
[2021-05-15 20:29:41,782 INFO] Step 26800/50000; acc:  90.39; ppl:  1.40; xent: 0.33; lr: 0.00010; 5363/6715 tok/s;  10287 sec
[2021-05-15 20:30:00,812 INFO] Step 26850/50000; acc:  90.71; ppl:  1.38; xent: 0.32; lr: 0.00010; 5288/6607 tok/s;  10306 sec
[2021-05-15 20:30:19,094 INFO] Step 26900/50000; acc:  90.87; ppl:  1.38; xent: 0.32; lr: 0.00010; 5683/7176 tok/s;  10324 sec
[2021-05-15 20:30:37,874 INFO] Step 26950/50000; acc:  90.42; ppl:  1.39; xent: 0.33; lr: 0.00010; 5420/6653 tok/s;  10343 sec
[2021-05-15 20:30:47,163 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:30:56,485 INFO] Step 27000/50000; acc:  90.62; ppl:  1.39; xent: 0.33; lr: 0.00010; 5338/6868 tok/s;  10362 sec
[2021-05-15 20:31:15,788 INFO] Step 27050/50000; acc:  90.81; ppl:  1.38; xent: 0.32; lr: 0.00010; 5425/6811 tok/s;  10381 sec
[2021-05-15 20:31:33,872 INFO] Step 27100/50000; acc:  89.99; ppl:  1.41; xent: 0.35; lr: 0.00010; 5493/6732 tok/s;  10399 sec
[2021-05-15 20:31:53,416 INFO] Step 27150/50000; acc:  90.50; ppl:  1.39; xent: 0.33; lr: 0.00010; 5227/6626 tok/s;  10419 sec
[2021-05-15 20:32:12,825 INFO] Step 27200/50000; acc:  90.38; ppl:  1.40; xent: 0.33; lr: 0.00010; 5092/6413 tok/s;  10438 sec
[2021-05-15 20:32:31,576 INFO] Step 27250/50000; acc:  90.73; ppl:  1.38; xent: 0.32; lr: 0.00010; 5517/6867 tok/s;  10457 sec
[2021-05-15 20:32:51,055 INFO] Step 27300/50000; acc:  90.54; ppl:  1.39; xent: 0.33; lr: 0.00010; 5278/6487 tok/s;  10476 sec
[2021-05-15 20:33:09,223 INFO] Step 27350/50000; acc:  90.39; ppl:  1.39; xent: 0.33; lr: 0.00010; 5615/6801 tok/s;  10495 sec
[2021-05-15 20:33:28,957 INFO] Step 27400/50000; acc:  90.76; ppl:  1.38; xent: 0.32; lr: 0.00010; 5291/6634 tok/s;  10514 sec
[2021-05-15 20:33:47,765 INFO] Step 27450/50000; acc:  90.54; ppl:  1.39; xent: 0.33; lr: 0.00010; 5176/6600 tok/s;  10533 sec
[2021-05-15 20:34:07,271 INFO] Step 27500/50000; acc:  90.81; ppl:  1.38; xent: 0.32; lr: 0.00010; 5328/6765 tok/s;  10553 sec
[2021-05-15 20:34:25,941 INFO] Step 27550/50000; acc:  90.31; ppl:  1.40; xent: 0.33; lr: 0.00010; 5397/6705 tok/s;  10571 sec
[2021-05-15 20:34:44,413 INFO] Step 27600/50000; acc:  90.90; ppl:  1.37; xent: 0.32; lr: 0.00010; 5547/6836 tok/s;  10590 sec
[2021-05-15 20:35:02,699 INFO] Step 27650/50000; acc:  90.69; ppl:  1.38; xent: 0.32; lr: 0.00010; 5441/7028 tok/s;  10608 sec
[2021-05-15 20:35:21,383 INFO] Step 27700/50000; acc:  90.75; ppl:  1.38; xent: 0.32; lr: 0.00010; 5572/6887 tok/s;  10627 sec
[2021-05-15 20:35:25,515 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:35:40,666 INFO] Step 27750/50000; acc:  90.77; ppl:  1.38; xent: 0.32; lr: 0.00010; 5320/6730 tok/s;  10646 sec
[2021-05-15 20:35:58,573 INFO] Step 27800/50000; acc:  90.44; ppl:  1.40; xent: 0.33; lr: 0.00010; 5532/6958 tok/s;  10664 sec
[2021-05-15 20:36:17,633 INFO] Step 27850/50000; acc:  90.26; ppl:  1.40; xent: 0.34; lr: 0.00010; 5422/6707 tok/s;  10683 sec
[2021-05-15 20:36:36,954 INFO] Step 27900/50000; acc:  90.58; ppl:  1.38; xent: 0.32; lr: 0.00010; 5173/6606 tok/s;  10702 sec
[2021-05-15 20:36:56,003 INFO] Step 27950/50000; acc:  90.55; ppl:  1.39; xent: 0.33; lr: 0.00010; 5377/6577 tok/s;  10721 sec
[2021-05-15 20:37:14,507 INFO] Step 28000/50000; acc:  90.54; ppl:  1.38; xent: 0.33; lr: 0.00010; 5327/6752 tok/s;  10740 sec
[2021-05-15 20:37:33,601 INFO] Step 28050/50000; acc:  90.54; ppl:  1.39; xent: 0.33; lr: 0.00010; 5543/6666 tok/s;  10759 sec
[2021-05-15 20:37:52,845 INFO] Step 28100/50000; acc:  90.84; ppl:  1.37; xent: 0.32; lr: 0.00010; 5293/6704 tok/s;  10778 sec
[2021-05-15 20:38:12,135 INFO] Step 28150/50000; acc:  90.62; ppl:  1.39; xent: 0.33; lr: 0.00010; 5243/6502 tok/s;  10798 sec
[2021-05-15 20:38:31,470 INFO] Step 28200/50000; acc:  90.90; ppl:  1.38; xent: 0.32; lr: 0.00010; 5406/6872 tok/s;  10817 sec
[2021-05-15 20:38:50,357 INFO] Step 28250/50000; acc:  90.33; ppl:  1.40; xent: 0.34; lr: 0.00010; 5134/6395 tok/s;  10836 sec
[2021-05-15 20:39:09,182 INFO] Step 28300/50000; acc:  90.67; ppl:  1.38; xent: 0.32; lr: 0.00010; 5556/6727 tok/s;  10855 sec
[2021-05-15 20:39:27,632 INFO] Step 28350/50000; acc:  91.05; ppl:  1.37; xent: 0.31; lr: 0.00010; 5452/7087 tok/s;  10873 sec
[2021-05-15 20:39:46,129 INFO] Step 28400/50000; acc:  90.73; ppl:  1.38; xent: 0.32; lr: 0.00010; 5484/6802 tok/s;  10892 sec
[2021-05-15 20:39:52,495 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:40:05,192 INFO] Step 28450/50000; acc:  90.67; ppl:  1.38; xent: 0.32; lr: 0.00010; 5292/6831 tok/s;  10911 sec
[2021-05-15 20:40:24,275 INFO] Step 28500/50000; acc:  90.73; ppl:  1.38; xent: 0.32; lr: 0.00010; 5450/6625 tok/s;  10930 sec
[2021-05-15 20:40:42,718 INFO] Step 28550/50000; acc:  90.63; ppl:  1.39; xent: 0.33; lr: 0.00010; 5522/6936 tok/s;  10948 sec
[2021-05-15 20:41:01,627 INFO] Step 28600/50000; acc:  90.18; ppl:  1.40; xent: 0.34; lr: 0.00010; 5215/6589 tok/s;  10967 sec
[2021-05-15 20:41:20,782 INFO] Step 28650/50000; acc:  90.82; ppl:  1.38; xent: 0.32; lr: 0.00010; 5426/6777 tok/s;  10986 sec
[2021-05-15 20:41:39,966 INFO] Step 28700/50000; acc:  90.62; ppl:  1.38; xent: 0.32; lr: 0.00010; 5186/6556 tok/s;  11005 sec
[2021-05-15 20:41:58,872 INFO] Step 28750/50000; acc:  90.74; ppl:  1.38; xent: 0.32; lr: 0.00010; 5459/6625 tok/s;  11024 sec
[2021-05-15 20:42:17,271 INFO] Step 28800/50000; acc:  90.47; ppl:  1.39; xent: 0.33; lr: 0.00010; 5478/6763 tok/s;  11043 sec
[2021-05-15 20:42:36,249 INFO] Step 28850/50000; acc:  90.84; ppl:  1.37; xent: 0.32; lr: 0.00010; 5471/6867 tok/s;  11062 sec
[2021-05-15 20:42:55,978 INFO] Step 28900/50000; acc:  90.88; ppl:  1.37; xent: 0.32; lr: 0.00010; 5165/6523 tok/s;  11081 sec
[2021-05-15 20:43:15,046 INFO] Step 28950/50000; acc:  90.59; ppl:  1.39; xent: 0.33; lr: 0.00010; 5288/6672 tok/s;  11100 sec
[2021-05-15 20:43:34,433 INFO] Step 29000/50000; acc:  90.55; ppl:  1.38; xent: 0.33; lr: 0.00010; 5391/6572 tok/s;  11120 sec
[2021-05-15 20:43:52,691 INFO] Step 29050/50000; acc:  90.82; ppl:  1.37; xent: 0.32; lr: 0.00010; 5371/6744 tok/s;  11138 sec
[2021-05-15 20:44:11,894 INFO] Step 29100/50000; acc:  91.17; ppl:  1.36; xent: 0.31; lr: 0.00010; 5376/6910 tok/s;  11157 sec
[2021-05-15 20:44:30,131 INFO] Step 29150/50000; acc:  90.65; ppl:  1.38; xent: 0.32; lr: 0.00010; 5513/6930 tok/s;  11176 sec
[2021-05-15 20:44:30,803 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:44:49,011 INFO] Step 29200/50000; acc:  90.68; ppl:  1.38; xent: 0.32; lr: 0.00010; 5406/6773 tok/s;  11194 sec
[2021-05-15 20:45:07,586 INFO] Step 29250/50000; acc:  90.82; ppl:  1.38; xent: 0.32; lr: 0.00010; 5422/6790 tok/s;  11213 sec
[2021-05-15 20:45:26,823 INFO] Step 29300/50000; acc:  90.35; ppl:  1.40; xent: 0.34; lr: 0.00010; 5348/6606 tok/s;  11232 sec
[2021-05-15 20:45:46,103 INFO] Step 29350/50000; acc:  90.74; ppl:  1.38; xent: 0.32; lr: 0.00010; 5308/6769 tok/s;  11251 sec
[2021-05-15 20:46:04,960 INFO] Step 29400/50000; acc:  90.57; ppl:  1.39; xent: 0.33; lr: 0.00010; 5255/6459 tok/s;  11270 sec
[2021-05-15 20:46:24,880 INFO] Step 29450/50000; acc:  90.84; ppl:  1.37; xent: 0.32; lr: 0.00010; 5177/6524 tok/s;  11290 sec
[2021-05-15 20:46:43,477 INFO] Step 29500/50000; acc:  90.60; ppl:  1.38; xent: 0.33; lr: 0.00010; 5459/6612 tok/s;  11309 sec
[2021-05-15 20:47:02,460 INFO] Step 29550/50000; acc:  90.52; ppl:  1.39; xent: 0.33; lr: 0.00010; 5454/6725 tok/s;  11328 sec
[2021-05-15 20:47:21,861 INFO] Step 29600/50000; acc:  90.93; ppl:  1.37; xent: 0.31; lr: 0.00010; 5084/6470 tok/s;  11347 sec
[2021-05-15 20:47:41,460 INFO] Step 29650/50000; acc:  90.88; ppl:  1.38; xent: 0.32; lr: 0.00010; 5327/6660 tok/s;  11367 sec
[2021-05-15 20:48:00,709 INFO] Step 29700/50000; acc:  90.84; ppl:  1.37; xent: 0.32; lr: 0.00010; 5283/6734 tok/s;  11386 sec
[2021-05-15 20:48:19,601 INFO] Step 29750/50000; acc:  90.60; ppl:  1.38; xent: 0.32; lr: 0.00010; 5371/6467 tok/s;  11405 sec
[2021-05-15 20:48:38,394 INFO] Step 29800/50000; acc:  91.11; ppl:  1.36; xent: 0.31; lr: 0.00010; 5556/7129 tok/s;  11424 sec
[2021-05-15 20:48:56,401 INFO] Step 29850/50000; acc:  90.67; ppl:  1.38; xent: 0.32; lr: 0.00010; 5363/6801 tok/s;  11442 sec
[2021-05-15 20:49:10,574 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:49:15,281 INFO] Step 29900/50000; acc:  91.08; ppl:  1.36; xent: 0.31; lr: 0.00010; 5525/6966 tok/s;  11461 sec
[2021-05-15 20:49:34,159 INFO] Step 29950/50000; acc:  90.71; ppl:  1.38; xent: 0.32; lr: 0.00010; 5329/6643 tok/s;  11480 sec
[2021-05-15 20:49:52,887 INFO] Step 30000/50000; acc:  90.60; ppl:  1.38; xent: 0.32; lr: 0.00010; 5448/6837 tok/s;  11498 sec
[2021-05-15 20:49:52,888 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/basic/valid.txt, align=None)...
[2021-05-15 20:50:20,592 INFO] Validation perplexity: 1.44115
[2021-05-15 20:50:20,593 INFO] Validation accuracy: 89.9022
[2021-05-15 20:50:20,597 INFO] Saving checkpoint ../models/group2_params/basic_ops/model_step_30000.pt
[2021-05-15 20:50:39,437 INFO] Step 30050/50000; acc:  90.25; ppl:  1.40; xent: 0.34; lr: 0.00010; 2139/2631 tok/s;  11545 sec
[2021-05-15 20:50:58,979 INFO] Step 30100/50000; acc:  91.02; ppl:  1.37; xent: 0.31; lr: 0.00010; 5316/6727 tok/s;  11564 sec
[2021-05-15 20:51:18,186 INFO] Step 30150/50000; acc:  90.77; ppl:  1.38; xent: 0.32; lr: 0.00010; 5308/6574 tok/s;  11584 sec
[2021-05-15 20:51:37,065 INFO] Step 30200/50000; acc:  90.66; ppl:  1.38; xent: 0.32; lr: 0.00010; 5275/6615 tok/s;  11602 sec
[2021-05-15 20:51:56,211 INFO] Step 30250/50000; acc:  90.82; ppl:  1.37; xent: 0.32; lr: 0.00010; 5489/6669 tok/s;  11622 sec
[2021-05-15 20:52:15,539 INFO] Step 30300/50000; acc:  90.53; ppl:  1.38; xent: 0.33; lr: 0.00010; 5205/6479 tok/s;  11641 sec
[2021-05-15 20:52:35,073 INFO] Step 30350/50000; acc:  91.13; ppl:  1.36; xent: 0.31; lr: 0.00010; 5242/6672 tok/s;  11660 sec
[2021-05-15 20:52:53,880 INFO] Step 30400/50000; acc:  90.76; ppl:  1.37; xent: 0.32; lr: 0.00010; 5276/6711 tok/s;  11679 sec
[2021-05-15 20:53:12,886 INFO] Step 30450/50000; acc:  90.77; ppl:  1.38; xent: 0.32; lr: 0.00010; 5470/6723 tok/s;  11698 sec
[2021-05-15 20:53:32,078 INFO] Step 30500/50000; acc:  91.13; ppl:  1.36; xent: 0.30; lr: 0.00010; 5323/6646 tok/s;  11717 sec
[2021-05-15 20:53:50,103 INFO] Step 30550/50000; acc:  90.87; ppl:  1.37; xent: 0.32; lr: 0.00010; 5596/7158 tok/s;  11735 sec
[2021-05-15 20:54:09,186 INFO] Step 30600/50000; acc:  90.83; ppl:  1.37; xent: 0.32; lr: 0.00010; 5433/6657 tok/s;  11755 sec
[2021-05-15 20:54:17,739 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:54:27,646 INFO] Step 30650/50000; acc:  90.88; ppl:  1.37; xent: 0.31; lr: 0.00010; 5279/6852 tok/s;  11773 sec
[2021-05-15 20:54:46,926 INFO] Step 30700/50000; acc:  90.98; ppl:  1.37; xent: 0.31; lr: 0.00010; 5434/6745 tok/s;  11792 sec
[2021-05-15 20:55:04,919 INFO] Step 30750/50000; acc:  90.20; ppl:  1.40; xent: 0.34; lr: 0.00010; 5557/6824 tok/s;  11810 sec
[2021-05-15 20:55:24,367 INFO] Step 30800/50000; acc:  90.90; ppl:  1.37; xent: 0.31; lr: 0.00010; 5211/6642 tok/s;  11830 sec
[2021-05-15 20:55:43,887 INFO] Step 30850/50000; acc:  90.67; ppl:  1.38; xent: 0.32; lr: 0.00010; 5134/6412 tok/s;  11849 sec
[2021-05-15 20:56:02,892 INFO] Step 30900/50000; acc:  90.93; ppl:  1.37; xent: 0.32; lr: 0.00010; 5437/6717 tok/s;  11868 sec
[2021-05-15 20:56:22,300 INFO] Step 30950/50000; acc:  90.85; ppl:  1.37; xent: 0.32; lr: 0.00010; 5336/6593 tok/s;  11888 sec
[2021-05-15 20:56:40,440 INFO] Step 31000/50000; acc:  90.62; ppl:  1.38; xent: 0.32; lr: 0.00010; 5522/6713 tok/s;  11906 sec
[2021-05-15 20:56:59,986 INFO] Step 31050/50000; acc:  91.12; ppl:  1.36; xent: 0.31; lr: 0.00010; 5310/6724 tok/s;  11925 sec
[2021-05-15 20:57:18,934 INFO] Step 31100/50000; acc:  90.95; ppl:  1.37; xent: 0.31; lr: 0.00010; 5298/6675 tok/s;  11944 sec
[2021-05-15 20:57:38,218 INFO] Step 31150/50000; acc:  90.94; ppl:  1.37; xent: 0.32; lr: 0.00010; 5326/6744 tok/s;  11964 sec
[2021-05-15 20:57:56,443 INFO] Step 31200/50000; acc:  90.64; ppl:  1.38; xent: 0.32; lr: 0.00010; 5430/6763 tok/s;  11982 sec
[2021-05-15 20:58:15,158 INFO] Step 31250/50000; acc:  91.23; ppl:  1.35; xent: 0.30; lr: 0.00010; 5584/6937 tok/s;  12001 sec
[2021-05-15 20:58:33,976 INFO] Step 31300/50000; acc:  91.08; ppl:  1.36; xent: 0.31; lr: 0.00010; 5369/6876 tok/s;  12019 sec
[2021-05-15 20:58:52,121 INFO] Step 31350/50000; acc:  90.88; ppl:  1.37; xent: 0.32; lr: 0.00010; 5571/6950 tok/s;  12038 sec
[2021-05-15 20:58:55,594 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 20:59:11,493 INFO] Step 31400/50000; acc:  91.03; ppl:  1.36; xent: 0.31; lr: 0.00010; 5396/6726 tok/s;  12057 sec
[2021-05-15 20:59:29,473 INFO] Step 31450/50000; acc:  90.64; ppl:  1.38; xent: 0.32; lr: 0.00010; 5401/6874 tok/s;  12075 sec
[2021-05-15 20:59:48,298 INFO] Step 31500/50000; acc:  90.58; ppl:  1.39; xent: 0.33; lr: 0.00010; 5498/6820 tok/s;  12094 sec
[2021-05-15 21:00:07,563 INFO] Step 31550/50000; acc:  90.90; ppl:  1.37; xent: 0.31; lr: 0.00010; 5214/6592 tok/s;  12113 sec
[2021-05-15 21:00:26,678 INFO] Step 31600/50000; acc:  90.75; ppl:  1.37; xent: 0.32; lr: 0.00010; 5316/6564 tok/s;  12132 sec
[2021-05-15 21:00:45,813 INFO] Step 31650/50000; acc:  90.94; ppl:  1.37; xent: 0.31; lr: 0.00010; 5230/6574 tok/s;  12151 sec
[2021-05-15 21:01:04,775 INFO] Step 31700/50000; acc:  90.83; ppl:  1.37; xent: 0.32; lr: 0.00010; 5577/6667 tok/s;  12170 sec
[2021-05-15 21:01:24,045 INFO] Step 31750/50000; acc:  91.03; ppl:  1.36; xent: 0.31; lr: 0.00010; 5311/6698 tok/s;  12189 sec
[2021-05-15 21:01:43,302 INFO] Step 31800/50000; acc:  90.91; ppl:  1.37; xent: 0.31; lr: 0.00010; 5167/6547 tok/s;  12209 sec
[2021-05-15 21:02:03,009 INFO] Step 31850/50000; acc:  91.10; ppl:  1.36; xent: 0.31; lr: 0.00010; 5282/6685 tok/s;  12228 sec
[2021-05-15 21:02:21,763 INFO] Step 31900/50000; acc:  90.57; ppl:  1.38; xent: 0.32; lr: 0.00010; 5320/6528 tok/s;  12247 sec
[2021-05-15 21:02:40,319 INFO] Step 31950/50000; acc:  90.98; ppl:  1.36; xent: 0.31; lr: 0.00010; 5579/6803 tok/s;  12266 sec
[2021-05-15 21:02:58,828 INFO] Step 32000/50000; acc:  91.31; ppl:  1.35; xent: 0.30; lr: 0.00010; 5335/6969 tok/s;  12284 sec
[2021-05-15 21:03:17,441 INFO] Step 32050/50000; acc:  90.98; ppl:  1.37; xent: 0.31; lr: 0.00010; 5562/6927 tok/s;  12303 sec
[2021-05-15 21:03:22,951 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:03:36,642 INFO] Step 32100/50000; acc:  90.97; ppl:  1.36; xent: 0.31; lr: 0.00010; 5324/6832 tok/s;  12322 sec
[2021-05-15 21:03:55,229 INFO] Step 32150/50000; acc:  90.96; ppl:  1.37; xent: 0.31; lr: 0.00010; 5439/6678 tok/s;  12341 sec
[2021-05-15 21:04:13,725 INFO] Step 32200/50000; acc:  90.80; ppl:  1.38; xent: 0.32; lr: 0.00010; 5595/6906 tok/s;  12359 sec
[2021-05-15 21:04:32,913 INFO] Step 32250/50000; acc:  90.53; ppl:  1.38; xent: 0.32; lr: 0.00010; 5056/6538 tok/s;  12378 sec
[2021-05-15 21:04:52,202 INFO] Step 32300/50000; acc:  91.11; ppl:  1.36; xent: 0.31; lr: 0.00010; 5392/6647 tok/s;  12398 sec
[2021-05-15 21:05:11,086 INFO] Step 32350/50000; acc:  90.96; ppl:  1.36; xent: 0.31; lr: 0.00010; 5287/6734 tok/s;  12416 sec
[2021-05-15 21:05:30,123 INFO] Step 32400/50000; acc:  90.91; ppl:  1.37; xent: 0.31; lr: 0.00010; 5394/6501 tok/s;  12436 sec
[2021-05-15 21:05:48,542 INFO] Step 32450/50000; acc:  90.74; ppl:  1.37; xent: 0.32; lr: 0.00010; 5540/6835 tok/s;  12454 sec
[2021-05-15 21:06:07,625 INFO] Step 32500/50000; acc:  91.19; ppl:  1.36; xent: 0.30; lr: 0.00010; 5430/6754 tok/s;  12473 sec
[2021-05-15 21:06:27,083 INFO] Step 32550/50000; acc:  91.08; ppl:  1.36; xent: 0.31; lr: 0.00010; 5275/6670 tok/s;  12492 sec
[2021-05-15 21:06:45,947 INFO] Step 32600/50000; acc:  90.95; ppl:  1.37; xent: 0.31; lr: 0.00010; 5255/6707 tok/s;  12511 sec
[2021-05-15 21:07:05,287 INFO] Step 32650/50000; acc:  90.89; ppl:  1.37; xent: 0.31; lr: 0.00010; 5385/6514 tok/s;  12531 sec
[2021-05-15 21:07:23,668 INFO] Step 32700/50000; acc:  91.22; ppl:  1.35; xent: 0.30; lr: 0.00010; 5479/6952 tok/s;  12549 sec
[2021-05-15 21:07:42,594 INFO] Step 32750/50000; acc:  91.24; ppl:  1.35; xent: 0.30; lr: 0.00010; 5398/6847 tok/s;  12568 sec
[2021-05-15 21:08:00,795 INFO] Step 32800/50000; acc:  90.96; ppl:  1.36; xent: 0.31; lr: 0.00010; 5435/6925 tok/s;  12586 sec
[2021-05-15 21:08:00,808 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:08:19,795 INFO] Step 32850/50000; acc:  90.96; ppl:  1.37; xent: 0.31; lr: 0.00010; 5468/6740 tok/s;  12605 sec
[2021-05-15 21:08:39,139 INFO] Step 32900/50000; acc:  91.12; ppl:  1.36; xent: 0.31; lr: 0.00010; 5285/6672 tok/s;  12625 sec
[2021-05-15 21:08:57,827 INFO] Step 32950/50000; acc:  90.38; ppl:  1.39; xent: 0.33; lr: 0.00010; 5344/6576 tok/s;  12643 sec
[2021-05-15 21:09:17,361 INFO] Step 33000/50000; acc:  91.05; ppl:  1.36; xent: 0.31; lr: 0.00010; 5324/6780 tok/s;  12663 sec
[2021-05-15 21:09:35,981 INFO] Step 33050/50000; acc:  90.90; ppl:  1.37; xent: 0.31; lr: 0.00010; 5229/6522 tok/s;  12681 sec
[2021-05-15 21:09:55,807 INFO] Step 33100/50000; acc:  91.10; ppl:  1.36; xent: 0.31; lr: 0.00010; 5222/6544 tok/s;  12701 sec
[2021-05-15 21:10:14,375 INFO] Step 33150/50000; acc:  90.86; ppl:  1.37; xent: 0.31; lr: 0.00010; 5493/6629 tok/s;  12720 sec
[2021-05-15 21:10:33,439 INFO] Step 33200/50000; acc:  90.83; ppl:  1.37; xent: 0.31; lr: 0.00010; 5385/6667 tok/s;  12739 sec
[2021-05-15 21:10:52,569 INFO] Step 33250/50000; acc:  91.15; ppl:  1.36; xent: 0.30; lr: 0.00010; 5220/6588 tok/s;  12758 sec
[2021-05-15 21:11:12,509 INFO] Step 33300/50000; acc:  91.10; ppl:  1.36; xent: 0.31; lr: 0.00010; 5237/6556 tok/s;  12778 sec
[2021-05-15 21:11:31,900 INFO] Step 33350/50000; acc:  91.23; ppl:  1.35; xent: 0.30; lr: 0.00010; 5278/6715 tok/s;  12797 sec
[2021-05-15 21:11:50,553 INFO] Step 33400/50000; acc:  90.82; ppl:  1.37; xent: 0.31; lr: 0.00010; 5350/6485 tok/s;  12816 sec
[2021-05-15 21:12:09,003 INFO] Step 33450/50000; acc:  91.46; ppl:  1.34; xent: 0.29; lr: 0.00010; 5630/7270 tok/s;  12834 sec
[2021-05-15 21:12:27,347 INFO] Step 33500/50000; acc:  90.91; ppl:  1.37; xent: 0.31; lr: 0.00010; 5413/6734 tok/s;  12853 sec
[2021-05-15 21:12:40,750 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:12:46,230 INFO] Step 33550/50000; acc:  91.25; ppl:  1.35; xent: 0.30; lr: 0.00010; 5463/6944 tok/s;  12872 sec
[2021-05-15 21:13:05,148 INFO] Step 33600/50000; acc:  91.04; ppl:  1.36; xent: 0.31; lr: 0.00010; 5236/6604 tok/s;  12891 sec
[2021-05-15 21:13:23,641 INFO] Step 33650/50000; acc:  90.86; ppl:  1.37; xent: 0.31; lr: 0.00010; 5614/6950 tok/s;  12909 sec
[2021-05-15 21:13:42,732 INFO] Step 33700/50000; acc:  90.66; ppl:  1.37; xent: 0.32; lr: 0.00010; 5301/6610 tok/s;  12928 sec
[2021-05-15 21:14:02,046 INFO] Step 33750/50000; acc:  91.04; ppl:  1.36; xent: 0.31; lr: 0.00010; 5216/6549 tok/s;  12947 sec
[2021-05-15 21:14:21,366 INFO] Step 33800/50000; acc:  91.02; ppl:  1.36; xent: 0.31; lr: 0.00010; 5376/6632 tok/s;  12967 sec
[2021-05-15 21:14:39,947 INFO] Step 33850/50000; acc:  90.94; ppl:  1.36; xent: 0.31; lr: 0.00010; 5272/6704 tok/s;  12985 sec
[2021-05-15 21:14:58,838 INFO] Step 33900/50000; acc:  90.92; ppl:  1.36; xent: 0.31; lr: 0.00010; 5560/6675 tok/s;  13004 sec
[2021-05-15 21:15:17,898 INFO] Step 33950/50000; acc:  90.92; ppl:  1.37; xent: 0.31; lr: 0.00010; 5307/6617 tok/s;  13023 sec
[2021-05-15 21:15:37,763 INFO] Step 34000/50000; acc:  91.28; ppl:  1.35; xent: 0.30; lr: 0.00010; 5115/6493 tok/s;  13043 sec
[2021-05-15 21:15:56,805 INFO] Step 34050/50000; acc:  91.27; ppl:  1.35; xent: 0.30; lr: 0.00010; 5276/6809 tok/s;  13062 sec
[2021-05-15 21:16:16,013 INFO] Step 34100/50000; acc:  90.85; ppl:  1.37; xent: 0.31; lr: 0.00010; 5417/6552 tok/s;  13081 sec
[2021-05-15 21:16:34,756 INFO] Step 34150/50000; acc:  91.43; ppl:  1.34; xent: 0.29; lr: 0.00010; 5482/6848 tok/s;  13100 sec
[2021-05-15 21:16:52,533 INFO] Step 34200/50000; acc:  91.15; ppl:  1.35; xent: 0.30; lr: 0.00010; 5571/7190 tok/s;  13118 sec
[2021-05-15 21:17:11,427 INFO] Step 34250/50000; acc:  91.10; ppl:  1.36; xent: 0.30; lr: 0.00010; 5469/6714 tok/s;  13137 sec
[2021-05-15 21:17:19,328 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:17:30,266 INFO] Step 34300/50000; acc:  91.16; ppl:  1.35; xent: 0.30; lr: 0.00010; 5324/6769 tok/s;  13156 sec
[2021-05-15 21:17:49,233 INFO] Step 34350/50000; acc:  91.32; ppl:  1.35; xent: 0.30; lr: 0.00010; 5453/6914 tok/s;  13175 sec
[2021-05-15 21:18:07,106 INFO] Step 34400/50000; acc:  90.38; ppl:  1.39; xent: 0.33; lr: 0.00010; 5505/6730 tok/s;  13193 sec
[2021-05-15 21:18:26,625 INFO] Step 34450/50000; acc:  91.16; ppl:  1.35; xent: 0.30; lr: 0.00010; 5294/6675 tok/s;  13212 sec
[2021-05-15 21:18:46,191 INFO] Step 34500/50000; acc:  91.02; ppl:  1.36; xent: 0.31; lr: 0.00010; 5194/6514 tok/s;  13232 sec
[2021-05-15 21:19:04,904 INFO] Step 34550/50000; acc:  91.06; ppl:  1.36; xent: 0.31; lr: 0.00010; 5365/6698 tok/s;  13250 sec
[2021-05-15 21:19:24,556 INFO] Step 34600/50000; acc:  91.11; ppl:  1.36; xent: 0.30; lr: 0.00010; 5366/6522 tok/s;  13270 sec
[2021-05-15 21:19:42,753 INFO] Step 34650/50000; acc:  90.98; ppl:  1.36; xent: 0.31; lr: 0.00010; 5394/6703 tok/s;  13288 sec
[2021-05-15 21:20:02,330 INFO] Step 34700/50000; acc:  91.35; ppl:  1.35; xent: 0.30; lr: 0.00010; 5318/6672 tok/s;  13308 sec
[2021-05-15 21:20:21,154 INFO] Step 34750/50000; acc:  91.13; ppl:  1.35; xent: 0.30; lr: 0.00010; 5359/6759 tok/s;  13327 sec
[2021-05-15 21:20:40,553 INFO] Step 34800/50000; acc:  91.15; ppl:  1.36; xent: 0.31; lr: 0.00010; 5248/6595 tok/s;  13346 sec
[2021-05-15 21:20:59,392 INFO] Step 34850/50000; acc:  90.91; ppl:  1.36; xent: 0.31; lr: 0.00010; 5327/6660 tok/s;  13365 sec
[2021-05-15 21:21:17,929 INFO] Step 34900/50000; acc:  91.42; ppl:  1.34; xent: 0.29; lr: 0.00010; 5627/6972 tok/s;  13383 sec
[2021-05-15 21:21:36,749 INFO] Step 34950/50000; acc:  91.32; ppl:  1.35; xent: 0.30; lr: 0.00010; 5413/6917 tok/s;  13402 sec
[2021-05-15 21:21:55,288 INFO] Step 35000/50000; acc:  91.22; ppl:  1.35; xent: 0.30; lr: 0.00010; 5358/6779 tok/s;  13421 sec
[2021-05-15 21:21:55,290 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/basic/valid.txt, align=None)...
[2021-05-15 21:22:22,919 INFO] Validation perplexity: 1.43821
[2021-05-15 21:22:22,919 INFO] Validation accuracy: 90.0281
[2021-05-15 21:22:22,923 INFO] Saving checkpoint ../models/group2_params/basic_ops/model_step_35000.pt
[2021-05-15 21:22:25,874 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:22:42,337 INFO] Step 35050/50000; acc:  91.19; ppl:  1.35; xent: 0.30; lr: 0.00010; 2213/2740 tok/s;  13468 sec
[2021-05-15 21:23:00,568 INFO] Step 35100/50000; acc:  90.92; ppl:  1.36; xent: 0.31; lr: 0.00010; 5474/6898 tok/s;  13486 sec
[2021-05-15 21:23:19,814 INFO] Step 35150/50000; acc:  90.83; ppl:  1.37; xent: 0.31; lr: 0.00010; 5308/6684 tok/s;  13505 sec
[2021-05-15 21:23:38,639 INFO] Step 35200/50000; acc:  91.14; ppl:  1.35; xent: 0.30; lr: 0.00010; 5257/6613 tok/s;  13524 sec
[2021-05-15 21:23:57,836 INFO] Step 35250/50000; acc:  91.15; ppl:  1.36; xent: 0.31; lr: 0.00010; 5392/6628 tok/s;  13543 sec
[2021-05-15 21:24:17,116 INFO] Step 35300/50000; acc:  91.19; ppl:  1.35; xent: 0.30; lr: 0.00010; 5269/6592 tok/s;  13563 sec
[2021-05-15 21:24:35,737 INFO] Step 35350/50000; acc:  90.95; ppl:  1.36; xent: 0.31; lr: 0.00010; 5524/6651 tok/s;  13581 sec
[2021-05-15 21:24:54,925 INFO] Step 35400/50000; acc:  91.34; ppl:  1.35; xent: 0.30; lr: 0.00010; 5426/6806 tok/s;  13600 sec
[2021-05-15 21:25:14,043 INFO] Step 35450/50000; acc:  91.08; ppl:  1.36; xent: 0.30; lr: 0.00010; 5105/6507 tok/s;  13619 sec
[2021-05-15 21:25:33,719 INFO] Step 35500/50000; acc:  91.32; ppl:  1.35; xent: 0.30; lr: 0.00010; 5301/6688 tok/s;  13639 sec
[2021-05-15 21:25:52,101 INFO] Step 35550/50000; acc:  91.01; ppl:  1.36; xent: 0.31; lr: 0.00010; 5454/6716 tok/s;  13657 sec
[2021-05-15 21:26:10,898 INFO] Step 35600/50000; acc:  91.28; ppl:  1.35; xent: 0.30; lr: 0.00010; 5471/6705 tok/s;  13676 sec
[2021-05-15 21:26:29,749 INFO] Step 35650/50000; acc:  91.54; ppl:  1.34; xent: 0.29; lr: 0.00010; 5296/6926 tok/s;  13695 sec
[2021-05-15 21:26:48,569 INFO] Step 35700/50000; acc:  91.32; ppl:  1.35; xent: 0.30; lr: 0.00010; 5499/6828 tok/s;  13714 sec
[2021-05-15 21:26:53,061 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:27:07,548 INFO] Step 35750/50000; acc:  91.23; ppl:  1.35; xent: 0.30; lr: 0.00010; 5421/6871 tok/s;  13733 sec
[2021-05-15 21:27:26,275 INFO] Step 35800/50000; acc:  91.17; ppl:  1.35; xent: 0.30; lr: 0.00010; 5307/6592 tok/s;  13752 sec
[2021-05-15 21:27:45,073 INFO] Step 35850/50000; acc:  91.02; ppl:  1.36; xent: 0.31; lr: 0.00010; 5490/6766 tok/s;  13770 sec
[2021-05-15 21:28:04,229 INFO] Step 35900/50000; acc:  90.81; ppl:  1.37; xent: 0.31; lr: 0.00010; 5211/6577 tok/s;  13790 sec
[2021-05-15 21:28:23,423 INFO] Step 35950/50000; acc:  91.32; ppl:  1.35; xent: 0.30; lr: 0.00010; 5352/6723 tok/s;  13809 sec
[2021-05-15 21:28:42,227 INFO] Step 36000/50000; acc:  91.19; ppl:  1.35; xent: 0.30; lr: 0.00010; 5215/6656 tok/s;  13828 sec
[2021-05-15 21:29:01,384 INFO] Step 36050/50000; acc:  91.22; ppl:  1.35; xent: 0.30; lr: 0.00010; 5475/6559 tok/s;  13847 sec
[2021-05-15 21:29:20,196 INFO] Step 36100/50000; acc:  91.07; ppl:  1.35; xent: 0.30; lr: 0.00010; 5492/6781 tok/s;  13866 sec
[2021-05-15 21:29:39,149 INFO] Step 36150/50000; acc:  91.29; ppl:  1.34; xent: 0.30; lr: 0.00010; 5312/6708 tok/s;  13885 sec
[2021-05-15 21:29:58,503 INFO] Step 36200/50000; acc:  91.26; ppl:  1.35; xent: 0.30; lr: 0.00010; 5397/6717 tok/s;  13904 sec
[2021-05-15 21:30:16,929 INFO] Step 36250/50000; acc:  91.20; ppl:  1.35; xent: 0.30; lr: 0.00010; 5283/6790 tok/s;  13922 sec
[2021-05-15 21:30:36,460 INFO] Step 36300/50000; acc:  91.09; ppl:  1.36; xent: 0.30; lr: 0.00010; 5339/6431 tok/s;  13942 sec
[2021-05-15 21:30:54,970 INFO] Step 36350/50000; acc:  91.51; ppl:  1.33; xent: 0.29; lr: 0.00010; 5474/6990 tok/s;  13960 sec
[2021-05-15 21:31:13,807 INFO] Step 36400/50000; acc:  91.44; ppl:  1.34; xent: 0.29; lr: 0.00010; 5370/6815 tok/s;  13979 sec
[2021-05-15 21:31:31,625 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:31:32,533 INFO] Step 36450/50000; acc:  91.29; ppl:  1.34; xent: 0.30; lr: 0.00010; 5350/6824 tok/s;  13998 sec
[2021-05-15 21:31:51,500 INFO] Step 36500/50000; acc:  91.32; ppl:  1.35; xent: 0.30; lr: 0.00010; 5478/6798 tok/s;  14017 sec
[2021-05-15 21:32:10,763 INFO] Step 36550/50000; acc:  91.25; ppl:  1.35; xent: 0.30; lr: 0.00010; 5345/6679 tok/s;  14036 sec
[2021-05-15 21:32:29,204 INFO] Step 36600/50000; acc:  90.66; ppl:  1.37; xent: 0.32; lr: 0.00010; 5323/6591 tok/s;  14055 sec
[2021-05-15 21:32:48,902 INFO] Step 36650/50000; acc:  91.45; ppl:  1.34; xent: 0.29; lr: 0.00010; 5264/6687 tok/s;  14074 sec
[2021-05-15 21:33:07,490 INFO] Step 36700/50000; acc:  91.04; ppl:  1.36; xent: 0.31; lr: 0.00010; 5383/6695 tok/s;  14093 sec
[2021-05-15 21:33:27,100 INFO] Step 36750/50000; acc:  91.37; ppl:  1.34; xent: 0.29; lr: 0.00010; 5224/6547 tok/s;  14112 sec
[2021-05-15 21:33:45,540 INFO] Step 36800/50000; acc:  91.11; ppl:  1.35; xent: 0.30; lr: 0.00010; 5436/6557 tok/s;  14131 sec
[2021-05-15 21:34:04,681 INFO] Step 36850/50000; acc:  91.16; ppl:  1.35; xent: 0.30; lr: 0.00010; 5472/6708 tok/s;  14150 sec
[2021-05-15 21:34:24,562 INFO] Step 36900/50000; acc:  91.58; ppl:  1.33; xent: 0.29; lr: 0.00010; 5087/6514 tok/s;  14170 sec
[2021-05-15 21:34:43,983 INFO] Step 36950/50000; acc:  91.11; ppl:  1.36; xent: 0.30; lr: 0.00010; 5227/6522 tok/s;  14189 sec
[2021-05-15 21:35:03,617 INFO] Step 37000/50000; acc:  91.34; ppl:  1.34; xent: 0.29; lr: 0.00010; 5299/6673 tok/s;  14209 sec
[2021-05-15 21:35:22,200 INFO] Step 37050/50000; acc:  91.27; ppl:  1.35; xent: 0.30; lr: 0.00010; 5266/6507 tok/s;  14228 sec
[2021-05-15 21:35:41,122 INFO] Step 37100/50000; acc:  91.73; ppl:  1.33; xent: 0.28; lr: 0.00010; 5504/7118 tok/s;  14247 sec
[2021-05-15 21:35:59,243 INFO] Step 37150/50000; acc:  91.04; ppl:  1.35; xent: 0.30; lr: 0.00010; 5510/6803 tok/s;  14265 sec
[2021-05-15 21:36:11,888 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:36:17,956 INFO] Step 37200/50000; acc:  91.49; ppl:  1.34; xent: 0.29; lr: 0.00010; 5474/6930 tok/s;  14283 sec
[2021-05-15 21:36:37,107 INFO] Step 37250/50000; acc:  91.25; ppl:  1.35; xent: 0.30; lr: 0.00010; 5248/6605 tok/s;  14303 sec
[2021-05-15 21:36:55,537 INFO] Step 37300/50000; acc:  91.07; ppl:  1.36; xent: 0.31; lr: 0.00010; 5619/6979 tok/s;  14321 sec
[2021-05-15 21:37:14,857 INFO] Step 37350/50000; acc:  91.25; ppl:  1.35; xent: 0.30; lr: 0.00010; 5270/6626 tok/s;  14340 sec
[2021-05-15 21:37:33,986 INFO] Step 37400/50000; acc:  91.29; ppl:  1.35; xent: 0.30; lr: 0.00010; 5180/6501 tok/s;  14359 sec
[2021-05-15 21:37:52,962 INFO] Step 37450/50000; acc:  91.35; ppl:  1.35; xent: 0.30; lr: 0.00010; 5447/6691 tok/s;  14378 sec
[2021-05-15 21:38:12,228 INFO] Step 37500/50000; acc:  91.19; ppl:  1.35; xent: 0.30; lr: 0.00010; 5234/6523 tok/s;  14398 sec
[2021-05-15 21:38:30,694 INFO] Step 37550/50000; acc:  91.30; ppl:  1.34; xent: 0.30; lr: 0.00010; 5632/6896 tok/s;  14416 sec
[2021-05-15 21:38:49,996 INFO] Step 37600/50000; acc:  91.05; ppl:  1.35; xent: 0.30; lr: 0.00010; 5146/6436 tok/s;  14435 sec
[2021-05-15 21:39:09,693 INFO] Step 37650/50000; acc:  91.56; ppl:  1.34; xent: 0.29; lr: 0.00010; 5260/6573 tok/s;  14455 sec
[2021-05-15 21:39:28,801 INFO] Step 37700/50000; acc:  91.65; ppl:  1.33; xent: 0.29; lr: 0.00010; 5331/6911 tok/s;  14474 sec
[2021-05-15 21:39:47,396 INFO] Step 37750/50000; acc:  90.97; ppl:  1.36; xent: 0.31; lr: 0.00010; 5438/6613 tok/s;  14493 sec
[2021-05-15 21:40:06,240 INFO] Step 37800/50000; acc:  91.62; ppl:  1.33; xent: 0.29; lr: 0.00010; 5549/6877 tok/s;  14512 sec
[2021-05-15 21:40:23,816 INFO] Step 37850/50000; acc:  91.39; ppl:  1.34; xent: 0.29; lr: 0.00010; 5524/7201 tok/s;  14529 sec
[2021-05-15 21:40:42,835 INFO] Step 37900/50000; acc:  91.31; ppl:  1.34; xent: 0.29; lr: 0.00010; 5443/6666 tok/s;  14548 sec
[2021-05-15 21:40:50,050 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:41:01,794 INFO] Step 37950/50000; acc:  91.32; ppl:  1.34; xent: 0.29; lr: 0.00010; 5329/6789 tok/s;  14567 sec
[2021-05-15 21:41:20,771 INFO] Step 38000/50000; acc:  91.50; ppl:  1.34; xent: 0.29; lr: 0.00010; 5388/6825 tok/s;  14586 sec
[2021-05-15 21:41:38,936 INFO] Step 38050/50000; acc:  90.80; ppl:  1.37; xent: 0.31; lr: 0.00010; 5496/6680 tok/s;  14604 sec
[2021-05-15 21:41:58,391 INFO] Step 38100/50000; acc:  91.46; ppl:  1.34; xent: 0.29; lr: 0.00010; 5309/6732 tok/s;  14624 sec
[2021-05-15 21:42:17,770 INFO] Step 38150/50000; acc:  91.34; ppl:  1.34; xent: 0.30; lr: 0.00010; 5280/6585 tok/s;  14643 sec
[2021-05-15 21:42:36,191 INFO] Step 38200/50000; acc:  91.31; ppl:  1.34; xent: 0.30; lr: 0.00010; 5356/6707 tok/s;  14662 sec
[2021-05-15 21:42:55,460 INFO] Step 38250/50000; acc:  91.33; ppl:  1.34; xent: 0.29; lr: 0.00010; 5459/6627 tok/s;  14681 sec
[2021-05-15 21:43:14,057 INFO] Step 38300/50000; acc:  91.23; ppl:  1.34; xent: 0.30; lr: 0.00010; 5423/6763 tok/s;  14699 sec
[2021-05-15 21:43:33,477 INFO] Step 38350/50000; acc:  91.47; ppl:  1.34; xent: 0.29; lr: 0.00010; 5296/6627 tok/s;  14719 sec
[2021-05-15 21:43:52,217 INFO] Step 38400/50000; acc:  91.40; ppl:  1.34; xent: 0.29; lr: 0.00010; 5299/6726 tok/s;  14738 sec
[2021-05-15 21:44:11,678 INFO] Step 38450/50000; acc:  91.39; ppl:  1.34; xent: 0.30; lr: 0.00010; 5330/6599 tok/s;  14757 sec
[2021-05-15 21:44:30,563 INFO] Step 38500/50000; acc:  91.38; ppl:  1.34; xent: 0.29; lr: 0.00010; 5398/6681 tok/s;  14776 sec
[2021-05-15 21:44:48,833 INFO] Step 38550/50000; acc:  91.62; ppl:  1.33; xent: 0.28; lr: 0.00010; 5537/7013 tok/s;  14794 sec
[2021-05-15 21:45:07,885 INFO] Step 38600/50000; acc:  91.59; ppl:  1.34; xent: 0.29; lr: 0.00010; 5446/6896 tok/s;  14813 sec
[2021-05-15 21:45:26,046 INFO] Step 38650/50000; acc:  91.41; ppl:  1.34; xent: 0.29; lr: 0.00010; 5369/6866 tok/s;  14831 sec
[2021-05-15 21:45:28,265 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:45:45,602 INFO] Step 38700/50000; acc:  91.52; ppl:  1.34; xent: 0.29; lr: 0.00010; 5339/6571 tok/s;  14851 sec
[2021-05-15 21:46:03,517 INFO] Step 38750/50000; acc:  91.26; ppl:  1.35; xent: 0.30; lr: 0.00010; 5596/7028 tok/s;  14869 sec
[2021-05-15 21:46:22,564 INFO] Step 38800/50000; acc:  91.17; ppl:  1.35; xent: 0.30; lr: 0.00010; 5324/6724 tok/s;  14888 sec
[2021-05-15 21:46:41,723 INFO] Step 38850/50000; acc:  91.27; ppl:  1.34; xent: 0.30; lr: 0.00010; 5225/6537 tok/s;  14907 sec
[2021-05-15 21:47:00,860 INFO] Step 38900/50000; acc:  91.54; ppl:  1.34; xent: 0.29; lr: 0.00010; 5409/6726 tok/s;  14926 sec
[2021-05-15 21:47:19,883 INFO] Step 38950/50000; acc:  91.39; ppl:  1.34; xent: 0.29; lr: 0.00010; 5383/6615 tok/s;  14945 sec
[2021-05-15 21:47:38,295 INFO] Step 39000/50000; acc:  91.17; ppl:  1.34; xent: 0.30; lr: 0.00010; 5491/6733 tok/s;  14964 sec
[2021-05-15 21:47:57,327 INFO] Step 39050/50000; acc:  91.54; ppl:  1.33; xent: 0.29; lr: 0.00010; 5449/6827 tok/s;  14983 sec
[2021-05-15 21:48:16,754 INFO] Step 39100/50000; acc:  91.43; ppl:  1.34; xent: 0.29; lr: 0.00010; 5162/6523 tok/s;  15002 sec
[2021-05-15 21:48:36,067 INFO] Step 39150/50000; acc:  91.50; ppl:  1.34; xent: 0.29; lr: 0.00010; 5333/6758 tok/s;  15021 sec
[2021-05-15 21:48:54,977 INFO] Step 39200/50000; acc:  91.17; ppl:  1.35; xent: 0.30; lr: 0.00010; 5217/6486 tok/s;  15040 sec
[2021-05-15 21:49:13,946 INFO] Step 39250/50000; acc:  91.66; ppl:  1.33; xent: 0.29; lr: 0.00010; 5527/6742 tok/s;  15059 sec
[2021-05-15 21:49:33,042 INFO] Step 39300/50000; acc:  91.74; ppl:  1.32; xent: 0.28; lr: 0.00010; 5314/6841 tok/s;  15078 sec
[2021-05-15 21:49:51,536 INFO] Step 39350/50000; acc:  91.49; ppl:  1.34; xent: 0.29; lr: 0.00010; 5427/6860 tok/s;  15097 sec
[2021-05-15 21:49:55,248 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:50:10,799 INFO] Step 39400/50000; acc:  91.48; ppl:  1.34; xent: 0.29; lr: 0.00010; 5427/6836 tok/s;  15116 sec
[2021-05-15 21:50:28,822 INFO] Step 39450/50000; acc:  91.41; ppl:  1.34; xent: 0.29; lr: 0.00010; 5419/6756 tok/s;  15134 sec
[2021-05-15 21:50:47,553 INFO] Step 39500/50000; acc:  91.21; ppl:  1.35; xent: 0.30; lr: 0.00010; 5513/6779 tok/s;  15153 sec
[2021-05-15 21:51:06,539 INFO] Step 39550/50000; acc:  91.15; ppl:  1.35; xent: 0.30; lr: 0.00010; 5290/6702 tok/s;  15172 sec
[2021-05-15 21:51:25,752 INFO] Step 39600/50000; acc:  91.57; ppl:  1.33; xent: 0.29; lr: 0.00010; 5300/6622 tok/s;  15191 sec
[2021-05-15 21:51:44,759 INFO] Step 39650/50000; acc:  91.45; ppl:  1.34; xent: 0.29; lr: 0.00010; 5234/6678 tok/s;  15210 sec
[2021-05-15 21:52:03,954 INFO] Step 39700/50000; acc:  91.42; ppl:  1.34; xent: 0.29; lr: 0.00010; 5464/6474 tok/s;  15229 sec
[2021-05-15 21:52:22,754 INFO] Step 39750/50000; acc:  91.28; ppl:  1.34; xent: 0.30; lr: 0.00010; 5514/6808 tok/s;  15248 sec
[2021-05-15 21:52:41,729 INFO] Step 39800/50000; acc:  91.56; ppl:  1.33; xent: 0.29; lr: 0.00010; 5230/6681 tok/s;  15267 sec
[2021-05-15 21:53:00,918 INFO] Step 39850/50000; acc:  91.54; ppl:  1.33; xent: 0.29; lr: 0.00010; 5409/6724 tok/s;  15286 sec
[2021-05-15 21:53:19,945 INFO] Step 39900/50000; acc:  91.49; ppl:  1.33; xent: 0.29; lr: 0.00010; 5271/6656 tok/s;  15305 sec
[2021-05-15 21:53:39,317 INFO] Step 39950/50000; acc:  91.40; ppl:  1.34; xent: 0.29; lr: 0.00010; 5324/6510 tok/s;  15325 sec
[2021-05-15 21:53:57,567 INFO] Step 40000/50000; acc:  91.76; ppl:  1.32; xent: 0.28; lr: 0.00010; 5465/7002 tok/s;  15343 sec
[2021-05-15 21:53:57,568 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/basic/valid.txt, align=None)...
[2021-05-15 21:54:25,403 INFO] Validation perplexity: 1.44079
[2021-05-15 21:54:25,404 INFO] Validation accuracy: 90.09
[2021-05-15 21:54:25,408 INFO] Saving checkpoint ../models/group2_params/basic_ops/model_step_40000.pt
[2021-05-15 21:54:44,707 INFO] Step 40050/50000; acc:  91.76; ppl:  1.33; xent: 0.28; lr: 0.00010; 2185/2767 tok/s;  15390 sec
[2021-05-15 21:55:01,522 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:55:03,448 INFO] Step 40100/50000; acc:  91.78; ppl:  1.32; xent: 0.28; lr: 0.00010; 5429/6951 tok/s;  15409 sec
[2021-05-15 21:55:22,136 INFO] Step 40150/50000; acc:  91.36; ppl:  1.34; xent: 0.29; lr: 0.00010; 5395/6691 tok/s;  15428 sec
[2021-05-15 21:55:41,407 INFO] Step 40200/50000; acc:  91.53; ppl:  1.34; xent: 0.29; lr: 0.00010; 5428/6779 tok/s;  15447 sec
[2021-05-15 21:55:59,433 INFO] Step 40250/50000; acc:  90.98; ppl:  1.35; xent: 0.30; lr: 0.00010; 5355/6612 tok/s;  15465 sec
[2021-05-15 21:56:19,352 INFO] Step 40300/50000; acc:  91.58; ppl:  1.33; xent: 0.29; lr: 0.00010; 5213/6567 tok/s;  15485 sec
[2021-05-15 21:56:38,005 INFO] Step 40350/50000; acc:  91.38; ppl:  1.34; xent: 0.29; lr: 0.00010; 5384/6774 tok/s;  15503 sec
[2021-05-15 21:56:57,167 INFO] Step 40400/50000; acc:  91.65; ppl:  1.33; xent: 0.28; lr: 0.00010; 5319/6680 tok/s;  15523 sec
[2021-05-15 21:57:16,133 INFO] Step 40450/50000; acc:  91.35; ppl:  1.34; xent: 0.29; lr: 0.00010; 5345/6462 tok/s;  15542 sec
[2021-05-15 21:57:34,984 INFO] Step 40500/50000; acc:  91.37; ppl:  1.34; xent: 0.29; lr: 0.00010; 5553/6748 tok/s;  15560 sec
[2021-05-15 21:57:54,958 INFO] Step 40550/50000; acc:  91.81; ppl:  1.32; xent: 0.28; lr: 0.00010; 5100/6558 tok/s;  15580 sec
[2021-05-15 21:58:14,296 INFO] Step 40600/50000; acc:  91.47; ppl:  1.34; xent: 0.29; lr: 0.00010; 5159/6458 tok/s;  15600 sec
[2021-05-15 21:58:33,750 INFO] Step 40650/50000; acc:  91.64; ppl:  1.33; xent: 0.28; lr: 0.00010; 5327/6692 tok/s;  15619 sec
[2021-05-15 21:58:52,254 INFO] Step 40700/50000; acc:  91.56; ppl:  1.33; xent: 0.28; lr: 0.00010; 5436/6698 tok/s;  15638 sec
[2021-05-15 21:59:10,843 INFO] Step 40750/50000; acc:  91.85; ppl:  1.32; xent: 0.28; lr: 0.00010; 5537/7091 tok/s;  15656 sec
[2021-05-15 21:59:29,295 INFO] Step 40800/50000; acc:  91.29; ppl:  1.34; xent: 0.29; lr: 0.00010; 5314/6683 tok/s;  15675 sec
[2021-05-15 21:59:41,255 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 21:59:48,091 INFO] Step 40850/50000; acc:  91.74; ppl:  1.32; xent: 0.28; lr: 0.00010; 5554/6975 tok/s;  15693 sec
[2021-05-15 22:00:07,576 INFO] Step 40900/50000; acc:  91.61; ppl:  1.33; xent: 0.29; lr: 0.00010; 5251/6564 tok/s;  15713 sec
[2021-05-15 22:00:25,591 INFO] Step 40950/50000; acc:  91.25; ppl:  1.35; xent: 0.30; lr: 0.00010; 5570/7017 tok/s;  15731 sec
[2021-05-15 22:00:44,952 INFO] Step 41000/50000; acc:  91.43; ppl:  1.34; xent: 0.29; lr: 0.00010; 5347/6643 tok/s;  15750 sec
[2021-05-15 22:01:03,902 INFO] Step 41050/50000; acc:  91.47; ppl:  1.33; xent: 0.29; lr: 0.00010; 5139/6507 tok/s;  15769 sec
[2021-05-15 22:01:22,815 INFO] Step 41100/50000; acc:  91.58; ppl:  1.33; xent: 0.29; lr: 0.00010; 5478/6723 tok/s;  15788 sec
[2021-05-15 22:01:42,129 INFO] Step 41150/50000; acc:  91.47; ppl:  1.33; xent: 0.29; lr: 0.00010; 5243/6555 tok/s;  15808 sec
[2021-05-15 22:02:00,839 INFO] Step 41200/50000; acc:  91.64; ppl:  1.33; xent: 0.28; lr: 0.00010; 5521/6746 tok/s;  15826 sec
[2021-05-15 22:02:20,336 INFO] Step 41250/50000; acc:  91.42; ppl:  1.34; xent: 0.29; lr: 0.00010; 5154/6419 tok/s;  15846 sec
[2021-05-15 22:02:40,157 INFO] Step 41300/50000; acc:  91.73; ppl:  1.33; xent: 0.28; lr: 0.00010; 5234/6543 tok/s;  15866 sec
[2021-05-15 22:02:59,551 INFO] Step 41350/50000; acc:  91.90; ppl:  1.32; xent: 0.28; lr: 0.00010; 5281/6858 tok/s;  15885 sec
[2021-05-15 22:03:18,087 INFO] Step 41400/50000; acc:  91.22; ppl:  1.34; xent: 0.30; lr: 0.00010; 5364/6526 tok/s;  15903 sec
[2021-05-15 22:03:36,758 INFO] Step 41450/50000; acc:  91.89; ppl:  1.32; xent: 0.27; lr: 0.00010; 5577/6963 tok/s;  15922 sec
[2021-05-15 22:03:55,141 INFO] Step 41500/50000; acc:  91.64; ppl:  1.33; xent: 0.28; lr: 0.00010; 5425/6947 tok/s;  15941 sec
[2021-05-15 22:04:13,827 INFO] Step 41550/50000; acc:  91.73; ppl:  1.33; xent: 0.28; lr: 0.00010; 5484/6807 tok/s;  15959 sec
[2021-05-15 22:04:20,141 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:04:32,598 INFO] Step 41600/50000; acc:  91.49; ppl:  1.33; xent: 0.29; lr: 0.00010; 5278/6762 tok/s;  15978 sec
[2021-05-15 22:04:51,239 INFO] Step 41650/50000; acc:  91.73; ppl:  1.33; xent: 0.28; lr: 0.00010; 5599/6996 tok/s;  15997 sec
[2021-05-15 22:05:09,896 INFO] Step 41700/50000; acc:  91.27; ppl:  1.34; xent: 0.29; lr: 0.00010; 5432/6683 tok/s;  16015 sec
[2021-05-15 22:05:29,010 INFO] Step 41750/50000; acc:  91.44; ppl:  1.33; xent: 0.29; lr: 0.00010; 5248/6648 tok/s;  16034 sec
[2021-05-15 22:05:48,542 INFO] Step 41800/50000; acc:  91.56; ppl:  1.33; xent: 0.28; lr: 0.00010; 5331/6554 tok/s;  16054 sec
[2021-05-15 22:06:06,829 INFO] Step 41850/50000; acc:  91.45; ppl:  1.33; xent: 0.29; lr: 0.00010; 5297/6713 tok/s;  16072 sec
[2021-05-15 22:06:26,004 INFO] Step 41900/50000; acc:  91.64; ppl:  1.33; xent: 0.28; lr: 0.00010; 5502/6628 tok/s;  16091 sec
[2021-05-15 22:06:44,483 INFO] Step 41950/50000; acc:  91.53; ppl:  1.33; xent: 0.28; lr: 0.00010; 5464/6858 tok/s;  16110 sec
[2021-05-15 22:07:03,690 INFO] Step 42000/50000; acc:  91.75; ppl:  1.32; xent: 0.28; lr: 0.00010; 5327/6661 tok/s;  16129 sec
[2021-05-15 22:07:22,726 INFO] Step 42050/50000; acc:  91.60; ppl:  1.33; xent: 0.28; lr: 0.00010; 5280/6710 tok/s;  16148 sec
[2021-05-15 22:07:42,010 INFO] Step 42100/50000; acc:  91.64; ppl:  1.33; xent: 0.29; lr: 0.00010; 5375/6632 tok/s;  16167 sec
[2021-05-15 22:08:00,648 INFO] Step 42150/50000; acc:  91.69; ppl:  1.32; xent: 0.28; lr: 0.00010; 5515/6794 tok/s;  16186 sec
[2021-05-15 22:08:19,129 INFO] Step 42200/50000; acc:  91.73; ppl:  1.32; xent: 0.28; lr: 0.00010; 5376/6888 tok/s;  16205 sec
[2021-05-15 22:08:37,968 INFO] Step 42250/50000; acc:  91.87; ppl:  1.32; xent: 0.28; lr: 0.00010; 5485/6912 tok/s;  16223 sec
[2021-05-15 22:08:46,164 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:08:56,562 INFO] Step 42300/50000; acc:  91.71; ppl:  1.32; xent: 0.28; lr: 0.00010; 5397/6851 tok/s;  16242 sec
[2021-05-15 22:09:15,912 INFO] Step 42350/50000; acc:  91.67; ppl:  1.33; xent: 0.28; lr: 0.00010; 5328/6586 tok/s;  16261 sec
[2021-05-15 22:09:33,897 INFO] Step 42400/50000; acc:  91.43; ppl:  1.33; xent: 0.29; lr: 0.00010; 5476/6957 tok/s;  16279 sec
[2021-05-15 22:09:52,867 INFO] Step 42450/50000; acc:  91.32; ppl:  1.34; xent: 0.29; lr: 0.00010; 5452/6740 tok/s;  16298 sec
[2021-05-15 22:10:12,463 INFO] Step 42500/50000; acc:  91.67; ppl:  1.32; xent: 0.28; lr: 0.00010; 5189/6565 tok/s;  16318 sec
[2021-05-15 22:10:31,203 INFO] Step 42550/50000; acc:  91.70; ppl:  1.32; xent: 0.28; lr: 0.00010; 5359/6690 tok/s;  16337 sec
[2021-05-15 22:10:50,320 INFO] Step 42600/50000; acc:  91.75; ppl:  1.32; xent: 0.28; lr: 0.00010; 5451/6640 tok/s;  16356 sec
[2021-05-15 22:11:08,823 INFO] Step 42650/50000; acc:  91.38; ppl:  1.33; xent: 0.29; lr: 0.00010; 5375/6634 tok/s;  16374 sec
[2021-05-15 22:11:27,880 INFO] Step 42700/50000; acc:  91.80; ppl:  1.32; xent: 0.28; lr: 0.00010; 5446/6819 tok/s;  16393 sec
[2021-05-15 22:11:47,232 INFO] Step 42750/50000; acc:  91.62; ppl:  1.33; xent: 0.28; lr: 0.00010; 5206/6557 tok/s;  16413 sec
[2021-05-15 22:12:06,514 INFO] Step 42800/50000; acc:  91.80; ppl:  1.32; xent: 0.28; lr: 0.00010; 5301/6727 tok/s;  16432 sec
[2021-05-15 22:12:25,517 INFO] Step 42850/50000; acc:  91.46; ppl:  1.33; xent: 0.29; lr: 0.00010; 5272/6506 tok/s;  16451 sec
[2021-05-15 22:12:44,315 INFO] Step 42900/50000; acc:  91.92; ppl:  1.32; xent: 0.28; lr: 0.00010; 5565/6835 tok/s;  16470 sec
[2021-05-15 22:13:03,139 INFO] Step 42950/50000; acc:  91.91; ppl:  1.31; xent: 0.27; lr: 0.00010; 5422/6965 tok/s;  16489 sec
[2021-05-15 22:13:21,500 INFO] Step 43000/50000; acc:  91.71; ppl:  1.32; xent: 0.28; lr: 0.00010; 5376/6817 tok/s;  16507 sec
[2021-05-15 22:13:24,598 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:13:40,747 INFO] Step 43050/50000; acc:  91.70; ppl:  1.32; xent: 0.28; lr: 0.00010; 5405/6830 tok/s;  16526 sec
[2021-05-15 22:13:59,504 INFO] Step 43100/50000; acc:  91.69; ppl:  1.33; xent: 0.28; lr: 0.00010; 5358/6656 tok/s;  16545 sec
[2021-05-15 22:14:18,096 INFO] Step 43150/50000; acc:  91.39; ppl:  1.34; xent: 0.29; lr: 0.00010; 5488/6773 tok/s;  16563 sec
[2021-05-15 22:14:36,760 INFO] Step 43200/50000; acc:  91.47; ppl:  1.33; xent: 0.29; lr: 0.00010; 5294/6726 tok/s;  16582 sec
[2021-05-15 22:14:56,301 INFO] Step 43250/50000; acc:  91.88; ppl:  1.32; xent: 0.28; lr: 0.00010; 5312/6616 tok/s;  16602 sec
[2021-05-15 22:15:15,711 INFO] Step 43300/50000; acc:  91.70; ppl:  1.32; xent: 0.28; lr: 0.00010; 5199/6580 tok/s;  16621 sec
[2021-05-15 22:15:34,551 INFO] Step 43350/50000; acc:  91.50; ppl:  1.33; xent: 0.29; lr: 0.00010; 5424/6441 tok/s;  16640 sec
[2021-05-15 22:15:53,513 INFO] Step 43400/50000; acc:  91.62; ppl:  1.33; xent: 0.28; lr: 0.00010; 5545/6864 tok/s;  16659 sec
[2021-05-15 22:16:12,767 INFO] Step 43450/50000; acc:  91.89; ppl:  1.31; xent: 0.27; lr: 0.00010; 5069/6531 tok/s;  16678 sec
[2021-05-15 22:16:32,036 INFO] Step 43500/50000; acc:  91.70; ppl:  1.32; xent: 0.28; lr: 0.00010; 5397/6658 tok/s;  16697 sec
[2021-05-15 22:16:50,674 INFO] Step 43550/50000; acc:  91.70; ppl:  1.32; xent: 0.28; lr: 0.00010; 5395/6817 tok/s;  16716 sec
[2021-05-15 22:17:09,572 INFO] Step 43600/50000; acc:  91.67; ppl:  1.33; xent: 0.28; lr: 0.00010; 5424/6621 tok/s;  16735 sec
[2021-05-15 22:17:28,217 INFO] Step 43650/50000; acc:  92.07; ppl:  1.31; xent: 0.27; lr: 0.00010; 5411/6954 tok/s;  16754 sec
[2021-05-15 22:17:47,320 INFO] Step 43700/50000; acc:  91.84; ppl:  1.32; xent: 0.28; lr: 0.00010; 5397/6783 tok/s;  16773 sec
[2021-05-15 22:18:03,486 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:18:06,267 INFO] Step 43750/50000; acc:  91.96; ppl:  1.31; xent: 0.27; lr: 0.00010; 5404/6927 tok/s;  16792 sec
[2021-05-15 22:18:24,774 INFO] Step 43800/50000; acc:  91.60; ppl:  1.32; xent: 0.28; lr: 0.00010; 5358/6666 tok/s;  16810 sec
[2021-05-15 22:18:43,751 INFO] Step 43850/50000; acc:  91.67; ppl:  1.33; xent: 0.28; lr: 0.00010; 5485/6834 tok/s;  16829 sec
[2021-05-15 22:19:02,097 INFO] Step 43900/50000; acc:  91.20; ppl:  1.34; xent: 0.29; lr: 0.00010; 5408/6608 tok/s;  16847 sec
[2021-05-15 22:19:22,226 INFO] Step 43950/50000; acc:  91.83; ppl:  1.32; xent: 0.28; lr: 0.00010; 5107/6499 tok/s;  16868 sec
[2021-05-15 22:19:40,717 INFO] Step 44000/50000; acc:  91.65; ppl:  1.32; xent: 0.28; lr: 0.00010; 5337/6745 tok/s;  16886 sec
[2021-05-15 22:20:00,286 INFO] Step 44050/50000; acc:  91.84; ppl:  1.32; xent: 0.28; lr: 0.00010; 5313/6626 tok/s;  16906 sec
[2021-05-15 22:20:19,146 INFO] Step 44100/50000; acc:  91.73; ppl:  1.32; xent: 0.28; lr: 0.00010; 5452/6641 tok/s;  16925 sec
[2021-05-15 22:20:38,006 INFO] Step 44150/50000; acc:  91.49; ppl:  1.33; xent: 0.29; lr: 0.00010; 5388/6566 tok/s;  16943 sec
[2021-05-15 22:20:57,927 INFO] Step 44200/50000; acc:  92.07; ppl:  1.31; xent: 0.27; lr: 0.00010; 5213/6637 tok/s;  16963 sec
[2021-05-15 22:21:17,319 INFO] Step 44250/50000; acc:  91.68; ppl:  1.32; xent: 0.28; lr: 0.00010; 5049/6448 tok/s;  16983 sec
[2021-05-15 22:21:36,479 INFO] Step 44300/50000; acc:  91.83; ppl:  1.32; xent: 0.28; lr: 0.00010; 5417/6696 tok/s;  17002 sec
[2021-05-15 22:21:55,056 INFO] Step 44350/50000; acc:  91.85; ppl:  1.32; xent: 0.28; lr: 0.00010; 5446/6718 tok/s;  17020 sec
[2021-05-15 22:22:13,783 INFO] Step 44400/50000; acc:  92.08; ppl:  1.31; xent: 0.27; lr: 0.00010; 5444/7028 tok/s;  17039 sec
[2021-05-15 22:22:32,446 INFO] Step 44450/50000; acc:  91.67; ppl:  1.32; xent: 0.28; lr: 0.00010; 5332/6656 tok/s;  17058 sec
[2021-05-15 22:22:43,505 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:22:51,251 INFO] Step 44500/50000; acc:  91.95; ppl:  1.31; xent: 0.27; lr: 0.00010; 5543/6908 tok/s;  17077 sec
[2021-05-15 22:23:10,594 INFO] Step 44550/50000; acc:  91.84; ppl:  1.32; xent: 0.28; lr: 0.00010; 5334/6723 tok/s;  17096 sec
[2021-05-15 22:23:28,522 INFO] Step 44600/50000; acc:  91.41; ppl:  1.33; xent: 0.29; lr: 0.00010; 5491/6928 tok/s;  17114 sec
[2021-05-15 22:23:48,062 INFO] Step 44650/50000; acc:  91.75; ppl:  1.32; xent: 0.28; lr: 0.00010; 5282/6565 tok/s;  17133 sec
[2021-05-15 22:24:07,316 INFO] Step 44700/50000; acc:  91.65; ppl:  1.32; xent: 0.28; lr: 0.00010; 5201/6502 tok/s;  17153 sec
[2021-05-15 22:24:25,887 INFO] Step 44750/50000; acc:  91.87; ppl:  1.32; xent: 0.27; lr: 0.00010; 5505/6829 tok/s;  17171 sec
[2021-05-15 22:24:45,126 INFO] Step 44800/50000; acc:  91.75; ppl:  1.32; xent: 0.28; lr: 0.00010; 5191/6534 tok/s;  17191 sec
[2021-05-15 22:25:03,662 INFO] Step 44850/50000; acc:  91.73; ppl:  1.32; xent: 0.28; lr: 0.00010; 5672/6805 tok/s;  17209 sec
[2021-05-15 22:25:23,485 INFO] Step 44900/50000; acc:  91.83; ppl:  1.32; xent: 0.27; lr: 0.00010; 5144/6524 tok/s;  17229 sec
[2021-05-15 22:25:42,669 INFO] Step 44950/50000; acc:  91.70; ppl:  1.32; xent: 0.28; lr: 0.00010; 5255/6504 tok/s;  17248 sec
[2021-05-15 22:26:02,503 INFO] Step 45000/50000; acc:  92.13; ppl:  1.30; xent: 0.27; lr: 0.00010; 5254/6758 tok/s;  17268 sec
[2021-05-15 22:26:02,505 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/basic/valid.txt, align=None)...
[2021-05-15 22:26:29,958 INFO] Validation perplexity: 1.45278
[2021-05-15 22:26:29,958 INFO] Validation accuracy: 90.0763
[2021-05-15 22:26:29,962 INFO] Saving checkpoint ../models/group2_params/basic_ops/model_step_45000.pt
[2021-05-15 22:26:48,349 INFO] Step 45050/50000; acc:  91.57; ppl:  1.32; xent: 0.28; lr: 0.00010; 2125/2659 tok/s;  17314 sec
[2021-05-15 22:27:06,998 INFO] Step 45100/50000; acc:  92.05; ppl:  1.31; xent: 0.27; lr: 0.00010; 5603/6904 tok/s;  17332 sec
[2021-05-15 22:27:25,629 INFO] Step 45150/50000; acc:  91.99; ppl:  1.31; xent: 0.27; lr: 0.00010; 5372/6910 tok/s;  17351 sec
[2021-05-15 22:27:43,940 INFO] Step 45200/50000; acc:  91.89; ppl:  1.31; xent: 0.27; lr: 0.00010; 5564/6867 tok/s;  17369 sec
[2021-05-15 22:27:49,758 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:28:03,187 INFO] Step 45250/50000; acc:  91.86; ppl:  1.31; xent: 0.27; lr: 0.00010; 5219/6656 tok/s;  17389 sec
[2021-05-15 22:28:21,497 INFO] Step 45300/50000; acc:  91.87; ppl:  1.32; xent: 0.28; lr: 0.00010; 5684/7011 tok/s;  17407 sec
[2021-05-15 22:28:40,276 INFO] Step 45350/50000; acc:  91.45; ppl:  1.33; xent: 0.29; lr: 0.00010; 5429/6787 tok/s;  17426 sec
[2021-05-15 22:28:59,465 INFO] Step 45400/50000; acc:  91.85; ppl:  1.31; xent: 0.27; lr: 0.00010; 5141/6564 tok/s;  17445 sec
[2021-05-15 22:29:18,871 INFO] Step 45450/50000; acc:  91.83; ppl:  1.32; xent: 0.28; lr: 0.00010; 5347/6506 tok/s;  17464 sec
[2021-05-15 22:29:37,461 INFO] Step 45500/50000; acc:  91.72; ppl:  1.32; xent: 0.28; lr: 0.00010; 5358/6718 tok/s;  17483 sec
[2021-05-15 22:29:56,547 INFO] Step 45550/50000; acc:  91.93; ppl:  1.31; xent: 0.27; lr: 0.00010; 5475/6719 tok/s;  17502 sec
[2021-05-15 22:30:15,026 INFO] Step 45600/50000; acc:  91.76; ppl:  1.31; xent: 0.27; lr: 0.00010; 5368/6737 tok/s;  17520 sec
[2021-05-15 22:30:34,259 INFO] Step 45650/50000; acc:  91.87; ppl:  1.31; xent: 0.27; lr: 0.00010; 5416/6715 tok/s;  17540 sec
[2021-05-15 22:30:53,691 INFO] Step 45700/50000; acc:  92.06; ppl:  1.31; xent: 0.27; lr: 0.00010; 5255/6750 tok/s;  17559 sec
[2021-05-15 22:31:12,785 INFO] Step 45750/50000; acc:  91.78; ppl:  1.32; xent: 0.28; lr: 0.00010; 5267/6518 tok/s;  17578 sec
[2021-05-15 22:31:31,299 INFO] Step 45800/50000; acc:  91.82; ppl:  1.32; xent: 0.27; lr: 0.00010; 5652/6840 tok/s;  17597 sec
[2021-05-15 22:31:49,733 INFO] Step 45850/50000; acc:  92.06; ppl:  1.30; xent: 0.26; lr: 0.00010; 5287/6872 tok/s;  17615 sec
[2021-05-15 22:32:08,422 INFO] Step 45900/50000; acc:  92.11; ppl:  1.30; xent: 0.27; lr: 0.00010; 5534/6940 tok/s;  17634 sec
[2021-05-15 22:32:16,048 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:32:27,219 INFO] Step 45950/50000; acc:  91.87; ppl:  1.31; xent: 0.27; lr: 0.00010; 5376/6844 tok/s;  17653 sec
[2021-05-15 22:32:46,366 INFO] Step 46000/50000; acc:  91.99; ppl:  1.31; xent: 0.27; lr: 0.00010; 5337/6590 tok/s;  17672 sec
[2021-05-15 22:33:04,640 INFO] Step 46050/50000; acc:  91.76; ppl:  1.32; xent: 0.28; lr: 0.00010; 5469/6882 tok/s;  17690 sec
[2021-05-15 22:33:24,099 INFO] Step 46100/50000; acc:  91.50; ppl:  1.33; xent: 0.29; lr: 0.00010; 5306/6561 tok/s;  17709 sec
[2021-05-15 22:33:43,342 INFO] Step 46150/50000; acc:  91.95; ppl:  1.31; xent: 0.27; lr: 0.00010; 5326/6720 tok/s;  17729 sec
[2021-05-15 22:34:02,199 INFO] Step 46200/50000; acc:  91.92; ppl:  1.31; xent: 0.27; lr: 0.00010; 5226/6619 tok/s;  17748 sec
[2021-05-15 22:34:21,523 INFO] Step 46250/50000; acc:  92.01; ppl:  1.31; xent: 0.27; lr: 0.00010; 5378/6522 tok/s;  17767 sec
[2021-05-15 22:34:40,010 INFO] Step 46300/50000; acc:  91.72; ppl:  1.32; xent: 0.28; lr: 0.00010; 5521/6747 tok/s;  17785 sec
[2021-05-15 22:34:58,669 INFO] Step 46350/50000; acc:  92.07; ppl:  1.31; xent: 0.27; lr: 0.00010; 5508/6912 tok/s;  17804 sec
[2021-05-15 22:35:17,970 INFO] Step 46400/50000; acc:  91.85; ppl:  1.31; xent: 0.27; lr: 0.00010; 5133/6512 tok/s;  17823 sec
[2021-05-15 22:35:37,614 INFO] Step 46450/50000; acc:  92.04; ppl:  1.31; xent: 0.27; lr: 0.00010; 5294/6719 tok/s;  17843 sec
[2021-05-15 22:35:57,111 INFO] Step 46500/50000; acc:  91.78; ppl:  1.32; xent: 0.27; lr: 0.00010; 5222/6413 tok/s;  17863 sec
[2021-05-15 22:36:15,676 INFO] Step 46550/50000; acc:  92.01; ppl:  1.31; xent: 0.27; lr: 0.00010; 5471/6757 tok/s;  17881 sec
[2021-05-15 22:36:34,596 INFO] Step 46600/50000; acc:  92.27; ppl:  1.30; xent: 0.26; lr: 0.00010; 5483/6971 tok/s;  17900 sec
[2021-05-15 22:36:52,962 INFO] Step 46650/50000; acc:  92.05; ppl:  1.30; xent: 0.27; lr: 0.00010; 5290/6818 tok/s;  17918 sec
[2021-05-15 22:36:55,327 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:37:12,156 INFO] Step 46700/50000; acc:  91.85; ppl:  1.31; xent: 0.27; lr: 0.00010; 5423/6749 tok/s;  17938 sec
[2021-05-15 22:37:30,733 INFO] Step 46750/50000; acc:  92.02; ppl:  1.31; xent: 0.27; lr: 0.00010; 5442/6855 tok/s;  17956 sec
[2021-05-15 22:37:49,533 INFO] Step 46800/50000; acc:  91.56; ppl:  1.32; xent: 0.28; lr: 0.00010; 5372/6620 tok/s;  17975 sec
[2021-05-15 22:38:08,631 INFO] Step 46850/50000; acc:  91.78; ppl:  1.32; xent: 0.28; lr: 0.00010; 5245/6662 tok/s;  17994 sec
[2021-05-15 22:38:28,147 INFO] Step 46900/50000; acc:  91.99; ppl:  1.31; xent: 0.27; lr: 0.00010; 5320/6561 tok/s;  18014 sec
[2021-05-15 22:38:47,686 INFO] Step 46950/50000; acc:  92.05; ppl:  1.31; xent: 0.27; lr: 0.00010; 5200/6605 tok/s;  18033 sec
[2021-05-15 22:39:06,279 INFO] Step 47000/50000; acc:  91.74; ppl:  1.32; xent: 0.28; lr: 0.00010; 5405/6536 tok/s;  18052 sec
[2021-05-15 22:39:25,088 INFO] Step 47050/50000; acc:  91.73; ppl:  1.32; xent: 0.28; lr: 0.00010; 5562/6823 tok/s;  18070 sec
[2021-05-15 22:39:44,925 INFO] Step 47100/50000; acc:  92.19; ppl:  1.30; xent: 0.26; lr: 0.00010; 5052/6370 tok/s;  18090 sec
[2021-05-15 22:40:04,426 INFO] Step 47150/50000; acc:  91.98; ppl:  1.31; xent: 0.27; lr: 0.00010; 5281/6603 tok/s;  18110 sec
[2021-05-15 22:40:22,828 INFO] Step 47200/50000; acc:  91.89; ppl:  1.31; xent: 0.27; lr: 0.00010; 5373/6832 tok/s;  18128 sec
[2021-05-15 22:40:42,028 INFO] Step 47250/50000; acc:  91.98; ppl:  1.31; xent: 0.27; lr: 0.00010; 5443/6596 tok/s;  18147 sec
[2021-05-15 22:41:00,781 INFO] Step 47300/50000; acc:  92.35; ppl:  1.29; xent: 0.26; lr: 0.00010; 5446/7086 tok/s;  18166 sec
[2021-05-15 22:41:19,456 INFO] Step 47350/50000; acc:  91.93; ppl:  1.31; xent: 0.27; lr: 0.00010; 5360/6705 tok/s;  18185 sec
[2021-05-15 22:41:34,808 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:41:38,375 INFO] Step 47400/50000; acc:  92.22; ppl:  1.30; xent: 0.26; lr: 0.00010; 5515/6960 tok/s;  18204 sec
[2021-05-15 22:41:56,904 INFO] Step 47450/50000; acc:  91.95; ppl:  1.31; xent: 0.27; lr: 0.00010; 5258/6691 tok/s;  18222 sec
[2021-05-15 22:42:15,777 INFO] Step 47500/50000; acc:  91.98; ppl:  1.31; xent: 0.27; lr: 0.00010; 5517/6788 tok/s;  18241 sec
[2021-05-15 22:42:34,335 INFO] Step 47550/50000; acc:  91.50; ppl:  1.33; xent: 0.28; lr: 0.00010; 5376/6586 tok/s;  18260 sec
[2021-05-15 22:42:54,107 INFO] Step 47600/50000; acc:  92.07; ppl:  1.30; xent: 0.27; lr: 0.00010; 5166/6548 tok/s;  18280 sec
[2021-05-15 22:43:13,009 INFO] Step 47650/50000; acc:  91.89; ppl:  1.31; xent: 0.27; lr: 0.00010; 5283/6636 tok/s;  18298 sec
[2021-05-15 22:43:32,556 INFO] Step 47700/50000; acc:  92.04; ppl:  1.31; xent: 0.27; lr: 0.00010; 5318/6632 tok/s;  18318 sec
[2021-05-15 22:43:51,248 INFO] Step 47750/50000; acc:  91.97; ppl:  1.31; xent: 0.27; lr: 0.00010; 5553/6728 tok/s;  18337 sec
[2021-05-15 22:44:10,144 INFO] Step 47800/50000; acc:  91.75; ppl:  1.32; xent: 0.27; lr: 0.00010; 5271/6608 tok/s;  18356 sec
[2021-05-15 22:44:29,817 INFO] Step 47850/50000; acc:  92.24; ppl:  1.30; xent: 0.26; lr: 0.00010; 5262/6614 tok/s;  18375 sec
[2021-05-15 22:44:49,003 INFO] Step 47900/50000; acc:  92.07; ppl:  1.31; xent: 0.27; lr: 0.00010; 5241/6631 tok/s;  18394 sec
[2021-05-15 22:45:08,005 INFO] Step 47950/50000; acc:  91.97; ppl:  1.31; xent: 0.27; lr: 0.00010; 5405/6692 tok/s;  18413 sec
[2021-05-15 22:45:26,740 INFO] Step 48000/50000; acc:  92.09; ppl:  1.30; xent: 0.26; lr: 0.00010; 5302/6641 tok/s;  18432 sec
[2021-05-15 22:45:45,186 INFO] Step 48050/50000; acc:  92.24; ppl:  1.30; xent: 0.26; lr: 0.00010; 5638/7186 tok/s;  18451 sec
[2021-05-15 22:46:03,980 INFO] Step 48100/50000; acc:  92.04; ppl:  1.30; xent: 0.26; lr: 0.00010; 5377/6649 tok/s;  18469 sec
[2021-05-15 22:46:14,055 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:46:22,512 INFO] Step 48150/50000; acc:  92.06; ppl:  1.31; xent: 0.27; lr: 0.00010; 5455/6928 tok/s;  18488 sec
[2021-05-15 22:46:41,823 INFO] Step 48200/50000; acc:  92.16; ppl:  1.30; xent: 0.27; lr: 0.00010; 5445/6819 tok/s;  18507 sec
[2021-05-15 22:46:59,553 INFO] Step 48250/50000; acc:  91.60; ppl:  1.32; xent: 0.28; lr: 0.00010; 5448/6871 tok/s;  18525 sec
[2021-05-15 22:47:19,125 INFO] Step 48300/50000; acc:  92.02; ppl:  1.31; xent: 0.27; lr: 0.00010; 5277/6563 tok/s;  18545 sec
[2021-05-15 22:47:38,582 INFO] Step 48350/50000; acc:  91.97; ppl:  1.31; xent: 0.27; lr: 0.00010; 5171/6495 tok/s;  18564 sec
[2021-05-15 22:47:57,271 INFO] Step 48400/50000; acc:  92.10; ppl:  1.30; xent: 0.26; lr: 0.00010; 5426/6742 tok/s;  18583 sec
[2021-05-15 22:48:16,588 INFO] Step 48450/50000; acc:  92.02; ppl:  1.31; xent: 0.27; lr: 0.00010; 5243/6503 tok/s;  18602 sec
[2021-05-15 22:48:35,038 INFO] Step 48500/50000; acc:  91.98; ppl:  1.31; xent: 0.27; lr: 0.00010; 5695/6824 tok/s;  18620 sec
[2021-05-15 22:48:54,679 INFO] Step 48550/50000; acc:  92.14; ppl:  1.30; xent: 0.26; lr: 0.00010; 5227/6632 tok/s;  18640 sec
[2021-05-15 22:49:13,699 INFO] Step 48600/50000; acc:  92.07; ppl:  1.30; xent: 0.27; lr: 0.00010; 5209/6595 tok/s;  18659 sec
[2021-05-15 22:49:33,064 INFO] Step 48650/50000; acc:  92.28; ppl:  1.30; xent: 0.26; lr: 0.00010; 5358/6809 tok/s;  18678 sec
[2021-05-15 22:49:51,660 INFO] Step 48700/50000; acc:  91.93; ppl:  1.31; xent: 0.27; lr: 0.00010; 5391/6654 tok/s;  18697 sec
[2021-05-15 22:50:10,442 INFO] Step 48750/50000; acc:  92.30; ppl:  1.30; xent: 0.26; lr: 0.00010; 5499/6796 tok/s;  18716 sec
[2021-05-15 22:50:28,733 INFO] Step 48800/50000; acc:  92.21; ppl:  1.30; xent: 0.26; lr: 0.00010; 5367/7000 tok/s;  18734 sec
[2021-05-15 22:50:47,113 INFO] Step 48850/50000; acc:  92.08; ppl:  1.30; xent: 0.27; lr: 0.00010; 5667/6944 tok/s;  18753 sec
[2021-05-15 22:50:52,161 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:51:06,434 INFO] Step 48900/50000; acc:  92.22; ppl:  1.30; xent: 0.26; lr: 0.00010; 5277/6765 tok/s;  18772 sec
[2021-05-15 22:51:24,598 INFO] Step 48950/50000; acc:  91.98; ppl:  1.31; xent: 0.27; lr: 0.00010; 5552/6889 tok/s;  18790 sec
[2021-05-15 22:51:43,558 INFO] Step 49000/50000; acc:  91.72; ppl:  1.32; xent: 0.28; lr: 0.00010; 5471/6737 tok/s;  18809 sec
[2021-05-15 22:52:02,619 INFO] Step 49050/50000; acc:  92.07; ppl:  1.30; xent: 0.26; lr: 0.00010; 5093/6605 tok/s;  18828 sec
[2021-05-15 22:52:21,842 INFO] Step 49100/50000; acc:  92.07; ppl:  1.31; xent: 0.27; lr: 0.00010; 5395/6522 tok/s;  18847 sec
[2021-05-15 22:52:40,433 INFO] Step 49150/50000; acc:  92.06; ppl:  1.30; xent: 0.27; lr: 0.00010; 5392/6816 tok/s;  18866 sec
[2021-05-15 22:52:59,467 INFO] Step 49200/50000; acc:  92.11; ppl:  1.30; xent: 0.27; lr: 0.00010; 5450/6614 tok/s;  18885 sec
[2021-05-15 22:53:18,324 INFO] Step 49250/50000; acc:  92.24; ppl:  1.30; xent: 0.26; lr: 0.00010; 5327/6789 tok/s;  18904 sec
[2021-05-15 22:53:37,950 INFO] Step 49300/50000; acc:  92.15; ppl:  1.30; xent: 0.27; lr: 0.00010; 5308/6523 tok/s;  18923 sec
[2021-05-15 22:53:57,231 INFO] Step 49350/50000; acc:  92.32; ppl:  1.30; xent: 0.26; lr: 0.00010; 5329/6810 tok/s;  18943 sec
[2021-05-15 22:54:16,098 INFO] Step 49400/50000; acc:  91.97; ppl:  1.31; xent: 0.27; lr: 0.00010; 5233/6501 tok/s;  18961 sec
[2021-05-15 22:54:34,478 INFO] Step 49450/50000; acc:  92.18; ppl:  1.30; xent: 0.26; lr: 0.00010; 5677/6834 tok/s;  18980 sec
[2021-05-15 22:54:53,043 INFO] Step 49500/50000; acc:  92.36; ppl:  1.29; xent: 0.26; lr: 0.00010; 5402/7004 tok/s;  18998 sec
[2021-05-15 22:55:11,660 INFO] Step 49550/50000; acc:  92.20; ppl:  1.30; xent: 0.26; lr: 0.00010; 5485/6838 tok/s;  19017 sec
[2021-05-15 22:55:18,649 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/basic/train.txt, align=None)...
[2021-05-15 22:55:30,433 INFO] Step 49600/50000; acc:  92.14; ppl:  1.30; xent: 0.26; lr: 0.00010; 5304/6875 tok/s;  19036 sec
[2021-05-15 22:55:49,835 INFO] Step 49650/50000; acc:  92.19; ppl:  1.30; xent: 0.26; lr: 0.00010; 5363/6564 tok/s;  19055 sec
[2021-05-15 22:56:08,183 INFO] Step 49700/50000; acc:  92.04; ppl:  1.31; xent: 0.27; lr: 0.00010; 5513/6931 tok/s;  19074 sec
[2021-05-15 22:56:27,118 INFO] Step 49750/50000; acc:  91.72; ppl:  1.32; xent: 0.28; lr: 0.00010; 5302/6633 tok/s;  19093 sec
[2021-05-15 22:56:46,501 INFO] Step 49800/50000; acc:  92.21; ppl:  1.30; xent: 0.26; lr: 0.00010; 5380/6692 tok/s;  19112 sec
[2021-05-15 22:57:05,222 INFO] Step 49850/50000; acc:  92.08; ppl:  1.30; xent: 0.26; lr: 0.00010; 5168/6584 tok/s;  19131 sec
[2021-05-15 22:57:24,472 INFO] Step 49900/50000; acc:  92.24; ppl:  1.30; xent: 0.26; lr: 0.00010; 5416/6539 tok/s;  19150 sec
[2021-05-15 22:57:42,873 INFO] Step 49950/50000; acc:  91.89; ppl:  1.31; xent: 0.27; lr: 0.00010; 5571/6854 tok/s;  19168 sec
[2021-05-15 22:58:01,748 INFO] Step 50000/50000; acc:  92.26; ppl:  1.30; xent: 0.26; lr: 0.00005; 5400/6769 tok/s;  19187 sec
[2021-05-15 22:58:01,749 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/basic/valid.txt, align=None)...
[2021-05-15 22:58:29,586 INFO] Validation perplexity: 1.46358
[2021-05-15 22:58:29,587 INFO] Validation accuracy: 89.9593
[2021-05-15 22:58:29,591 INFO] Saving checkpoint ../models/group2_params/basic_ops/model_step_50000.pt

Strictly condensed EditOperations:

modelGroup2Strict = HephaestusModel(MODEL_GROUP2_STRICT)
modelGroup2Strict.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_OPS_TYPED_STRICT_TRAIN,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_OPS_TYPED_STRICT_VALID
)
[2021-05-15 17:12:31,163 INFO] Counter vocab from -1 samples.
[2021-05-15 17:12:31,163 INFO] n_sample=-1: Build vocab on full datasets.
[2021-05-15 17:12:31,166 INFO] corpus_1's transforms: TransformPipe()
[2021-05-15 17:12:31,167 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:12:31,769 INFO] Counters src:429
[2021-05-15 17:12:31,770 INFO] Counters tgt:448
[2021-05-15 17:12:31,770 WARNING] path ../models/group2_params/strict_ops/save_data.vocab.src exists, may overwrite...
[2021-05-15 17:12:31,772 WARNING] path ../models/group2_params/strict_ops/save_data.vocab.tgt exists, may overwrite...
[2021-05-15 17:12:32,444 INFO] Parsed 2 corpora from -data.
[2021-05-15 17:12:32,444 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-05-15 17:12:32,444 INFO] Loading vocab from text file...
[2021-05-15 17:12:32,444 INFO] Loading src vocabulary from ../models/group2_params/strict_ops/save_data.vocab.src
[2021-05-15 17:12:32,445 INFO] Loaded src vocab has 429 tokens.
[2021-05-15 17:12:32,446 INFO] Loading tgt vocabulary from ../models/group2_params/strict_ops/save_data.vocab.tgt
[2021-05-15 17:12:32,447 INFO] Loaded tgt vocab has 448 tokens.
[2021-05-15 17:12:32,448 INFO] Building fields with vocab in counters...
[2021-05-15 17:12:32,448 INFO]  * tgt vocab size: 452.
[2021-05-15 17:12:32,449 INFO]  * src vocab size: 431.
[2021-05-15 17:12:32,449 INFO]  * src vocab size = 431
[2021-05-15 17:12:32,449 INFO]  * tgt vocab size = 452
[2021-05-15 17:12:32,450 INFO] Building model...
[2021-05-15 17:12:33,609 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): LSTM(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(452, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedLSTM(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): LSTMCell(768, 256)
        (1): LSTMCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=452, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-05-15 17:12:33,610 INFO] encoder: 1535488
[2021-05-15 17:12:33,610 INFO] decoder: 2187460
[2021-05-15 17:12:33,610 INFO] * number of parameters: 3722948
[2021-05-15 17:12:33,611 INFO] Starting training on GPU: [0]
[2021-05-15 17:12:33,611 INFO] Start training loop and validate every 5000 steps...
[2021-05-15 17:12:33,611 INFO] corpus_1's transforms: TransformPipe()
[2021-05-15 17:12:33,611 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:12:44,733 INFO] Step 50/50000; acc:  14.00; ppl: 181.50; xent: 5.20; lr: 0.00010; 9216/3788 tok/s;     11 sec
[2021-05-15 17:12:55,200 INFO] Step 100/50000; acc:  16.66; ppl: 50.43; xent: 3.92; lr: 0.00010; 9435/4106 tok/s;     22 sec
[2021-05-15 17:13:06,444 INFO] Step 150/50000; acc:  19.97; ppl: 36.83; xent: 3.61; lr: 0.00010; 9158/3893 tok/s;     33 sec
[2021-05-15 17:13:16,741 INFO] Step 200/50000; acc:  26.45; ppl: 27.98; xent: 3.33; lr: 0.00010; 9634/4125 tok/s;     43 sec
[2021-05-15 17:13:27,389 INFO] Step 250/50000; acc:  32.47; ppl: 20.34; xent: 3.01; lr: 0.00010; 9800/4026 tok/s;     54 sec
[2021-05-15 17:13:37,756 INFO] Step 300/50000; acc:  35.95; ppl: 14.97; xent: 2.71; lr: 0.00010; 9562/4056 tok/s;     64 sec
[2021-05-15 17:13:49,075 INFO] Step 350/50000; acc:  40.05; ppl: 11.63; xent: 2.45; lr: 0.00010; 9371/3774 tok/s;     75 sec
[2021-05-15 17:13:59,850 INFO] Step 400/50000; acc:  40.67; ppl: 10.48; xent: 2.35; lr: 0.00010; 9205/3985 tok/s;     86 sec
[2021-05-15 17:14:10,756 INFO] Step 450/50000; acc:  41.86; ppl:  9.60; xent: 2.26; lr: 0.00010; 9414/3878 tok/s;     97 sec
[2021-05-15 17:14:21,472 INFO] Step 500/50000; acc:  41.99; ppl:  9.49; xent: 2.25; lr: 0.00010; 9545/4030 tok/s;    108 sec
[2021-05-15 17:14:32,259 INFO] Step 550/50000; acc:  42.09; ppl:  9.12; xent: 2.21; lr: 0.00010; 9297/3987 tok/s;    119 sec
[2021-05-15 17:14:43,491 INFO] Step 600/50000; acc:  43.32; ppl:  8.58; xent: 2.15; lr: 0.00010; 9241/3771 tok/s;    130 sec
[2021-05-15 17:14:53,520 INFO] Step 650/50000; acc:  42.92; ppl:  8.55; xent: 2.15; lr: 0.00010; 9773/4172 tok/s;    140 sec
[2021-05-15 17:15:04,453 INFO] Step 700/50000; acc:  43.77; ppl:  8.36; xent: 2.12; lr: 0.00010; 9557/3911 tok/s;    151 sec
[2021-05-15 17:15:05,320 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:15:15,235 INFO] Step 750/50000; acc:  43.79; ppl:  8.25; xent: 2.11; lr: 0.00010; 9314/4062 tok/s;    162 sec
[2021-05-15 17:15:25,813 INFO] Step 800/50000; acc:  44.18; ppl:  8.08; xent: 2.09; lr: 0.00010; 9781/3959 tok/s;    172 sec
[2021-05-15 17:15:37,573 INFO] Step 850/50000; acc:  44.56; ppl:  7.97; xent: 2.08; lr: 0.00010; 8606/3739 tok/s;    184 sec
[2021-05-15 17:15:47,893 INFO] Step 900/50000; acc:  45.14; ppl:  7.60; xent: 2.03; lr: 0.00010; 9630/4141 tok/s;    194 sec
[2021-05-15 17:15:58,252 INFO] Step 950/50000; acc:  45.98; ppl:  7.47; xent: 2.01; lr: 0.00010; 9956/4141 tok/s;    205 sec
[2021-05-15 17:16:08,470 INFO] Step 1000/50000; acc:  46.76; ppl:  7.08; xent: 1.96; lr: 0.00010; 9679/4139 tok/s;    215 sec
[2021-05-15 17:16:19,430 INFO] Step 1050/50000; acc:  46.91; ppl:  6.99; xent: 1.95; lr: 0.00010; 9669/3866 tok/s;    226 sec
[2021-05-15 17:16:30,669 INFO] Step 1100/50000; acc:  47.46; ppl:  6.68; xent: 1.90; lr: 0.00010; 8870/3788 tok/s;    237 sec
[2021-05-15 17:16:41,607 INFO] Step 1150/50000; acc:  48.16; ppl:  6.55; xent: 1.88; lr: 0.00010; 9532/3940 tok/s;    248 sec
[2021-05-15 17:16:52,227 INFO] Step 1200/50000; acc:  48.14; ppl:  6.42; xent: 1.86; lr: 0.00010; 9357/3959 tok/s;    259 sec
[2021-05-15 17:17:02,879 INFO] Step 1250/50000; acc:  48.45; ppl:  6.43; xent: 1.86; lr: 0.00010; 9645/4038 tok/s;    269 sec
[2021-05-15 17:17:13,877 INFO] Step 1300/50000; acc:  48.90; ppl:  6.17; xent: 1.82; lr: 0.00010; 9319/3901 tok/s;    280 sec
[2021-05-15 17:17:24,232 INFO] Step 1350/50000; acc:  49.42; ppl:  6.13; xent: 1.81; lr: 0.00010; 9729/4102 tok/s;    291 sec
[2021-05-15 17:17:34,513 INFO] Step 1400/50000; acc:  50.21; ppl:  5.87; xent: 1.77; lr: 0.00010; 9958/4089 tok/s;    301 sec
[2021-05-15 17:17:43,265 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:17:45,131 INFO] Step 1450/50000; acc:  49.86; ppl:  5.88; xent: 1.77; lr: 0.00010; 9297/4034 tok/s;    312 sec
[2021-05-15 17:17:56,193 INFO] Step 1500/50000; acc:  50.36; ppl:  5.82; xent: 1.76; lr: 0.00010; 9462/3895 tok/s;    323 sec
[2021-05-15 17:18:07,079 INFO] Step 1550/50000; acc:  50.20; ppl:  5.77; xent: 1.75; lr: 0.00010; 9222/3927 tok/s;    333 sec
[2021-05-15 17:18:18,239 INFO] Step 1600/50000; acc:  50.48; ppl:  5.70; xent: 1.74; lr: 0.00010; 9160/3921 tok/s;    345 sec
[2021-05-15 17:18:28,975 INFO] Step 1650/50000; acc:  50.97; ppl:  5.64; xent: 1.73; lr: 0.00010; 9511/3989 tok/s;    355 sec
[2021-05-15 17:18:39,242 INFO] Step 1700/50000; acc:  51.23; ppl:  5.54; xent: 1.71; lr: 0.00010; 9647/4160 tok/s;    366 sec
[2021-05-15 17:18:50,070 INFO] Step 1750/50000; acc:  51.42; ppl:  5.52; xent: 1.71; lr: 0.00010; 9571/3944 tok/s;    376 sec
[2021-05-15 17:19:00,462 INFO] Step 1800/50000; acc:  51.68; ppl:  5.37; xent: 1.68; lr: 0.00010; 9675/3982 tok/s;    387 sec
[2021-05-15 17:19:12,367 INFO] Step 1850/50000; acc:  51.50; ppl:  5.48; xent: 1.70; lr: 0.00010; 8861/3690 tok/s;    399 sec
[2021-05-15 17:19:22,138 INFO] Step 1900/50000; acc:  52.33; ppl:  5.21; xent: 1.65; lr: 0.00010; 10045/4233 tok/s;    409 sec
[2021-05-15 17:19:33,488 INFO] Step 1950/50000; acc:  51.79; ppl:  5.34; xent: 1.68; lr: 0.00010; 9236/3868 tok/s;    420 sec
[2021-05-15 17:19:44,441 INFO] Step 2000/50000; acc:  51.79; ppl:  5.33; xent: 1.67; lr: 0.00010; 9057/3877 tok/s;    431 sec
[2021-05-15 17:19:55,502 INFO] Step 2050/50000; acc:  52.60; ppl:  5.14; xent: 1.64; lr: 0.00010; 9342/3831 tok/s;    442 sec
[2021-05-15 17:20:05,768 INFO] Step 2100/50000; acc:  52.64; ppl:  5.21; xent: 1.65; lr: 0.00010; 9899/4153 tok/s;    452 sec
[2021-05-15 17:20:16,131 INFO] Step 2150/50000; acc:  53.45; ppl:  4.97; xent: 1.60; lr: 0.00010; 9633/4125 tok/s;    463 sec
[2021-05-15 17:20:22,032 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:20:27,206 INFO] Step 2200/50000; acc:  53.04; ppl:  5.10; xent: 1.63; lr: 0.00010; 9346/3857 tok/s;    474 sec
[2021-05-15 17:20:37,833 INFO] Step 2250/50000; acc:  53.77; ppl:  4.97; xent: 1.60; lr: 0.00010; 9323/4009 tok/s;    484 sec
[2021-05-15 17:20:49,595 INFO] Step 2300/50000; acc:  52.94; ppl:  5.09; xent: 1.63; lr: 0.00010; 8860/3688 tok/s;    496 sec
[2021-05-15 17:21:00,509 INFO] Step 2350/50000; acc:  53.18; ppl:  5.01; xent: 1.61; lr: 0.00010; 9129/3975 tok/s;    507 sec
[2021-05-15 17:21:11,031 INFO] Step 2400/50000; acc:  53.70; ppl:  4.94; xent: 1.60; lr: 0.00010; 9786/4064 tok/s;    517 sec
[2021-05-15 17:21:21,413 INFO] Step 2450/50000; acc:  54.02; ppl:  4.89; xent: 1.59; lr: 0.00010; 9781/4093 tok/s;    528 sec
[2021-05-15 17:21:31,623 INFO] Step 2500/50000; acc:  54.09; ppl:  4.87; xent: 1.58; lr: 0.00010; 9850/4133 tok/s;    538 sec
[2021-05-15 17:21:43,258 INFO] Step 2550/50000; acc:  54.04; ppl:  4.84; xent: 1.58; lr: 0.00010; 8973/3624 tok/s;    550 sec
[2021-05-15 17:21:53,955 INFO] Step 2600/50000; acc:  54.11; ppl:  4.83; xent: 1.57; lr: 0.00010; 9302/4038 tok/s;    560 sec
[2021-05-15 17:22:05,054 INFO] Step 2650/50000; acc:  54.31; ppl:  4.79; xent: 1.57; lr: 0.00010; 9449/3818 tok/s;    571 sec
[2021-05-15 17:22:15,373 INFO] Step 2700/50000; acc:  54.92; ppl:  4.74; xent: 1.56; lr: 0.00010; 9572/4105 tok/s;    582 sec
[2021-05-15 17:22:26,890 INFO] Step 2750/50000; acc:  54.44; ppl:  4.80; xent: 1.57; lr: 0.00010; 9052/3795 tok/s;    593 sec
[2021-05-15 17:22:37,350 INFO] Step 2800/50000; acc:  55.51; ppl:  4.57; xent: 1.52; lr: 0.00010; 9538/3984 tok/s;    604 sec
[2021-05-15 17:22:47,846 INFO] Step 2850/50000; acc:  55.13; ppl:  4.70; xent: 1.55; lr: 0.00010; 9733/4116 tok/s;    614 sec
[2021-05-15 17:22:58,336 INFO] Step 2900/50000; acc:  55.53; ppl:  4.56; xent: 1.52; lr: 0.00010; 9715/4093 tok/s;    625 sec
[2021-05-15 17:23:01,184 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:23:09,001 INFO] Step 2950/50000; acc:  55.33; ppl:  4.60; xent: 1.53; lr: 0.00010; 9436/3999 tok/s;    635 sec
[2021-05-15 17:23:20,055 INFO] Step 3000/50000; acc:  55.43; ppl:  4.65; xent: 1.54; lr: 0.00010; 9336/3852 tok/s;    646 sec
[2021-05-15 17:23:31,020 INFO] Step 3050/50000; acc:  55.84; ppl:  4.55; xent: 1.52; lr: 0.00010; 8939/3972 tok/s;    657 sec
[2021-05-15 17:23:41,995 INFO] Step 3100/50000; acc:  55.67; ppl:  4.60; xent: 1.52; lr: 0.00010; 9517/3903 tok/s;    668 sec
[2021-05-15 17:23:52,540 INFO] Step 3150/50000; acc:  55.95; ppl:  4.54; xent: 1.51; lr: 0.00010; 9461/4069 tok/s;    679 sec
[2021-05-15 17:24:03,079 INFO] Step 3200/50000; acc:  56.15; ppl:  4.47; xent: 1.50; lr: 0.00010; 9756/4042 tok/s;    689 sec
[2021-05-15 17:24:14,126 INFO] Step 3250/50000; acc:  55.95; ppl:  4.53; xent: 1.51; lr: 0.00010; 9410/3785 tok/s;    701 sec
[2021-05-15 17:24:24,898 INFO] Step 3300/50000; acc:  56.48; ppl:  4.41; xent: 1.48; lr: 0.00010; 9248/4006 tok/s;    711 sec
[2021-05-15 17:24:35,774 INFO] Step 3350/50000; acc:  56.18; ppl:  4.48; xent: 1.50; lr: 0.00010; 9528/3903 tok/s;    722 sec
[2021-05-15 17:24:46,211 INFO] Step 3400/50000; acc:  56.31; ppl:  4.43; xent: 1.49; lr: 0.00010; 9539/4103 tok/s;    733 sec
[2021-05-15 17:24:57,338 INFO] Step 3450/50000; acc:  56.31; ppl:  4.51; xent: 1.51; lr: 0.00010; 9398/3864 tok/s;    744 sec
[2021-05-15 17:25:08,320 INFO] Step 3500/50000; acc:  56.96; ppl:  4.33; xent: 1.47; lr: 0.00010; 9034/3871 tok/s;    755 sec
[2021-05-15 17:25:18,527 INFO] Step 3550/50000; acc:  57.10; ppl:  4.36; xent: 1.47; lr: 0.00010; 10206/4120 tok/s;    765 sec
[2021-05-15 17:25:29,015 INFO] Step 3600/50000; acc:  57.54; ppl:  4.27; xent: 1.45; lr: 0.00010; 9438/4073 tok/s;    775 sec
[2021-05-15 17:25:32,576 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:25:39,956 INFO] Step 3650/50000; acc:  57.25; ppl:  4.33; xent: 1.47; lr: 0.00010; 9425/3949 tok/s;    786 sec
[2021-05-15 17:25:50,865 INFO] Step 3700/50000; acc:  57.52; ppl:  4.28; xent: 1.45; lr: 0.00010; 9379/3868 tok/s;    797 sec
[2021-05-15 17:26:01,762 INFO] Step 3750/50000; acc:  57.41; ppl:  4.32; xent: 1.46; lr: 0.00010; 9144/3976 tok/s;    808 sec
[2021-05-15 17:26:12,984 INFO] Step 3800/50000; acc:  57.59; ppl:  4.26; xent: 1.45; lr: 0.00010; 9150/3873 tok/s;    819 sec
[2021-05-15 17:26:23,046 INFO] Step 3850/50000; acc:  57.51; ppl:  4.28; xent: 1.45; lr: 0.00010; 9783/4275 tok/s;    829 sec
[2021-05-15 17:26:33,478 INFO] Step 3900/50000; acc:  57.86; ppl:  4.23; xent: 1.44; lr: 0.00010; 9984/4054 tok/s;    840 sec
[2021-05-15 17:26:43,952 INFO] Step 3950/50000; acc:  57.77; ppl:  4.22; xent: 1.44; lr: 0.00010; 9609/4080 tok/s;    850 sec
[2021-05-15 17:26:55,099 INFO] Step 4000/50000; acc:  57.76; ppl:  4.23; xent: 1.44; lr: 0.00010; 9414/3795 tok/s;    861 sec
[2021-05-15 17:27:06,016 INFO] Step 4050/50000; acc:  58.03; ppl:  4.20; xent: 1.44; lr: 0.00010; 9353/3941 tok/s;    872 sec
[2021-05-15 17:27:16,654 INFO] Step 4100/50000; acc:  58.48; ppl:  4.11; xent: 1.41; lr: 0.00010; 9346/3963 tok/s;    883 sec
[2021-05-15 17:27:27,510 INFO] Step 4150/50000; acc:  58.03; ppl:  4.28; xent: 1.45; lr: 0.00010; 9529/3989 tok/s;    894 sec
[2021-05-15 17:27:38,232 INFO] Step 4200/50000; acc:  58.42; ppl:  4.16; xent: 1.42; lr: 0.00010; 9278/3975 tok/s;    905 sec
[2021-05-15 17:27:49,544 INFO] Step 4250/50000; acc:  58.60; ppl:  4.11; xent: 1.41; lr: 0.00010; 9326/3749 tok/s;    916 sec
[2021-05-15 17:27:59,699 INFO] Step 4300/50000; acc:  59.18; ppl:  4.03; xent: 1.39; lr: 0.00010; 9666/4126 tok/s;    926 sec
[2021-05-15 17:28:10,509 INFO] Step 4350/50000; acc:  58.82; ppl:  4.11; xent: 1.41; lr: 0.00010; 9625/3988 tok/s;    937 sec
[2021-05-15 17:28:10,961 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:28:21,348 INFO] Step 4400/50000; acc:  58.99; ppl:  4.06; xent: 1.40; lr: 0.00010; 9177/3993 tok/s;    948 sec
[2021-05-15 17:28:32,048 INFO] Step 4450/50000; acc:  58.99; ppl:  4.09; xent: 1.41; lr: 0.00010; 9643/3954 tok/s;    958 sec
[2021-05-15 17:28:43,696 INFO] Step 4500/50000; acc:  59.03; ppl:  4.11; xent: 1.41; lr: 0.00010; 8668/3760 tok/s;    970 sec
[2021-05-15 17:28:54,037 INFO] Step 4550/50000; acc:  59.08; ppl:  4.03; xent: 1.39; lr: 0.00010; 9687/4140 tok/s;    980 sec
[2021-05-15 17:29:04,558 INFO] Step 4600/50000; acc:  59.41; ppl:  4.06; xent: 1.40; lr: 0.00010; 9790/4049 tok/s;    991 sec
[2021-05-15 17:29:14,943 INFO] Step 4650/50000; acc:  59.92; ppl:  3.97; xent: 1.38; lr: 0.00010; 9438/4087 tok/s;   1001 sec
[2021-05-15 17:29:25,996 INFO] Step 4700/50000; acc:  58.96; ppl:  4.09; xent: 1.41; lr: 0.00010; 9593/3858 tok/s;   1012 sec
[2021-05-15 17:29:37,193 INFO] Step 4750/50000; acc:  59.49; ppl:  3.98; xent: 1.38; lr: 0.00010; 9026/3819 tok/s;   1024 sec
[2021-05-15 17:29:47,868 INFO] Step 4800/50000; acc:  59.88; ppl:  3.97; xent: 1.38; lr: 0.00010; 9648/3978 tok/s;   1034 sec
[2021-05-15 17:29:58,760 INFO] Step 4850/50000; acc:  59.63; ppl:  3.97; xent: 1.38; lr: 0.00010; 9397/3935 tok/s;   1045 sec
[2021-05-15 17:30:09,326 INFO] Step 4900/50000; acc:  59.95; ppl:  3.99; xent: 1.38; lr: 0.00010; 9402/4013 tok/s;   1056 sec
[2021-05-15 17:30:20,531 INFO] Step 4950/50000; acc:  59.82; ppl:  3.94; xent: 1.37; lr: 0.00010; 9273/3845 tok/s;   1067 sec
[2021-05-15 17:30:30,588 INFO] Step 5000/50000; acc:  60.39; ppl:  3.91; xent: 1.36; lr: 0.00010; 9910/4176 tok/s;   1077 sec
[2021-05-15 17:30:30,588 INFO] valid's transforms: TransformPipe()
[2021-05-15 17:30:30,591 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/strict/valid.txt, align=None)...
[2021-05-15 17:30:39,361 INFO] Validation perplexity: 3.74558
[2021-05-15 17:30:39,361 INFO] Validation accuracy: 62.0656
[2021-05-15 17:30:39,363 INFO] Saving checkpoint ../models/group2_params/strict_ops/model_step_5000.pt
[2021-05-15 17:30:50,475 INFO] Step 5050/50000; acc:  60.36; ppl:  3.90; xent: 1.36; lr: 0.00010; 5232/2146 tok/s;   1097 sec
[2021-05-15 17:30:58,542 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:31:01,063 INFO] Step 5100/50000; acc:  60.43; ppl:  3.88; xent: 1.36; lr: 0.00010; 9353/4042 tok/s;   1107 sec
[2021-05-15 17:31:11,984 INFO] Step 5150/50000; acc:  60.42; ppl:  3.92; xent: 1.37; lr: 0.00010; 9532/3935 tok/s;   1118 sec
[2021-05-15 17:31:22,962 INFO] Step 5200/50000; acc:  60.12; ppl:  3.92; xent: 1.37; lr: 0.00010; 9049/3887 tok/s;   1129 sec
[2021-05-15 17:31:34,050 INFO] Step 5250/50000; acc:  60.30; ppl:  3.89; xent: 1.36; lr: 0.00010; 9202/3946 tok/s;   1140 sec
[2021-05-15 17:31:44,663 INFO] Step 5300/50000; acc:  60.32; ppl:  3.91; xent: 1.36; lr: 0.00010; 9620/4037 tok/s;   1151 sec
[2021-05-15 17:31:55,058 INFO] Step 5350/50000; acc:  60.80; ppl:  3.87; xent: 1.35; lr: 0.00010; 9607/4100 tok/s;   1161 sec
[2021-05-15 17:32:05,823 INFO] Step 5400/50000; acc:  60.41; ppl:  3.91; xent: 1.36; lr: 0.00010; 9615/3977 tok/s;   1172 sec
[2021-05-15 17:32:16,290 INFO] Step 5450/50000; acc:  61.06; ppl:  3.78; xent: 1.33; lr: 0.00010; 9541/3944 tok/s;   1183 sec
[2021-05-15 17:32:27,928 INFO] Step 5500/50000; acc:  60.33; ppl:  3.90; xent: 1.36; lr: 0.00010; 9043/3759 tok/s;   1194 sec
[2021-05-15 17:32:37,975 INFO] Step 5550/50000; acc:  60.95; ppl:  3.79; xent: 1.33; lr: 0.00010; 9919/4160 tok/s;   1204 sec
[2021-05-15 17:32:49,109 INFO] Step 5600/50000; acc:  60.77; ppl:  3.84; xent: 1.35; lr: 0.00010; 9312/3922 tok/s;   1215 sec
[2021-05-15 17:33:00,268 INFO] Step 5650/50000; acc:  60.99; ppl:  3.88; xent: 1.36; lr: 0.00010; 9138/3819 tok/s;   1227 sec
[2021-05-15 17:33:10,927 INFO] Step 5700/50000; acc:  61.42; ppl:  3.72; xent: 1.31; lr: 0.00010; 9361/3956 tok/s;   1237 sec
[2021-05-15 17:33:21,272 INFO] Step 5750/50000; acc:  61.01; ppl:  3.82; xent: 1.34; lr: 0.00010; 9966/4136 tok/s;   1248 sec
[2021-05-15 17:33:31,603 INFO] Step 5800/50000; acc:  61.76; ppl:  3.69; xent: 1.30; lr: 0.00010; 9591/4115 tok/s;   1258 sec
[2021-05-15 17:33:37,204 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:33:42,990 INFO] Step 5850/50000; acc:  60.93; ppl:  3.82; xent: 1.34; lr: 0.00010; 9223/3767 tok/s;   1269 sec
[2021-05-15 17:33:53,630 INFO] Step 5900/50000; acc:  61.74; ppl:  3.73; xent: 1.32; lr: 0.00010; 9350/4007 tok/s;   1280 sec
[2021-05-15 17:34:05,357 INFO] Step 5950/50000; acc:  61.07; ppl:  3.81; xent: 1.34; lr: 0.00010; 8821/3711 tok/s;   1292 sec
[2021-05-15 17:34:16,218 INFO] Step 6000/50000; acc:  61.81; ppl:  3.73; xent: 1.32; lr: 0.00010; 9100/3961 tok/s;   1303 sec
[2021-05-15 17:34:26,897 INFO] Step 6050/50000; acc:  61.46; ppl:  3.76; xent: 1.33; lr: 0.00010; 9604/4029 tok/s;   1313 sec
[2021-05-15 17:34:37,176 INFO] Step 6100/50000; acc:  61.89; ppl:  3.71; xent: 1.31; lr: 0.00010; 9862/4131 tok/s;   1324 sec
[2021-05-15 17:34:47,405 INFO] Step 6150/50000; acc:  61.63; ppl:  3.74; xent: 1.32; lr: 0.00010; 9940/4127 tok/s;   1334 sec
[2021-05-15 17:34:59,063 INFO] Step 6200/50000; acc:  61.36; ppl:  3.74; xent: 1.32; lr: 0.00010; 8926/3648 tok/s;   1345 sec
[2021-05-15 17:35:09,590 INFO] Step 6250/50000; acc:  61.99; ppl:  3.70; xent: 1.31; lr: 0.00010; 9375/4059 tok/s;   1356 sec
[2021-05-15 17:35:20,473 INFO] Step 6300/50000; acc:  61.87; ppl:  3.70; xent: 1.31; lr: 0.00010; 9633/3897 tok/s;   1367 sec
[2021-05-15 17:35:30,893 INFO] Step 6350/50000; acc:  61.99; ppl:  3.74; xent: 1.32; lr: 0.00010; 9604/4094 tok/s;   1377 sec
[2021-05-15 17:35:42,124 INFO] Step 6400/50000; acc:  61.82; ppl:  3.71; xent: 1.31; lr: 0.00010; 9196/3849 tok/s;   1389 sec
[2021-05-15 17:35:52,792 INFO] Step 6450/50000; acc:  62.48; ppl:  3.62; xent: 1.29; lr: 0.00010; 9613/3913 tok/s;   1399 sec
[2021-05-15 17:36:03,104 INFO] Step 6500/50000; acc:  62.41; ppl:  3.66; xent: 1.30; lr: 0.00010; 9568/4184 tok/s;   1409 sec
[2021-05-15 17:36:13,879 INFO] Step 6550/50000; acc:  62.61; ppl:  3.63; xent: 1.29; lr: 0.00010; 9612/4004 tok/s;   1420 sec
[2021-05-15 17:36:16,180 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:36:24,363 INFO] Step 6600/50000; acc:  62.21; ppl:  3.66; xent: 1.30; lr: 0.00010; 9509/4037 tok/s;   1431 sec
[2021-05-15 17:36:35,412 INFO] Step 6650/50000; acc:  61.85; ppl:  3.73; xent: 1.32; lr: 0.00010; 9470/3859 tok/s;   1442 sec
[2021-05-15 17:36:46,485 INFO] Step 6700/50000; acc:  62.43; ppl:  3.63; xent: 1.29; lr: 0.00010; 8870/3964 tok/s;   1453 sec
[2021-05-15 17:36:57,384 INFO] Step 6750/50000; acc:  62.29; ppl:  3.66; xent: 1.30; lr: 0.00010; 9551/3922 tok/s;   1464 sec
[2021-05-15 17:37:08,013 INFO] Step 6800/50000; acc:  62.86; ppl:  3.64; xent: 1.29; lr: 0.00010; 9295/4023 tok/s;   1474 sec
[2021-05-15 17:37:18,418 INFO] Step 6850/50000; acc:  62.88; ppl:  3.59; xent: 1.28; lr: 0.00010; 9862/4094 tok/s;   1485 sec
[2021-05-15 17:37:29,521 INFO] Step 6900/50000; acc:  62.31; ppl:  3.64; xent: 1.29; lr: 0.00010; 9350/3759 tok/s;   1496 sec
[2021-05-15 17:37:40,123 INFO] Step 6950/50000; acc:  62.84; ppl:  3.59; xent: 1.28; lr: 0.00010; 9480/4095 tok/s;   1507 sec
[2021-05-15 17:37:51,273 INFO] Step 7000/50000; acc:  62.48; ppl:  3.64; xent: 1.29; lr: 0.00010; 9263/3810 tok/s;   1518 sec
[2021-05-15 17:38:01,430 INFO] Step 7050/50000; acc:  62.83; ppl:  3.58; xent: 1.28; lr: 0.00010; 9735/4183 tok/s;   1528 sec
[2021-05-15 17:38:12,681 INFO] Step 7100/50000; acc:  62.36; ppl:  3.69; xent: 1.31; lr: 0.00010; 9271/3848 tok/s;   1539 sec
[2021-05-15 17:38:23,599 INFO] Step 7150/50000; acc:  63.33; ppl:  3.51; xent: 1.26; lr: 0.00010; 9234/3869 tok/s;   1550 sec
[2021-05-15 17:38:33,790 INFO] Step 7200/50000; acc:  62.98; ppl:  3.58; xent: 1.28; lr: 0.00010; 10106/4150 tok/s;   1560 sec
[2021-05-15 17:38:44,516 INFO] Step 7250/50000; acc:  63.23; ppl:  3.55; xent: 1.27; lr: 0.00010; 9479/3978 tok/s;   1571 sec
[2021-05-15 17:38:47,706 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:38:55,380 INFO] Step 7300/50000; acc:  63.23; ppl:  3.55; xent: 1.27; lr: 0.00010; 9202/3991 tok/s;   1582 sec
[2021-05-15 17:39:06,195 INFO] Step 7350/50000; acc:  63.21; ppl:  3.56; xent: 1.27; lr: 0.00010; 9578/3895 tok/s;   1593 sec
[2021-05-15 17:39:16,901 INFO] Step 7400/50000; acc:  62.96; ppl:  3.57; xent: 1.27; lr: 0.00010; 9213/4007 tok/s;   1603 sec
[2021-05-15 17:39:28,263 INFO] Step 7450/50000; acc:  63.05; ppl:  3.57; xent: 1.27; lr: 0.00010; 9186/3873 tok/s;   1615 sec
[2021-05-15 17:39:38,534 INFO] Step 7500/50000; acc:  63.43; ppl:  3.57; xent: 1.27; lr: 0.00010; 9612/4182 tok/s;   1625 sec
[2021-05-15 17:39:48,895 INFO] Step 7550/50000; acc:  63.39; ppl:  3.54; xent: 1.26; lr: 0.00010; 9993/4075 tok/s;   1635 sec
[2021-05-15 17:39:59,182 INFO] Step 7600/50000; acc:  63.50; ppl:  3.51; xent: 1.25; lr: 0.00010; 9719/4133 tok/s;   1646 sec
[2021-05-15 17:40:10,760 INFO] Step 7650/50000; acc:  63.17; ppl:  3.54; xent: 1.26; lr: 0.00010; 9028/3675 tok/s;   1657 sec
[2021-05-15 17:40:21,373 INFO] Step 7700/50000; acc:  63.44; ppl:  3.52; xent: 1.26; lr: 0.00010; 9601/4032 tok/s;   1668 sec
[2021-05-15 17:40:32,117 INFO] Step 7750/50000; acc:  63.43; ppl:  3.49; xent: 1.25; lr: 0.00010; 9340/3952 tok/s;   1679 sec
[2021-05-15 17:40:43,006 INFO] Step 7800/50000; acc:  63.60; ppl:  3.54; xent: 1.26; lr: 0.00010; 9470/3960 tok/s;   1689 sec
[2021-05-15 17:40:53,622 INFO] Step 7850/50000; acc:  63.65; ppl:  3.50; xent: 1.25; lr: 0.00010; 9307/4003 tok/s;   1700 sec
[2021-05-15 17:41:04,935 INFO] Step 7900/50000; acc:  63.76; ppl:  3.47; xent: 1.24; lr: 0.00010; 9329/3747 tok/s;   1711 sec
[2021-05-15 17:41:15,069 INFO] Step 7950/50000; acc:  64.07; ppl:  3.44; xent: 1.24; lr: 0.00010; 9794/4154 tok/s;   1721 sec
[2021-05-15 17:41:25,986 INFO] Step 8000/50000; acc:  63.80; ppl:  3.49; xent: 1.25; lr: 0.00010; 9455/3954 tok/s;   1732 sec
[2021-05-15 17:41:25,996 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:41:36,992 INFO] Step 8050/50000; acc:  63.79; ppl:  3.47; xent: 1.24; lr: 0.00010; 9285/3919 tok/s;   1743 sec
[2021-05-15 17:41:47,362 INFO] Step 8100/50000; acc:  64.04; ppl:  3.46; xent: 1.24; lr: 0.00010; 9621/4069 tok/s;   1754 sec
[2021-05-15 17:41:59,214 INFO] Step 8150/50000; acc:  63.41; ppl:  3.52; xent: 1.26; lr: 0.00010; 8636/3693 tok/s;   1766 sec
[2021-05-15 17:42:09,646 INFO] Step 8200/50000; acc:  64.10; ppl:  3.42; xent: 1.23; lr: 0.00010; 9530/4100 tok/s;   1776 sec
[2021-05-15 17:42:20,554 INFO] Step 8250/50000; acc:  63.80; ppl:  3.51; xent: 1.25; lr: 0.00010; 9598/3923 tok/s;   1787 sec
[2021-05-15 17:42:30,719 INFO] Step 8300/50000; acc:  64.29; ppl:  3.41; xent: 1.23; lr: 0.00010; 9665/4180 tok/s;   1797 sec
[2021-05-15 17:42:41,790 INFO] Step 8350/50000; acc:  63.62; ppl:  3.50; xent: 1.25; lr: 0.00010; 9520/3817 tok/s;   1808 sec
[2021-05-15 17:42:52,950 INFO] Step 8400/50000; acc:  64.05; ppl:  3.41; xent: 1.23; lr: 0.00010; 8974/3867 tok/s;   1819 sec
[2021-05-15 17:43:03,348 INFO] Step 8450/50000; acc:  64.18; ppl:  3.43; xent: 1.23; lr: 0.00010; 9847/4059 tok/s;   1830 sec
[2021-05-15 17:43:14,270 INFO] Step 8500/50000; acc:  63.89; ppl:  3.45; xent: 1.24; lr: 0.00010; 9386/3943 tok/s;   1841 sec
[2021-05-15 17:43:25,144 INFO] Step 8550/50000; acc:  64.06; ppl:  3.46; xent: 1.24; lr: 0.00010; 9222/3928 tok/s;   1852 sec
[2021-05-15 17:43:36,368 INFO] Step 8600/50000; acc:  64.41; ppl:  3.40; xent: 1.22; lr: 0.00010; 9231/3801 tok/s;   1863 sec
[2021-05-15 17:43:46,229 INFO] Step 8650/50000; acc:  64.61; ppl:  3.38; xent: 1.22; lr: 0.00010; 9986/4281 tok/s;   1873 sec
[2021-05-15 17:43:56,797 INFO] Step 8700/50000; acc:  64.37; ppl:  3.40; xent: 1.22; lr: 0.00010; 9854/4020 tok/s;   1883 sec
[2021-05-15 17:44:04,623 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:44:07,685 INFO] Step 8750/50000; acc:  64.59; ppl:  3.37; xent: 1.22; lr: 0.00010; 9237/3948 tok/s;   1894 sec
[2021-05-15 17:44:18,452 INFO] Step 8800/50000; acc:  64.39; ppl:  3.42; xent: 1.23; lr: 0.00010; 9577/3971 tok/s;   1905 sec
[2021-05-15 17:44:29,861 INFO] Step 8850/50000; acc:  63.95; ppl:  3.44; xent: 1.24; lr: 0.00010; 8948/3781 tok/s;   1916 sec
[2021-05-15 17:44:40,444 INFO] Step 8900/50000; acc:  64.68; ppl:  3.36; xent: 1.21; lr: 0.00010; 9325/4069 tok/s;   1927 sec
[2021-05-15 17:44:51,076 INFO] Step 8950/50000; acc:  63.97; ppl:  3.43; xent: 1.23; lr: 0.00010; 9720/4068 tok/s;   1937 sec
[2021-05-15 17:45:01,345 INFO] Step 9000/50000; acc:  64.92; ppl:  3.37; xent: 1.21; lr: 0.00010; 9641/4114 tok/s;   1948 sec
[2021-05-15 17:45:12,316 INFO] Step 9050/50000; acc:  64.00; ppl:  3.46; xent: 1.24; lr: 0.00010; 9601/3917 tok/s;   1959 sec
[2021-05-15 17:45:23,153 INFO] Step 9100/50000; acc:  64.62; ppl:  3.32; xent: 1.20; lr: 0.00010; 9234/3861 tok/s;   1970 sec
[2021-05-15 17:45:34,530 INFO] Step 9150/50000; acc:  64.16; ppl:  3.42; xent: 1.23; lr: 0.00010; 9195/3824 tok/s;   1981 sec
[2021-05-15 17:45:44,618 INFO] Step 9200/50000; acc:  65.04; ppl:  3.32; xent: 1.20; lr: 0.00010; 9795/4135 tok/s;   1991 sec
[2021-05-15 17:45:55,781 INFO] Step 9250/50000; acc:  64.77; ppl:  3.36; xent: 1.21; lr: 0.00010; 9239/3893 tok/s;   2002 sec
[2021-05-15 17:46:06,992 INFO] Step 9300/50000; acc:  64.63; ppl:  3.41; xent: 1.23; lr: 0.00010; 9094/3842 tok/s;   2013 sec
[2021-05-15 17:46:17,641 INFO] Step 9350/50000; acc:  64.92; ppl:  3.31; xent: 1.20; lr: 0.00010; 9467/3943 tok/s;   2024 sec
[2021-05-15 17:46:28,024 INFO] Step 9400/50000; acc:  64.69; ppl:  3.37; xent: 1.21; lr: 0.00010; 9905/4142 tok/s;   2034 sec
[2021-05-15 17:46:38,160 INFO] Step 9450/50000; acc:  65.47; ppl:  3.25; xent: 1.18; lr: 0.00010; 9685/4189 tok/s;   2045 sec
[2021-05-15 17:46:43,359 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:46:49,455 INFO] Step 9500/50000; acc:  64.54; ppl:  3.39; xent: 1.22; lr: 0.00010; 9292/3798 tok/s;   2056 sec
[2021-05-15 17:47:00,152 INFO] Step 9550/50000; acc:  65.21; ppl:  3.33; xent: 1.20; lr: 0.00010; 9421/3950 tok/s;   2067 sec
[2021-05-15 17:47:11,885 INFO] Step 9600/50000; acc:  64.39; ppl:  3.36; xent: 1.21; lr: 0.00010; 8744/3724 tok/s;   2078 sec
[2021-05-15 17:47:22,476 INFO] Step 9650/50000; acc:  65.00; ppl:  3.33; xent: 1.20; lr: 0.00010; 9586/4032 tok/s;   2089 sec
[2021-05-15 17:47:33,144 INFO] Step 9700/50000; acc:  64.87; ppl:  3.34; xent: 1.21; lr: 0.00010; 9289/4061 tok/s;   2100 sec
[2021-05-15 17:47:43,529 INFO] Step 9750/50000; acc:  65.32; ppl:  3.30; xent: 1.19; lr: 0.00010; 9909/4065 tok/s;   2110 sec
[2021-05-15 17:47:53,665 INFO] Step 9800/50000; acc:  64.79; ppl:  3.31; xent: 1.20; lr: 0.00010; 9951/4143 tok/s;   2120 sec
[2021-05-15 17:48:05,638 INFO] Step 9850/50000; acc:  64.46; ppl:  3.35; xent: 1.21; lr: 0.00010; 8816/3607 tok/s;   2132 sec
[2021-05-15 17:48:16,168 INFO] Step 9900/50000; acc:  65.05; ppl:  3.30; xent: 1.19; lr: 0.00010; 9381/4047 tok/s;   2143 sec
[2021-05-15 17:48:26,880 INFO] Step 9950/50000; acc:  65.10; ppl:  3.30; xent: 1.19; lr: 0.00010; 9744/3963 tok/s;   2153 sec
[2021-05-15 17:48:37,282 INFO] Step 10000/50000; acc:  65.25; ppl:  3.34; xent: 1.21; lr: 0.00010; 9539/4077 tok/s;   2164 sec
[2021-05-15 17:48:37,286 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/strict/valid.txt, align=None)...
[2021-05-15 17:48:46,046 INFO] Validation perplexity: 3.23319
[2021-05-15 17:48:46,047 INFO] Validation accuracy: 66.0682
[2021-05-15 17:48:46,049 INFO] Saving checkpoint ../models/group2_params/strict_ops/model_step_10000.pt
[2021-05-15 17:48:57,892 INFO] Step 10050/50000; acc:  65.20; ppl:  3.28; xent: 1.19; lr: 0.00010; 4991/2109 tok/s;   2184 sec
[2021-05-15 17:49:08,466 INFO] Step 10100/50000; acc:  65.21; ppl:  3.25; xent: 1.18; lr: 0.00010; 9684/4003 tok/s;   2195 sec
[2021-05-15 17:49:18,722 INFO] Step 10150/50000; acc:  65.58; ppl:  3.26; xent: 1.18; lr: 0.00010; 9725/4130 tok/s;   2205 sec
[2021-05-15 17:49:29,508 INFO] Step 10200/50000; acc:  65.28; ppl:  3.28; xent: 1.19; lr: 0.00010; 9580/3994 tok/s;   2216 sec
[2021-05-15 17:49:31,392 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:49:39,942 INFO] Step 10250/50000; acc:  65.27; ppl:  3.25; xent: 1.18; lr: 0.00010; 9484/4070 tok/s;   2226 sec
[2021-05-15 17:49:51,060 INFO] Step 10300/50000; acc:  64.76; ppl:  3.34; xent: 1.21; lr: 0.00010; 9390/3828 tok/s;   2237 sec
[2021-05-15 17:50:02,002 INFO] Step 10350/50000; acc:  65.37; ppl:  3.26; xent: 1.18; lr: 0.00010; 9109/4011 tok/s;   2248 sec
[2021-05-15 17:50:12,820 INFO] Step 10400/50000; acc:  65.13; ppl:  3.30; xent: 1.19; lr: 0.00010; 9514/3974 tok/s;   2259 sec
[2021-05-15 17:50:23,603 INFO] Step 10450/50000; acc:  65.37; ppl:  3.29; xent: 1.19; lr: 0.00010; 9424/3970 tok/s;   2270 sec
[2021-05-15 17:50:33,721 INFO] Step 10500/50000; acc:  65.64; ppl:  3.21; xent: 1.17; lr: 0.00010; 9823/4175 tok/s;   2280 sec
[2021-05-15 17:50:45,025 INFO] Step 10550/50000; acc:  65.04; ppl:  3.29; xent: 1.19; lr: 0.00010; 9311/3691 tok/s;   2291 sec
[2021-05-15 17:50:55,480 INFO] Step 10600/50000; acc:  65.68; ppl:  3.21; xent: 1.17; lr: 0.00010; 9509/4151 tok/s;   2302 sec
[2021-05-15 17:51:06,800 INFO] Step 10650/50000; acc:  65.23; ppl:  3.32; xent: 1.20; lr: 0.00010; 9279/3779 tok/s;   2313 sec
[2021-05-15 17:51:17,070 INFO] Step 10700/50000; acc:  65.63; ppl:  3.24; xent: 1.17; lr: 0.00010; 9641/4161 tok/s;   2323 sec
[2021-05-15 17:51:28,025 INFO] Step 10750/50000; acc:  65.15; ppl:  3.32; xent: 1.20; lr: 0.00010; 9465/3920 tok/s;   2334 sec
[2021-05-15 17:51:38,837 INFO] Step 10800/50000; acc:  66.10; ppl:  3.14; xent: 1.14; lr: 0.00010; 9258/3886 tok/s;   2345 sec
[2021-05-15 17:51:49,087 INFO] Step 10850/50000; acc:  65.73; ppl:  3.23; xent: 1.17; lr: 0.00010; 10004/4132 tok/s;   2355 sec
[2021-05-15 17:51:59,716 INFO] Step 10900/50000; acc:  65.71; ppl:  3.23; xent: 1.17; lr: 0.00010; 9565/4050 tok/s;   2366 sec
[2021-05-15 17:52:02,405 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:52:10,581 INFO] Step 10950/50000; acc:  65.74; ppl:  3.23; xent: 1.17; lr: 0.00010; 9278/3958 tok/s;   2377 sec
[2021-05-15 17:52:21,395 INFO] Step 11000/50000; acc:  65.41; ppl:  3.23; xent: 1.17; lr: 0.00010; 9558/3916 tok/s;   2388 sec
[2021-05-15 17:52:32,049 INFO] Step 11050/50000; acc:  65.88; ppl:  3.22; xent: 1.17; lr: 0.00010; 9171/4025 tok/s;   2398 sec
[2021-05-15 17:52:43,481 INFO] Step 11100/50000; acc:  65.63; ppl:  3.24; xent: 1.18; lr: 0.00010; 9132/3826 tok/s;   2410 sec
[2021-05-15 17:52:53,888 INFO] Step 11150/50000; acc:  65.65; ppl:  3.24; xent: 1.17; lr: 0.00010; 9614/4124 tok/s;   2420 sec
[2021-05-15 17:53:04,318 INFO] Step 11200/50000; acc:  65.91; ppl:  3.21; xent: 1.16; lr: 0.00010; 9829/4089 tok/s;   2431 sec
[2021-05-15 17:53:14,651 INFO] Step 11250/50000; acc:  65.50; ppl:  3.23; xent: 1.17; lr: 0.00010; 9954/4082 tok/s;   2441 sec
[2021-05-15 17:53:26,052 INFO] Step 11300/50000; acc:  66.01; ppl:  3.19; xent: 1.16; lr: 0.00010; 8868/3708 tok/s;   2452 sec
[2021-05-15 17:53:36,859 INFO] Step 11350/50000; acc:  65.70; ppl:  3.23; xent: 1.17; lr: 0.00010; 9548/3993 tok/s;   2463 sec
[2021-05-15 17:53:47,374 INFO] Step 11400/50000; acc:  66.32; ppl:  3.16; xent: 1.15; lr: 0.00010; 9465/4011 tok/s;   2474 sec
[2021-05-15 17:53:58,616 INFO] Step 11450/50000; acc:  65.88; ppl:  3.25; xent: 1.18; lr: 0.00010; 9319/3851 tok/s;   2485 sec
[2021-05-15 17:54:09,267 INFO] Step 11500/50000; acc:  65.95; ppl:  3.19; xent: 1.16; lr: 0.00010; 9299/3997 tok/s;   2496 sec
[2021-05-15 17:54:20,547 INFO] Step 11550/50000; acc:  65.78; ppl:  3.19; xent: 1.16; lr: 0.00010; 9298/3805 tok/s;   2507 sec
[2021-05-15 17:54:30,513 INFO] Step 11600/50000; acc:  66.52; ppl:  3.13; xent: 1.14; lr: 0.00010; 9881/4166 tok/s;   2517 sec
[2021-05-15 17:54:40,961 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:54:41,361 INFO] Step 11650/50000; acc:  65.88; ppl:  3.20; xent: 1.16; lr: 0.00010; 9486/3981 tok/s;   2528 sec
[2021-05-15 17:54:52,352 INFO] Step 11700/50000; acc:  66.15; ppl:  3.18; xent: 1.16; lr: 0.00010; 9277/3934 tok/s;   2539 sec
[2021-05-15 17:55:03,012 INFO] Step 11750/50000; acc:  65.99; ppl:  3.19; xent: 1.16; lr: 0.00010; 9442/3976 tok/s;   2549 sec
[2021-05-15 17:55:14,934 INFO] Step 11800/50000; acc:  65.70; ppl:  3.22; xent: 1.17; lr: 0.00010; 8570/3671 tok/s;   2561 sec
[2021-05-15 17:55:25,202 INFO] Step 11850/50000; acc:  66.28; ppl:  3.13; xent: 1.14; lr: 0.00010; 9594/4138 tok/s;   2572 sec
[2021-05-15 17:55:36,255 INFO] Step 11900/50000; acc:  65.94; ppl:  3.21; xent: 1.17; lr: 0.00010; 9470/3897 tok/s;   2583 sec
[2021-05-15 17:55:46,424 INFO] Step 11950/50000; acc:  66.31; ppl:  3.16; xent: 1.15; lr: 0.00010; 9819/4194 tok/s;   2593 sec
[2021-05-15 17:55:57,250 INFO] Step 12000/50000; acc:  65.83; ppl:  3.20; xent: 1.16; lr: 0.00010; 9644/3864 tok/s;   2604 sec
[2021-05-15 17:56:08,710 INFO] Step 12050/50000; acc:  65.96; ppl:  3.16; xent: 1.15; lr: 0.00010; 8972/3771 tok/s;   2615 sec
[2021-05-15 17:56:18,930 INFO] Step 12100/50000; acc:  66.33; ppl:  3.12; xent: 1.14; lr: 0.00010; 9672/4141 tok/s;   2625 sec
[2021-05-15 17:56:30,105 INFO] Step 12150/50000; acc:  65.76; ppl:  3.20; xent: 1.16; lr: 0.00010; 9315/3861 tok/s;   2636 sec
[2021-05-15 17:56:40,836 INFO] Step 12200/50000; acc:  66.47; ppl:  3.17; xent: 1.15; lr: 0.00010; 9253/3955 tok/s;   2647 sec
[2021-05-15 17:56:52,236 INFO] Step 12250/50000; acc:  66.26; ppl:  3.14; xent: 1.14; lr: 0.00010; 9246/3750 tok/s;   2659 sec
[2021-05-15 17:57:02,098 INFO] Step 12300/50000; acc:  66.51; ppl:  3.13; xent: 1.14; lr: 0.00010; 9982/4297 tok/s;   2668 sec
[2021-05-15 17:57:12,725 INFO] Step 12350/50000; acc:  66.45; ppl:  3.13; xent: 1.14; lr: 0.00010; 9745/3995 tok/s;   2679 sec
[2021-05-15 17:57:20,019 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 17:57:23,524 INFO] Step 12400/50000; acc:  66.43; ppl:  3.12; xent: 1.14; lr: 0.00010; 9230/4003 tok/s;   2690 sec
[2021-05-15 17:57:34,237 INFO] Step 12450/50000; acc:  66.22; ppl:  3.15; xent: 1.15; lr: 0.00010; 9613/3963 tok/s;   2701 sec
[2021-05-15 17:57:45,693 INFO] Step 12500/50000; acc:  65.75; ppl:  3.18; xent: 1.16; lr: 0.00010; 8888/3774 tok/s;   2712 sec
[2021-05-15 17:57:56,570 INFO] Step 12550/50000; acc:  66.60; ppl:  3.12; xent: 1.14; lr: 0.00010; 9169/3978 tok/s;   2723 sec
[2021-05-15 17:58:07,314 INFO] Step 12600/50000; acc:  66.13; ppl:  3.16; xent: 1.15; lr: 0.00010; 9584/4013 tok/s;   2734 sec
[2021-05-15 17:58:17,552 INFO] Step 12650/50000; acc:  66.81; ppl:  3.10; xent: 1.13; lr: 0.00010; 9595/4100 tok/s;   2744 sec
[2021-05-15 17:58:28,671 INFO] Step 12700/50000; acc:  65.99; ppl:  3.19; xent: 1.16; lr: 0.00010; 9485/3884 tok/s;   2755 sec
[2021-05-15 17:58:39,500 INFO] Step 12750/50000; acc:  66.55; ppl:  3.08; xent: 1.13; lr: 0.00010; 9345/3840 tok/s;   2766 sec
[2021-05-15 17:58:50,767 INFO] Step 12800/50000; acc:  66.22; ppl:  3.15; xent: 1.15; lr: 0.00010; 9196/3871 tok/s;   2777 sec
[2021-05-15 17:59:01,696 INFO] Step 12850/50000; acc:  66.91; ppl:  3.12; xent: 1.14; lr: 0.00010; 9301/3894 tok/s;   2788 sec
[2021-05-15 17:59:12,089 INFO] Step 12900/50000; acc:  66.80; ppl:  3.08; xent: 1.12; lr: 0.00010; 9583/4093 tok/s;   2798 sec
[2021-05-15 17:59:23,391 INFO] Step 12950/50000; acc:  66.23; ppl:  3.16; xent: 1.15; lr: 0.00010; 9170/3805 tok/s;   2810 sec
[2021-05-15 17:59:34,035 INFO] Step 13000/50000; acc:  66.91; ppl:  3.07; xent: 1.12; lr: 0.00010; 9374/3939 tok/s;   2820 sec
[2021-05-15 17:59:44,483 INFO] Step 13050/50000; acc:  66.44; ppl:  3.13; xent: 1.14; lr: 0.00010; 10000/4140 tok/s;   2831 sec
[2021-05-15 17:59:54,555 INFO] Step 13100/50000; acc:  67.13; ppl:  3.03; xent: 1.11; lr: 0.00010; 9768/4194 tok/s;   2841 sec
[2021-05-15 17:59:59,333 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:00:05,872 INFO] Step 13150/50000; acc:  66.14; ppl:  3.14; xent: 1.14; lr: 0.00010; 9218/3797 tok/s;   2852 sec
[2021-05-15 18:00:16,712 INFO] Step 13200/50000; acc:  66.91; ppl:  3.07; xent: 1.12; lr: 0.00010; 9215/3903 tok/s;   2863 sec
[2021-05-15 18:00:28,341 INFO] Step 13250/50000; acc:  66.30; ppl:  3.11; xent: 1.13; lr: 0.00010; 8797/3758 tok/s;   2875 sec
[2021-05-15 18:00:39,014 INFO] Step 13300/50000; acc:  66.83; ppl:  3.09; xent: 1.13; lr: 0.00010; 9509/3997 tok/s;   2885 sec
[2021-05-15 18:00:49,731 INFO] Step 13350/50000; acc:  66.53; ppl:  3.11; xent: 1.14; lr: 0.00010; 9326/4056 tok/s;   2896 sec
[2021-05-15 18:01:00,187 INFO] Step 13400/50000; acc:  66.84; ppl:  3.08; xent: 1.12; lr: 0.00010; 9824/4016 tok/s;   2907 sec
[2021-05-15 18:01:10,252 INFO] Step 13450/50000; acc:  66.96; ppl:  3.06; xent: 1.12; lr: 0.00010; 9944/4199 tok/s;   2917 sec
[2021-05-15 18:01:21,959 INFO] Step 13500/50000; acc:  66.58; ppl:  3.12; xent: 1.14; lr: 0.00010; 8992/3661 tok/s;   2928 sec
[2021-05-15 18:01:32,654 INFO] Step 13550/50000; acc:  66.79; ppl:  3.07; xent: 1.12; lr: 0.00010; 9377/4003 tok/s;   2939 sec
[2021-05-15 18:01:43,182 INFO] Step 13600/50000; acc:  66.94; ppl:  3.08; xent: 1.12; lr: 0.00010; 9826/4068 tok/s;   2950 sec
[2021-05-15 18:01:53,809 INFO] Step 13650/50000; acc:  66.60; ppl:  3.13; xent: 1.14; lr: 0.00010; 9590/3973 tok/s;   2960 sec
[2021-05-15 18:02:04,947 INFO] Step 13700/50000; acc:  67.25; ppl:  3.01; xent: 1.10; lr: 0.00010; 8938/3903 tok/s;   2971 sec
[2021-05-15 18:02:15,355 INFO] Step 13750/50000; acc:  67.17; ppl:  3.04; xent: 1.11; lr: 0.00010; 9958/4020 tok/s;   2982 sec
[2021-05-15 18:02:25,543 INFO] Step 13800/50000; acc:  66.81; ppl:  3.06; xent: 1.12; lr: 0.00010; 9724/4158 tok/s;   2992 sec
[2021-05-15 18:02:36,498 INFO] Step 13850/50000; acc:  66.63; ppl:  3.09; xent: 1.13; lr: 0.00010; 9575/3973 tok/s;   3003 sec
[2021-05-15 18:02:37,802 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:02:47,028 INFO] Step 13900/50000; acc:  67.07; ppl:  3.04; xent: 1.11; lr: 0.00010; 9412/3997 tok/s;   3013 sec
[2021-05-15 18:02:58,095 INFO] Step 13950/50000; acc:  66.40; ppl:  3.13; xent: 1.14; lr: 0.00010; 9376/3876 tok/s;   3024 sec
[2021-05-15 18:03:09,171 INFO] Step 14000/50000; acc:  67.51; ppl:  3.01; xent: 1.10; lr: 0.00010; 8912/3950 tok/s;   3036 sec
[2021-05-15 18:03:19,804 INFO] Step 14050/50000; acc:  66.69; ppl:  3.09; xent: 1.13; lr: 0.00010; 9665/4027 tok/s;   3046 sec
[2021-05-15 18:03:30,438 INFO] Step 14100/50000; acc:  66.95; ppl:  3.07; xent: 1.12; lr: 0.00010; 9536/4038 tok/s;   3057 sec
[2021-05-15 18:03:40,793 INFO] Step 14150/50000; acc:  67.40; ppl:  3.01; xent: 1.10; lr: 0.00010; 9709/4086 tok/s;   3067 sec
[2021-05-15 18:03:52,086 INFO] Step 14200/50000; acc:  66.73; ppl:  3.07; xent: 1.12; lr: 0.00010; 9291/3709 tok/s;   3078 sec
[2021-05-15 18:04:02,769 INFO] Step 14250/50000; acc:  67.30; ppl:  3.02; xent: 1.11; lr: 0.00010; 9237/4044 tok/s;   3089 sec
[2021-05-15 18:04:14,057 INFO] Step 14300/50000; acc:  66.79; ppl:  3.07; xent: 1.12; lr: 0.00010; 9282/3778 tok/s;   3100 sec
[2021-05-15 18:04:24,532 INFO] Step 14350/50000; acc:  67.21; ppl:  3.05; xent: 1.12; lr: 0.00010; 9587/4109 tok/s;   3111 sec
[2021-05-15 18:04:35,298 INFO] Step 14400/50000; acc:  66.96; ppl:  3.08; xent: 1.13; lr: 0.00010; 9550/3973 tok/s;   3122 sec
[2021-05-15 18:04:46,479 INFO] Step 14450/50000; acc:  67.42; ppl:  2.98; xent: 1.09; lr: 0.00010; 9200/3770 tok/s;   3133 sec
[2021-05-15 18:04:56,487 INFO] Step 14500/50000; acc:  67.64; ppl:  2.99; xent: 1.09; lr: 0.00010; 9882/4239 tok/s;   3143 sec
[2021-05-15 18:05:07,014 INFO] Step 14550/50000; acc:  67.12; ppl:  3.04; xent: 1.11; lr: 0.00010; 9796/4056 tok/s;   3153 sec
[2021-05-15 18:05:09,290 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:05:18,017 INFO] Step 14600/50000; acc:  67.33; ppl:  3.01; xent: 1.10; lr: 0.00010; 9080/3916 tok/s;   3164 sec
[2021-05-15 18:05:28,865 INFO] Step 14650/50000; acc:  66.88; ppl:  3.07; xent: 1.12; lr: 0.00010; 9685/3930 tok/s;   3175 sec
[2021-05-15 18:05:39,828 INFO] Step 14700/50000; acc:  67.32; ppl:  3.01; xent: 1.10; lr: 0.00010; 8938/3922 tok/s;   3186 sec
[2021-05-15 18:05:50,928 INFO] Step 14750/50000; acc:  67.03; ppl:  3.04; xent: 1.11; lr: 0.00010; 9361/3925 tok/s;   3197 sec
[2021-05-15 18:06:01,136 INFO] Step 14800/50000; acc:  67.39; ppl:  3.01; xent: 1.10; lr: 0.00010; 9708/4199 tok/s;   3208 sec
[2021-05-15 18:06:11,426 INFO] Step 14850/50000; acc:  67.30; ppl:  3.02; xent: 1.11; lr: 0.00010; 9924/4130 tok/s;   3218 sec
[2021-05-15 18:06:21,889 INFO] Step 14900/50000; acc:  67.23; ppl:  3.05; xent: 1.11; lr: 0.00010; 9839/4018 tok/s;   3228 sec
[2021-05-15 18:06:33,372 INFO] Step 14950/50000; acc:  67.32; ppl:  3.00; xent: 1.10; lr: 0.00010; 8864/3698 tok/s;   3240 sec
[2021-05-15 18:06:44,107 INFO] Step 15000/50000; acc:  67.37; ppl:  3.02; xent: 1.11; lr: 0.00010; 9595/4033 tok/s;   3250 sec
[2021-05-15 18:06:44,110 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/strict/valid.txt, align=None)...
[2021-05-15 18:06:52,870 INFO] Validation perplexity: 3.02465
[2021-05-15 18:06:52,870 INFO] Validation accuracy: 67.4263
[2021-05-15 18:06:52,873 INFO] Saving checkpoint ../models/group2_params/strict_ops/model_step_15000.pt
[2021-05-15 18:07:03,930 INFO] Step 15050/50000; acc:  67.71; ppl:  2.96; xent: 1.09; lr: 0.00010; 4978/2115 tok/s;   3270 sec
[2021-05-15 18:07:14,957 INFO] Step 15100/50000; acc:  67.01; ppl:  3.06; xent: 1.12; lr: 0.00010; 9501/3917 tok/s;   3281 sec
[2021-05-15 18:07:26,065 INFO] Step 15150/50000; acc:  67.51; ppl:  3.00; xent: 1.10; lr: 0.00010; 9039/3851 tok/s;   3292 sec
[2021-05-15 18:07:37,034 INFO] Step 15200/50000; acc:  67.46; ppl:  2.99; xent: 1.10; lr: 0.00010; 9477/3895 tok/s;   3303 sec
[2021-05-15 18:07:47,124 INFO] Step 15250/50000; acc:  67.54; ppl:  2.99; xent: 1.10; lr: 0.00010; 10025/4145 tok/s;   3314 sec
[2021-05-15 18:07:57,109 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:07:57,787 INFO] Step 15300/50000; acc:  67.67; ppl:  2.97; xent: 1.09; lr: 0.00010; 9323/4046 tok/s;   3324 sec
[2021-05-15 18:08:08,878 INFO] Step 15350/50000; acc:  67.18; ppl:  3.01; xent: 1.10; lr: 0.00010; 9329/3876 tok/s;   3335 sec
[2021-05-15 18:08:19,300 INFO] Step 15400/50000; acc:  67.54; ppl:  3.00; xent: 1.10; lr: 0.00010; 9578/4058 tok/s;   3346 sec
[2021-05-15 18:08:30,998 INFO] Step 15450/50000; acc:  67.09; ppl:  3.04; xent: 1.11; lr: 0.00010; 8869/3775 tok/s;   3357 sec
[2021-05-15 18:08:41,337 INFO] Step 15500/50000; acc:  67.68; ppl:  2.96; xent: 1.08; lr: 0.00010; 9549/4100 tok/s;   3368 sec
[2021-05-15 18:08:52,224 INFO] Step 15550/50000; acc:  67.28; ppl:  3.03; xent: 1.11; lr: 0.00010; 9552/3941 tok/s;   3379 sec
[2021-05-15 18:09:02,547 INFO] Step 15600/50000; acc:  67.73; ppl:  2.96; xent: 1.09; lr: 0.00010; 9601/4132 tok/s;   3389 sec
[2021-05-15 18:09:13,335 INFO] Step 15650/50000; acc:  67.37; ppl:  3.01; xent: 1.10; lr: 0.00010; 9645/3907 tok/s;   3400 sec
[2021-05-15 18:09:24,708 INFO] Step 15700/50000; acc:  67.50; ppl:  2.99; xent: 1.10; lr: 0.00010; 9043/3777 tok/s;   3411 sec
[2021-05-15 18:09:34,979 INFO] Step 15750/50000; acc:  67.91; ppl:  2.95; xent: 1.08; lr: 0.00010; 9695/4136 tok/s;   3421 sec
[2021-05-15 18:09:46,164 INFO] Step 15800/50000; acc:  67.45; ppl:  3.02; xent: 1.10; lr: 0.00010; 9283/3868 tok/s;   3433 sec
[2021-05-15 18:09:56,800 INFO] Step 15850/50000; acc:  67.97; ppl:  2.98; xent: 1.09; lr: 0.00010; 9252/3974 tok/s;   3443 sec
[2021-05-15 18:10:08,138 INFO] Step 15900/50000; acc:  67.72; ppl:  2.94; xent: 1.08; lr: 0.00010; 9275/3755 tok/s;   3455 sec
[2021-05-15 18:10:18,072 INFO] Step 15950/50000; acc:  67.78; ppl:  2.97; xent: 1.09; lr: 0.00010; 10070/4273 tok/s;   3464 sec
[2021-05-15 18:10:28,617 INFO] Step 16000/50000; acc:  67.78; ppl:  2.96; xent: 1.09; lr: 0.00010; 9730/4037 tok/s;   3475 sec
[2021-05-15 18:10:35,709 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:10:39,714 INFO] Step 16050/50000; acc:  67.88; ppl:  2.97; xent: 1.09; lr: 0.00010; 9235/3892 tok/s;   3486 sec
[2021-05-15 18:10:50,179 INFO] Step 16100/50000; acc:  67.87; ppl:  2.95; xent: 1.08; lr: 0.00010; 9522/4058 tok/s;   3497 sec
[2021-05-15 18:11:01,928 INFO] Step 16150/50000; acc:  66.98; ppl:  3.02; xent: 1.11; lr: 0.00010; 8778/3649 tok/s;   3508 sec
[2021-05-15 18:11:12,598 INFO] Step 16200/50000; acc:  68.41; ppl:  2.93; xent: 1.07; lr: 0.00010; 9264/4053 tok/s;   3519 sec
[2021-05-15 18:11:23,469 INFO] Step 16250/50000; acc:  67.40; ppl:  3.00; xent: 1.10; lr: 0.00010; 9626/3984 tok/s;   3530 sec
[2021-05-15 18:11:33,744 INFO] Step 16300/50000; acc:  67.90; ppl:  2.95; xent: 1.08; lr: 0.00010; 9567/4106 tok/s;   3540 sec
[2021-05-15 18:11:44,649 INFO] Step 16350/50000; acc:  67.55; ppl:  2.99; xent: 1.10; lr: 0.00010; 9628/3940 tok/s;   3551 sec
[2021-05-15 18:11:55,736 INFO] Step 16400/50000; acc:  68.07; ppl:  2.90; xent: 1.07; lr: 0.00010; 9059/3780 tok/s;   3562 sec
[2021-05-15 18:12:06,982 INFO] Step 16450/50000; acc:  67.48; ppl:  2.99; xent: 1.09; lr: 0.00010; 9171/3877 tok/s;   3573 sec
[2021-05-15 18:12:17,769 INFO] Step 16500/50000; acc:  68.15; ppl:  2.94; xent: 1.08; lr: 0.00010; 9424/3920 tok/s;   3584 sec
[2021-05-15 18:12:28,142 INFO] Step 16550/50000; acc:  68.10; ppl:  2.93; xent: 1.08; lr: 0.00010; 9689/4106 tok/s;   3595 sec
[2021-05-15 18:12:39,549 INFO] Step 16600/50000; acc:  67.77; ppl:  2.98; xent: 1.09; lr: 0.00010; 9058/3775 tok/s;   3606 sec
[2021-05-15 18:12:50,053 INFO] Step 16650/50000; acc:  68.15; ppl:  2.88; xent: 1.06; lr: 0.00010; 9419/3975 tok/s;   3616 sec
[2021-05-15 18:13:00,488 INFO] Step 16700/50000; acc:  67.85; ppl:  2.97; xent: 1.09; lr: 0.00010; 10001/4147 tok/s;   3627 sec
[2021-05-15 18:13:10,908 INFO] Step 16750/50000; acc:  68.31; ppl:  2.91; xent: 1.07; lr: 0.00010; 9576/4102 tok/s;   3637 sec
[2021-05-15 18:13:15,053 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:13:21,799 INFO] Step 16800/50000; acc:  67.65; ppl:  2.97; xent: 1.09; lr: 0.00010; 9505/3940 tok/s;   3648 sec
[2021-05-15 18:13:33,042 INFO] Step 16850/50000; acc:  67.84; ppl:  2.96; xent: 1.08; lr: 0.00010; 9109/3775 tok/s;   3659 sec
[2021-05-15 18:13:44,149 INFO] Step 16900/50000; acc:  68.08; ppl:  2.91; xent: 1.07; lr: 0.00010; 8898/3884 tok/s;   3671 sec
[2021-05-15 18:13:54,967 INFO] Step 16950/50000; acc:  68.12; ppl:  2.93; xent: 1.07; lr: 0.00010; 9519/3939 tok/s;   3681 sec
[2021-05-15 18:14:05,634 INFO] Step 17000/50000; acc:  67.93; ppl:  2.93; xent: 1.08; lr: 0.00010; 9288/4076 tok/s;   3692 sec
[2021-05-15 18:14:16,456 INFO] Step 17050/50000; acc:  67.80; ppl:  2.95; xent: 1.08; lr: 0.00010; 9650/3938 tok/s;   3703 sec
[2021-05-15 18:14:26,225 INFO] Step 17100/50000; acc:  68.22; ppl:  2.89; xent: 1.06; lr: 0.00010; 10272/4264 tok/s;   3713 sec
[2021-05-15 18:14:38,098 INFO] Step 17150/50000; acc:  67.56; ppl:  2.96; xent: 1.09; lr: 0.00010; 8808/3643 tok/s;   3724 sec
[2021-05-15 18:14:48,723 INFO] Step 17200/50000; acc:  68.16; ppl:  2.91; xent: 1.07; lr: 0.00010; 9356/4006 tok/s;   3735 sec
[2021-05-15 18:14:59,331 INFO] Step 17250/50000; acc:  68.01; ppl:  2.92; xent: 1.07; lr: 0.00010; 9722/4052 tok/s;   3746 sec
[2021-05-15 18:15:10,177 INFO] Step 17300/50000; acc:  67.78; ppl:  2.97; xent: 1.09; lr: 0.00010; 9387/3903 tok/s;   3757 sec
[2021-05-15 18:15:21,116 INFO] Step 17350/50000; acc:  68.39; ppl:  2.87; xent: 1.05; lr: 0.00010; 9192/3937 tok/s;   3768 sec
[2021-05-15 18:15:31,569 INFO] Step 17400/50000; acc:  68.36; ppl:  2.89; xent: 1.06; lr: 0.00010; 9877/4009 tok/s;   3778 sec
[2021-05-15 18:15:41,861 INFO] Step 17450/50000; acc:  68.43; ppl:  2.90; xent: 1.06; lr: 0.00010; 9553/4146 tok/s;   3788 sec
[2021-05-15 18:15:52,899 INFO] Step 17500/50000; acc:  68.03; ppl:  2.94; xent: 1.08; lr: 0.00010; 9494/3909 tok/s;   3799 sec
[2021-05-15 18:15:53,823 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:16:03,769 INFO] Step 17550/50000; acc:  68.17; ppl:  2.90; xent: 1.06; lr: 0.00010; 9259/3906 tok/s;   3810 sec
[2021-05-15 18:16:14,440 INFO] Step 17600/50000; acc:  67.83; ppl:  2.95; xent: 1.08; lr: 0.00010; 9619/4015 tok/s;   3821 sec
[2021-05-15 18:16:25,589 INFO] Step 17650/50000; acc:  68.25; ppl:  2.90; xent: 1.06; lr: 0.00010; 9112/3915 tok/s;   3832 sec
[2021-05-15 18:16:36,244 INFO] Step 17700/50000; acc:  68.28; ppl:  2.90; xent: 1.07; lr: 0.00010; 9304/4030 tok/s;   3843 sec
[2021-05-15 18:16:46,754 INFO] Step 17750/50000; acc:  68.17; ppl:  2.92; xent: 1.07; lr: 0.00010; 9790/4058 tok/s;   3853 sec
[2021-05-15 18:16:56,969 INFO] Step 17800/50000; acc:  68.56; ppl:  2.88; xent: 1.06; lr: 0.00010; 9759/4131 tok/s;   3863 sec
[2021-05-15 18:17:08,350 INFO] Step 17850/50000; acc:  67.80; ppl:  2.94; xent: 1.08; lr: 0.00010; 9373/3725 tok/s;   3875 sec
[2021-05-15 18:17:18,979 INFO] Step 17900/50000; acc:  68.47; ppl:  2.88; xent: 1.06; lr: 0.00010; 9294/4083 tok/s;   3885 sec
[2021-05-15 18:17:29,982 INFO] Step 17950/50000; acc:  68.21; ppl:  2.91; xent: 1.07; lr: 0.00010; 9453/3851 tok/s;   3896 sec
[2021-05-15 18:17:40,419 INFO] Step 18000/50000; acc:  68.20; ppl:  2.91; xent: 1.07; lr: 0.00010; 9547/4088 tok/s;   3907 sec
[2021-05-15 18:17:51,210 INFO] Step 18050/50000; acc:  68.34; ppl:  2.91; xent: 1.07; lr: 0.00010; 9503/3979 tok/s;   3918 sec
[2021-05-15 18:18:02,444 INFO] Step 18100/50000; acc:  68.61; ppl:  2.86; xent: 1.05; lr: 0.00010; 9157/3781 tok/s;   3929 sec
[2021-05-15 18:18:12,619 INFO] Step 18150/50000; acc:  68.95; ppl:  2.84; xent: 1.05; lr: 0.00010; 9814/4137 tok/s;   3939 sec
[2021-05-15 18:18:23,063 INFO] Step 18200/50000; acc:  68.35; ppl:  2.91; xent: 1.07; lr: 0.00010; 9846/4087 tok/s;   3949 sec
[2021-05-15 18:18:25,074 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:18:34,056 INFO] Step 18250/50000; acc:  68.55; ppl:  2.88; xent: 1.06; lr: 0.00010; 9003/3927 tok/s;   3960 sec
[2021-05-15 18:18:44,784 INFO] Step 18300/50000; acc:  68.22; ppl:  2.92; xent: 1.07; lr: 0.00010; 9790/3965 tok/s;   3971 sec
[2021-05-15 18:18:55,982 INFO] Step 18350/50000; acc:  68.47; ppl:  2.87; xent: 1.06; lr: 0.00010; 8859/3853 tok/s;   3982 sec
[2021-05-15 18:19:06,732 INFO] Step 18400/50000; acc:  68.21; ppl:  2.90; xent: 1.07; lr: 0.00010; 9592/4071 tok/s;   3993 sec
[2021-05-15 18:19:17,211 INFO] Step 18450/50000; acc:  68.56; ppl:  2.89; xent: 1.06; lr: 0.00010; 9710/4060 tok/s;   4004 sec
[2021-05-15 18:19:27,289 INFO] Step 18500/50000; acc:  68.73; ppl:  2.85; xent: 1.05; lr: 0.00010; 9805/4199 tok/s;   4014 sec
[2021-05-15 18:19:37,844 INFO] Step 18550/50000; acc:  68.12; ppl:  2.92; xent: 1.07; lr: 0.00010; 9889/3999 tok/s;   4024 sec
[2021-05-15 18:19:49,314 INFO] Step 18600/50000; acc:  68.33; ppl:  2.84; xent: 1.04; lr: 0.00010; 8767/3695 tok/s;   4036 sec
[2021-05-15 18:20:00,382 INFO] Step 18650/50000; acc:  68.28; ppl:  2.90; xent: 1.07; lr: 0.00010; 9471/3917 tok/s;   4047 sec
[2021-05-15 18:20:10,759 INFO] Step 18700/50000; acc:  68.95; ppl:  2.83; xent: 1.04; lr: 0.00010; 9524/4004 tok/s;   4057 sec
[2021-05-15 18:20:21,867 INFO] Step 18750/50000; acc:  68.12; ppl:  2.92; xent: 1.07; lr: 0.00010; 9387/3928 tok/s;   4068 sec
[2021-05-15 18:20:32,750 INFO] Step 18800/50000; acc:  68.80; ppl:  2.83; xent: 1.04; lr: 0.00010; 9155/3928 tok/s;   4079 sec
[2021-05-15 18:20:43,685 INFO] Step 18850/50000; acc:  68.65; ppl:  2.85; xent: 1.05; lr: 0.00010; 9467/3899 tok/s;   4090 sec
[2021-05-15 18:20:53,850 INFO] Step 18900/50000; acc:  68.70; ppl:  2.86; xent: 1.05; lr: 0.00010; 9948/4112 tok/s;   4100 sec
[2021-05-15 18:21:03,426 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:21:04,542 INFO] Step 18950/50000; acc:  68.91; ppl:  2.85; xent: 1.05; lr: 0.00010; 9382/4045 tok/s;   4111 sec
[2021-05-15 18:21:15,549 INFO] Step 19000/50000; acc:  68.47; ppl:  2.87; xent: 1.06; lr: 0.00010; 9364/3894 tok/s;   4122 sec
[2021-05-15 18:21:25,951 INFO] Step 19050/50000; acc:  68.46; ppl:  2.87; xent: 1.05; lr: 0.00010; 9514/4073 tok/s;   4132 sec
[2021-05-15 18:21:37,811 INFO] Step 19100/50000; acc:  68.31; ppl:  2.88; xent: 1.06; lr: 0.00010; 8756/3720 tok/s;   4144 sec
[2021-05-15 18:21:48,305 INFO] Step 19150/50000; acc:  68.70; ppl:  2.85; xent: 1.05; lr: 0.00010; 9534/4071 tok/s;   4155 sec
[2021-05-15 18:21:58,870 INFO] Step 19200/50000; acc:  68.54; ppl:  2.88; xent: 1.06; lr: 0.00010; 9748/4048 tok/s;   4165 sec
[2021-05-15 18:22:09,515 INFO] Step 19250/50000; acc:  68.66; ppl:  2.86; xent: 1.05; lr: 0.00010; 9582/4034 tok/s;   4176 sec
[2021-05-15 18:22:19,822 INFO] Step 19300/50000; acc:  68.86; ppl:  2.82; xent: 1.04; lr: 0.00010; 9754/4018 tok/s;   4186 sec
[2021-05-15 18:22:31,510 INFO] Step 19350/50000; acc:  68.43; ppl:  2.87; xent: 1.05; lr: 0.00010; 8927/3709 tok/s;   4198 sec
[2021-05-15 18:22:41,513 INFO] Step 19400/50000; acc:  69.08; ppl:  2.81; xent: 1.03; lr: 0.00010; 9856/4210 tok/s;   4208 sec
[2021-05-15 18:22:52,914 INFO] Step 19450/50000; acc:  68.34; ppl:  2.89; xent: 1.06; lr: 0.00010; 9252/3821 tok/s;   4219 sec
[2021-05-15 18:23:03,556 INFO] Step 19500/50000; acc:  68.98; ppl:  2.83; xent: 1.04; lr: 0.00010; 9271/3983 tok/s;   4230 sec
[2021-05-15 18:23:14,689 INFO] Step 19550/50000; acc:  68.50; ppl:  2.83; xent: 1.04; lr: 0.00010; 9388/3827 tok/s;   4241 sec
[2021-05-15 18:23:24,728 INFO] Step 19600/50000; acc:  68.88; ppl:  2.84; xent: 1.04; lr: 0.00010; 9882/4211 tok/s;   4251 sec
[2021-05-15 18:23:35,441 INFO] Step 19650/50000; acc:  69.06; ppl:  2.82; xent: 1.04; lr: 0.00010; 9545/3988 tok/s;   4262 sec
[2021-05-15 18:23:42,158 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:23:46,551 INFO] Step 19700/50000; acc:  68.77; ppl:  2.85; xent: 1.05; lr: 0.00010; 9214/3893 tok/s;   4273 sec
[2021-05-15 18:23:57,092 INFO] Step 19750/50000; acc:  69.07; ppl:  2.83; xent: 1.04; lr: 0.00010; 9563/4042 tok/s;   4283 sec
[2021-05-15 18:24:08,758 INFO] Step 19800/50000; acc:  68.18; ppl:  2.90; xent: 1.06; lr: 0.00010; 8800/3650 tok/s;   4295 sec
[2021-05-15 18:24:19,499 INFO] Step 19850/50000; acc:  69.37; ppl:  2.78; xent: 1.02; lr: 0.00010; 9119/4039 tok/s;   4306 sec
[2021-05-15 18:24:30,387 INFO] Step 19900/50000; acc:  68.57; ppl:  2.86; xent: 1.05; lr: 0.00010; 9608/3964 tok/s;   4317 sec
[2021-05-15 18:24:40,703 INFO] Step 19950/50000; acc:  68.96; ppl:  2.83; xent: 1.04; lr: 0.00010; 9669/4098 tok/s;   4327 sec
[2021-05-15 18:24:51,390 INFO] Step 20000/50000; acc:  68.69; ppl:  2.86; xent: 1.05; lr: 0.00010; 9723/4020 tok/s;   4338 sec
[2021-05-15 18:24:51,393 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/strict/valid.txt, align=None)...
[2021-05-15 18:25:00,167 INFO] Validation perplexity: 2.91986
[2021-05-15 18:25:00,167 INFO] Validation accuracy: 68.3792
[2021-05-15 18:25:00,169 INFO] Saving checkpoint ../models/group2_params/strict_ops/model_step_20000.pt
[2021-05-15 18:25:12,068 INFO] Step 20050/50000; acc:  68.58; ppl:  2.83; xent: 1.04; lr: 0.00010; 4998/2019 tok/s;   4358 sec
[2021-05-15 18:25:22,921 INFO] Step 20100/50000; acc:  68.90; ppl:  2.82; xent: 1.04; lr: 0.00010; 9176/4005 tok/s;   4369 sec
[2021-05-15 18:25:33,833 INFO] Step 20150/50000; acc:  68.99; ppl:  2.83; xent: 1.04; lr: 0.00010; 9460/3861 tok/s;   4380 sec
[2021-05-15 18:25:44,226 INFO] Step 20200/50000; acc:  68.86; ppl:  2.82; xent: 1.04; lr: 0.00010; 9580/4085 tok/s;   4391 sec
[2021-05-15 18:25:55,929 INFO] Step 20250/50000; acc:  68.56; ppl:  2.87; xent: 1.05; lr: 0.00010; 8976/3737 tok/s;   4402 sec
[2021-05-15 18:26:06,407 INFO] Step 20300/50000; acc:  69.26; ppl:  2.74; xent: 1.01; lr: 0.00010; 9455/3951 tok/s;   4413 sec
[2021-05-15 18:26:16,774 INFO] Step 20350/50000; acc:  69.07; ppl:  2.83; xent: 1.04; lr: 0.00010; 9988/4202 tok/s;   4423 sec
[2021-05-15 18:26:27,211 INFO] Step 20400/50000; acc:  69.48; ppl:  2.78; xent: 1.02; lr: 0.00010; 9500/4084 tok/s;   4434 sec
[2021-05-15 18:26:30,884 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:26:37,998 INFO] Step 20450/50000; acc:  68.75; ppl:  2.83; xent: 1.04; lr: 0.00010; 9554/3981 tok/s;   4444 sec
[2021-05-15 18:26:48,981 INFO] Step 20500/50000; acc:  68.90; ppl:  2.85; xent: 1.05; lr: 0.00010; 9314/3840 tok/s;   4455 sec
[2021-05-15 18:27:00,329 INFO] Step 20550/50000; acc:  68.96; ppl:  2.79; xent: 1.03; lr: 0.00010; 8794/3844 tok/s;   4467 sec
[2021-05-15 18:27:11,076 INFO] Step 20600/50000; acc:  68.93; ppl:  2.82; xent: 1.03; lr: 0.00010; 9566/3953 tok/s;   4477 sec
[2021-05-15 18:27:21,710 INFO] Step 20650/50000; acc:  69.29; ppl:  2.79; xent: 1.03; lr: 0.00010; 9234/4103 tok/s;   4488 sec
[2021-05-15 18:27:32,448 INFO] Step 20700/50000; acc:  69.01; ppl:  2.82; xent: 1.04; lr: 0.00010; 9720/3951 tok/s;   4499 sec
[2021-05-15 18:27:42,576 INFO] Step 20750/50000; acc:  69.05; ppl:  2.79; xent: 1.03; lr: 0.00010; 10055/4107 tok/s;   4509 sec
[2021-05-15 18:27:53,986 INFO] Step 20800/50000; acc:  68.89; ppl:  2.82; xent: 1.04; lr: 0.00010; 9056/3800 tok/s;   4520 sec
[2021-05-15 18:28:04,692 INFO] Step 20850/50000; acc:  69.00; ppl:  2.82; xent: 1.04; lr: 0.00010; 9566/3980 tok/s;   4531 sec
[2021-05-15 18:28:15,112 INFO] Step 20900/50000; acc:  69.33; ppl:  2.77; xent: 1.02; lr: 0.00010; 9558/4088 tok/s;   4542 sec
[2021-05-15 18:28:26,038 INFO] Step 20950/50000; acc:  68.80; ppl:  2.84; xent: 1.05; lr: 0.00010; 9448/3906 tok/s;   4552 sec
[2021-05-15 18:28:37,093 INFO] Step 21000/50000; acc:  69.75; ppl:  2.74; xent: 1.01; lr: 0.00010; 9024/3876 tok/s;   4563 sec
[2021-05-15 18:28:47,583 INFO] Step 21050/50000; acc:  69.14; ppl:  2.80; xent: 1.03; lr: 0.00010; 10001/4018 tok/s;   4574 sec
[2021-05-15 18:28:58,147 INFO] Step 21100/50000; acc:  69.41; ppl:  2.78; xent: 1.02; lr: 0.00010; 9318/4054 tok/s;   4585 sec
[2021-05-15 18:29:02,578 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:29:09,228 INFO] Step 21150/50000; acc:  68.74; ppl:  2.83; xent: 1.04; lr: 0.00010; 9416/3889 tok/s;   4596 sec
[2021-05-15 18:29:20,126 INFO] Step 21200/50000; acc:  69.39; ppl:  2.76; xent: 1.01; lr: 0.00010; 9148/3873 tok/s;   4607 sec
[2021-05-15 18:29:30,685 INFO] Step 21250/50000; acc:  68.98; ppl:  2.83; xent: 1.04; lr: 0.00010; 9676/4078 tok/s;   4617 sec
[2021-05-15 18:29:41,860 INFO] Step 21300/50000; acc:  69.29; ppl:  2.80; xent: 1.03; lr: 0.00010; 9089/3919 tok/s;   4628 sec
[2021-05-15 18:29:52,211 INFO] Step 21350/50000; acc:  69.10; ppl:  2.78; xent: 1.02; lr: 0.00010; 9673/4116 tok/s;   4639 sec
[2021-05-15 18:30:02,619 INFO] Step 21400/50000; acc:  69.31; ppl:  2.80; xent: 1.03; lr: 0.00010; 9860/4113 tok/s;   4649 sec
[2021-05-15 18:30:12,970 INFO] Step 21450/50000; acc:  69.75; ppl:  2.73; xent: 1.00; lr: 0.00010; 9553/4078 tok/s;   4659 sec
[2021-05-15 18:30:24,290 INFO] Step 21500/50000; acc:  68.91; ppl:  2.82; xent: 1.04; lr: 0.00010; 9426/3736 tok/s;   4671 sec
[2021-05-15 18:30:35,050 INFO] Step 21550/50000; acc:  69.22; ppl:  2.78; xent: 1.02; lr: 0.00010; 9302/4003 tok/s;   4681 sec
[2021-05-15 18:30:46,104 INFO] Step 21600/50000; acc:  69.25; ppl:  2.79; xent: 1.03; lr: 0.00010; 9321/3841 tok/s;   4692 sec
[2021-05-15 18:30:56,804 INFO] Step 21650/50000; acc:  69.00; ppl:  2.81; xent: 1.03; lr: 0.00010; 9566/4025 tok/s;   4703 sec
[2021-05-15 18:31:07,417 INFO] Step 21700/50000; acc:  69.73; ppl:  2.74; xent: 1.01; lr: 0.00010; 9357/4009 tok/s;   4714 sec
[2021-05-15 18:31:18,814 INFO] Step 21750/50000; acc:  69.43; ppl:  2.76; xent: 1.01; lr: 0.00010; 9133/3764 tok/s;   4725 sec
[2021-05-15 18:31:28,879 INFO] Step 21800/50000; acc:  69.88; ppl:  2.71; xent: 1.00; lr: 0.00010; 9835/4148 tok/s;   4735 sec
[2021-05-15 18:31:39,675 INFO] Step 21850/50000; acc:  69.09; ppl:  2.82; xent: 1.04; lr: 0.00010; 9673/3979 tok/s;   4746 sec
[2021-05-15 18:31:40,956 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:31:50,474 INFO] Step 21900/50000; acc:  69.71; ppl:  2.74; xent: 1.01; lr: 0.00010; 9174/4003 tok/s;   4757 sec
[2021-05-15 18:32:01,282 INFO] Step 21950/50000; acc:  68.86; ppl:  2.81; xent: 1.03; lr: 0.00010; 9667/3926 tok/s;   4768 sec
[2021-05-15 18:32:12,620 INFO] Step 22000/50000; acc:  69.45; ppl:  2.76; xent: 1.01; lr: 0.00010; 8677/3835 tok/s;   4779 sec
[2021-05-15 18:32:23,301 INFO] Step 22050/50000; acc:  69.29; ppl:  2.78; xent: 1.02; lr: 0.00010; 9620/4045 tok/s;   4790 sec
[2021-05-15 18:32:33,494 INFO] Step 22100/50000; acc:  69.32; ppl:  2.77; xent: 1.02; lr: 0.00010; 9982/4179 tok/s;   4800 sec
[2021-05-15 18:32:43,836 INFO] Step 22150/50000; acc:  69.63; ppl:  2.75; xent: 1.01; lr: 0.00010; 9644/4121 tok/s;   4810 sec
[2021-05-15 18:32:54,689 INFO] Step 22200/50000; acc:  69.11; ppl:  2.79; xent: 1.03; lr: 0.00010; 9616/3876 tok/s;   4821 sec
[2021-05-15 18:33:05,979 INFO] Step 22250/50000; acc:  69.73; ppl:  2.71; xent: 1.00; lr: 0.00010; 8816/3794 tok/s;   4832 sec
[2021-05-15 18:33:16,775 INFO] Step 22300/50000; acc:  69.24; ppl:  2.79; xent: 1.03; lr: 0.00010; 9713/3956 tok/s;   4843 sec
[2021-05-15 18:33:27,588 INFO] Step 22350/50000; acc:  69.77; ppl:  2.74; xent: 1.01; lr: 0.00010; 9262/3909 tok/s;   4854 sec
[2021-05-15 18:33:38,334 INFO] Step 22400/50000; acc:  69.01; ppl:  2.80; xent: 1.03; lr: 0.00010; 9595/4007 tok/s;   4865 sec
[2021-05-15 18:33:49,241 INFO] Step 22450/50000; acc:  69.82; ppl:  2.74; xent: 1.01; lr: 0.00010; 9404/3925 tok/s;   4876 sec
[2021-05-15 18:33:59,695 INFO] Step 22500/50000; acc:  69.72; ppl:  2.72; xent: 1.00; lr: 0.00010; 9558/4063 tok/s;   4886 sec
[2021-05-15 18:34:09,987 INFO] Step 22550/50000; acc:  69.71; ppl:  2.75; xent: 1.01; lr: 0.00010; 9971/4080 tok/s;   4896 sec
[2021-05-15 18:34:19,081 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:34:20,564 INFO] Step 22600/50000; acc:  69.77; ppl:  2.73; xent: 1.00; lr: 0.00010; 9406/4059 tok/s;   4907 sec
[2021-05-15 18:34:31,633 INFO] Step 22650/50000; acc:  69.36; ppl:  2.78; xent: 1.02; lr: 0.00010; 9467/3895 tok/s;   4918 sec
[2021-05-15 18:34:42,245 INFO] Step 22700/50000; acc:  69.59; ppl:  2.75; xent: 1.01; lr: 0.00010; 9327/3992 tok/s;   4929 sec
[2021-05-15 18:34:53,763 INFO] Step 22750/50000; acc:  69.44; ppl:  2.76; xent: 1.01; lr: 0.00010; 8963/3842 tok/s;   4940 sec
[2021-05-15 18:35:04,313 INFO] Step 22800/50000; acc:  69.58; ppl:  2.74; xent: 1.01; lr: 0.00010; 9413/4033 tok/s;   4951 sec
[2021-05-15 18:35:14,791 INFO] Step 22850/50000; acc:  69.70; ppl:  2.74; xent: 1.01; lr: 0.00010; 9787/4079 tok/s;   4961 sec
[2021-05-15 18:35:25,577 INFO] Step 22900/50000; acc:  69.78; ppl:  2.75; xent: 1.01; lr: 0.00010; 9463/3983 tok/s;   4972 sec
[2021-05-15 18:35:35,988 INFO] Step 22950/50000; acc:  69.74; ppl:  2.72; xent: 1.00; lr: 0.00010; 9751/3976 tok/s;   4982 sec
[2021-05-15 18:35:47,632 INFO] Step 23000/50000; acc:  69.49; ppl:  2.76; xent: 1.01; lr: 0.00010; 8918/3737 tok/s;   4994 sec
[2021-05-15 18:35:57,684 INFO] Step 23050/50000; acc:  69.99; ppl:  2.70; xent: 0.99; lr: 0.00010; 9744/4159 tok/s;   5004 sec
[2021-05-15 18:36:09,059 INFO] Step 23100/50000; acc:  69.49; ppl:  2.78; xent: 1.02; lr: 0.00010; 9268/3831 tok/s;   5015 sec
[2021-05-15 18:36:20,060 INFO] Step 23150/50000; acc:  69.64; ppl:  2.74; xent: 1.01; lr: 0.00010; 9094/3879 tok/s;   5026 sec
[2021-05-15 18:36:31,040 INFO] Step 23200/50000; acc:  69.72; ppl:  2.71; xent: 1.00; lr: 0.00010; 9438/3867 tok/s;   5037 sec
[2021-05-15 18:36:41,296 INFO] Step 23250/50000; acc:  69.55; ppl:  2.76; xent: 1.02; lr: 0.00010; 9930/4118 tok/s;   5048 sec
[2021-05-15 18:36:51,717 INFO] Step 23300/50000; acc:  70.16; ppl:  2.67; xent: 0.98; lr: 0.00010; 9481/4106 tok/s;   5058 sec
[2021-05-15 18:36:57,996 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:37:02,655 INFO] Step 23350/50000; acc:  69.58; ppl:  2.75; xent: 1.01; lr: 0.00010; 9490/3918 tok/s;   5069 sec
[2021-05-15 18:37:13,442 INFO] Step 23400/50000; acc:  70.03; ppl:  2.71; xent: 1.00; lr: 0.00010; 9269/3945 tok/s;   5080 sec
[2021-05-15 18:37:25,123 INFO] Step 23450/50000; acc:  68.81; ppl:  2.78; xent: 1.02; lr: 0.00010; 8923/3713 tok/s;   5092 sec
[2021-05-15 18:37:35,900 INFO] Step 23500/50000; acc:  70.28; ppl:  2.68; xent: 0.99; lr: 0.00010; 9112/4027 tok/s;   5102 sec
[2021-05-15 18:37:46,564 INFO] Step 23550/50000; acc:  69.48; ppl:  2.76; xent: 1.01; lr: 0.00010; 9752/4031 tok/s;   5113 sec
[2021-05-15 18:37:56,732 INFO] Step 23600/50000; acc:  70.33; ppl:  2.70; xent: 0.99; lr: 0.00010; 9713/4162 tok/s;   5123 sec
[2021-05-15 18:38:07,360 INFO] Step 23650/50000; acc:  69.51; ppl:  2.74; xent: 1.01; lr: 0.00010; 9774/4022 tok/s;   5134 sec
[2021-05-15 18:38:18,675 INFO] Step 23700/50000; acc:  69.90; ppl:  2.71; xent: 1.00; lr: 0.00010; 9113/3675 tok/s;   5145 sec
[2021-05-15 18:38:29,559 INFO] Step 23750/50000; acc:  69.78; ppl:  2.71; xent: 1.00; lr: 0.00010; 9228/3997 tok/s;   5156 sec
[2021-05-15 18:38:40,361 INFO] Step 23800/50000; acc:  69.84; ppl:  2.73; xent: 1.00; lr: 0.00010; 9545/3883 tok/s;   5167 sec
[2021-05-15 18:38:50,614 INFO] Step 23850/50000; acc:  69.86; ppl:  2.70; xent: 0.99; lr: 0.00010; 9619/4139 tok/s;   5177 sec
[2021-05-15 18:39:02,391 INFO] Step 23900/50000; acc:  69.56; ppl:  2.76; xent: 1.01; lr: 0.00010; 8900/3723 tok/s;   5189 sec
[2021-05-15 18:39:12,915 INFO] Step 23950/50000; acc:  70.20; ppl:  2.65; xent: 0.98; lr: 0.00010; 9559/3979 tok/s;   5199 sec
[2021-05-15 18:39:23,212 INFO] Step 24000/50000; acc:  70.10; ppl:  2.73; xent: 1.00; lr: 0.00010; 9958/4183 tok/s;   5210 sec
[2021-05-15 18:39:33,827 INFO] Step 24050/50000; acc:  70.09; ppl:  2.70; xent: 0.99; lr: 0.00010; 9612/4019 tok/s;   5220 sec
[2021-05-15 18:39:37,027 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:39:44,463 INFO] Step 24100/50000; acc:  69.90; ppl:  2.70; xent: 0.99; lr: 0.00010; 9370/4030 tok/s;   5231 sec
[2021-05-15 18:39:55,479 INFO] Step 24150/50000; acc:  69.46; ppl:  2.76; xent: 1.01; lr: 0.00010; 9399/3851 tok/s;   5242 sec
[2021-05-15 18:40:06,637 INFO] Step 24200/50000; acc:  69.87; ppl:  2.71; xent: 1.00; lr: 0.00010; 8865/3913 tok/s;   5253 sec
[2021-05-15 18:40:17,476 INFO] Step 24250/50000; acc:  69.59; ppl:  2.72; xent: 1.00; lr: 0.00010; 9634/3957 tok/s;   5264 sec
[2021-05-15 18:40:28,061 INFO] Step 24300/50000; acc:  70.27; ppl:  2.68; xent: 0.99; lr: 0.00010; 9299/4064 tok/s;   5274 sec
[2021-05-15 18:40:38,689 INFO] Step 24350/50000; acc:  70.08; ppl:  2.70; xent: 0.99; lr: 0.00010; 9764/4001 tok/s;   5285 sec
[2021-05-15 18:40:49,220 INFO] Step 24400/50000; acc:  70.07; ppl:  2.68; xent: 0.98; lr: 0.00010; 9607/3951 tok/s;   5296 sec
[2021-05-15 18:41:00,566 INFO] Step 24450/50000; acc:  69.94; ppl:  2.71; xent: 1.00; lr: 0.00010; 9073/3834 tok/s;   5307 sec
[2021-05-15 18:41:11,492 INFO] Step 24500/50000; acc:  70.03; ppl:  2.70; xent: 0.99; lr: 0.00010; 9360/3871 tok/s;   5318 sec
[2021-05-15 18:41:21,951 INFO] Step 24550/50000; acc:  69.91; ppl:  2.69; xent: 0.99; lr: 0.00010; 9614/4107 tok/s;   5328 sec
[2021-05-15 18:41:32,927 INFO] Step 24600/50000; acc:  70.03; ppl:  2.73; xent: 1.00; lr: 0.00010; 9370/3880 tok/s;   5339 sec
[2021-05-15 18:41:43,812 INFO] Step 24650/50000; acc:  70.46; ppl:  2.64; xent: 0.97; lr: 0.00010; 9094/3909 tok/s;   5350 sec
[2021-05-15 18:41:54,233 INFO] Step 24700/50000; acc:  70.14; ppl:  2.70; xent: 0.99; lr: 0.00010; 10059/4054 tok/s;   5361 sec
[2021-05-15 18:42:04,847 INFO] Step 24750/50000; acc:  70.22; ppl:  2.68; xent: 0.99; lr: 0.00010; 9399/4039 tok/s;   5371 sec
[2021-05-15 18:42:08,777 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:42:15,893 INFO] Step 24800/50000; acc:  69.78; ppl:  2.71; xent: 1.00; lr: 0.00010; 9371/3921 tok/s;   5382 sec
[2021-05-15 18:42:26,671 INFO] Step 24850/50000; acc:  70.16; ppl:  2.68; xent: 0.99; lr: 0.00010; 9491/3900 tok/s;   5393 sec
[2021-05-15 18:42:37,564 INFO] Step 24900/50000; acc:  70.22; ppl:  2.69; xent: 0.99; lr: 0.00010; 9079/3964 tok/s;   5404 sec
[2021-05-15 18:42:48,845 INFO] Step 24950/50000; acc:  70.22; ppl:  2.70; xent: 0.99; lr: 0.00010; 9117/3871 tok/s;   5415 sec
[2021-05-15 18:42:59,078 INFO] Step 25000/50000; acc:  69.77; ppl:  2.69; xent: 0.99; lr: 0.00010; 9709/4182 tok/s;   5425 sec
[2021-05-15 18:42:59,080 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/strict/valid.txt, align=None)...
[2021-05-15 18:43:07,856 INFO] Validation perplexity: 2.87064
[2021-05-15 18:43:07,856 INFO] Validation accuracy: 68.8473
[2021-05-15 18:43:07,858 INFO] Saving checkpoint ../models/group2_params/strict_ops/model_step_25000.pt
[2021-05-15 18:43:18,977 INFO] Step 25050/50000; acc:  70.05; ppl:  2.71; xent: 1.00; lr: 0.00010; 5237/2131 tok/s;   5445 sec
[2021-05-15 18:43:29,306 INFO] Step 25100/50000; acc:  70.68; ppl:  2.63; xent: 0.97; lr: 0.00010; 9596/4102 tok/s;   5456 sec
[2021-05-15 18:43:40,989 INFO] Step 25150/50000; acc:  69.86; ppl:  2.72; xent: 1.00; lr: 0.00010; 9068/3646 tok/s;   5467 sec
[2021-05-15 18:43:51,541 INFO] Step 25200/50000; acc:  70.35; ppl:  2.66; xent: 0.98; lr: 0.00010; 9420/4065 tok/s;   5478 sec
[2021-05-15 18:44:02,474 INFO] Step 25250/50000; acc:  70.23; ppl:  2.69; xent: 0.99; lr: 0.00010; 9403/3884 tok/s;   5489 sec
[2021-05-15 18:44:13,236 INFO] Step 25300/50000; acc:  70.07; ppl:  2.71; xent: 1.00; lr: 0.00010; 9485/4023 tok/s;   5500 sec
[2021-05-15 18:44:24,000 INFO] Step 25350/50000; acc:  70.35; ppl:  2.67; xent: 0.98; lr: 0.00010; 9323/3963 tok/s;   5510 sec
[2021-05-15 18:44:35,212 INFO] Step 25400/50000; acc:  70.31; ppl:  2.66; xent: 0.98; lr: 0.00010; 9261/3780 tok/s;   5522 sec
[2021-05-15 18:44:45,233 INFO] Step 25450/50000; acc:  70.97; ppl:  2.61; xent: 0.96; lr: 0.00010; 9775/4166 tok/s;   5532 sec
[2021-05-15 18:44:56,244 INFO] Step 25500/50000; acc:  69.96; ppl:  2.71; xent: 1.00; lr: 0.00010; 9511/3899 tok/s;   5543 sec
[2021-05-15 18:44:57,124 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:45:06,952 INFO] Step 25550/50000; acc:  70.19; ppl:  2.65; xent: 0.97; lr: 0.00010; 9360/4079 tok/s;   5553 sec
[2021-05-15 18:45:17,611 INFO] Step 25600/50000; acc:  69.95; ppl:  2.71; xent: 1.00; lr: 0.00010; 9714/3942 tok/s;   5564 sec
[2021-05-15 18:45:29,151 INFO] Step 25650/50000; acc:  70.23; ppl:  2.69; xent: 0.99; lr: 0.00010; 8758/3813 tok/s;   5576 sec
[2021-05-15 18:45:39,620 INFO] Step 25700/50000; acc:  70.33; ppl:  2.65; xent: 0.97; lr: 0.00010; 9484/4061 tok/s;   5586 sec
[2021-05-15 18:45:50,247 INFO] Step 25750/50000; acc:  70.06; ppl:  2.69; xent: 0.99; lr: 0.00010; 9718/4027 tok/s;   5597 sec
[2021-05-15 18:46:00,612 INFO] Step 25800/50000; acc:  70.90; ppl:  2.63; xent: 0.97; lr: 0.00010; 9530/4109 tok/s;   5607 sec
[2021-05-15 18:46:11,627 INFO] Step 25850/50000; acc:  69.77; ppl:  2.72; xent: 1.00; lr: 0.00010; 9631/3840 tok/s;   5618 sec
[2021-05-15 18:46:22,699 INFO] Step 25900/50000; acc:  70.43; ppl:  2.62; xent: 0.96; lr: 0.00010; 9000/3857 tok/s;   5629 sec
[2021-05-15 18:46:33,435 INFO] Step 25950/50000; acc:  70.36; ppl:  2.67; xent: 0.98; lr: 0.00010; 9696/4005 tok/s;   5640 sec
[2021-05-15 18:46:44,011 INFO] Step 26000/50000; acc:  70.67; ppl:  2.64; xent: 0.97; lr: 0.00010; 9416/3987 tok/s;   5650 sec
[2021-05-15 18:46:54,827 INFO] Step 26050/50000; acc:  69.98; ppl:  2.70; xent: 0.99; lr: 0.00010; 9501/3968 tok/s;   5661 sec
[2021-05-15 18:47:05,847 INFO] Step 26100/50000; acc:  70.45; ppl:  2.64; xent: 0.97; lr: 0.00010; 9298/3889 tok/s;   5672 sec
[2021-05-15 18:47:16,093 INFO] Step 26150/50000; acc:  70.65; ppl:  2.64; xent: 0.97; lr: 0.00010; 9810/4138 tok/s;   5682 sec
[2021-05-15 18:47:26,294 INFO] Step 26200/50000; acc:  70.52; ppl:  2.65; xent: 0.97; lr: 0.00010; 10041/4138 tok/s;   5693 sec
[2021-05-15 18:47:34,894 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:47:36,762 INFO] Step 26250/50000; acc:  70.72; ppl:  2.63; xent: 0.97; lr: 0.00010; 9441/4079 tok/s;   5703 sec
[2021-05-15 18:47:48,001 INFO] Step 26300/50000; acc:  70.07; ppl:  2.68; xent: 0.99; lr: 0.00010; 9320/3853 tok/s;   5714 sec
[2021-05-15 18:47:58,921 INFO] Step 26350/50000; acc:  70.29; ppl:  2.67; xent: 0.98; lr: 0.00010; 9173/3904 tok/s;   5725 sec
[2021-05-15 18:48:10,068 INFO] Step 26400/50000; acc:  70.69; ppl:  2.65; xent: 0.98; lr: 0.00010; 9183/3915 tok/s;   5736 sec
[2021-05-15 18:48:20,755 INFO] Step 26450/50000; acc:  70.31; ppl:  2.68; xent: 0.98; lr: 0.00010; 9560/4020 tok/s;   5747 sec
[2021-05-15 18:48:30,955 INFO] Step 26500/50000; acc:  70.77; ppl:  2.63; xent: 0.97; lr: 0.00010; 9703/4162 tok/s;   5757 sec
[2021-05-15 18:48:41,812 INFO] Step 26550/50000; acc:  70.22; ppl:  2.68; xent: 0.99; lr: 0.00010; 9547/3955 tok/s;   5768 sec
[2021-05-15 18:48:52,241 INFO] Step 26600/50000; acc:  70.76; ppl:  2.60; xent: 0.96; lr: 0.00010; 9663/3956 tok/s;   5779 sec
[2021-05-15 18:49:04,205 INFO] Step 26650/50000; acc:  70.02; ppl:  2.70; xent: 0.99; lr: 0.00010; 8798/3667 tok/s;   5791 sec
[2021-05-15 18:49:14,052 INFO] Step 26700/50000; acc:  70.91; ppl:  2.59; xent: 0.95; lr: 0.00010; 9983/4228 tok/s;   5800 sec
[2021-05-15 18:49:25,303 INFO] Step 26750/50000; acc:  70.34; ppl:  2.67; xent: 0.98; lr: 0.00010; 9305/3880 tok/s;   5812 sec
[2021-05-15 18:49:36,325 INFO] Step 26800/50000; acc:  70.60; ppl:  2.63; xent: 0.97; lr: 0.00010; 9003/3860 tok/s;   5823 sec
[2021-05-15 18:49:47,367 INFO] Step 26850/50000; acc:  70.56; ppl:  2.62; xent: 0.96; lr: 0.00010; 9345/3845 tok/s;   5834 sec
[2021-05-15 18:49:57,622 INFO] Step 26900/50000; acc:  70.28; ppl:  2.66; xent: 0.98; lr: 0.00010; 9916/4161 tok/s;   5844 sec
[2021-05-15 18:50:07,992 INFO] Step 26950/50000; acc:  70.90; ppl:  2.60; xent: 0.96; lr: 0.00010; 9636/4111 tok/s;   5854 sec
[2021-05-15 18:50:13,920 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:50:19,104 INFO] Step 27000/50000; acc:  70.48; ppl:  2.65; xent: 0.97; lr: 0.00010; 9306/3844 tok/s;   5865 sec
[2021-05-15 18:50:29,786 INFO] Step 27050/50000; acc:  70.79; ppl:  2.62; xent: 0.96; lr: 0.00010; 9298/3979 tok/s;   5876 sec
[2021-05-15 18:50:41,599 INFO] Step 27100/50000; acc:  70.23; ppl:  2.68; xent: 0.98; lr: 0.00010; 8808/3688 tok/s;   5888 sec
[2021-05-15 18:50:52,574 INFO] Step 27150/50000; acc:  70.80; ppl:  2.63; xent: 0.97; lr: 0.00010; 9071/3953 tok/s;   5899 sec
[2021-05-15 18:51:03,131 INFO] Step 27200/50000; acc:  70.33; ppl:  2.66; xent: 0.98; lr: 0.00010; 9753/4054 tok/s;   5910 sec
[2021-05-15 18:51:13,525 INFO] Step 27250/50000; acc:  70.55; ppl:  2.63; xent: 0.97; lr: 0.00010; 9763/4092 tok/s;   5920 sec
[2021-05-15 18:51:23,730 INFO] Step 27300/50000; acc:  71.10; ppl:  2.60; xent: 0.96; lr: 0.00010; 9863/4115 tok/s;   5930 sec
[2021-05-15 18:51:35,283 INFO] Step 27350/50000; acc:  70.53; ppl:  2.64; xent: 0.97; lr: 0.00010; 9040/3672 tok/s;   5942 sec
[2021-05-15 18:51:45,957 INFO] Step 27400/50000; acc:  70.67; ppl:  2.62; xent: 0.96; lr: 0.00010; 9331/4033 tok/s;   5952 sec
[2021-05-15 18:51:56,797 INFO] Step 27450/50000; acc:  70.43; ppl:  2.65; xent: 0.97; lr: 0.00010; 9665/3913 tok/s;   5963 sec
[2021-05-15 18:52:07,096 INFO] Step 27500/50000; acc:  70.87; ppl:  2.62; xent: 0.96; lr: 0.00010; 9587/4125 tok/s;   5973 sec
[2021-05-15 18:52:18,503 INFO] Step 27550/50000; acc:  70.53; ppl:  2.64; xent: 0.97; lr: 0.00010; 9143/3814 tok/s;   5985 sec
[2021-05-15 18:52:28,898 INFO] Step 27600/50000; acc:  71.31; ppl:  2.55; xent: 0.94; lr: 0.00010; 9596/4000 tok/s;   5995 sec
[2021-05-15 18:52:39,444 INFO] Step 27650/50000; acc:  70.76; ppl:  2.63; xent: 0.97; lr: 0.00010; 9681/4098 tok/s;   6006 sec
[2021-05-15 18:52:50,074 INFO] Step 27700/50000; acc:  71.04; ppl:  2.61; xent: 0.96; lr: 0.00010; 9606/4055 tok/s;   6016 sec
[2021-05-15 18:52:52,898 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:53:00,760 INFO] Step 27750/50000; acc:  70.57; ppl:  2.61; xent: 0.96; lr: 0.00010; 9413/3991 tok/s;   6027 sec
[2021-05-15 18:53:11,727 INFO] Step 27800/50000; acc:  70.52; ppl:  2.65; xent: 0.97; lr: 0.00010; 9400/3868 tok/s;   6038 sec
[2021-05-15 18:53:22,659 INFO] Step 27850/50000; acc:  70.98; ppl:  2.59; xent: 0.95; lr: 0.00010; 8965/3984 tok/s;   6049 sec
[2021-05-15 18:53:33,610 INFO] Step 27900/50000; acc:  70.44; ppl:  2.63; xent: 0.97; lr: 0.00010; 9552/3915 tok/s;   6060 sec
[2021-05-15 18:53:44,284 INFO] Step 27950/50000; acc:  71.11; ppl:  2.61; xent: 0.96; lr: 0.00010; 9339/4017 tok/s;   6071 sec
[2021-05-15 18:53:54,651 INFO] Step 28000/50000; acc:  70.76; ppl:  2.61; xent: 0.96; lr: 0.00010; 9923/4104 tok/s;   6081 sec
[2021-05-15 18:54:05,736 INFO] Step 28050/50000; acc:  70.72; ppl:  2.62; xent: 0.96; lr: 0.00010; 9370/3789 tok/s;   6092 sec
[2021-05-15 18:54:16,340 INFO] Step 28100/50000; acc:  71.18; ppl:  2.57; xent: 0.95; lr: 0.00010; 9397/4071 tok/s;   6103 sec
[2021-05-15 18:54:27,418 INFO] Step 28150/50000; acc:  70.68; ppl:  2.63; xent: 0.97; lr: 0.00010; 9356/3820 tok/s;   6114 sec
[2021-05-15 18:54:37,731 INFO] Step 28200/50000; acc:  70.95; ppl:  2.60; xent: 0.95; lr: 0.00010; 9660/4142 tok/s;   6124 sec
[2021-05-15 18:54:49,011 INFO] Step 28250/50000; acc:  70.56; ppl:  2.65; xent: 0.98; lr: 0.00010; 9261/3823 tok/s;   6135 sec
[2021-05-15 18:54:59,769 INFO] Step 28300/50000; acc:  71.39; ppl:  2.54; xent: 0.93; lr: 0.00010; 9234/3936 tok/s;   6146 sec
[2021-05-15 18:55:10,125 INFO] Step 28350/50000; acc:  70.72; ppl:  2.62; xent: 0.96; lr: 0.00010; 10055/4081 tok/s;   6157 sec
[2021-05-15 18:55:20,599 INFO] Step 28400/50000; acc:  71.06; ppl:  2.58; xent: 0.95; lr: 0.00010; 9431/4080 tok/s;   6167 sec
[2021-05-15 18:55:24,316 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:55:31,689 INFO] Step 28450/50000; acc:  70.57; ppl:  2.63; xent: 0.97; lr: 0.00010; 9321/3921 tok/s;   6178 sec
[2021-05-15 18:55:42,451 INFO] Step 28500/50000; acc:  70.81; ppl:  2.60; xent: 0.96; lr: 0.00010; 9490/3896 tok/s;   6189 sec
[2021-05-15 18:55:53,153 INFO] Step 28550/50000; acc:  71.18; ppl:  2.60; xent: 0.96; lr: 0.00010; 9306/4021 tok/s;   6200 sec
[2021-05-15 18:56:04,426 INFO] Step 28600/50000; acc:  70.63; ppl:  2.61; xent: 0.96; lr: 0.00010; 9113/3873 tok/s;   6211 sec
[2021-05-15 18:56:14,522 INFO] Step 28650/50000; acc:  71.18; ppl:  2.59; xent: 0.95; lr: 0.00010; 9763/4261 tok/s;   6221 sec
[2021-05-15 18:56:25,044 INFO] Step 28700/50000; acc:  71.11; ppl:  2.60; xent: 0.96; lr: 0.00010; 9893/4015 tok/s;   6231 sec
[2021-05-15 18:56:35,520 INFO] Step 28750/50000; acc:  71.32; ppl:  2.57; xent: 0.94; lr: 0.00010; 9621/4072 tok/s;   6242 sec
[2021-05-15 18:56:47,064 INFO] Step 28800/50000; acc:  70.81; ppl:  2.60; xent: 0.96; lr: 0.00010; 9083/3702 tok/s;   6253 sec
[2021-05-15 18:56:57,835 INFO] Step 28850/50000; acc:  70.99; ppl:  2.60; xent: 0.96; lr: 0.00010; 9472/3977 tok/s;   6264 sec
[2021-05-15 18:57:08,329 INFO] Step 28900/50000; acc:  71.47; ppl:  2.55; xent: 0.94; lr: 0.00010; 9474/4007 tok/s;   6275 sec
[2021-05-15 18:57:19,271 INFO] Step 28950/50000; acc:  70.79; ppl:  2.62; xent: 0.96; lr: 0.00010; 9454/3961 tok/s;   6286 sec
[2021-05-15 18:57:29,909 INFO] Step 29000/50000; acc:  71.17; ppl:  2.58; xent: 0.95; lr: 0.00010; 9362/3997 tok/s;   6296 sec
[2021-05-15 18:57:41,160 INFO] Step 29050/50000; acc:  71.20; ppl:  2.58; xent: 0.95; lr: 0.00010; 9383/3762 tok/s;   6308 sec
[2021-05-15 18:57:51,286 INFO] Step 29100/50000; acc:  71.68; ppl:  2.54; xent: 0.93; lr: 0.00010; 9671/4145 tok/s;   6318 sec
[2021-05-15 18:58:02,196 INFO] Step 29150/50000; acc:  70.61; ppl:  2.63; xent: 0.97; lr: 0.00010; 9552/3957 tok/s;   6329 sec
[2021-05-15 18:58:02,647 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 18:58:13,032 INFO] Step 29200/50000; acc:  71.39; ppl:  2.54; xent: 0.93; lr: 0.00010; 9174/3978 tok/s;   6339 sec
[2021-05-15 18:58:23,882 INFO] Step 29250/50000; acc:  70.66; ppl:  2.62; xent: 0.96; lr: 0.00010; 9510/3912 tok/s;   6350 sec
[2021-05-15 18:58:35,623 INFO] Step 29300/50000; acc:  70.65; ppl:  2.61; xent: 0.96; lr: 0.00010; 8597/3748 tok/s;   6362 sec
[2021-05-15 18:58:45,912 INFO] Step 29350/50000; acc:  71.48; ppl:  2.55; xent: 0.94; lr: 0.00010; 9750/4132 tok/s;   6372 sec
[2021-05-15 18:58:56,589 INFO] Step 29400/50000; acc:  70.87; ppl:  2.60; xent: 0.96; lr: 0.00010; 9643/4005 tok/s;   6383 sec
[2021-05-15 18:59:07,043 INFO] Step 29450/50000; acc:  71.59; ppl:  2.54; xent: 0.93; lr: 0.00010; 9376/4081 tok/s;   6393 sec
[2021-05-15 18:59:17,969 INFO] Step 29500/50000; acc:  70.91; ppl:  2.61; xent: 0.96; lr: 0.00010; 9706/3866 tok/s;   6404 sec
[2021-05-15 18:59:29,130 INFO] Step 29550/50000; acc:  71.26; ppl:  2.55; xent: 0.94; lr: 0.00010; 9050/3855 tok/s;   6416 sec
[2021-05-15 18:59:39,541 INFO] Step 29600/50000; acc:  71.16; ppl:  2.58; xent: 0.95; lr: 0.00010; 9867/4062 tok/s;   6426 sec
[2021-05-15 18:59:50,360 INFO] Step 29650/50000; acc:  71.08; ppl:  2.60; xent: 0.95; lr: 0.00010; 9481/3969 tok/s;   6437 sec
[2021-05-15 19:00:00,952 INFO] Step 29700/50000; acc:  71.26; ppl:  2.56; xent: 0.94; lr: 0.00010; 9381/4001 tok/s;   6447 sec
[2021-05-15 19:00:12,188 INFO] Step 29750/50000; acc:  71.37; ppl:  2.56; xent: 0.94; lr: 0.00010; 9248/3820 tok/s;   6459 sec
[2021-05-15 19:00:22,344 INFO] Step 29800/50000; acc:  71.49; ppl:  2.54; xent: 0.93; lr: 0.00010; 9791/4174 tok/s;   6469 sec
[2021-05-15 19:00:32,743 INFO] Step 29850/50000; acc:  71.24; ppl:  2.58; xent: 0.95; lr: 0.00010; 10013/4078 tok/s;   6479 sec
[2021-05-15 19:00:40,949 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:00:43,465 INFO] Step 29900/50000; acc:  71.55; ppl:  2.54; xent: 0.93; lr: 0.00010; 9248/3995 tok/s;   6490 sec
[2021-05-15 19:00:54,267 INFO] Step 29950/50000; acc:  71.06; ppl:  2.59; xent: 0.95; lr: 0.00010; 9643/3984 tok/s;   6501 sec
[2021-05-15 19:01:05,317 INFO] Step 30000/50000; acc:  71.15; ppl:  2.57; xent: 0.94; lr: 0.00010; 8984/3857 tok/s;   6512 sec
[2021-05-15 19:01:05,319 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/strict/valid.txt, align=None)...
[2021-05-15 19:01:14,083 INFO] Validation perplexity: 2.85479
[2021-05-15 19:01:14,083 INFO] Validation accuracy: 69.0269
[2021-05-15 19:01:14,085 INFO] Saving checkpoint ../models/group2_params/strict_ops/model_step_30000.pt
[2021-05-15 19:01:25,576 INFO] Step 30050/50000; acc:  71.21; ppl:  2.57; xent: 0.94; lr: 0.00010; 5040/2151 tok/s;   6532 sec
[2021-05-15 19:01:36,269 INFO] Step 30100/50000; acc:  71.13; ppl:  2.58; xent: 0.95; lr: 0.00010; 9531/4018 tok/s;   6543 sec
[2021-05-15 19:01:46,808 INFO] Step 30150/50000; acc:  71.42; ppl:  2.56; xent: 0.94; lr: 0.00010; 9479/4054 tok/s;   6553 sec
[2021-05-15 19:01:57,594 INFO] Step 30200/50000; acc:  71.21; ppl:  2.58; xent: 0.95; lr: 0.00010; 9604/3946 tok/s;   6564 sec
[2021-05-15 19:02:08,242 INFO] Step 30250/50000; acc:  71.76; ppl:  2.51; xent: 0.92; lr: 0.00010; 9386/3905 tok/s;   6575 sec
[2021-05-15 19:02:19,764 INFO] Step 30300/50000; acc:  70.88; ppl:  2.59; xent: 0.95; lr: 0.00010; 9130/3789 tok/s;   6586 sec
[2021-05-15 19:02:30,006 INFO] Step 30350/50000; acc:  71.67; ppl:  2.54; xent: 0.93; lr: 0.00010; 9724/4103 tok/s;   6596 sec
[2021-05-15 19:02:41,073 INFO] Step 30400/50000; acc:  71.13; ppl:  2.57; xent: 0.94; lr: 0.00010; 9354/3924 tok/s;   6607 sec
[2021-05-15 19:02:52,384 INFO] Step 30450/50000; acc:  71.44; ppl:  2.56; xent: 0.94; lr: 0.00010; 9026/3794 tok/s;   6619 sec
[2021-05-15 19:03:02,897 INFO] Step 30500/50000; acc:  71.80; ppl:  2.50; xent: 0.92; lr: 0.00010; 9497/3975 tok/s;   6629 sec
[2021-05-15 19:03:13,287 INFO] Step 30550/50000; acc:  71.04; ppl:  2.58; xent: 0.95; lr: 0.00010; 9928/4140 tok/s;   6640 sec
[2021-05-15 19:03:23,566 INFO] Step 30600/50000; acc:  71.89; ppl:  2.51; xent: 0.92; lr: 0.00010; 9632/4142 tok/s;   6650 sec
[2021-05-15 19:03:29,095 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:03:34,693 INFO] Step 30650/50000; acc:  71.12; ppl:  2.58; xent: 0.95; lr: 0.00010; 9436/3860 tok/s;   6661 sec
[2021-05-15 19:03:45,388 INFO] Step 30700/50000; acc:  71.73; ppl:  2.54; xent: 0.93; lr: 0.00010; 9298/3956 tok/s;   6672 sec
[2021-05-15 19:03:57,404 INFO] Step 30750/50000; acc:  70.90; ppl:  2.58; xent: 0.95; lr: 0.00010; 8622/3638 tok/s;   6684 sec
[2021-05-15 19:04:07,882 INFO] Step 30800/50000; acc:  71.88; ppl:  2.52; xent: 0.92; lr: 0.00010; 9419/4092 tok/s;   6694 sec
[2021-05-15 19:04:18,647 INFO] Step 30850/50000; acc:  71.26; ppl:  2.56; xent: 0.94; lr: 0.00010; 9534/4000 tok/s;   6705 sec
[2021-05-15 19:04:28,911 INFO] Step 30900/50000; acc:  71.61; ppl:  2.54; xent: 0.93; lr: 0.00010; 9876/4132 tok/s;   6715 sec
[2021-05-15 19:04:39,034 INFO] Step 30950/50000; acc:  71.59; ppl:  2.53; xent: 0.93; lr: 0.00010; 10045/4152 tok/s;   6725 sec
[2021-05-15 19:04:50,882 INFO] Step 31000/50000; acc:  71.28; ppl:  2.56; xent: 0.94; lr: 0.00010; 8779/3626 tok/s;   6737 sec
[2021-05-15 19:05:01,195 INFO] Step 31050/50000; acc:  71.77; ppl:  2.52; xent: 0.92; lr: 0.00010; 9558/4129 tok/s;   6748 sec
[2021-05-15 19:05:12,008 INFO] Step 31100/50000; acc:  71.26; ppl:  2.57; xent: 0.94; lr: 0.00010; 9704/3914 tok/s;   6758 sec
[2021-05-15 19:05:22,353 INFO] Step 31150/50000; acc:  71.67; ppl:  2.55; xent: 0.93; lr: 0.00010; 9677/4122 tok/s;   6769 sec
[2021-05-15 19:05:33,756 INFO] Step 31200/50000; acc:  71.43; ppl:  2.54; xent: 0.93; lr: 0.00010; 9047/3794 tok/s;   6780 sec
[2021-05-15 19:05:44,361 INFO] Step 31250/50000; acc:  71.79; ppl:  2.50; xent: 0.92; lr: 0.00010; 9671/3956 tok/s;   6791 sec
[2021-05-15 19:05:54,674 INFO] Step 31300/50000; acc:  71.66; ppl:  2.53; xent: 0.93; lr: 0.00010; 9580/4156 tok/s;   6801 sec
[2021-05-15 19:06:05,321 INFO] Step 31350/50000; acc:  71.73; ppl:  2.54; xent: 0.93; lr: 0.00010; 9724/4046 tok/s;   6812 sec
[2021-05-15 19:06:07,646 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:06:15,884 INFO] Step 31400/50000; acc:  71.44; ppl:  2.53; xent: 0.93; lr: 0.00010; 9450/4030 tok/s;   6822 sec
[2021-05-15 19:06:26,915 INFO] Step 31450/50000; acc:  71.13; ppl:  2.58; xent: 0.95; lr: 0.00010; 9479/3845 tok/s;   6833 sec
[2021-05-15 19:06:37,818 INFO] Step 31500/50000; acc:  72.07; ppl:  2.51; xent: 0.92; lr: 0.00010; 9007/4036 tok/s;   6844 sec
[2021-05-15 19:06:48,771 INFO] Step 31550/50000; acc:  71.29; ppl:  2.54; xent: 0.93; lr: 0.00010; 9494/3910 tok/s;   6855 sec
[2021-05-15 19:06:59,401 INFO] Step 31600/50000; acc:  71.72; ppl:  2.52; xent: 0.92; lr: 0.00010; 9298/4025 tok/s;   6866 sec
[2021-05-15 19:07:09,861 INFO] Step 31650/50000; acc:  71.79; ppl:  2.52; xent: 0.93; lr: 0.00010; 9821/4072 tok/s;   6876 sec
[2021-05-15 19:07:21,008 INFO] Step 31700/50000; acc:  71.44; ppl:  2.53; xent: 0.93; lr: 0.00010; 9310/3742 tok/s;   6887 sec
[2021-05-15 19:07:31,451 INFO] Step 31750/50000; acc:  71.80; ppl:  2.50; xent: 0.92; lr: 0.00010; 9608/4155 tok/s;   6898 sec
[2021-05-15 19:07:42,727 INFO] Step 31800/50000; acc:  71.43; ppl:  2.55; xent: 0.93; lr: 0.00010; 9173/3773 tok/s;   6909 sec
[2021-05-15 19:07:52,984 INFO] Step 31850/50000; acc:  71.72; ppl:  2.50; xent: 0.92; lr: 0.00010; 9631/4143 tok/s;   6919 sec
[2021-05-15 19:08:04,099 INFO] Step 31900/50000; acc:  71.33; ppl:  2.56; xent: 0.94; lr: 0.00010; 9391/3894 tok/s;   6930 sec
[2021-05-15 19:08:14,965 INFO] Step 31950/50000; acc:  72.31; ppl:  2.46; xent: 0.90; lr: 0.00010; 9281/3880 tok/s;   6941 sec
[2021-05-15 19:08:25,097 INFO] Step 32000/50000; acc:  71.62; ppl:  2.54; xent: 0.93; lr: 0.00010; 10166/4163 tok/s;   6951 sec
[2021-05-15 19:08:35,511 INFO] Step 32050/50000; acc:  71.80; ppl:  2.51; xent: 0.92; lr: 0.00010; 9762/4115 tok/s;   6962 sec
[2021-05-15 19:08:38,699 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:08:46,215 INFO] Step 32100/50000; acc:  71.69; ppl:  2.51; xent: 0.92; lr: 0.00010; 9332/4029 tok/s;   6973 sec
[2021-05-15 19:08:57,224 INFO] Step 32150/50000; acc:  71.57; ppl:  2.54; xent: 0.93; lr: 0.00010; 9411/3853 tok/s;   6984 sec
[2021-05-15 19:09:07,845 INFO] Step 32200/50000; acc:  71.79; ppl:  2.51; xent: 0.92; lr: 0.00010; 9285/4028 tok/s;   6994 sec
[2021-05-15 19:09:19,203 INFO] Step 32250/50000; acc:  71.59; ppl:  2.54; xent: 0.93; lr: 0.00010; 9200/3863 tok/s;   7006 sec
[2021-05-15 19:09:29,457 INFO] Step 32300/50000; acc:  71.76; ppl:  2.51; xent: 0.92; lr: 0.00010; 9617/4177 tok/s;   7016 sec
[2021-05-15 19:09:40,015 INFO] Step 32350/50000; acc:  71.79; ppl:  2.52; xent: 0.92; lr: 0.00010; 9805/4055 tok/s;   7026 sec
[2021-05-15 19:09:50,045 INFO] Step 32400/50000; acc:  72.06; ppl:  2.49; xent: 0.91; lr: 0.00010; 9977/4203 tok/s;   7036 sec
[2021-05-15 19:10:01,621 INFO] Step 32450/50000; acc:  71.47; ppl:  2.52; xent: 0.92; lr: 0.00010; 9029/3673 tok/s;   7048 sec
[2021-05-15 19:10:12,415 INFO] Step 32500/50000; acc:  71.59; ppl:  2.52; xent: 0.92; lr: 0.00010; 9428/3987 tok/s;   7059 sec
[2021-05-15 19:10:22,992 INFO] Step 32550/50000; acc:  72.18; ppl:  2.48; xent: 0.91; lr: 0.00010; 9491/4005 tok/s;   7069 sec
[2021-05-15 19:10:33,993 INFO] Step 32600/50000; acc:  72.02; ppl:  2.51; xent: 0.92; lr: 0.00010; 9381/3898 tok/s;   7080 sec
[2021-05-15 19:10:44,584 INFO] Step 32650/50000; acc:  72.02; ppl:  2.48; xent: 0.91; lr: 0.00010; 9323/4022 tok/s;   7091 sec
[2021-05-15 19:10:55,846 INFO] Step 32700/50000; acc:  71.88; ppl:  2.50; xent: 0.92; lr: 0.00010; 9363/3778 tok/s;   7102 sec
[2021-05-15 19:11:06,018 INFO] Step 32750/50000; acc:  72.10; ppl:  2.49; xent: 0.91; lr: 0.00010; 9772/4135 tok/s;   7112 sec
[2021-05-15 19:11:16,882 INFO] Step 32800/50000; acc:  71.77; ppl:  2.52; xent: 0.93; lr: 0.00010; 9502/3958 tok/s;   7123 sec
[2021-05-15 19:11:16,892 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:11:28,042 INFO] Step 32850/50000; acc:  71.97; ppl:  2.49; xent: 0.91; lr: 0.00010; 9148/3875 tok/s;   7134 sec
[2021-05-15 19:11:38,647 INFO] Step 32900/50000; acc:  71.89; ppl:  2.50; xent: 0.92; lr: 0.00010; 9410/3982 tok/s;   7145 sec
[2021-05-15 19:11:50,391 INFO] Step 32950/50000; acc:  71.49; ppl:  2.54; xent: 0.93; lr: 0.00010; 8720/3746 tok/s;   7157 sec
[2021-05-15 19:12:00,693 INFO] Step 33000/50000; acc:  72.32; ppl:  2.46; xent: 0.90; lr: 0.00010; 9638/4137 tok/s;   7167 sec
[2021-05-15 19:12:11,671 INFO] Step 33050/50000; acc:  71.51; ppl:  2.53; xent: 0.93; lr: 0.00010; 9538/3912 tok/s;   7178 sec
[2021-05-15 19:12:21,637 INFO] Step 33100/50000; acc:  72.28; ppl:  2.47; xent: 0.91; lr: 0.00010; 9879/4249 tok/s;   7188 sec
[2021-05-15 19:12:32,602 INFO] Step 33150/50000; acc:  71.77; ppl:  2.51; xent: 0.92; lr: 0.00010; 9612/3845 tok/s;   7199 sec
[2021-05-15 19:12:43,739 INFO] Step 33200/50000; acc:  72.20; ppl:  2.47; xent: 0.91; lr: 0.00010; 8987/3865 tok/s;   7210 sec
[2021-05-15 19:12:54,218 INFO] Step 33250/50000; acc:  72.10; ppl:  2.48; xent: 0.91; lr: 0.00010; 9758/4043 tok/s;   7221 sec
[2021-05-15 19:13:05,136 INFO] Step 33300/50000; acc:  71.83; ppl:  2.50; xent: 0.92; lr: 0.00010; 9396/3926 tok/s;   7232 sec
[2021-05-15 19:13:16,074 INFO] Step 33350/50000; acc:  72.26; ppl:  2.49; xent: 0.91; lr: 0.00010; 9169/3924 tok/s;   7242 sec
[2021-05-15 19:13:27,355 INFO] Step 33400/50000; acc:  72.34; ppl:  2.47; xent: 0.90; lr: 0.00010; 9189/3775 tok/s;   7254 sec
[2021-05-15 19:13:37,245 INFO] Step 33450/50000; acc:  72.21; ppl:  2.46; xent: 0.90; lr: 0.00010; 9949/4277 tok/s;   7264 sec
[2021-05-15 19:13:47,746 INFO] Step 33500/50000; acc:  71.87; ppl:  2.51; xent: 0.92; lr: 0.00010; 9915/4030 tok/s;   7274 sec
[2021-05-15 19:13:55,693 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:13:58,706 INFO] Step 33550/50000; acc:  72.21; ppl:  2.47; xent: 0.91; lr: 0.00010; 9172/3953 tok/s;   7285 sec
[2021-05-15 19:14:09,486 INFO] Step 33600/50000; acc:  71.90; ppl:  2.50; xent: 0.92; lr: 0.00010; 9581/3950 tok/s;   7296 sec
[2021-05-15 19:14:21,049 INFO] Step 33650/50000; acc:  71.57; ppl:  2.53; xent: 0.93; lr: 0.00010; 8817/3721 tok/s;   7307 sec
[2021-05-15 19:14:31,751 INFO] Step 33700/50000; acc:  72.39; ppl:  2.46; xent: 0.90; lr: 0.00010; 9234/4048 tok/s;   7318 sec
[2021-05-15 19:14:42,581 INFO] Step 33750/50000; acc:  72.16; ppl:  2.50; xent: 0.92; lr: 0.00010; 9529/3989 tok/s;   7329 sec
[2021-05-15 19:14:52,901 INFO] Step 33800/50000; acc:  72.27; ppl:  2.48; xent: 0.91; lr: 0.00010; 9602/4091 tok/s;   7339 sec
[2021-05-15 19:15:03,952 INFO] Step 33850/50000; acc:  72.00; ppl:  2.51; xent: 0.92; lr: 0.00010; 9538/3895 tok/s;   7350 sec
[2021-05-15 19:15:14,693 INFO] Step 33900/50000; acc:  72.63; ppl:  2.43; xent: 0.89; lr: 0.00010; 9301/3883 tok/s;   7361 sec
[2021-05-15 19:15:25,997 INFO] Step 33950/50000; acc:  71.57; ppl:  2.52; xent: 0.92; lr: 0.00010; 9258/3834 tok/s;   7372 sec
[2021-05-15 19:15:36,348 INFO] Step 34000/50000; acc:  72.70; ppl:  2.44; xent: 0.89; lr: 0.00010; 9550/4045 tok/s;   7383 sec
[2021-05-15 19:15:47,405 INFO] Step 34050/50000; acc:  72.06; ppl:  2.49; xent: 0.91; lr: 0.00010; 9319/3926 tok/s;   7394 sec
[2021-05-15 19:15:58,668 INFO] Step 34100/50000; acc:  72.25; ppl:  2.49; xent: 0.91; lr: 0.00010; 9072/3820 tok/s;   7405 sec
[2021-05-15 19:16:09,405 INFO] Step 34150/50000; acc:  72.45; ppl:  2.44; xent: 0.89; lr: 0.00010; 9372/3924 tok/s;   7416 sec
[2021-05-15 19:16:19,702 INFO] Step 34200/50000; acc:  72.10; ppl:  2.49; xent: 0.91; lr: 0.00010; 9991/4158 tok/s;   7426 sec
[2021-05-15 19:16:29,839 INFO] Step 34250/50000; acc:  72.77; ppl:  2.43; xent: 0.89; lr: 0.00010; 9680/4200 tok/s;   7436 sec
[2021-05-15 19:16:35,098 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:16:41,246 INFO] Step 34300/50000; acc:  72.00; ppl:  2.49; xent: 0.91; lr: 0.00010; 9208/3762 tok/s;   7448 sec
[2021-05-15 19:16:51,869 INFO] Step 34350/50000; acc:  72.21; ppl:  2.48; xent: 0.91; lr: 0.00010; 9481/3979 tok/s;   7458 sec
[2021-05-15 19:17:03,629 INFO] Step 34400/50000; acc:  72.06; ppl:  2.49; xent: 0.91; lr: 0.00010; 8726/3712 tok/s;   7470 sec
[2021-05-15 19:17:14,423 INFO] Step 34450/50000; acc:  72.36; ppl:  2.47; xent: 0.90; lr: 0.00010; 9407/3950 tok/s;   7481 sec
[2021-05-15 19:17:25,030 INFO] Step 34500/50000; acc:  72.34; ppl:  2.46; xent: 0.90; lr: 0.00010; 9343/4087 tok/s;   7491 sec
[2021-05-15 19:17:35,392 INFO] Step 34550/50000; acc:  72.43; ppl:  2.47; xent: 0.90; lr: 0.00010; 9935/4076 tok/s;   7502 sec
[2021-05-15 19:17:45,597 INFO] Step 34600/50000; acc:  72.57; ppl:  2.45; xent: 0.89; lr: 0.00010; 9882/4107 tok/s;   7512 sec
[2021-05-15 19:17:57,445 INFO] Step 34650/50000; acc:  72.03; ppl:  2.49; xent: 0.91; lr: 0.00010; 8899/3637 tok/s;   7524 sec
[2021-05-15 19:18:07,988 INFO] Step 34700/50000; acc:  72.55; ppl:  2.43; xent: 0.89; lr: 0.00010; 9380/4060 tok/s;   7534 sec
[2021-05-15 19:18:18,664 INFO] Step 34750/50000; acc:  72.17; ppl:  2.47; xent: 0.90; lr: 0.00010; 9782/3984 tok/s;   7545 sec
[2021-05-15 19:18:28,996 INFO] Step 34800/50000; acc:  72.50; ppl:  2.46; xent: 0.90; lr: 0.00010; 9592/4091 tok/s;   7555 sec
[2021-05-15 19:18:40,474 INFO] Step 34850/50000; acc:  72.44; ppl:  2.44; xent: 0.89; lr: 0.00010; 8965/3790 tok/s;   7567 sec
[2021-05-15 19:18:51,029 INFO] Step 34900/50000; acc:  72.63; ppl:  2.43; xent: 0.89; lr: 0.00010; 9694/3983 tok/s;   7577 sec
[2021-05-15 19:19:01,293 INFO] Step 34950/50000; acc:  72.21; ppl:  2.46; xent: 0.90; lr: 0.00010; 9736/4137 tok/s;   7588 sec
[2021-05-15 19:19:12,211 INFO] Step 35000/50000; acc:  72.18; ppl:  2.48; xent: 0.91; lr: 0.00010; 9454/3977 tok/s;   7599 sec
[2021-05-15 19:19:12,215 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/strict/valid.txt, align=None)...
[2021-05-15 19:19:20,996 INFO] Validation perplexity: 2.8596
[2021-05-15 19:19:20,996 INFO] Validation accuracy: 69.2051
[2021-05-15 19:19:20,998 INFO] Saving checkpoint ../models/group2_params/strict_ops/model_step_35000.pt
[2021-05-15 19:19:23,349 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:19:32,211 INFO] Step 35050/50000; acc:  72.60; ppl:  2.44; xent: 0.89; lr: 0.00010; 4947/2110 tok/s;   7619 sec
[2021-05-15 19:19:43,234 INFO] Step 35100/50000; acc:  71.91; ppl:  2.50; xent: 0.92; lr: 0.00010; 9472/3893 tok/s;   7630 sec
[2021-05-15 19:19:54,422 INFO] Step 35150/50000; acc:  72.58; ppl:  2.44; xent: 0.89; lr: 0.00010; 8897/3901 tok/s;   7641 sec
[2021-05-15 19:20:05,016 INFO] Step 35200/50000; acc:  72.31; ppl:  2.47; xent: 0.90; lr: 0.00010; 9734/4051 tok/s;   7651 sec
[2021-05-15 19:20:15,688 INFO] Step 35250/50000; acc:  72.49; ppl:  2.45; xent: 0.90; lr: 0.00010; 9508/4002 tok/s;   7662 sec
[2021-05-15 19:20:25,979 INFO] Step 35300/50000; acc:  72.87; ppl:  2.42; xent: 0.88; lr: 0.00010; 9668/4122 tok/s;   7672 sec
[2021-05-15 19:20:37,311 INFO] Step 35350/50000; acc:  72.33; ppl:  2.46; xent: 0.90; lr: 0.00010; 9286/3698 tok/s;   7684 sec
[2021-05-15 19:20:47,963 INFO] Step 35400/50000; acc:  72.84; ppl:  2.42; xent: 0.88; lr: 0.00010; 9343/4054 tok/s;   7694 sec
[2021-05-15 19:20:59,232 INFO] Step 35450/50000; acc:  72.16; ppl:  2.48; xent: 0.91; lr: 0.00010; 9306/3787 tok/s;   7706 sec
[2021-05-15 19:21:09,751 INFO] Step 35500/50000; acc:  72.75; ppl:  2.43; xent: 0.89; lr: 0.00010; 9421/4099 tok/s;   7716 sec
[2021-05-15 19:21:20,634 INFO] Step 35550/50000; acc:  72.26; ppl:  2.48; xent: 0.91; lr: 0.00010; 9536/3921 tok/s;   7727 sec
[2021-05-15 19:21:31,521 INFO] Step 35600/50000; acc:  73.20; ppl:  2.38; xent: 0.87; lr: 0.00010; 9198/3858 tok/s;   7738 sec
[2021-05-15 19:21:41,722 INFO] Step 35650/50000; acc:  72.50; ppl:  2.45; xent: 0.89; lr: 0.00010; 10031/4159 tok/s;   7748 sec
[2021-05-15 19:21:52,130 INFO] Step 35700/50000; acc:  72.41; ppl:  2.45; xent: 0.90; lr: 0.00010; 9765/4126 tok/s;   7759 sec
[2021-05-15 19:21:54,920 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:22:03,136 INFO] Step 35750/50000; acc:  72.54; ppl:  2.44; xent: 0.89; lr: 0.00010; 9161/3924 tok/s;   7770 sec
[2021-05-15 19:22:14,128 INFO] Step 35800/50000; acc:  72.31; ppl:  2.46; xent: 0.90; lr: 0.00010; 9402/3852 tok/s;   7781 sec
[2021-05-15 19:22:24,738 INFO] Step 35850/50000; acc:  72.79; ppl:  2.42; xent: 0.88; lr: 0.00010; 9220/4040 tok/s;   7791 sec
[2021-05-15 19:22:36,001 INFO] Step 35900/50000; acc:  72.16; ppl:  2.47; xent: 0.90; lr: 0.00010; 9279/3872 tok/s;   7802 sec
[2021-05-15 19:22:46,225 INFO] Step 35950/50000; acc:  72.77; ppl:  2.43; xent: 0.89; lr: 0.00010; 9774/4195 tok/s;   7813 sec
[2021-05-15 19:22:56,715 INFO] Step 36000/50000; acc:  72.68; ppl:  2.43; xent: 0.89; lr: 0.00010; 9763/4064 tok/s;   7823 sec
[2021-05-15 19:23:07,177 INFO] Step 36050/50000; acc:  72.69; ppl:  2.44; xent: 0.89; lr: 0.00010; 9848/4041 tok/s;   7834 sec
[2021-05-15 19:23:18,570 INFO] Step 36100/50000; acc:  72.97; ppl:  2.40; xent: 0.87; lr: 0.00010; 8857/3709 tok/s;   7845 sec
[2021-05-15 19:23:29,452 INFO] Step 36150/50000; acc:  72.42; ppl:  2.45; xent: 0.90; lr: 0.00010; 9492/3960 tok/s;   7856 sec
[2021-05-15 19:23:39,870 INFO] Step 36200/50000; acc:  73.01; ppl:  2.40; xent: 0.87; lr: 0.00010; 9550/4027 tok/s;   7866 sec
[2021-05-15 19:23:50,924 INFO] Step 36250/50000; acc:  72.17; ppl:  2.47; xent: 0.90; lr: 0.00010; 9490/3920 tok/s;   7877 sec
[2021-05-15 19:24:01,708 INFO] Step 36300/50000; acc:  73.10; ppl:  2.40; xent: 0.88; lr: 0.00010; 9177/3962 tok/s;   7888 sec
[2021-05-15 19:24:12,844 INFO] Step 36350/50000; acc:  72.43; ppl:  2.43; xent: 0.89; lr: 0.00010; 9424/3863 tok/s;   7899 sec
[2021-05-15 19:24:22,817 INFO] Step 36400/50000; acc:  72.87; ppl:  2.40; xent: 0.88; lr: 0.00010; 9867/4169 tok/s;   7909 sec
[2021-05-15 19:24:33,349 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:24:33,763 INFO] Step 36450/50000; acc:  72.50; ppl:  2.46; xent: 0.90; lr: 0.00010; 9392/3953 tok/s;   7920 sec
[2021-05-15 19:24:44,604 INFO] Step 36500/50000; acc:  72.93; ppl:  2.41; xent: 0.88; lr: 0.00010; 9416/3964 tok/s;   7931 sec
[2021-05-15 19:24:55,238 INFO] Step 36550/50000; acc:  72.72; ppl:  2.43; xent: 0.89; lr: 0.00010; 9473/3987 tok/s;   7942 sec
[2021-05-15 19:25:06,906 INFO] Step 36600/50000; acc:  72.60; ppl:  2.44; xent: 0.89; lr: 0.00010; 8755/3771 tok/s;   7953 sec
[2021-05-15 19:25:17,270 INFO] Step 36650/50000; acc:  73.24; ppl:  2.39; xent: 0.87; lr: 0.00010; 9502/4090 tok/s;   7964 sec
[2021-05-15 19:25:28,209 INFO] Step 36700/50000; acc:  72.32; ppl:  2.45; xent: 0.90; lr: 0.00010; 9560/3935 tok/s;   7975 sec
[2021-05-15 19:25:38,511 INFO] Step 36750/50000; acc:  72.92; ppl:  2.41; xent: 0.88; lr: 0.00010; 9702/4157 tok/s;   7985 sec
[2021-05-15 19:25:49,261 INFO] Step 36800/50000; acc:  72.63; ppl:  2.43; xent: 0.89; lr: 0.00010; 9708/3872 tok/s;   7996 sec
[2021-05-15 19:26:00,742 INFO] Step 36850/50000; acc:  72.68; ppl:  2.42; xent: 0.88; lr: 0.00010; 8968/3761 tok/s;   8007 sec
[2021-05-15 19:26:10,994 INFO] Step 36900/50000; acc:  73.35; ppl:  2.38; xent: 0.87; lr: 0.00010; 9621/4137 tok/s;   8017 sec
[2021-05-15 19:26:22,032 INFO] Step 36950/50000; acc:  72.79; ppl:  2.44; xent: 0.89; lr: 0.00010; 9431/3910 tok/s;   8028 sec
[2021-05-15 19:26:32,854 INFO] Step 37000/50000; acc:  72.87; ppl:  2.41; xent: 0.88; lr: 0.00010; 9173/3925 tok/s;   8039 sec
[2021-05-15 19:26:44,353 INFO] Step 37050/50000; acc:  73.03; ppl:  2.40; xent: 0.87; lr: 0.00010; 9152/3722 tok/s;   8051 sec
[2021-05-15 19:26:54,146 INFO] Step 37100/50000; acc:  72.89; ppl:  2.40; xent: 0.88; lr: 0.00010; 10078/4310 tok/s;   8061 sec
[2021-05-15 19:27:04,630 INFO] Step 37150/50000; acc:  72.94; ppl:  2.42; xent: 0.88; lr: 0.00010; 9881/4046 tok/s;   8071 sec
[2021-05-15 19:27:12,118 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:27:15,570 INFO] Step 37200/50000; acc:  73.30; ppl:  2.39; xent: 0.87; lr: 0.00010; 9117/3959 tok/s;   8082 sec
[2021-05-15 19:27:26,338 INFO] Step 37250/50000; acc:  72.76; ppl:  2.42; xent: 0.88; lr: 0.00010; 9569/3962 tok/s;   8093 sec
[2021-05-15 19:27:37,874 INFO] Step 37300/50000; acc:  72.38; ppl:  2.44; xent: 0.89; lr: 0.00010; 8811/3717 tok/s;   8104 sec
[2021-05-15 19:27:48,510 INFO] Step 37350/50000; acc:  73.25; ppl:  2.39; xent: 0.87; lr: 0.00010; 9377/4069 tok/s;   8115 sec
[2021-05-15 19:27:59,272 INFO] Step 37400/50000; acc:  72.57; ppl:  2.43; xent: 0.89; lr: 0.00010; 9571/4008 tok/s;   8126 sec
[2021-05-15 19:28:09,532 INFO] Step 37450/50000; acc:  72.96; ppl:  2.40; xent: 0.87; lr: 0.00010; 9566/4098 tok/s;   8136 sec
[2021-05-15 19:28:20,587 INFO] Step 37500/50000; acc:  72.85; ppl:  2.43; xent: 0.89; lr: 0.00010; 9543/3907 tok/s;   8147 sec
[2021-05-15 19:28:31,231 INFO] Step 37550/50000; acc:  73.25; ppl:  2.36; xent: 0.86; lr: 0.00010; 9525/3924 tok/s;   8158 sec
[2021-05-15 19:28:42,588 INFO] Step 37600/50000; acc:  72.70; ppl:  2.43; xent: 0.89; lr: 0.00010; 9112/3822 tok/s;   8169 sec
[2021-05-15 19:28:53,482 INFO] Step 37650/50000; acc:  73.16; ppl:  2.39; xent: 0.87; lr: 0.00010; 9330/3896 tok/s;   8180 sec
[2021-05-15 19:29:03,850 INFO] Step 37700/50000; acc:  73.15; ppl:  2.38; xent: 0.87; lr: 0.00010; 9608/4116 tok/s;   8190 sec
[2021-05-15 19:29:15,217 INFO] Step 37750/50000; acc:  73.09; ppl:  2.42; xent: 0.88; lr: 0.00010; 9116/3788 tok/s;   8202 sec
[2021-05-15 19:29:25,794 INFO] Step 37800/50000; acc:  73.31; ppl:  2.35; xent: 0.86; lr: 0.00010; 9434/3962 tok/s;   8212 sec
[2021-05-15 19:29:35,996 INFO] Step 37850/50000; acc:  72.61; ppl:  2.43; xent: 0.89; lr: 0.00010; 10238/4225 tok/s;   8222 sec
[2021-05-15 19:29:46,407 INFO] Step 37900/50000; acc:  73.43; ppl:  2.36; xent: 0.86; lr: 0.00010; 9442/4078 tok/s;   8233 sec
[2021-05-15 19:29:51,053 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:29:57,458 INFO] Step 37950/50000; acc:  72.58; ppl:  2.43; xent: 0.89; lr: 0.00010; 9469/3884 tok/s;   8244 sec
[2021-05-15 19:30:08,331 INFO] Step 38000/50000; acc:  73.25; ppl:  2.38; xent: 0.87; lr: 0.00010; 9154/3910 tok/s;   8255 sec
[2021-05-15 19:30:19,894 INFO] Step 38050/50000; acc:  72.82; ppl:  2.42; xent: 0.88; lr: 0.00010; 8850/3757 tok/s;   8266 sec
[2021-05-15 19:30:30,529 INFO] Step 38100/50000; acc:  73.27; ppl:  2.38; xent: 0.87; lr: 0.00010; 9543/4011 tok/s;   8277 sec
[2021-05-15 19:30:41,063 INFO] Step 38150/50000; acc:  73.01; ppl:  2.40; xent: 0.88; lr: 0.00010; 9493/4141 tok/s;   8287 sec
[2021-05-15 19:30:51,645 INFO] Step 38200/50000; acc:  73.33; ppl:  2.39; xent: 0.87; lr: 0.00010; 9709/3979 tok/s;   8298 sec
[2021-05-15 19:31:01,489 INFO] Step 38250/50000; acc:  73.49; ppl:  2.36; xent: 0.86; lr: 0.00010; 10168/4250 tok/s;   8308 sec
[2021-05-15 19:31:13,418 INFO] Step 38300/50000; acc:  72.85; ppl:  2.42; xent: 0.88; lr: 0.00010; 8831/3613 tok/s;   8320 sec
[2021-05-15 19:31:24,137 INFO] Step 38350/50000; acc:  73.41; ppl:  2.37; xent: 0.86; lr: 0.00010; 9350/3987 tok/s;   8331 sec
[2021-05-15 19:31:34,691 INFO] Step 38400/50000; acc:  73.20; ppl:  2.39; xent: 0.87; lr: 0.00010; 9805/4047 tok/s;   8341 sec
[2021-05-15 19:31:45,514 INFO] Step 38450/50000; acc:  73.20; ppl:  2.40; xent: 0.88; lr: 0.00010; 9410/3928 tok/s;   8352 sec
[2021-05-15 19:31:56,466 INFO] Step 38500/50000; acc:  73.72; ppl:  2.33; xent: 0.84; lr: 0.00010; 9092/3934 tok/s;   8363 sec
[2021-05-15 19:32:07,023 INFO] Step 38550/50000; acc:  73.21; ppl:  2.37; xent: 0.86; lr: 0.00010; 9815/3974 tok/s;   8373 sec
[2021-05-15 19:32:17,459 INFO] Step 38600/50000; acc:  73.23; ppl:  2.38; xent: 0.87; lr: 0.00010; 9495/4100 tok/s;   8384 sec
[2021-05-15 19:32:28,425 INFO] Step 38650/50000; acc:  73.14; ppl:  2.40; xent: 0.88; lr: 0.00010; 9563/3931 tok/s;   8395 sec
[2021-05-15 19:32:29,727 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:32:38,998 INFO] Step 38700/50000; acc:  73.39; ppl:  2.37; xent: 0.86; lr: 0.00010; 9384/4007 tok/s;   8405 sec
[2021-05-15 19:32:49,981 INFO] Step 38750/50000; acc:  72.86; ppl:  2.42; xent: 0.88; lr: 0.00010; 9448/3896 tok/s;   8416 sec
[2021-05-15 19:33:01,067 INFO] Step 38800/50000; acc:  73.53; ppl:  2.36; xent: 0.86; lr: 0.00010; 8906/3950 tok/s;   8427 sec
[2021-05-15 19:33:11,696 INFO] Step 38850/50000; acc:  73.08; ppl:  2.39; xent: 0.87; lr: 0.00010; 9650/4023 tok/s;   8438 sec
[2021-05-15 19:33:22,358 INFO] Step 38900/50000; acc:  73.27; ppl:  2.37; xent: 0.86; lr: 0.00010; 9519/4019 tok/s;   8449 sec
[2021-05-15 19:33:32,569 INFO] Step 38950/50000; acc:  73.56; ppl:  2.35; xent: 0.85; lr: 0.00010; 9850/4144 tok/s;   8459 sec
[2021-05-15 19:33:43,709 INFO] Step 39000/50000; acc:  73.12; ppl:  2.39; xent: 0.87; lr: 0.00010; 9411/3762 tok/s;   8470 sec
[2021-05-15 19:33:54,435 INFO] Step 39050/50000; acc:  73.72; ppl:  2.34; xent: 0.85; lr: 0.00010; 9206/4054 tok/s;   8481 sec
[2021-05-15 19:34:05,496 INFO] Step 39100/50000; acc:  73.31; ppl:  2.38; xent: 0.87; lr: 0.00010; 9464/3844 tok/s;   8492 sec
[2021-05-15 19:34:16,017 INFO] Step 39150/50000; acc:  73.13; ppl:  2.38; xent: 0.87; lr: 0.00010; 9543/4066 tok/s;   8502 sec
[2021-05-15 19:34:26,863 INFO] Step 39200/50000; acc:  73.24; ppl:  2.38; xent: 0.87; lr: 0.00010; 9482/3965 tok/s;   8513 sec
[2021-05-15 19:34:38,158 INFO] Step 39250/50000; acc:  73.77; ppl:  2.33; xent: 0.84; lr: 0.00010; 9113/3751 tok/s;   8525 sec
[2021-05-15 19:34:48,153 INFO] Step 39300/50000; acc:  73.59; ppl:  2.34; xent: 0.85; lr: 0.00010; 9909/4211 tok/s;   8535 sec
[2021-05-15 19:34:58,670 INFO] Step 39350/50000; acc:  73.32; ppl:  2.40; xent: 0.87; lr: 0.00010; 9797/4060 tok/s;   8545 sec
[2021-05-15 19:35:01,067 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:35:09,574 INFO] Step 39400/50000; acc:  73.70; ppl:  2.34; xent: 0.85; lr: 0.00010; 9157/3962 tok/s;   8556 sec
[2021-05-15 19:35:20,398 INFO] Step 39450/50000; acc:  72.97; ppl:  2.41; xent: 0.88; lr: 0.00010; 9701/3927 tok/s;   8567 sec
[2021-05-15 19:35:31,191 INFO] Step 39500/50000; acc:  73.77; ppl:  2.33; xent: 0.85; lr: 0.00010; 9071/3988 tok/s;   8578 sec
[2021-05-15 19:35:42,150 INFO] Step 39550/50000; acc:  72.92; ppl:  2.39; xent: 0.87; lr: 0.00010; 9500/3982 tok/s;   8589 sec
[2021-05-15 19:35:52,498 INFO] Step 39600/50000; acc:  73.53; ppl:  2.35; xent: 0.85; lr: 0.00010; 9570/4130 tok/s;   8599 sec
[2021-05-15 19:36:02,805 INFO] Step 39650/50000; acc:  73.61; ppl:  2.36; xent: 0.86; lr: 0.00010; 9909/4135 tok/s;   8609 sec
[2021-05-15 19:36:13,195 INFO] Step 39700/50000; acc:  73.22; ppl:  2.37; xent: 0.86; lr: 0.00010; 9910/4017 tok/s;   8620 sec
[2021-05-15 19:36:24,687 INFO] Step 39750/50000; acc:  73.82; ppl:  2.33; xent: 0.85; lr: 0.00010; 8836/3727 tok/s;   8631 sec
[2021-05-15 19:36:35,566 INFO] Step 39800/50000; acc:  73.37; ppl:  2.37; xent: 0.86; lr: 0.00010; 9488/3963 tok/s;   8642 sec
[2021-05-15 19:36:46,115 INFO] Step 39850/50000; acc:  73.95; ppl:  2.32; xent: 0.84; lr: 0.00010; 9339/3968 tok/s;   8653 sec
[2021-05-15 19:36:57,084 INFO] Step 39900/50000; acc:  73.11; ppl:  2.39; xent: 0.87; lr: 0.00010; 9566/3926 tok/s;   8663 sec
[2021-05-15 19:37:08,219 INFO] Step 39950/50000; acc:  74.06; ppl:  2.33; xent: 0.85; lr: 0.00010; 9021/3877 tok/s;   8675 sec
[2021-05-15 19:37:19,037 INFO] Step 40000/50000; acc:  73.47; ppl:  2.35; xent: 0.85; lr: 0.00010; 9612/3921 tok/s;   8685 sec
[2021-05-15 19:37:19,041 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/strict/valid.txt, align=None)...
[2021-05-15 19:37:27,799 INFO] Validation perplexity: 2.90595
[2021-05-15 19:37:27,799 INFO] Validation accuracy: 69.241
[2021-05-15 19:37:27,801 INFO] Saving checkpoint ../models/group2_params/strict_ops/model_step_40000.pt
[2021-05-15 19:37:38,633 INFO] Step 40050/50000; acc:  73.70; ppl:  2.36; xent: 0.86; lr: 0.00010; 5157/2140 tok/s;   8705 sec
[2021-05-15 19:37:48,598 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:37:49,263 INFO] Step 40100/50000; acc:  73.77; ppl:  2.34; xent: 0.85; lr: 0.00010; 9355/4052 tok/s;   8716 sec
[2021-05-15 19:38:00,368 INFO] Step 40150/50000; acc:  73.45; ppl:  2.36; xent: 0.86; lr: 0.00010; 9313/3879 tok/s;   8727 sec
[2021-05-15 19:38:10,765 INFO] Step 40200/50000; acc:  73.46; ppl:  2.36; xent: 0.86; lr: 0.00010; 9593/4071 tok/s;   8737 sec
[2021-05-15 19:38:22,458 INFO] Step 40250/50000; acc:  73.18; ppl:  2.38; xent: 0.87; lr: 0.00010; 8887/3779 tok/s;   8749 sec
[2021-05-15 19:38:32,793 INFO] Step 40300/50000; acc:  74.05; ppl:  2.33; xent: 0.84; lr: 0.00010; 9543/4122 tok/s;   8759 sec
[2021-05-15 19:38:43,547 INFO] Step 40350/50000; acc:  73.14; ppl:  2.37; xent: 0.86; lr: 0.00010; 9670/3977 tok/s;   8770 sec
[2021-05-15 19:38:53,831 INFO] Step 40400/50000; acc:  73.98; ppl:  2.32; xent: 0.84; lr: 0.00010; 9652/4149 tok/s;   8780 sec
[2021-05-15 19:39:04,503 INFO] Step 40450/50000; acc:  73.63; ppl:  2.34; xent: 0.85; lr: 0.00010; 9738/3933 tok/s;   8791 sec
[2021-05-15 19:39:15,889 INFO] Step 40500/50000; acc:  73.38; ppl:  2.35; xent: 0.85; lr: 0.00010; 9031/3775 tok/s;   8802 sec
[2021-05-15 19:39:26,084 INFO] Step 40550/50000; acc:  74.10; ppl:  2.31; xent: 0.84; lr: 0.00010; 9768/4162 tok/s;   8812 sec
[2021-05-15 19:39:37,246 INFO] Step 40600/50000; acc:  73.41; ppl:  2.37; xent: 0.86; lr: 0.00010; 9301/3877 tok/s;   8824 sec
[2021-05-15 19:39:47,828 INFO] Step 40650/50000; acc:  73.82; ppl:  2.33; xent: 0.84; lr: 0.00010; 9306/4000 tok/s;   8834 sec
[2021-05-15 19:39:59,121 INFO] Step 40700/50000; acc:  73.74; ppl:  2.32; xent: 0.84; lr: 0.00010; 9307/3766 tok/s;   8846 sec
[2021-05-15 19:40:09,186 INFO] Step 40750/50000; acc:  73.52; ppl:  2.35; xent: 0.85; lr: 0.00010; 9941/4227 tok/s;   8856 sec
[2021-05-15 19:40:19,915 INFO] Step 40800/50000; acc:  73.75; ppl:  2.35; xent: 0.85; lr: 0.00010; 9556/3976 tok/s;   8866 sec
[2021-05-15 19:40:26,925 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:40:30,995 INFO] Step 40850/50000; acc:  73.94; ppl:  2.34; xent: 0.85; lr: 0.00010; 9253/3901 tok/s;   8877 sec
[2021-05-15 19:40:41,346 INFO] Step 40900/50000; acc:  73.88; ppl:  2.33; xent: 0.85; lr: 0.00010; 9652/4099 tok/s;   8888 sec
[2021-05-15 19:40:53,065 INFO] Step 40950/50000; acc:  73.14; ppl:  2.37; xent: 0.86; lr: 0.00010; 8782/3666 tok/s;   8899 sec
[2021-05-15 19:41:03,859 INFO] Step 41000/50000; acc:  73.92; ppl:  2.32; xent: 0.84; lr: 0.00010; 9152/4003 tok/s;   8910 sec
[2021-05-15 19:41:14,679 INFO] Step 41050/50000; acc:  73.14; ppl:  2.37; xent: 0.86; lr: 0.00010; 9683/3992 tok/s;   8921 sec
[2021-05-15 19:41:24,873 INFO] Step 41100/50000; acc:  73.96; ppl:  2.31; xent: 0.84; lr: 0.00010; 9641/4142 tok/s;   8931 sec
[2021-05-15 19:41:35,666 INFO] Step 41150/50000; acc:  73.72; ppl:  2.35; xent: 0.86; lr: 0.00010; 9718/3982 tok/s;   8942 sec
[2021-05-15 19:41:46,386 INFO] Step 41200/50000; acc:  74.20; ppl:  2.29; xent: 0.83; lr: 0.00010; 9387/3883 tok/s;   8953 sec
[2021-05-15 19:41:57,825 INFO] Step 41250/50000; acc:  73.35; ppl:  2.35; xent: 0.86; lr: 0.00010; 9002/3819 tok/s;   8964 sec
[2021-05-15 19:42:08,648 INFO] Step 41300/50000; acc:  73.97; ppl:  2.32; xent: 0.84; lr: 0.00010; 9399/3895 tok/s;   8975 sec
[2021-05-15 19:42:19,016 INFO] Step 41350/50000; acc:  73.74; ppl:  2.33; xent: 0.84; lr: 0.00010; 9690/4110 tok/s;   8985 sec
[2021-05-15 19:42:30,550 INFO] Step 41400/50000; acc:  73.70; ppl:  2.35; xent: 0.85; lr: 0.00010; 8966/3767 tok/s;   8997 sec
[2021-05-15 19:42:41,114 INFO] Step 41450/50000; acc:  74.12; ppl:  2.28; xent: 0.83; lr: 0.00010; 9359/3937 tok/s;   9008 sec
[2021-05-15 19:42:51,509 INFO] Step 41500/50000; acc:  73.74; ppl:  2.35; xent: 0.85; lr: 0.00010; 10028/4146 tok/s;   9018 sec
[2021-05-15 19:43:02,026 INFO] Step 41550/50000; acc:  74.09; ppl:  2.31; xent: 0.84; lr: 0.00010; 9498/4081 tok/s;   9028 sec
[2021-05-15 19:43:06,169 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:43:12,849 INFO] Step 41600/50000; acc:  73.41; ppl:  2.34; xent: 0.85; lr: 0.00010; 9547/3964 tok/s;   9039 sec
[2021-05-15 19:43:23,833 INFO] Step 41650/50000; acc:  73.78; ppl:  2.35; xent: 0.85; lr: 0.00010; 9339/3855 tok/s;   9050 sec
[2021-05-15 19:43:34,961 INFO] Step 41700/50000; acc:  74.07; ppl:  2.31; xent: 0.84; lr: 0.00010; 8882/3887 tok/s;   9061 sec
[2021-05-15 19:43:45,682 INFO] Step 41750/50000; acc:  73.84; ppl:  2.32; xent: 0.84; lr: 0.00010; 9609/3977 tok/s;   9072 sec
[2021-05-15 19:43:56,325 INFO] Step 41800/50000; acc:  73.80; ppl:  2.32; xent: 0.84; lr: 0.00010; 9315/4089 tok/s;   9083 sec
[2021-05-15 19:44:07,071 INFO] Step 41850/50000; acc:  73.80; ppl:  2.34; xent: 0.85; lr: 0.00010; 9711/3952 tok/s;   9093 sec
[2021-05-15 19:44:16,908 INFO] Step 41900/50000; acc:  74.29; ppl:  2.29; xent: 0.83; lr: 0.00010; 10205/4243 tok/s;   9103 sec
[2021-05-15 19:44:28,628 INFO] Step 41950/50000; acc:  73.80; ppl:  2.34; xent: 0.85; lr: 0.00010; 8909/3688 tok/s;   9115 sec
[2021-05-15 19:44:39,186 INFO] Step 42000/50000; acc:  74.30; ppl:  2.30; xent: 0.83; lr: 0.00010; 9436/4018 tok/s;   9126 sec
[2021-05-15 19:44:49,816 INFO] Step 42050/50000; acc:  73.94; ppl:  2.32; xent: 0.84; lr: 0.00010; 9689/4047 tok/s;   9136 sec
[2021-05-15 19:45:00,714 INFO] Step 42100/50000; acc:  73.80; ppl:  2.33; xent: 0.85; lr: 0.00010; 9343/3901 tok/s;   9147 sec
[2021-05-15 19:45:11,705 INFO] Step 42150/50000; acc:  74.50; ppl:  2.26; xent: 0.81; lr: 0.00010; 9156/3910 tok/s;   9158 sec
[2021-05-15 19:45:22,191 INFO] Step 42200/50000; acc:  74.01; ppl:  2.32; xent: 0.84; lr: 0.00010; 9849/4016 tok/s;   9169 sec
[2021-05-15 19:45:32,607 INFO] Step 42250/50000; acc:  74.16; ppl:  2.29; xent: 0.83; lr: 0.00010; 9428/4093 tok/s;   9179 sec
[2021-05-15 19:45:37,563 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:45:43,654 INFO] Step 42300/50000; acc:  73.67; ppl:  2.34; xent: 0.85; lr: 0.00010; 9500/3892 tok/s;   9190 sec
[2021-05-15 19:45:54,530 INFO] Step 42350/50000; acc:  74.12; ppl:  2.30; xent: 0.83; lr: 0.00010; 9243/3909 tok/s;   9201 sec
[2021-05-15 19:46:05,037 INFO] Step 42400/50000; acc:  73.82; ppl:  2.34; xent: 0.85; lr: 0.00010; 9764/4089 tok/s;   9211 sec
[2021-05-15 19:46:16,189 INFO] Step 42450/50000; acc:  73.80; ppl:  2.32; xent: 0.84; lr: 0.00010; 9109/3899 tok/s;   9223 sec
[2021-05-15 19:46:26,722 INFO] Step 42500/50000; acc:  74.02; ppl:  2.30; xent: 0.83; lr: 0.00010; 9422/4086 tok/s;   9233 sec
[2021-05-15 19:46:37,101 INFO] Step 42550/50000; acc:  74.10; ppl:  2.31; xent: 0.84; lr: 0.00010; 9911/4103 tok/s;   9243 sec
[2021-05-15 19:46:47,413 INFO] Step 42600/50000; acc:  74.38; ppl:  2.28; xent: 0.83; lr: 0.00010; 9661/4081 tok/s;   9254 sec
[2021-05-15 19:46:58,771 INFO] Step 42650/50000; acc:  73.74; ppl:  2.33; xent: 0.84; lr: 0.00010; 9400/3733 tok/s;   9265 sec
[2021-05-15 19:47:09,395 INFO] Step 42700/50000; acc:  74.33; ppl:  2.28; xent: 0.82; lr: 0.00010; 9298/4076 tok/s;   9276 sec
[2021-05-15 19:47:20,470 INFO] Step 42750/50000; acc:  73.96; ppl:  2.32; xent: 0.84; lr: 0.00010; 9398/3842 tok/s;   9287 sec
[2021-05-15 19:47:31,017 INFO] Step 42800/50000; acc:  74.19; ppl:  2.29; xent: 0.83; lr: 0.00010; 9438/4058 tok/s;   9297 sec
[2021-05-15 19:47:41,943 INFO] Step 42850/50000; acc:  74.15; ppl:  2.30; xent: 0.83; lr: 0.00010; 9403/3913 tok/s;   9308 sec
[2021-05-15 19:47:53,212 INFO] Step 42900/50000; acc:  74.56; ppl:  2.28; xent: 0.82; lr: 0.00010; 9108/3773 tok/s;   9320 sec
[2021-05-15 19:48:03,379 INFO] Step 42950/50000; acc:  74.46; ppl:  2.27; xent: 0.82; lr: 0.00010; 9825/4140 tok/s;   9330 sec
[2021-05-15 19:48:13,965 INFO] Step 43000/50000; acc:  73.79; ppl:  2.33; xent: 0.85; lr: 0.00010; 9716/4035 tok/s;   9340 sec
[2021-05-15 19:48:15,887 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:48:24,931 INFO] Step 43050/50000; acc:  74.54; ppl:  2.26; xent: 0.82; lr: 0.00010; 9015/3952 tok/s;   9351 sec
[2021-05-15 19:48:35,715 INFO] Step 43100/50000; acc:  73.64; ppl:  2.34; xent: 0.85; lr: 0.00010; 9734/3925 tok/s;   9362 sec
[2021-05-15 19:48:46,997 INFO] Step 43150/50000; acc:  74.22; ppl:  2.29; xent: 0.83; lr: 0.00010; 8803/3845 tok/s;   9373 sec
[2021-05-15 19:48:57,805 INFO] Step 43200/50000; acc:  73.82; ppl:  2.31; xent: 0.84; lr: 0.00010; 9539/4017 tok/s;   9384 sec
[2021-05-15 19:49:08,136 INFO] Step 43250/50000; acc:  74.15; ppl:  2.30; xent: 0.83; lr: 0.00010; 9857/4137 tok/s;   9395 sec
[2021-05-15 19:49:18,307 INFO] Step 43300/50000; acc:  74.69; ppl:  2.26; xent: 0.81; lr: 0.00010; 9706/4162 tok/s;   9405 sec
[2021-05-15 19:49:29,040 INFO] Step 43350/50000; acc:  73.91; ppl:  2.32; xent: 0.84; lr: 0.00010; 9750/3939 tok/s;   9415 sec
[2021-05-15 19:49:40,505 INFO] Step 43400/50000; acc:  74.52; ppl:  2.25; xent: 0.81; lr: 0.00010; 8753/3708 tok/s;   9427 sec
[2021-05-15 19:49:51,362 INFO] Step 43450/50000; acc:  73.94; ppl:  2.32; xent: 0.84; lr: 0.00010; 9668/3973 tok/s;   9438 sec
[2021-05-15 19:50:01,953 INFO] Step 43500/50000; acc:  74.93; ppl:  2.25; xent: 0.81; lr: 0.00010; 9327/3964 tok/s;   9448 sec
[2021-05-15 19:50:12,782 INFO] Step 43550/50000; acc:  73.58; ppl:  2.33; xent: 0.85; lr: 0.00010; 9614/3989 tok/s;   9459 sec
[2021-05-15 19:50:23,523 INFO] Step 43600/50000; acc:  74.90; ppl:  2.26; xent: 0.81; lr: 0.00010; 9293/3973 tok/s;   9470 sec
[2021-05-15 19:50:34,247 INFO] Step 43650/50000; acc:  74.13; ppl:  2.28; xent: 0.82; lr: 0.00010; 9637/3989 tok/s;   9481 sec
[2021-05-15 19:50:44,510 INFO] Step 43700/50000; acc:  74.53; ppl:  2.28; xent: 0.83; lr: 0.00010; 9858/4076 tok/s;   9491 sec
[2021-05-15 19:50:53,989 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:50:55,109 INFO] Step 43750/50000; acc:  74.48; ppl:  2.27; xent: 0.82; lr: 0.00010; 9470/4075 tok/s;   9501 sec
[2021-05-15 19:51:06,166 INFO] Step 43800/50000; acc:  74.07; ppl:  2.29; xent: 0.83; lr: 0.00010; 9334/3866 tok/s;   9513 sec
[2021-05-15 19:51:16,587 INFO] Step 43850/50000; acc:  74.43; ppl:  2.28; xent: 0.82; lr: 0.00010; 9476/4071 tok/s;   9523 sec
[2021-05-15 19:51:28,106 INFO] Step 43900/50000; acc:  74.00; ppl:  2.30; xent: 0.83; lr: 0.00010; 9014/3819 tok/s;   9534 sec
[2021-05-15 19:51:38,811 INFO] Step 43950/50000; acc:  74.36; ppl:  2.28; xent: 0.82; lr: 0.00010; 9349/4029 tok/s;   9545 sec
[2021-05-15 19:51:49,247 INFO] Step 44000/50000; acc:  74.05; ppl:  2.30; xent: 0.83; lr: 0.00010; 9866/4055 tok/s;   9556 sec
[2021-05-15 19:52:00,146 INFO] Step 44050/50000; acc:  74.73; ppl:  2.28; xent: 0.82; lr: 0.00010; 9364/3969 tok/s;   9567 sec
[2021-05-15 19:52:10,472 INFO] Step 44100/50000; acc:  74.70; ppl:  2.24; xent: 0.81; lr: 0.00010; 9743/3984 tok/s;   9577 sec
[2021-05-15 19:52:22,129 INFO] Step 44150/50000; acc:  74.34; ppl:  2.28; xent: 0.83; lr: 0.00010; 8934/3732 tok/s;   9589 sec
[2021-05-15 19:52:32,226 INFO] Step 44200/50000; acc:  74.82; ppl:  2.25; xent: 0.81; lr: 0.00010; 9785/4149 tok/s;   9599 sec
[2021-05-15 19:52:43,516 INFO] Step 44250/50000; acc:  74.16; ppl:  2.31; xent: 0.84; lr: 0.00010; 9341/3876 tok/s;   9610 sec
[2021-05-15 19:52:54,481 INFO] Step 44300/50000; acc:  74.83; ppl:  2.25; xent: 0.81; lr: 0.00010; 8999/3889 tok/s;   9621 sec
[2021-05-15 19:53:05,523 INFO] Step 44350/50000; acc:  74.64; ppl:  2.26; xent: 0.82; lr: 0.00010; 9475/3819 tok/s;   9632 sec
[2021-05-15 19:53:15,578 INFO] Step 44400/50000; acc:  74.55; ppl:  2.27; xent: 0.82; lr: 0.00010; 9852/4209 tok/s;   9642 sec
[2021-05-15 19:53:26,403 INFO] Step 44450/50000; acc:  74.50; ppl:  2.27; xent: 0.82; lr: 0.00010; 9448/3971 tok/s;   9653 sec
[2021-05-15 19:53:32,930 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:53:37,339 INFO] Step 44500/50000; acc:  74.39; ppl:  2.28; xent: 0.82; lr: 0.00010; 9359/3928 tok/s;   9664 sec
[2021-05-15 19:53:47,977 INFO] Step 44550/50000; acc:  74.56; ppl:  2.27; xent: 0.82; lr: 0.00010; 9490/3997 tok/s;   9674 sec
[2021-05-15 19:53:59,577 INFO] Step 44600/50000; acc:  73.79; ppl:  2.30; xent: 0.83; lr: 0.00010; 8838/3700 tok/s;   9686 sec
[2021-05-15 19:54:10,417 INFO] Step 44650/50000; acc:  74.67; ppl:  2.24; xent: 0.81; lr: 0.00010; 9040/4011 tok/s;   9697 sec
[2021-05-15 19:54:21,169 INFO] Step 44700/50000; acc:  73.99; ppl:  2.31; xent: 0.84; lr: 0.00010; 9726/4000 tok/s;   9708 sec
[2021-05-15 19:54:31,338 INFO] Step 44750/50000; acc:  74.85; ppl:  2.25; xent: 0.81; lr: 0.00010; 9793/4164 tok/s;   9718 sec
[2021-05-15 19:54:42,039 INFO] Step 44800/50000; acc:  74.62; ppl:  2.28; xent: 0.82; lr: 0.00010; 9736/4006 tok/s;   9728 sec
[2021-05-15 19:54:53,267 INFO] Step 44850/50000; acc:  74.55; ppl:  2.26; xent: 0.82; lr: 0.00010; 9197/3720 tok/s;   9740 sec
[2021-05-15 19:55:04,095 INFO] Step 44900/50000; acc:  74.56; ppl:  2.26; xent: 0.81; lr: 0.00010; 9195/4011 tok/s;   9750 sec
[2021-05-15 19:55:15,013 INFO] Step 44950/50000; acc:  74.83; ppl:  2.26; xent: 0.82; lr: 0.00010; 9459/3858 tok/s;   9761 sec
[2021-05-15 19:55:25,361 INFO] Step 45000/50000; acc:  74.60; ppl:  2.24; xent: 0.81; lr: 0.00010; 9616/4100 tok/s;   9772 sec
[2021-05-15 19:55:25,365 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/strict/valid.txt, align=None)...
[2021-05-15 19:55:34,125 INFO] Validation perplexity: 2.9625
[2021-05-15 19:55:34,125 INFO] Validation accuracy: 69.0564
[2021-05-15 19:55:34,127 INFO] Saving checkpoint ../models/group2_params/strict_ops/model_step_45000.pt
[2021-05-15 19:55:46,325 INFO] Step 45050/50000; acc:  74.36; ppl:  2.29; xent: 0.83; lr: 0.00010; 5002/2089 tok/s;   9793 sec
[2021-05-15 19:55:56,805 INFO] Step 45100/50000; acc:  75.15; ppl:  2.20; xent: 0.79; lr: 0.00010; 9472/3967 tok/s;   9803 sec
[2021-05-15 19:56:07,194 INFO] Step 45150/50000; acc:  74.43; ppl:  2.29; xent: 0.83; lr: 0.00010; 9972/4170 tok/s;   9814 sec
[2021-05-15 19:56:17,718 INFO] Step 45200/50000; acc:  74.74; ppl:  2.24; xent: 0.81; lr: 0.00010; 9425/4050 tok/s;   9824 sec
[2021-05-15 19:56:21,381 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:56:28,428 INFO] Step 45250/50000; acc:  74.42; ppl:  2.26; xent: 0.82; lr: 0.00010; 9620/4004 tok/s;   9835 sec
[2021-05-15 19:56:39,331 INFO] Step 45300/50000; acc:  74.48; ppl:  2.28; xent: 0.83; lr: 0.00010; 9375/3905 tok/s;   9846 sec
[2021-05-15 19:56:50,471 INFO] Step 45350/50000; acc:  74.60; ppl:  2.25; xent: 0.81; lr: 0.00010; 8952/3902 tok/s;   9857 sec
[2021-05-15 19:57:01,231 INFO] Step 45400/50000; acc:  74.64; ppl:  2.26; xent: 0.82; lr: 0.00010; 9551/3940 tok/s;   9868 sec
[2021-05-15 19:57:11,858 INFO] Step 45450/50000; acc:  74.76; ppl:  2.24; xent: 0.81; lr: 0.00010; 9251/4095 tok/s;   9878 sec
[2021-05-15 19:57:22,532 INFO] Step 45500/50000; acc:  74.75; ppl:  2.27; xent: 0.82; lr: 0.00010; 9770/3971 tok/s;   9889 sec
[2021-05-15 19:57:33,037 INFO] Step 45550/50000; acc:  74.78; ppl:  2.23; xent: 0.80; lr: 0.00010; 9709/3974 tok/s;   9899 sec
[2021-05-15 19:57:44,497 INFO] Step 45600/50000; acc:  74.58; ppl:  2.27; xent: 0.82; lr: 0.00010; 9016/3781 tok/s;   9911 sec
[2021-05-15 19:57:55,308 INFO] Step 45650/50000; acc:  74.55; ppl:  2.26; xent: 0.82; lr: 0.00010; 9471/3924 tok/s;   9922 sec
[2021-05-15 19:58:05,711 INFO] Step 45700/50000; acc:  75.05; ppl:  2.23; xent: 0.80; lr: 0.00010; 9577/4127 tok/s;   9932 sec
[2021-05-15 19:58:16,646 INFO] Step 45750/50000; acc:  74.54; ppl:  2.27; xent: 0.82; lr: 0.00010; 9431/3898 tok/s;   9943 sec
[2021-05-15 19:58:27,638 INFO] Step 45800/50000; acc:  75.36; ppl:  2.20; xent: 0.79; lr: 0.00010; 9076/3877 tok/s;   9954 sec
[2021-05-15 19:58:38,132 INFO] Step 45850/50000; acc:  74.54; ppl:  2.27; xent: 0.82; lr: 0.00010; 10000/4037 tok/s;   9965 sec
[2021-05-15 19:58:48,590 INFO] Step 45900/50000; acc:  75.06; ppl:  2.23; xent: 0.80; lr: 0.00010; 9400/4058 tok/s;   9975 sec
[2021-05-15 19:58:53,012 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 19:58:59,836 INFO] Step 45950/50000; acc:  74.44; ppl:  2.27; xent: 0.82; lr: 0.00010; 9293/3855 tok/s;   9986 sec
[2021-05-15 19:59:10,492 INFO] Step 46000/50000; acc:  75.11; ppl:  2.22; xent: 0.80; lr: 0.00010; 9342/3955 tok/s;   9997 sec
[2021-05-15 19:59:21,507 INFO] Step 46050/50000; acc:  74.61; ppl:  2.26; xent: 0.82; lr: 0.00010; 9292/3923 tok/s;  10008 sec
[2021-05-15 19:59:32,774 INFO] Step 46100/50000; acc:  74.78; ppl:  2.25; xent: 0.81; lr: 0.00010; 8999/3886 tok/s;  10019 sec
[2021-05-15 19:59:43,117 INFO] Step 46150/50000; acc:  74.90; ppl:  2.24; xent: 0.81; lr: 0.00010; 9694/4144 tok/s;  10030 sec
[2021-05-15 19:59:53,736 INFO] Step 46200/50000; acc:  74.80; ppl:  2.26; xent: 0.81; lr: 0.00010; 9656/4001 tok/s;  10040 sec
[2021-05-15 20:00:03,988 INFO] Step 46250/50000; acc:  75.24; ppl:  2.20; xent: 0.79; lr: 0.00010; 9647/4129 tok/s;  10050 sec
[2021-05-15 20:00:15,372 INFO] Step 46300/50000; acc:  74.70; ppl:  2.26; xent: 0.81; lr: 0.00010; 9355/3705 tok/s;  10062 sec
[2021-05-15 20:00:25,968 INFO] Step 46350/50000; acc:  75.09; ppl:  2.23; xent: 0.80; lr: 0.00010; 9467/4073 tok/s;  10072 sec
[2021-05-15 20:00:36,861 INFO] Step 46400/50000; acc:  75.06; ppl:  2.24; xent: 0.80; lr: 0.00010; 9470/3901 tok/s;  10083 sec
[2021-05-15 20:00:47,637 INFO] Step 46450/50000; acc:  74.61; ppl:  2.25; xent: 0.81; lr: 0.00010; 9483/4003 tok/s;  10094 sec
[2021-05-15 20:00:58,345 INFO] Step 46500/50000; acc:  75.33; ppl:  2.21; xent: 0.79; lr: 0.00010; 9279/3949 tok/s;  10105 sec
[2021-05-15 20:01:09,696 INFO] Step 46550/50000; acc:  74.84; ppl:  2.24; xent: 0.80; lr: 0.00010; 9172/3780 tok/s;  10116 sec
[2021-05-15 20:01:19,796 INFO] Step 46600/50000; acc:  75.51; ppl:  2.19; xent: 0.78; lr: 0.00010; 9791/4126 tok/s;  10126 sec
[2021-05-15 20:01:30,684 INFO] Step 46650/50000; acc:  74.45; ppl:  2.28; xent: 0.82; lr: 0.00010; 9613/3959 tok/s;  10137 sec
[2021-05-15 20:01:31,975 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 20:01:41,459 INFO] Step 46700/50000; acc:  75.38; ppl:  2.19; xent: 0.78; lr: 0.00010; 9183/4007 tok/s;  10148 sec
[2021-05-15 20:01:52,248 INFO] Step 46750/50000; acc:  74.51; ppl:  2.27; xent: 0.82; lr: 0.00010; 9690/3934 tok/s;  10159 sec
[2021-05-15 20:02:03,586 INFO] Step 46800/50000; acc:  75.08; ppl:  2.22; xent: 0.80; lr: 0.00010; 8664/3853 tok/s;  10170 sec
[2021-05-15 20:02:14,405 INFO] Step 46850/50000; acc:  74.74; ppl:  2.24; xent: 0.81; lr: 0.00010; 9496/3968 tok/s;  10181 sec
[2021-05-15 20:02:24,863 INFO] Step 46900/50000; acc:  75.07; ppl:  2.24; xent: 0.80; lr: 0.00010; 9740/4058 tok/s;  10191 sec
[2021-05-15 20:02:35,301 INFO] Step 46950/50000; acc:  75.42; ppl:  2.21; xent: 0.79; lr: 0.00010; 9547/4111 tok/s;  10202 sec
[2021-05-15 20:02:46,158 INFO] Step 47000/50000; acc:  74.67; ppl:  2.25; xent: 0.81; lr: 0.00010; 9618/3871 tok/s;  10213 sec
[2021-05-15 20:02:57,243 INFO] Step 47050/50000; acc:  75.29; ppl:  2.19; xent: 0.78; lr: 0.00010; 8970/3855 tok/s;  10224 sec
[2021-05-15 20:03:07,916 INFO] Step 47100/50000; acc:  74.82; ppl:  2.25; xent: 0.81; lr: 0.00010; 9813/4010 tok/s;  10234 sec
[2021-05-15 20:03:18,563 INFO] Step 47150/50000; acc:  75.31; ppl:  2.21; xent: 0.79; lr: 0.00010; 9428/3970 tok/s;  10245 sec
[2021-05-15 20:03:29,506 INFO] Step 47200/50000; acc:  74.69; ppl:  2.25; xent: 0.81; lr: 0.00010; 9417/3945 tok/s;  10256 sec
[2021-05-15 20:03:40,431 INFO] Step 47250/50000; acc:  75.45; ppl:  2.20; xent: 0.79; lr: 0.00010; 9390/3902 tok/s;  10267 sec
[2021-05-15 20:03:50,776 INFO] Step 47300/50000; acc:  75.53; ppl:  2.19; xent: 0.78; lr: 0.00010; 9637/4115 tok/s;  10277 sec
[2021-05-15 20:04:01,069 INFO] Step 47350/50000; acc:  75.17; ppl:  2.23; xent: 0.80; lr: 0.00010; 9974/4086 tok/s;  10287 sec
[2021-05-15 20:04:10,028 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 20:04:11,512 INFO] Step 47400/50000; acc:  75.14; ppl:  2.21; xent: 0.79; lr: 0.00010; 9539/4105 tok/s;  10298 sec
[2021-05-15 20:04:22,864 INFO] Step 47450/50000; acc:  74.84; ppl:  2.24; xent: 0.81; lr: 0.00010; 9236/3821 tok/s;  10309 sec
[2021-05-15 20:04:33,463 INFO] Step 47500/50000; acc:  74.98; ppl:  2.22; xent: 0.80; lr: 0.00010; 9324/3980 tok/s;  10320 sec
[2021-05-15 20:04:45,030 INFO] Step 47550/50000; acc:  74.81; ppl:  2.23; xent: 0.80; lr: 0.00010; 8938/3803 tok/s;  10331 sec
[2021-05-15 20:04:55,514 INFO] Step 47600/50000; acc:  75.42; ppl:  2.20; xent: 0.79; lr: 0.00010; 9473/4077 tok/s;  10342 sec
[2021-05-15 20:05:05,910 INFO] Step 47650/50000; acc:  74.98; ppl:  2.23; xent: 0.80; lr: 0.00010; 9858/4096 tok/s;  10352 sec
[2021-05-15 20:05:16,786 INFO] Step 47700/50000; acc:  75.32; ppl:  2.22; xent: 0.80; lr: 0.00010; 9387/3967 tok/s;  10363 sec
[2021-05-15 20:05:27,103 INFO] Step 47750/50000; acc:  75.60; ppl:  2.17; xent: 0.78; lr: 0.00010; 9864/3994 tok/s;  10373 sec
[2021-05-15 20:05:38,614 INFO] Step 47800/50000; acc:  74.92; ppl:  2.24; xent: 0.81; lr: 0.00010; 8996/3786 tok/s;  10385 sec
[2021-05-15 20:05:48,598 INFO] Step 47850/50000; acc:  75.71; ppl:  2.18; xent: 0.78; lr: 0.00010; 9825/4212 tok/s;  10395 sec
[2021-05-15 20:05:59,763 INFO] Step 47900/50000; acc:  75.18; ppl:  2.23; xent: 0.80; lr: 0.00010; 9429/3872 tok/s;  10406 sec
[2021-05-15 20:06:10,906 INFO] Step 47950/50000; acc:  75.39; ppl:  2.20; xent: 0.79; lr: 0.00010; 8982/3842 tok/s;  10417 sec
[2021-05-15 20:06:21,907 INFO] Step 48000/50000; acc:  75.32; ppl:  2.20; xent: 0.79; lr: 0.00010; 9408/3867 tok/s;  10428 sec
[2021-05-15 20:06:32,193 INFO] Step 48050/50000; acc:  74.86; ppl:  2.24; xent: 0.81; lr: 0.00010; 9909/4107 tok/s;  10439 sec
[2021-05-15 20:06:42,625 INFO] Step 48100/50000; acc:  75.69; ppl:  2.17; xent: 0.77; lr: 0.00010; 9478/4095 tok/s;  10449 sec
[2021-05-15 20:06:48,918 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 20:06:53,648 INFO] Step 48150/50000; acc:  74.67; ppl:  2.24; xent: 0.81; lr: 0.00010; 9411/3898 tok/s;  10460 sec
[2021-05-15 20:07:04,386 INFO] Step 48200/50000; acc:  75.48; ppl:  2.19; xent: 0.78; lr: 0.00010; 9330/3934 tok/s;  10471 sec
[2021-05-15 20:07:16,134 INFO] Step 48250/50000; acc:  74.48; ppl:  2.25; xent: 0.81; lr: 0.00010; 8859/3715 tok/s;  10483 sec
[2021-05-15 20:07:26,987 INFO] Step 48300/50000; acc:  75.74; ppl:  2.17; xent: 0.78; lr: 0.00010; 9040/3996 tok/s;  10493 sec
[2021-05-15 20:07:37,732 INFO] Step 48350/50000; acc:  75.07; ppl:  2.23; xent: 0.80; lr: 0.00010; 9684/4001 tok/s;  10504 sec
[2021-05-15 20:07:47,836 INFO] Step 48400/50000; acc:  75.67; ppl:  2.18; xent: 0.78; lr: 0.00010; 9765/4180 tok/s;  10514 sec
[2021-05-15 20:07:58,359 INFO] Step 48450/50000; acc:  75.23; ppl:  2.20; xent: 0.79; lr: 0.00010; 9878/4046 tok/s;  10525 sec
[2021-05-15 20:08:09,643 INFO] Step 48500/50000; acc:  75.18; ppl:  2.20; xent: 0.79; lr: 0.00010; 9141/3707 tok/s;  10536 sec
[2021-05-15 20:08:20,544 INFO] Step 48550/50000; acc:  75.33; ppl:  2.20; xent: 0.79; lr: 0.00010; 9221/3990 tok/s;  10547 sec
[2021-05-15 20:08:31,130 INFO] Step 48600/50000; acc:  75.69; ppl:  2.19; xent: 0.78; lr: 0.00010; 9730/3968 tok/s;  10558 sec
[2021-05-15 20:08:41,382 INFO] Step 48650/50000; acc:  75.42; ppl:  2.19; xent: 0.78; lr: 0.00010; 9619/4150 tok/s;  10568 sec
[2021-05-15 20:08:52,937 INFO] Step 48700/50000; acc:  75.32; ppl:  2.21; xent: 0.79; lr: 0.00010; 9071/3774 tok/s;  10579 sec
[2021-05-15 20:09:03,442 INFO] Step 48750/50000; acc:  75.80; ppl:  2.15; xent: 0.77; lr: 0.00010; 9576/3989 tok/s;  10590 sec
[2021-05-15 20:09:13,813 INFO] Step 48800/50000; acc:  75.29; ppl:  2.21; xent: 0.79; lr: 0.00010; 9874/4133 tok/s;  10600 sec
[2021-05-15 20:09:24,501 INFO] Step 48850/50000; acc:  75.61; ppl:  2.20; xent: 0.79; lr: 0.00010; 9564/4023 tok/s;  10611 sec
[2021-05-15 20:09:27,667 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 20:09:34,986 INFO] Step 48900/50000; acc:  75.35; ppl:  2.18; xent: 0.78; lr: 0.00010; 9509/4077 tok/s;  10621 sec
[2021-05-15 20:09:46,036 INFO] Step 48950/50000; acc:  75.22; ppl:  2.22; xent: 0.80; lr: 0.00010; 9359/3833 tok/s;  10632 sec
[2021-05-15 20:09:57,068 INFO] Step 49000/50000; acc:  75.42; ppl:  2.19; xent: 0.78; lr: 0.00010; 8961/3964 tok/s;  10643 sec
[2021-05-15 20:10:07,772 INFO] Step 49050/50000; acc:  75.17; ppl:  2.20; xent: 0.79; lr: 0.00010; 9768/4000 tok/s;  10654 sec
[2021-05-15 20:10:18,407 INFO] Step 49100/50000; acc:  75.63; ppl:  2.17; xent: 0.78; lr: 0.00010; 9247/4056 tok/s;  10665 sec
[2021-05-15 20:10:28,874 INFO] Step 49150/50000; acc:  75.50; ppl:  2.20; xent: 0.79; lr: 0.00010; 9923/4051 tok/s;  10675 sec
[2021-05-15 20:10:39,418 INFO] Step 49200/50000; acc:  75.83; ppl:  2.16; xent: 0.77; lr: 0.00010; 9588/3951 tok/s;  10686 sec
[2021-05-15 20:10:50,569 INFO] Step 49250/50000; acc:  75.51; ppl:  2.19; xent: 0.78; lr: 0.00010; 9234/3906 tok/s;  10697 sec
[2021-05-15 20:11:01,474 INFO] Step 49300/50000; acc:  75.54; ppl:  2.19; xent: 0.78; lr: 0.00010; 9381/3868 tok/s;  10708 sec
[2021-05-15 20:11:11,894 INFO] Step 49350/50000; acc:  75.77; ppl:  2.17; xent: 0.77; lr: 0.00010; 9652/4112 tok/s;  10718 sec
[2021-05-15 20:11:22,887 INFO] Step 49400/50000; acc:  75.26; ppl:  2.21; xent: 0.79; lr: 0.00010; 9346/3899 tok/s;  10729 sec
[2021-05-15 20:11:33,593 INFO] Step 49450/50000; acc:  76.06; ppl:  2.13; xent: 0.76; lr: 0.00010; 9253/3966 tok/s;  10740 sec
[2021-05-15 20:11:43,988 INFO] Step 49500/50000; acc:  75.05; ppl:  2.20; xent: 0.79; lr: 0.00010; 10089/4059 tok/s;  10750 sec
[2021-05-15 20:11:54,612 INFO] Step 49550/50000; acc:  75.66; ppl:  2.18; xent: 0.78; lr: 0.00010; 9372/4047 tok/s;  10761 sec
[2021-05-15 20:11:58,674 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/strict/train.txt, align=None)...
[2021-05-15 20:12:05,515 INFO] Step 49600/50000; acc:  75.18; ppl:  2.20; xent: 0.79; lr: 0.00010; 9513/3975 tok/s;  10772 sec
[2021-05-15 20:12:16,517 INFO] Step 49650/50000; acc:  75.58; ppl:  2.17; xent: 0.78; lr: 0.00010; 9290/3819 tok/s;  10783 sec
[2021-05-15 20:12:26,881 INFO] Step 49700/50000; acc:  75.66; ppl:  2.17; xent: 0.78; lr: 0.00010; 9522/4160 tok/s;  10793 sec
[2021-05-15 20:12:38,169 INFO] Step 49750/50000; acc:  75.37; ppl:  2.19; xent: 0.79; lr: 0.00010; 9127/3839 tok/s;  10805 sec
[2021-05-15 20:12:48,472 INFO] Step 49800/50000; acc:  75.68; ppl:  2.17; xent: 0.78; lr: 0.00010; 9645/4184 tok/s;  10815 sec
[2021-05-15 20:12:58,901 INFO] Step 49850/50000; acc:  75.46; ppl:  2.20; xent: 0.79; lr: 0.00010; 9995/4061 tok/s;  10825 sec
[2021-05-15 20:13:09,191 INFO] Step 49900/50000; acc:  76.20; ppl:  2.14; xent: 0.76; lr: 0.00010; 9640/4125 tok/s;  10836 sec
[2021-05-15 20:13:20,913 INFO] Step 49950/50000; acc:  75.10; ppl:  2.20; xent: 0.79; lr: 0.00010; 9035/3654 tok/s;  10847 sec
[2021-05-15 20:13:31,486 INFO] Step 50000/50000; acc:  75.72; ppl:  2.17; xent: 0.77; lr: 0.00005; 9396/4042 tok/s;  10858 sec
[2021-05-15 20:13:31,494 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/strict/valid.txt, align=None)...
[2021-05-15 20:13:40,274 INFO] Validation perplexity: 3.01988
[2021-05-15 20:13:40,274 INFO] Validation accuracy: 68.8088
[2021-05-15 20:13:40,276 INFO] Saving checkpoint ../models/group2_params/strict_ops/model_step_50000.pt

Loosely condensed EditOperations:

modelGroup2Loose = HephaestusModel(MODEL_GROUP2_LOOSE)
modelGroup2Loose.train(
    DATA_SMALL_METHODS_TRAIN_BUGGY,
    DATA_SMALL_OPS_TYPED_LOOSE_TRAIN,
    DATA_SMALL_METHODS_VALID_BUGGY,
    DATA_SMALL_OPS_TYPED_LOOSE_VALID
)
[2021-05-15 13:32:13,749 INFO] Counter vocab from -1 samples.
[2021-05-15 13:32:13,749 INFO] n_sample=-1: Build vocab on full datasets.
[2021-05-15 13:32:13,753 INFO] corpus_1's transforms: TransformPipe()
[2021-05-15 13:32:13,753 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:32:14,254 INFO] Counters src:429
[2021-05-15 13:32:14,254 INFO] Counters tgt:448
[2021-05-15 13:32:14,254 WARNING] path ../models/group2_params/loose_ops/save_data.vocab.src exists, may overwrite...
[2021-05-15 13:32:14,256 WARNING] path ../models/group2_params/loose_ops/save_data.vocab.tgt exists, may overwrite...
[2021-05-15 13:32:14,963 INFO] Parsed 2 corpora from -data.
[2021-05-15 13:32:14,963 INFO] Get special vocabs from Transforms: {'src': set(), 'tgt': set()}.
[2021-05-15 13:32:14,963 INFO] Loading vocab from text file...
[2021-05-15 13:32:14,963 INFO] Loading src vocabulary from ../models/group2_params/loose_ops/save_data.vocab.src
[2021-05-15 13:32:14,965 INFO] Loaded src vocab has 429 tokens.
[2021-05-15 13:32:14,965 INFO] Loading tgt vocabulary from ../models/group2_params/loose_ops/save_data.vocab.tgt
[2021-05-15 13:32:14,967 INFO] Loaded tgt vocab has 448 tokens.
[2021-05-15 13:32:14,967 INFO] Building fields with vocab in counters...
[2021-05-15 13:32:14,967 INFO]  * tgt vocab size: 452.
[2021-05-15 13:32:14,968 INFO]  * src vocab size: 431.
[2021-05-15 13:32:14,968 INFO]  * src vocab size = 431
[2021-05-15 13:32:14,968 INFO]  * tgt vocab size = 452
[2021-05-15 13:32:14,969 INFO] Building model...
[2021-05-15 13:32:16,136 INFO] NMTModel(
  (encoder): RNNEncoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(431, 512, padding_idx=1)
        )
      )
    )
    (rnn): LSTM(512, 256, num_layers=2, dropout=0.2)
  )
  (decoder): InputFeedRNNDecoder(
    (embeddings): Embeddings(
      (make_embedding): Sequential(
        (emb_luts): Elementwise(
          (0): Embedding(452, 512, padding_idx=1)
        )
      )
    )
    (dropout): Dropout(p=0.2, inplace=False)
    (rnn): StackedLSTM(
      (dropout): Dropout(p=0.2, inplace=False)
      (layers): ModuleList(
        (0): LSTMCell(768, 256)
        (1): LSTMCell(256, 256)
      )
    )
    (attn): GlobalAttention(
      (linear_context): Linear(in_features=256, out_features=256, bias=False)
      (linear_query): Linear(in_features=256, out_features=256, bias=True)
      (v): Linear(in_features=256, out_features=1, bias=False)
      (linear_out): Linear(in_features=512, out_features=256, bias=True)
    )
  )
  (generator): Sequential(
    (0): Linear(in_features=256, out_features=452, bias=True)
    (1): Cast()
    (2): LogSoftmax(dim=-1)
  )
)
[2021-05-15 13:32:16,136 INFO] encoder: 1535488
[2021-05-15 13:32:16,137 INFO] decoder: 2187460
[2021-05-15 13:32:16,137 INFO] * number of parameters: 3722948
[2021-05-15 13:32:16,138 INFO] Starting training on GPU: [0]
[2021-05-15 13:32:16,138 INFO] Start training loop and validate every 5000 steps...
[2021-05-15 13:32:16,139 INFO] corpus_1's transforms: TransformPipe()
[2021-05-15 13:32:16,139 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:32:25,899 INFO] Step 50/50000; acc:  11.74; ppl: 198.29; xent: 5.29; lr: 0.00010; 10316/4079 tok/s;     10 sec
[2021-05-15 13:32:35,683 INFO] Step 100/50000; acc:  16.52; ppl: 51.68; xent: 3.95; lr: 0.00010; 10302/4160 tok/s;     20 sec
[2021-05-15 13:32:45,382 INFO] Step 150/50000; acc:  18.70; ppl: 37.24; xent: 3.62; lr: 0.00010; 10442/4231 tok/s;     29 sec
[2021-05-15 13:32:54,715 INFO] Step 200/50000; acc:  25.07; ppl: 28.89; xent: 3.36; lr: 0.00010; 10949/4232 tok/s;     39 sec
[2021-05-15 13:33:04,081 INFO] Step 250/50000; acc:  31.59; ppl: 22.04; xent: 3.09; lr: 0.00010; 10760/4278 tok/s;     48 sec
[2021-05-15 13:33:13,749 INFO] Step 300/50000; acc:  35.71; ppl: 16.50; xent: 2.80; lr: 0.00010; 10492/4180 tok/s;     58 sec
[2021-05-15 13:33:23,673 INFO] Step 350/50000; acc:  39.76; ppl: 12.24; xent: 2.51; lr: 0.00010; 10584/4014 tok/s;     68 sec
[2021-05-15 13:33:33,312 INFO] Step 400/50000; acc:  40.56; ppl: 10.61; xent: 2.36; lr: 0.00010; 10266/4216 tok/s;     77 sec
[2021-05-15 13:33:43,104 INFO] Step 450/50000; acc:  41.65; ppl:  9.69; xent: 2.27; lr: 0.00010; 10681/4066 tok/s;     87 sec
[2021-05-15 13:33:52,324 INFO] Step 500/50000; acc:  41.60; ppl:  9.55; xent: 2.26; lr: 0.00010; 10749/4405 tok/s;     96 sec
[2021-05-15 13:34:02,181 INFO] Step 550/50000; acc:  42.05; ppl:  9.18; xent: 2.22; lr: 0.00010; 10390/4054 tok/s;    106 sec
[2021-05-15 13:34:11,723 INFO] Step 600/50000; acc:  43.06; ppl:  8.71; xent: 2.16; lr: 0.00010; 10812/4207 tok/s;    116 sec
[2021-05-15 13:34:20,879 INFO] Step 650/50000; acc:  43.53; ppl:  8.64; xent: 2.16; lr: 0.00010; 11008/4310 tok/s;    125 sec
[2021-05-15 13:34:30,238 INFO] Step 700/50000; acc:  43.88; ppl:  8.33; xent: 2.12; lr: 0.00010; 10933/4294 tok/s;    134 sec
[2021-05-15 13:34:31,071 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:34:39,737 INFO] Step 750/50000; acc:  43.89; ppl:  8.30; xent: 2.12; lr: 0.00010; 10460/4284 tok/s;    144 sec
[2021-05-15 13:34:49,635 INFO] Step 800/50000; acc:  44.66; ppl:  8.00; xent: 2.08; lr: 0.00010; 10577/4066 tok/s;    153 sec
[2021-05-15 13:34:59,851 INFO] Step 850/50000; acc:  44.65; ppl:  7.98; xent: 2.08; lr: 0.00010; 9724/4005 tok/s;    164 sec
[2021-05-15 13:35:09,361 INFO] Step 900/50000; acc:  45.27; ppl:  7.67; xent: 2.04; lr: 0.00010; 10669/4250 tok/s;    173 sec
[2021-05-15 13:35:18,524 INFO] Step 950/50000; acc:  45.96; ppl:  7.39; xent: 2.00; lr: 0.00010; 11071/4352 tok/s;    182 sec
[2021-05-15 13:35:27,983 INFO] Step 1000/50000; acc:  46.53; ppl:  7.22; xent: 1.98; lr: 0.00010; 10759/4270 tok/s;    192 sec
[2021-05-15 13:35:37,709 INFO] Step 1050/50000; acc:  46.71; ppl:  7.08; xent: 1.96; lr: 0.00010; 10541/4102 tok/s;    202 sec
[2021-05-15 13:35:47,842 INFO] Step 1100/50000; acc:  47.04; ppl:  6.84; xent: 1.92; lr: 0.00010; 10066/3978 tok/s;    212 sec
[2021-05-15 13:35:57,663 INFO] Step 1150/50000; acc:  47.47; ppl:  6.70; xent: 1.90; lr: 0.00010; 10507/4092 tok/s;    222 sec
[2021-05-15 13:36:06,905 INFO] Step 1200/50000; acc:  47.96; ppl:  6.60; xent: 1.89; lr: 0.00010; 10737/4344 tok/s;    231 sec
[2021-05-15 13:36:16,659 INFO] Step 1250/50000; acc:  48.05; ppl:  6.61; xent: 1.89; lr: 0.00010; 10721/4113 tok/s;    241 sec
[2021-05-15 13:36:26,323 INFO] Step 1300/50000; acc:  48.08; ppl:  6.36; xent: 1.85; lr: 0.00010; 10289/4148 tok/s;    250 sec
[2021-05-15 13:36:35,447 INFO] Step 1350/50000; acc:  49.13; ppl:  6.22; xent: 1.83; lr: 0.00010; 11281/4363 tok/s;    259 sec
[2021-05-15 13:36:44,635 INFO] Step 1400/50000; acc:  49.35; ppl:  6.10; xent: 1.81; lr: 0.00010; 11050/4407 tok/s;    268 sec
[2021-05-15 13:36:52,442 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:36:54,356 INFO] Step 1450/50000; acc:  49.39; ppl:  6.02; xent: 1.79; lr: 0.00010; 10453/4094 tok/s;    278 sec
[2021-05-15 13:37:04,051 INFO] Step 1500/50000; acc:  49.61; ppl:  6.02; xent: 1.80; lr: 0.00010; 10567/4183 tok/s;    288 sec
[2021-05-15 13:37:13,935 INFO] Step 1550/50000; acc:  49.48; ppl:  6.03; xent: 1.80; lr: 0.00010; 10044/4089 tok/s;    298 sec
[2021-05-15 13:37:23,906 INFO] Step 1600/50000; acc:  49.73; ppl:  5.93; xent: 1.78; lr: 0.00010; 10379/4112 tok/s;    308 sec
[2021-05-15 13:37:33,389 INFO] Step 1650/50000; acc:  50.02; ppl:  5.84; xent: 1.77; lr: 0.00010; 10575/4214 tok/s;    317 sec
[2021-05-15 13:37:42,952 INFO] Step 1700/50000; acc:  50.16; ppl:  5.78; xent: 1.75; lr: 0.00010; 10572/4233 tok/s;    327 sec
[2021-05-15 13:37:52,226 INFO] Step 1750/50000; acc:  50.72; ppl:  5.75; xent: 1.75; lr: 0.00010; 11008/4351 tok/s;    336 sec
[2021-05-15 13:38:01,803 INFO] Step 1800/50000; acc:  51.22; ppl:  5.55; xent: 1.71; lr: 0.00010; 10786/4074 tok/s;    346 sec
[2021-05-15 13:38:12,299 INFO] Step 1850/50000; acc:  50.67; ppl:  5.64; xent: 1.73; lr: 0.00010; 9716/3911 tok/s;    356 sec
[2021-05-15 13:38:21,350 INFO] Step 1900/50000; acc:  51.33; ppl:  5.42; xent: 1.69; lr: 0.00010; 11102/4385 tok/s;    365 sec
[2021-05-15 13:38:31,298 INFO] Step 1950/50000; acc:  51.17; ppl:  5.53; xent: 1.71; lr: 0.00010; 10430/4101 tok/s;    375 sec
[2021-05-15 13:38:40,642 INFO] Step 2000/50000; acc:  51.45; ppl:  5.47; xent: 1.70; lr: 0.00010; 10602/4205 tok/s;    385 sec
[2021-05-15 13:38:50,725 INFO] Step 2050/50000; acc:  51.45; ppl:  5.41; xent: 1.69; lr: 0.00010; 10431/4025 tok/s;    395 sec
[2021-05-15 13:38:59,475 INFO] Step 2100/50000; acc:  52.21; ppl:  5.29; xent: 1.66; lr: 0.00010; 11268/4519 tok/s;    403 sec
[2021-05-15 13:39:09,049 INFO] Step 2150/50000; acc:  51.96; ppl:  5.26; xent: 1.66; lr: 0.00010; 10634/4237 tok/s;    413 sec
[2021-05-15 13:39:14,166 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:39:18,826 INFO] Step 2200/50000; acc:  52.13; ppl:  5.25; xent: 1.66; lr: 0.00010; 10520/4160 tok/s;    423 sec
[2021-05-15 13:39:28,569 INFO] Step 2250/50000; acc:  52.41; ppl:  5.22; xent: 1.65; lr: 0.00010; 10456/4149 tok/s;    432 sec
[2021-05-15 13:39:38,839 INFO] Step 2300/50000; acc:  52.45; ppl:  5.18; xent: 1.64; lr: 0.00010; 9936/3927 tok/s;    443 sec
[2021-05-15 13:39:48,086 INFO] Step 2350/50000; acc:  52.59; ppl:  5.18; xent: 1.64; lr: 0.00010; 10668/4393 tok/s;    452 sec
[2021-05-15 13:39:57,842 INFO] Step 2400/50000; acc:  52.79; ppl:  5.15; xent: 1.64; lr: 0.00010; 10673/4138 tok/s;    462 sec
[2021-05-15 13:40:07,032 INFO] Step 2450/50000; acc:  53.55; ppl:  5.02; xent: 1.61; lr: 0.00010; 10856/4340 tok/s;    471 sec
[2021-05-15 13:40:16,582 INFO] Step 2500/50000; acc:  53.04; ppl:  5.11; xent: 1.63; lr: 0.00010; 10738/4187 tok/s;    480 sec
[2021-05-15 13:40:26,509 INFO] Step 2550/50000; acc:  53.38; ppl:  5.01; xent: 1.61; lr: 0.00010; 10352/4051 tok/s;    490 sec
[2021-05-15 13:40:36,544 INFO] Step 2600/50000; acc:  53.18; ppl:  4.99; xent: 1.61; lr: 0.00010; 10195/3988 tok/s;    500 sec
[2021-05-15 13:40:46,046 INFO] Step 2650/50000; acc:  53.53; ppl:  4.94; xent: 1.60; lr: 0.00010; 10672/4245 tok/s;    510 sec
[2021-05-15 13:40:55,379 INFO] Step 2700/50000; acc:  53.59; ppl:  4.98; xent: 1.61; lr: 0.00010; 10835/4309 tok/s;    519 sec
[2021-05-15 13:41:05,272 INFO] Step 2750/50000; acc:  53.66; ppl:  4.96; xent: 1.60; lr: 0.00010; 10424/4061 tok/s;    529 sec
[2021-05-15 13:41:14,535 INFO] Step 2800/50000; acc:  54.63; ppl:  4.79; xent: 1.57; lr: 0.00010; 10763/4240 tok/s;    538 sec
[2021-05-15 13:41:23,890 INFO] Step 2850/50000; acc:  54.18; ppl:  4.85; xent: 1.58; lr: 0.00010; 11118/4341 tok/s;    548 sec
[2021-05-15 13:41:33,166 INFO] Step 2900/50000; acc:  54.88; ppl:  4.73; xent: 1.55; lr: 0.00010; 10655/4349 tok/s;    557 sec
[2021-05-15 13:41:35,616 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:41:42,586 INFO] Step 2950/50000; acc:  54.85; ppl:  4.75; xent: 1.56; lr: 0.00010; 10901/4287 tok/s;    566 sec
[2021-05-15 13:41:52,365 INFO] Step 3000/50000; acc:  54.72; ppl:  4.81; xent: 1.57; lr: 0.00010; 10477/4116 tok/s;    576 sec
[2021-05-15 13:42:02,554 INFO] Step 3050/50000; acc:  54.51; ppl:  4.79; xent: 1.57; lr: 0.00010; 9904/4022 tok/s;    586 sec
[2021-05-15 13:42:11,927 INFO] Step 3100/50000; acc:  54.72; ppl:  4.73; xent: 1.55; lr: 0.00010; 10905/4308 tok/s;    596 sec
[2021-05-15 13:42:21,447 INFO] Step 3150/50000; acc:  55.39; ppl:  4.68; xent: 1.54; lr: 0.00010; 10388/4177 tok/s;    605 sec
[2021-05-15 13:42:30,845 INFO] Step 3200/50000; acc:  55.16; ppl:  4.69; xent: 1.55; lr: 0.00010; 11055/4304 tok/s;    615 sec
[2021-05-15 13:42:40,399 INFO] Step 3250/50000; acc:  55.61; ppl:  4.66; xent: 1.54; lr: 0.00010; 10706/4182 tok/s;    624 sec
[2021-05-15 13:42:50,224 INFO] Step 3300/50000; acc:  55.60; ppl:  4.63; xent: 1.53; lr: 0.00010; 10323/4079 tok/s;    634 sec
[2021-05-15 13:42:59,754 INFO] Step 3350/50000; acc:  55.38; ppl:  4.65; xent: 1.54; lr: 0.00010; 10694/4225 tok/s;    644 sec
[2021-05-15 13:43:09,303 INFO] Step 3400/50000; acc:  55.74; ppl:  4.60; xent: 1.53; lr: 0.00010; 10744/4231 tok/s;    653 sec
[2021-05-15 13:43:18,774 INFO] Step 3450/50000; acc:  55.80; ppl:  4.64; xent: 1.53; lr: 0.00010; 10668/4206 tok/s;    663 sec
[2021-05-15 13:43:28,568 INFO] Step 3500/50000; acc:  56.04; ppl:  4.50; xent: 1.50; lr: 0.00010; 10369/4139 tok/s;    672 sec
[2021-05-15 13:43:37,439 INFO] Step 3550/50000; acc:  56.45; ppl:  4.53; xent: 1.51; lr: 0.00010; 11623/4393 tok/s;    681 sec
[2021-05-15 13:43:46,851 INFO] Step 3600/50000; acc:  56.63; ppl:  4.48; xent: 1.50; lr: 0.00010; 10496/4285 tok/s;    691 sec
[2021-05-15 13:43:49,877 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:43:56,649 INFO] Step 3650/50000; acc:  56.39; ppl:  4.48; xent: 1.50; lr: 0.00010; 10716/4175 tok/s;    701 sec
[2021-05-15 13:44:06,150 INFO] Step 3700/50000; acc:  57.04; ppl:  4.39; xent: 1.48; lr: 0.00010; 10446/4190 tok/s;    710 sec
[2021-05-15 13:44:16,327 INFO] Step 3750/50000; acc:  55.96; ppl:  4.55; xent: 1.51; lr: 0.00010; 10008/4021 tok/s;    720 sec
[2021-05-15 13:44:25,950 INFO] Step 3800/50000; acc:  56.64; ppl:  4.43; xent: 1.49; lr: 0.00010; 10589/4244 tok/s;    730 sec
[2021-05-15 13:44:35,277 INFO] Step 3850/50000; acc:  56.92; ppl:  4.43; xent: 1.49; lr: 0.00010; 10861/4256 tok/s;    739 sec
[2021-05-15 13:44:44,738 INFO] Step 3900/50000; acc:  56.97; ppl:  4.41; xent: 1.48; lr: 0.00010; 10775/4240 tok/s;    749 sec
[2021-05-15 13:44:54,014 INFO] Step 3950/50000; acc:  57.22; ppl:  4.40; xent: 1.48; lr: 0.00010; 10735/4334 tok/s;    758 sec
[2021-05-15 13:45:03,900 INFO] Step 4000/50000; acc:  56.68; ppl:  4.45; xent: 1.49; lr: 0.00010; 10734/4033 tok/s;    768 sec
[2021-05-15 13:45:13,721 INFO] Step 4050/50000; acc:  57.14; ppl:  4.36; xent: 1.47; lr: 0.00010; 10209/4162 tok/s;    778 sec
[2021-05-15 13:45:23,242 INFO] Step 4100/50000; acc:  57.94; ppl:  4.28; xent: 1.45; lr: 0.00010; 10659/4175 tok/s;    787 sec
[2021-05-15 13:45:32,563 INFO] Step 4150/50000; acc:  57.05; ppl:  4.41; xent: 1.48; lr: 0.00010; 10910/4334 tok/s;    796 sec
[2021-05-15 13:45:42,314 INFO] Step 4200/50000; acc:  57.57; ppl:  4.38; xent: 1.48; lr: 0.00010; 10509/4097 tok/s;    806 sec
[2021-05-15 13:45:51,947 INFO] Step 4250/50000; acc:  57.79; ppl:  4.26; xent: 1.45; lr: 0.00010; 10584/4187 tok/s;    816 sec
[2021-05-15 13:46:00,965 INFO] Step 4300/50000; acc:  58.14; ppl:  4.22; xent: 1.44; lr: 0.00010; 11135/4372 tok/s;    825 sec
[2021-05-15 13:46:10,458 INFO] Step 4350/50000; acc:  58.00; ppl:  4.24; xent: 1.44; lr: 0.00010; 10856/4242 tok/s;    834 sec
[2021-05-15 13:46:10,888 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:46:20,025 INFO] Step 4400/50000; acc:  58.39; ppl:  4.25; xent: 1.45; lr: 0.00010; 10375/4248 tok/s;    844 sec
[2021-05-15 13:46:30,002 INFO] Step 4450/50000; acc:  58.24; ppl:  4.27; xent: 1.45; lr: 0.00010; 10534/4047 tok/s;    854 sec
[2021-05-15 13:46:40,036 INFO] Step 4500/50000; acc:  58.32; ppl:  4.21; xent: 1.44; lr: 0.00010; 9748/4053 tok/s;    864 sec
[2021-05-15 13:46:49,690 INFO] Step 4550/50000; acc:  57.82; ppl:  4.27; xent: 1.45; lr: 0.00010; 10605/4173 tok/s;    874 sec
[2021-05-15 13:46:59,016 INFO] Step 4600/50000; acc:  58.42; ppl:  4.21; xent: 1.44; lr: 0.00010; 10967/4305 tok/s;    883 sec
[2021-05-15 13:47:08,478 INFO] Step 4650/50000; acc:  58.22; ppl:  4.20; xent: 1.43; lr: 0.00010; 10654/4266 tok/s;    892 sec
[2021-05-15 13:47:18,354 INFO] Step 4700/50000; acc:  58.23; ppl:  4.27; xent: 1.45; lr: 0.00010; 10521/4012 tok/s;    902 sec
[2021-05-15 13:47:28,385 INFO] Step 4750/50000; acc:  58.80; ppl:  4.14; xent: 1.42; lr: 0.00010; 9971/4004 tok/s;    912 sec
[2021-05-15 13:47:38,049 INFO] Step 4800/50000; acc:  58.48; ppl:  4.16; xent: 1.43; lr: 0.00010; 10779/4182 tok/s;    922 sec
[2021-05-15 13:47:47,400 INFO] Step 4850/50000; acc:  59.10; ppl:  4.14; xent: 1.42; lr: 0.00010; 10751/4322 tok/s;    931 sec
[2021-05-15 13:47:57,170 INFO] Step 4900/50000; acc:  58.56; ppl:  4.19; xent: 1.43; lr: 0.00010; 10370/4072 tok/s;    941 sec
[2021-05-15 13:48:07,020 INFO] Step 4950/50000; acc:  58.96; ppl:  4.13; xent: 1.42; lr: 0.00010; 10390/4092 tok/s;    951 sec
[2021-05-15 13:48:15,997 INFO] Step 5000/50000; acc:  59.34; ppl:  4.10; xent: 1.41; lr: 0.00010; 11432/4424 tok/s;    960 sec
[2021-05-15 13:48:15,997 INFO] valid's transforms: TransformPipe()
[2021-05-15 13:48:16,000 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/loose/valid.txt, align=None)...
[2021-05-15 13:48:23,924 INFO] Validation perplexity: 3.91308
[2021-05-15 13:48:23,924 INFO] Validation accuracy: 61.327
[2021-05-15 13:48:23,927 INFO] Saving checkpoint ../models/group2_params/loose_ops/model_step_5000.pt
[2021-05-15 13:48:33,581 INFO] Step 5050/50000; acc:  59.23; ppl:  4.05; xent: 1.40; lr: 0.00010; 5709/2299 tok/s;    977 sec
[2021-05-15 13:48:40,845 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:48:43,097 INFO] Step 5100/50000; acc:  59.74; ppl:  4.01; xent: 1.39; lr: 0.00010; 10662/4201 tok/s;    987 sec
[2021-05-15 13:48:53,015 INFO] Step 5150/50000; acc:  59.30; ppl:  4.12; xent: 1.41; lr: 0.00010; 10385/4101 tok/s;    997 sec
[2021-05-15 13:49:02,691 INFO] Step 5200/50000; acc:  59.56; ppl:  4.06; xent: 1.40; lr: 0.00010; 10241/4164 tok/s;   1007 sec
[2021-05-15 13:49:12,903 INFO] Step 5250/50000; acc:  59.01; ppl:  4.12; xent: 1.42; lr: 0.00010; 10180/4016 tok/s;   1017 sec
[2021-05-15 13:49:22,065 INFO] Step 5300/50000; acc:  59.49; ppl:  4.05; xent: 1.40; lr: 0.00010; 10798/4391 tok/s;   1026 sec
[2021-05-15 13:49:31,802 INFO] Step 5350/50000; acc:  59.54; ppl:  4.06; xent: 1.40; lr: 0.00010; 10484/4134 tok/s;   1036 sec
[2021-05-15 13:49:41,133 INFO] Step 5400/50000; acc:  59.18; ppl:  4.11; xent: 1.41; lr: 0.00010; 11025/4328 tok/s;   1045 sec
[2021-05-15 13:49:50,776 INFO] Step 5450/50000; acc:  59.93; ppl:  3.96; xent: 1.38; lr: 0.00010; 10634/4060 tok/s;   1055 sec
[2021-05-15 13:50:01,124 INFO] Step 5500/50000; acc:  59.30; ppl:  4.07; xent: 1.40; lr: 0.00010; 9960/3931 tok/s;   1065 sec
[2021-05-15 13:50:10,128 INFO] Step 5550/50000; acc:  60.12; ppl:  3.92; xent: 1.37; lr: 0.00010; 10958/4437 tok/s;   1074 sec
[2021-05-15 13:50:20,039 INFO] Step 5600/50000; acc:  59.62; ppl:  4.07; xent: 1.40; lr: 0.00010; 10580/4093 tok/s;   1084 sec
[2021-05-15 13:50:29,459 INFO] Step 5650/50000; acc:  60.27; ppl:  4.00; xent: 1.39; lr: 0.00010; 10633/4178 tok/s;   1093 sec
[2021-05-15 13:50:39,310 INFO] Step 5700/50000; acc:  60.05; ppl:  3.96; xent: 1.38; lr: 0.00010; 10326/4104 tok/s;   1103 sec
[2021-05-15 13:50:48,247 INFO] Step 5750/50000; acc:  60.26; ppl:  3.96; xent: 1.38; lr: 0.00010; 11363/4486 tok/s;   1112 sec
[2021-05-15 13:50:57,863 INFO] Step 5800/50000; acc:  60.61; ppl:  3.92; xent: 1.37; lr: 0.00010; 10600/4185 tok/s;   1122 sec
[2021-05-15 13:51:02,493 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:51:07,448 INFO] Step 5850/50000; acc:  60.31; ppl:  3.93; xent: 1.37; lr: 0.00010; 10589/4234 tok/s;   1131 sec
[2021-05-15 13:51:17,229 INFO] Step 5900/50000; acc:  60.75; ppl:  3.93; xent: 1.37; lr: 0.00010; 10401/4140 tok/s;   1141 sec
[2021-05-15 13:51:27,611 INFO] Step 5950/50000; acc:  60.24; ppl:  3.94; xent: 1.37; lr: 0.00010; 9869/3859 tok/s;   1151 sec
[2021-05-15 13:51:36,982 INFO] Step 6000/50000; acc:  60.61; ppl:  3.91; xent: 1.36; lr: 0.00010; 10529/4364 tok/s;   1161 sec
[2021-05-15 13:51:46,762 INFO] Step 6050/50000; acc:  60.33; ppl:  3.96; xent: 1.38; lr: 0.00010; 10681/4140 tok/s;   1171 sec
[2021-05-15 13:51:55,716 INFO] Step 6100/50000; acc:  61.06; ppl:  3.84; xent: 1.35; lr: 0.00010; 10977/4418 tok/s;   1180 sec
[2021-05-15 13:52:05,502 INFO] Step 6150/50000; acc:  60.51; ppl:  3.97; xent: 1.38; lr: 0.00010; 10602/4096 tok/s;   1189 sec
[2021-05-15 13:52:15,514 INFO] Step 6200/50000; acc:  60.76; ppl:  3.90; xent: 1.36; lr: 0.00010; 10327/4026 tok/s;   1199 sec
[2021-05-15 13:52:25,351 INFO] Step 6250/50000; acc:  60.31; ppl:  3.89; xent: 1.36; lr: 0.00010; 10305/4089 tok/s;   1209 sec
[2021-05-15 13:52:34,716 INFO] Step 6300/50000; acc:  61.09; ppl:  3.85; xent: 1.35; lr: 0.00010; 10968/4259 tok/s;   1219 sec
[2021-05-15 13:52:44,074 INFO] Step 6350/50000; acc:  61.11; ppl:  3.87; xent: 1.35; lr: 0.00010; 10590/4303 tok/s;   1228 sec
[2021-05-15 13:52:53,942 INFO] Step 6400/50000; acc:  60.42; ppl:  3.93; xent: 1.37; lr: 0.00010; 10584/4073 tok/s;   1238 sec
[2021-05-15 13:53:03,170 INFO] Step 6450/50000; acc:  61.46; ppl:  3.78; xent: 1.33; lr: 0.00010; 10918/4278 tok/s;   1247 sec
[2021-05-15 13:53:12,507 INFO] Step 6500/50000; acc:  61.56; ppl:  3.81; xent: 1.34; lr: 0.00010; 10780/4319 tok/s;   1256 sec
[2021-05-15 13:53:22,110 INFO] Step 6550/50000; acc:  61.19; ppl:  3.82; xent: 1.34; lr: 0.00010; 10623/4260 tok/s;   1266 sec
[2021-05-15 13:53:24,068 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:53:31,423 INFO] Step 6600/50000; acc:  61.56; ppl:  3.80; xent: 1.34; lr: 0.00010; 11004/4305 tok/s;   1275 sec
[2021-05-15 13:53:41,237 INFO] Step 6650/50000; acc:  61.46; ppl:  3.83; xent: 1.34; lr: 0.00010; 10304/4040 tok/s;   1285 sec
[2021-05-15 13:53:51,360 INFO] Step 6700/50000; acc:  61.31; ppl:  3.82; xent: 1.34; lr: 0.00010; 9938/4101 tok/s;   1295 sec
[2021-05-15 13:54:00,779 INFO] Step 6750/50000; acc:  61.32; ppl:  3.82; xent: 1.34; lr: 0.00010; 10936/4272 tok/s;   1305 sec
[2021-05-15 13:54:10,341 INFO] Step 6800/50000; acc:  61.57; ppl:  3.78; xent: 1.33; lr: 0.00010; 10325/4180 tok/s;   1314 sec
[2021-05-15 13:54:19,911 INFO] Step 6850/50000; acc:  61.42; ppl:  3.81; xent: 1.34; lr: 0.00010; 10904/4201 tok/s;   1324 sec
[2021-05-15 13:54:29,373 INFO] Step 6900/50000; acc:  61.51; ppl:  3.80; xent: 1.34; lr: 0.00010; 10659/4250 tok/s;   1333 sec
[2021-05-15 13:54:39,125 INFO] Step 6950/50000; acc:  61.65; ppl:  3.78; xent: 1.33; lr: 0.00010; 10504/4110 tok/s;   1343 sec
[2021-05-15 13:54:48,881 INFO] Step 7000/50000; acc:  61.60; ppl:  3.78; xent: 1.33; lr: 0.00010; 10515/4108 tok/s;   1353 sec
[2021-05-15 13:54:58,273 INFO] Step 7050/50000; acc:  61.79; ppl:  3.77; xent: 1.33; lr: 0.00010; 10831/4314 tok/s;   1362 sec
[2021-05-15 13:55:08,038 INFO] Step 7100/50000; acc:  61.49; ppl:  3.82; xent: 1.34; lr: 0.00010; 10462/4072 tok/s;   1372 sec
[2021-05-15 13:55:17,537 INFO] Step 7150/50000; acc:  62.26; ppl:  3.67; xent: 1.30; lr: 0.00010; 10507/4250 tok/s;   1381 sec
[2021-05-15 13:55:26,513 INFO] Step 7200/50000; acc:  61.91; ppl:  3.75; xent: 1.32; lr: 0.00010; 11605/4390 tok/s;   1390 sec
[2021-05-15 13:55:36,055 INFO] Step 7250/50000; acc:  61.92; ppl:  3.71; xent: 1.31; lr: 0.00010; 10461/4227 tok/s;   1400 sec
[2021-05-15 13:55:38,737 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:55:45,740 INFO] Step 7300/50000; acc:  62.24; ppl:  3.70; xent: 1.31; lr: 0.00010; 10532/4207 tok/s;   1410 sec
[2021-05-15 13:55:55,420 INFO] Step 7350/50000; acc:  62.38; ppl:  3.69; xent: 1.31; lr: 0.00010; 10533/4139 tok/s;   1419 sec
[2021-05-15 13:56:05,354 INFO] Step 7400/50000; acc:  62.02; ppl:  3.77; xent: 1.33; lr: 0.00010; 10219/4098 tok/s;   1429 sec
[2021-05-15 13:56:14,967 INFO] Step 7450/50000; acc:  62.49; ppl:  3.68; xent: 1.30; lr: 0.00010; 10495/4223 tok/s;   1439 sec
[2021-05-15 13:56:24,246 INFO] Step 7500/50000; acc:  62.25; ppl:  3.71; xent: 1.31; lr: 0.00010; 10892/4293 tok/s;   1448 sec
[2021-05-15 13:56:33,650 INFO] Step 7550/50000; acc:  62.24; ppl:  3.72; xent: 1.31; lr: 0.00010; 10894/4262 tok/s;   1458 sec
[2021-05-15 13:56:43,177 INFO] Step 7600/50000; acc:  62.48; ppl:  3.67; xent: 1.30; lr: 0.00010; 10477/4241 tok/s;   1467 sec
[2021-05-15 13:56:53,380 INFO] Step 7650/50000; acc:  62.08; ppl:  3.76; xent: 1.32; lr: 0.00010; 10421/3912 tok/s;   1477 sec
[2021-05-15 13:57:02,867 INFO] Step 7700/50000; acc:  62.56; ppl:  3.64; xent: 1.29; lr: 0.00010; 10415/4270 tok/s;   1487 sec
[2021-05-15 13:57:12,571 INFO] Step 7750/50000; acc:  62.65; ppl:  3.64; xent: 1.29; lr: 0.00010; 10569/4108 tok/s;   1496 sec
[2021-05-15 13:57:21,948 INFO] Step 7800/50000; acc:  62.18; ppl:  3.75; xent: 1.32; lr: 0.00010; 10912/4309 tok/s;   1506 sec
[2021-05-15 13:57:31,568 INFO] Step 7850/50000; acc:  62.61; ppl:  3.67; xent: 1.30; lr: 0.00010; 10573/4120 tok/s;   1515 sec
[2021-05-15 13:57:41,286 INFO] Step 7900/50000; acc:  62.54; ppl:  3.65; xent: 1.29; lr: 0.00010; 10624/4182 tok/s;   1525 sec
[2021-05-15 13:57:50,115 INFO] Step 7950/50000; acc:  63.41; ppl:  3.56; xent: 1.27; lr: 0.00010; 11133/4459 tok/s;   1534 sec
[2021-05-15 13:57:59,930 INFO] Step 8000/50000; acc:  62.56; ppl:  3.65; xent: 1.29; lr: 0.00010; 10636/4134 tok/s;   1544 sec
[2021-05-15 13:57:59,940 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 13:58:09,468 INFO] Step 8050/50000; acc:  63.04; ppl:  3.62; xent: 1.29; lr: 0.00010; 10516/4245 tok/s;   1553 sec
[2021-05-15 13:58:19,341 INFO] Step 8100/50000; acc:  62.87; ppl:  3.62; xent: 1.29; lr: 0.00010; 10313/4072 tok/s;   1563 sec
[2021-05-15 13:58:29,589 INFO] Step 8150/50000; acc:  62.77; ppl:  3.63; xent: 1.29; lr: 0.00010; 9823/3993 tok/s;   1573 sec
[2021-05-15 13:58:39,168 INFO] Step 8200/50000; acc:  62.57; ppl:  3.66; xent: 1.30; lr: 0.00010; 10688/4185 tok/s;   1583 sec
[2021-05-15 13:58:48,484 INFO] Step 8250/50000; acc:  62.97; ppl:  3.61; xent: 1.28; lr: 0.00010; 10857/4307 tok/s;   1592 sec
[2021-05-15 13:58:58,183 INFO] Step 8300/50000; acc:  63.09; ppl:  3.59; xent: 1.28; lr: 0.00010; 10370/4193 tok/s;   1602 sec
[2021-05-15 13:59:08,035 INFO] Step 8350/50000; acc:  62.76; ppl:  3.67; xent: 1.30; lr: 0.00010; 10593/3972 tok/s;   1612 sec
[2021-05-15 13:59:17,931 INFO] Step 8400/50000; acc:  63.08; ppl:  3.58; xent: 1.27; lr: 0.00010; 10104/4096 tok/s;   1622 sec
[2021-05-15 13:59:27,485 INFO] Step 8450/50000; acc:  63.01; ppl:  3.59; xent: 1.28; lr: 0.00010; 10911/4210 tok/s;   1631 sec
[2021-05-15 13:59:36,877 INFO] Step 8500/50000; acc:  63.37; ppl:  3.58; xent: 1.28; lr: 0.00010; 10586/4287 tok/s;   1641 sec
[2021-05-15 13:59:46,606 INFO] Step 8550/50000; acc:  63.02; ppl:  3.65; xent: 1.30; lr: 0.00010; 10515/4097 tok/s;   1650 sec
[2021-05-15 13:59:56,546 INFO] Step 8600/50000; acc:  63.36; ppl:  3.55; xent: 1.27; lr: 0.00010; 10368/4044 tok/s;   1660 sec
[2021-05-15 14:00:05,432 INFO] Step 8650/50000; acc:  63.37; ppl:  3.56; xent: 1.27; lr: 0.00010; 11405/4506 tok/s;   1669 sec
[2021-05-15 14:00:14,684 INFO] Step 8700/50000; acc:  63.43; ppl:  3.53; xent: 1.26; lr: 0.00010; 11007/4360 tok/s;   1679 sec
[2021-05-15 14:00:21,630 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:00:24,137 INFO] Step 8750/50000; acc:  63.64; ppl:  3.52; xent: 1.26; lr: 0.00010; 10537/4214 tok/s;   1688 sec
[2021-05-15 14:00:34,056 INFO] Step 8800/50000; acc:  63.34; ppl:  3.57; xent: 1.27; lr: 0.00010; 10510/4111 tok/s;   1698 sec
[2021-05-15 14:00:43,956 INFO] Step 8850/50000; acc:  63.23; ppl:  3.57; xent: 1.27; lr: 0.00010; 10114/4106 tok/s;   1708 sec
[2021-05-15 14:00:53,666 INFO] Step 8900/50000; acc:  63.37; ppl:  3.56; xent: 1.27; lr: 0.00010; 10390/4174 tok/s;   1718 sec
[2021-05-15 14:01:02,923 INFO] Step 8950/50000; acc:  63.30; ppl:  3.54; xent: 1.26; lr: 0.00010; 10974/4373 tok/s;   1727 sec
[2021-05-15 14:01:12,667 INFO] Step 9000/50000; acc:  63.39; ppl:  3.56; xent: 1.27; lr: 0.00010; 10465/4126 tok/s;   1737 sec
[2021-05-15 14:01:22,017 INFO] Step 9050/50000; acc:  63.09; ppl:  3.60; xent: 1.28; lr: 0.00010; 10893/4312 tok/s;   1746 sec
[2021-05-15 14:01:31,850 INFO] Step 9100/50000; acc:  63.80; ppl:  3.48; xent: 1.25; lr: 0.00010; 10411/3988 tok/s;   1756 sec
[2021-05-15 14:01:42,006 INFO] Step 9150/50000; acc:  63.25; ppl:  3.57; xent: 1.27; lr: 0.00010; 10192/4009 tok/s;   1766 sec
[2021-05-15 14:01:51,012 INFO] Step 9200/50000; acc:  64.01; ppl:  3.46; xent: 1.24; lr: 0.00010; 10956/4448 tok/s;   1775 sec
[2021-05-15 14:02:01,051 INFO] Step 9250/50000; acc:  63.32; ppl:  3.57; xent: 1.27; lr: 0.00010; 10458/4020 tok/s;   1785 sec
[2021-05-15 14:02:10,338 INFO] Step 9300/50000; acc:  63.80; ppl:  3.49; xent: 1.25; lr: 0.00010; 10649/4261 tok/s;   1794 sec
[2021-05-15 14:02:20,155 INFO] Step 9350/50000; acc:  63.59; ppl:  3.51; xent: 1.26; lr: 0.00010; 10479/4077 tok/s;   1804 sec
[2021-05-15 14:02:29,264 INFO] Step 9400/50000; acc:  63.76; ppl:  3.51; xent: 1.26; lr: 0.00010; 11222/4434 tok/s;   1813 sec
[2021-05-15 14:02:38,671 INFO] Step 9450/50000; acc:  63.77; ppl:  3.48; xent: 1.25; lr: 0.00010; 10735/4308 tok/s;   1823 sec
[2021-05-15 14:02:43,002 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:02:48,392 INFO] Step 9500/50000; acc:  64.00; ppl:  3.47; xent: 1.24; lr: 0.00010; 10565/4160 tok/s;   1832 sec
[2021-05-15 14:02:57,949 INFO] Step 9550/50000; acc:  64.20; ppl:  3.47; xent: 1.24; lr: 0.00010; 10434/4155 tok/s;   1842 sec
[2021-05-15 14:03:08,424 INFO] Step 9600/50000; acc:  63.48; ppl:  3.50; xent: 1.25; lr: 0.00010; 9914/3919 tok/s;   1852 sec
[2021-05-15 14:03:17,989 INFO] Step 9650/50000; acc:  63.79; ppl:  3.48; xent: 1.25; lr: 0.00010; 10422/4255 tok/s;   1862 sec
[2021-05-15 14:03:27,428 INFO] Step 9700/50000; acc:  64.04; ppl:  3.47; xent: 1.24; lr: 0.00010; 10723/4249 tok/s;   1871 sec
[2021-05-15 14:03:36,730 INFO] Step 9750/50000; acc:  64.02; ppl:  3.46; xent: 1.24; lr: 0.00010; 10880/4315 tok/s;   1881 sec
[2021-05-15 14:03:46,314 INFO] Step 9800/50000; acc:  63.89; ppl:  3.51; xent: 1.26; lr: 0.00010; 10825/4169 tok/s;   1890 sec
[2021-05-15 14:03:56,278 INFO] Step 9850/50000; acc:  64.13; ppl:  3.46; xent: 1.24; lr: 0.00010; 10241/3992 tok/s;   1900 sec
[2021-05-15 14:04:06,182 INFO] Step 9900/50000; acc:  63.89; ppl:  3.45; xent: 1.24; lr: 0.00010; 10199/4080 tok/s;   1910 sec
[2021-05-15 14:04:15,512 INFO] Step 9950/50000; acc:  64.12; ppl:  3.45; xent: 1.24; lr: 0.00010; 11084/4297 tok/s;   1919 sec
[2021-05-15 14:04:24,836 INFO] Step 10000/50000; acc:  64.29; ppl:  3.46; xent: 1.24; lr: 0.00010; 10626/4315 tok/s;   1929 sec
[2021-05-15 14:04:24,839 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/loose/valid.txt, align=None)...
[2021-05-15 14:04:32,744 INFO] Validation perplexity: 3.37963
[2021-05-15 14:04:32,744 INFO] Validation accuracy: 64.6038
[2021-05-15 14:04:32,746 INFO] Saving checkpoint ../models/group2_params/loose_ops/model_step_10000.pt
[2021-05-15 14:04:43,266 INFO] Step 10050/50000; acc:  63.37; ppl:  3.51; xent: 1.26; lr: 0.00010; 5683/2188 tok/s;   1947 sec
[2021-05-15 14:04:52,360 INFO] Step 10100/50000; acc:  64.62; ppl:  3.37; xent: 1.21; lr: 0.00010; 10920/4348 tok/s;   1956 sec
[2021-05-15 14:05:01,699 INFO] Step 10150/50000; acc:  64.17; ppl:  3.43; xent: 1.23; lr: 0.00010; 10898/4283 tok/s;   1966 sec
[2021-05-15 14:05:11,221 INFO] Step 10200/50000; acc:  64.15; ppl:  3.44; xent: 1.24; lr: 0.00010; 10792/4282 tok/s;   1975 sec
[2021-05-15 14:05:12,774 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:05:20,468 INFO] Step 10250/50000; acc:  64.55; ppl:  3.40; xent: 1.22; lr: 0.00010; 10992/4361 tok/s;   1984 sec
[2021-05-15 14:05:30,518 INFO] Step 10300/50000; acc:  63.81; ppl:  3.48; xent: 1.25; lr: 0.00010; 10171/3977 tok/s;   1994 sec
[2021-05-15 14:05:40,343 INFO] Step 10350/50000; acc:  64.55; ppl:  3.39; xent: 1.22; lr: 0.00010; 10050/4182 tok/s;   2004 sec
[2021-05-15 14:05:49,839 INFO] Step 10400/50000; acc:  63.99; ppl:  3.46; xent: 1.24; lr: 0.00010; 10956/4250 tok/s;   2014 sec
[2021-05-15 14:05:59,270 INFO] Step 10450/50000; acc:  64.43; ppl:  3.40; xent: 1.22; lr: 0.00010; 10593/4236 tok/s;   2023 sec
[2021-05-15 14:06:08,754 INFO] Step 10500/50000; acc:  64.55; ppl:  3.40; xent: 1.22; lr: 0.00010; 10676/4246 tok/s;   2033 sec
[2021-05-15 14:06:18,462 INFO] Step 10550/50000; acc:  64.33; ppl:  3.43; xent: 1.23; lr: 0.00010; 10685/4141 tok/s;   2042 sec
[2021-05-15 14:06:28,053 INFO] Step 10600/50000; acc:  64.40; ppl:  3.42; xent: 1.23; lr: 0.00010; 10648/4189 tok/s;   2052 sec
[2021-05-15 14:06:37,860 INFO] Step 10650/50000; acc:  64.37; ppl:  3.39; xent: 1.22; lr: 0.00010; 10358/4069 tok/s;   2062 sec
[2021-05-15 14:06:47,256 INFO] Step 10700/50000; acc:  64.43; ppl:  3.41; xent: 1.23; lr: 0.00010; 10791/4313 tok/s;   2071 sec
[2021-05-15 14:06:57,028 INFO] Step 10750/50000; acc:  64.18; ppl:  3.46; xent: 1.24; lr: 0.00010; 10503/4100 tok/s;   2081 sec
[2021-05-15 14:07:06,407 INFO] Step 10800/50000; acc:  64.85; ppl:  3.32; xent: 1.20; lr: 0.00010; 10662/4263 tok/s;   2090 sec
[2021-05-15 14:07:15,527 INFO] Step 10850/50000; acc:  64.43; ppl:  3.40; xent: 1.22; lr: 0.00010; 11446/4336 tok/s;   2099 sec
[2021-05-15 14:07:24,824 INFO] Step 10900/50000; acc:  64.89; ppl:  3.36; xent: 1.21; lr: 0.00010; 10598/4333 tok/s;   2109 sec
[2021-05-15 14:07:27,158 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:07:34,585 INFO] Step 10950/50000; acc:  64.57; ppl:  3.39; xent: 1.22; lr: 0.00010; 10539/4170 tok/s;   2118 sec
[2021-05-15 14:07:44,326 INFO] Step 11000/50000; acc:  64.67; ppl:  3.35; xent: 1.21; lr: 0.00010; 10549/4083 tok/s;   2128 sec
[2021-05-15 14:07:54,265 INFO] Step 11050/50000; acc:  64.37; ppl:  3.42; xent: 1.23; lr: 0.00010; 10103/4137 tok/s;   2138 sec
[2021-05-15 14:08:04,088 INFO] Step 11100/50000; acc:  64.75; ppl:  3.36; xent: 1.21; lr: 0.00010; 10411/4130 tok/s;   2148 sec
[2021-05-15 14:08:13,153 INFO] Step 11150/50000; acc:  64.84; ppl:  3.34; xent: 1.21; lr: 0.00010; 10934/4360 tok/s;   2157 sec
[2021-05-15 14:08:22,791 INFO] Step 11200/50000; acc:  64.46; ppl:  3.40; xent: 1.22; lr: 0.00010; 10751/4228 tok/s;   2167 sec
[2021-05-15 14:08:32,283 INFO] Step 11250/50000; acc:  64.76; ppl:  3.36; xent: 1.21; lr: 0.00010; 10647/4201 tok/s;   2176 sec
[2021-05-15 14:08:42,263 INFO] Step 11300/50000; acc:  64.56; ppl:  3.36; xent: 1.21; lr: 0.00010; 10323/3995 tok/s;   2186 sec
[2021-05-15 14:08:52,047 INFO] Step 11350/50000; acc:  64.58; ppl:  3.37; xent: 1.21; lr: 0.00010; 10378/4166 tok/s;   2196 sec
[2021-05-15 14:09:01,470 INFO] Step 11400/50000; acc:  64.98; ppl:  3.33; xent: 1.20; lr: 0.00010; 10882/4212 tok/s;   2205 sec
[2021-05-15 14:09:10,932 INFO] Step 11450/50000; acc:  64.77; ppl:  3.38; xent: 1.22; lr: 0.00010; 10692/4265 tok/s;   2215 sec
[2021-05-15 14:09:20,645 INFO] Step 11500/50000; acc:  64.76; ppl:  3.33; xent: 1.20; lr: 0.00010; 10443/4095 tok/s;   2225 sec
[2021-05-15 14:09:30,250 INFO] Step 11550/50000; acc:  64.85; ppl:  3.34; xent: 1.21; lr: 0.00010; 10804/4215 tok/s;   2234 sec
[2021-05-15 14:09:39,089 INFO] Step 11600/50000; acc:  65.48; ppl:  3.25; xent: 1.18; lr: 0.00010; 11120/4486 tok/s;   2243 sec
[2021-05-15 14:09:48,552 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:09:48,934 INFO] Step 11650/50000; acc:  64.89; ppl:  3.34; xent: 1.21; lr: 0.00010; 10645/4096 tok/s;   2253 sec
[2021-05-15 14:09:58,543 INFO] Step 11700/50000; acc:  65.31; ppl:  3.29; xent: 1.19; lr: 0.00010; 10279/4220 tok/s;   2262 sec
[2021-05-15 14:10:08,688 INFO] Step 11750/50000; acc:  64.81; ppl:  3.35; xent: 1.21; lr: 0.00010; 10135/3982 tok/s;   2273 sec
[2021-05-15 14:10:18,692 INFO] Step 11800/50000; acc:  64.77; ppl:  3.34; xent: 1.21; lr: 0.00010; 10141/4090 tok/s;   2283 sec
[2021-05-15 14:10:28,354 INFO] Step 11850/50000; acc:  64.95; ppl:  3.33; xent: 1.20; lr: 0.00010; 10488/4140 tok/s;   2292 sec
[2021-05-15 14:10:37,959 INFO] Step 11900/50000; acc:  65.04; ppl:  3.32; xent: 1.20; lr: 0.00010; 10674/4178 tok/s;   2302 sec
[2021-05-15 14:10:47,407 INFO] Step 11950/50000; acc:  65.15; ppl:  3.29; xent: 1.19; lr: 0.00010; 10457/4272 tok/s;   2311 sec
[2021-05-15 14:10:57,143 INFO] Step 12000/50000; acc:  64.90; ppl:  3.37; xent: 1.22; lr: 0.00010; 10847/4063 tok/s;   2321 sec
[2021-05-15 14:11:07,263 INFO] Step 12050/50000; acc:  64.87; ppl:  3.30; xent: 1.19; lr: 0.00010; 9978/4016 tok/s;   2331 sec
[2021-05-15 14:11:16,628 INFO] Step 12100/50000; acc:  65.18; ppl:  3.27; xent: 1.18; lr: 0.00010; 10772/4275 tok/s;   2340 sec
[2021-05-15 14:11:26,232 INFO] Step 12150/50000; acc:  64.95; ppl:  3.34; xent: 1.20; lr: 0.00010; 10662/4191 tok/s;   2350 sec
[2021-05-15 14:11:35,941 INFO] Step 12200/50000; acc:  64.86; ppl:  3.35; xent: 1.21; lr: 0.00010; 10524/4108 tok/s;   2360 sec
[2021-05-15 14:11:45,830 INFO] Step 12250/50000; acc:  65.29; ppl:  3.26; xent: 1.18; lr: 0.00010; 10311/4045 tok/s;   2370 sec
[2021-05-15 14:11:54,674 INFO] Step 12300/50000; acc:  65.48; ppl:  3.26; xent: 1.18; lr: 0.00010; 11400/4556 tok/s;   2379 sec
[2021-05-15 14:12:04,007 INFO] Step 12350/50000; acc:  65.29; ppl:  3.29; xent: 1.19; lr: 0.00010; 10974/4283 tok/s;   2388 sec
[2021-05-15 14:12:10,612 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:12:13,591 INFO] Step 12400/50000; acc:  65.33; ppl:  3.27; xent: 1.19; lr: 0.00010; 10392/4201 tok/s;   2397 sec
[2021-05-15 14:12:23,424 INFO] Step 12450/50000; acc:  65.38; ppl:  3.29; xent: 1.19; lr: 0.00010; 10658/4127 tok/s;   2407 sec
[2021-05-15 14:12:33,340 INFO] Step 12500/50000; acc:  65.06; ppl:  3.26; xent: 1.18; lr: 0.00010; 9946/4077 tok/s;   2417 sec
[2021-05-15 14:12:43,201 INFO] Step 12550/50000; acc:  64.95; ppl:  3.33; xent: 1.20; lr: 0.00010; 10343/4163 tok/s;   2427 sec
[2021-05-15 14:12:52,579 INFO] Step 12600/50000; acc:  65.50; ppl:  3.24; xent: 1.18; lr: 0.00010; 10892/4262 tok/s;   2436 sec
[2021-05-15 14:13:02,333 INFO] Step 12650/50000; acc:  65.23; ppl:  3.29; xent: 1.19; lr: 0.00010; 10367/4140 tok/s;   2446 sec
[2021-05-15 14:13:11,898 INFO] Step 12700/50000; acc:  65.02; ppl:  3.34; xent: 1.21; lr: 0.00010; 10795/4212 tok/s;   2456 sec
[2021-05-15 14:13:21,609 INFO] Step 12750/50000; acc:  65.59; ppl:  3.22; xent: 1.17; lr: 0.00010; 10319/4032 tok/s;   2465 sec
[2021-05-15 14:13:31,790 INFO] Step 12800/50000; acc:  64.79; ppl:  3.33; xent: 1.20; lr: 0.00010; 10292/4016 tok/s;   2476 sec
[2021-05-15 14:13:41,022 INFO] Step 12850/50000; acc:  65.61; ppl:  3.23; xent: 1.17; lr: 0.00010; 10819/4330 tok/s;   2485 sec
[2021-05-15 14:13:50,828 INFO] Step 12900/50000; acc:  65.37; ppl:  3.26; xent: 1.18; lr: 0.00010; 10356/4116 tok/s;   2495 sec
[2021-05-15 14:14:00,346 INFO] Step 12950/50000; acc:  65.15; ppl:  3.29; xent: 1.19; lr: 0.00010; 10717/4194 tok/s;   2504 sec
[2021-05-15 14:14:10,027 INFO] Step 13000/50000; acc:  65.60; ppl:  3.24; xent: 1.17; lr: 0.00010; 10600/4102 tok/s;   2514 sec
[2021-05-15 14:14:19,105 INFO] Step 13050/50000; acc:  65.48; ppl:  3.25; xent: 1.18; lr: 0.00010; 11129/4440 tok/s;   2523 sec
[2021-05-15 14:14:28,521 INFO] Step 13100/50000; acc:  65.70; ppl:  3.22; xent: 1.17; lr: 0.00010; 10697/4319 tok/s;   2532 sec
[2021-05-15 14:14:32,437 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:14:38,294 INFO] Step 13150/50000; acc:  65.67; ppl:  3.25; xent: 1.18; lr: 0.00010; 10562/4109 tok/s;   2542 sec
[2021-05-15 14:14:47,825 INFO] Step 13200/50000; acc:  66.01; ppl:  3.19; xent: 1.16; lr: 0.00010; 10453/4189 tok/s;   2552 sec
[2021-05-15 14:14:58,419 INFO] Step 13250/50000; acc:  65.03; ppl:  3.28; xent: 1.19; lr: 0.00010; 9842/3890 tok/s;   2562 sec
[2021-05-15 14:15:07,844 INFO] Step 13300/50000; acc:  65.96; ppl:  3.20; xent: 1.16; lr: 0.00010; 10441/4289 tok/s;   2572 sec
[2021-05-15 14:15:17,425 INFO] Step 13350/50000; acc:  65.32; ppl:  3.24; xent: 1.18; lr: 0.00010; 10665/4187 tok/s;   2581 sec
[2021-05-15 14:15:26,736 INFO] Step 13400/50000; acc:  65.70; ppl:  3.23; xent: 1.17; lr: 0.00010; 10944/4327 tok/s;   2591 sec
[2021-05-15 14:15:36,296 INFO] Step 13450/50000; acc:  65.31; ppl:  3.27; xent: 1.19; lr: 0.00010; 10761/4186 tok/s;   2600 sec
[2021-05-15 14:15:46,048 INFO] Step 13500/50000; acc:  65.73; ppl:  3.22; xent: 1.17; lr: 0.00010; 10569/4012 tok/s;   2610 sec
[2021-05-15 14:15:55,888 INFO] Step 13550/50000; acc:  65.74; ppl:  3.21; xent: 1.16; lr: 0.00010; 10079/4141 tok/s;   2620 sec
[2021-05-15 14:16:05,307 INFO] Step 13600/50000; acc:  65.41; ppl:  3.24; xent: 1.17; lr: 0.00010; 11118/4284 tok/s;   2629 sec
[2021-05-15 14:16:14,578 INFO] Step 13650/50000; acc:  65.82; ppl:  3.24; xent: 1.18; lr: 0.00010; 10797/4319 tok/s;   2638 sec
[2021-05-15 14:16:24,578 INFO] Step 13700/50000; acc:  65.50; ppl:  3.22; xent: 1.17; lr: 0.00010; 10161/4036 tok/s;   2648 sec
[2021-05-15 14:16:33,462 INFO] Step 13750/50000; acc:  66.18; ppl:  3.17; xent: 1.15; lr: 0.00010; 11478/4444 tok/s;   2657 sec
[2021-05-15 14:16:42,759 INFO] Step 13800/50000; acc:  65.63; ppl:  3.22; xent: 1.17; lr: 0.00010; 10954/4348 tok/s;   2667 sec
[2021-05-15 14:16:52,145 INFO] Step 13850/50000; acc:  65.89; ppl:  3.20; xent: 1.16; lr: 0.00010; 10816/4287 tok/s;   2676 sec
[2021-05-15 14:16:53,463 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:17:01,626 INFO] Step 13900/50000; acc:  66.08; ppl:  3.17; xent: 1.15; lr: 0.00010; 10695/4252 tok/s;   2685 sec
[2021-05-15 14:17:11,793 INFO] Step 13950/50000; acc:  65.46; ppl:  3.26; xent: 1.18; lr: 0.00010; 10100/3936 tok/s;   2696 sec
[2021-05-15 14:17:21,634 INFO] Step 14000/50000; acc:  66.08; ppl:  3.16; xent: 1.15; lr: 0.00010; 10021/4220 tok/s;   2705 sec
[2021-05-15 14:17:31,081 INFO] Step 14050/50000; acc:  65.45; ppl:  3.24; xent: 1.18; lr: 0.00010; 11076/4238 tok/s;   2715 sec
[2021-05-15 14:17:40,254 INFO] Step 14100/50000; acc:  66.09; ppl:  3.16; xent: 1.15; lr: 0.00010; 10726/4318 tok/s;   2724 sec
[2021-05-15 14:17:50,075 INFO] Step 14150/50000; acc:  66.08; ppl:  3.20; xent: 1.16; lr: 0.00010; 10428/4130 tok/s;   2734 sec
[2021-05-15 14:17:59,772 INFO] Step 14200/50000; acc:  65.51; ppl:  3.23; xent: 1.17; lr: 0.00010; 10764/4124 tok/s;   2744 sec
[2021-05-15 14:18:09,618 INFO] Step 14250/50000; acc:  66.00; ppl:  3.19; xent: 1.16; lr: 0.00010; 10290/4095 tok/s;   2753 sec
[2021-05-15 14:18:19,463 INFO] Step 14300/50000; acc:  65.94; ppl:  3.19; xent: 1.16; lr: 0.00010; 10428/4065 tok/s;   2763 sec
[2021-05-15 14:18:28,884 INFO] Step 14350/50000; acc:  66.10; ppl:  3.18; xent: 1.16; lr: 0.00010; 10559/4309 tok/s;   2773 sec
[2021-05-15 14:18:38,533 INFO] Step 14400/50000; acc:  65.58; ppl:  3.24; xent: 1.17; lr: 0.00010; 10775/4157 tok/s;   2782 sec
[2021-05-15 14:18:48,042 INFO] Step 14450/50000; acc:  66.23; ppl:  3.13; xent: 1.14; lr: 0.00010; 10637/4187 tok/s;   2792 sec
[2021-05-15 14:18:57,007 INFO] Step 14500/50000; acc:  66.22; ppl:  3.15; xent: 1.15; lr: 0.00010; 11254/4413 tok/s;   2801 sec
[2021-05-15 14:19:06,202 INFO] Step 14550/50000; acc:  65.98; ppl:  3.16; xent: 1.15; lr: 0.00010; 11032/4378 tok/s;   2810 sec
[2021-05-15 14:19:08,234 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:19:16,091 INFO] Step 14600/50000; acc:  66.09; ppl:  3.18; xent: 1.16; lr: 0.00010; 10383/4150 tok/s;   2820 sec
[2021-05-15 14:19:25,773 INFO] Step 14650/50000; acc:  66.35; ppl:  3.13; xent: 1.14; lr: 0.00010; 10500/4087 tok/s;   2830 sec
[2021-05-15 14:19:35,867 INFO] Step 14700/50000; acc:  65.92; ppl:  3.20; xent: 1.16; lr: 0.00010; 9929/4064 tok/s;   2840 sec
[2021-05-15 14:19:45,610 INFO] Step 14750/50000; acc:  66.11; ppl:  3.18; xent: 1.16; lr: 0.00010; 10562/4176 tok/s;   2849 sec
[2021-05-15 14:19:54,428 INFO] Step 14800/50000; acc:  66.36; ppl:  3.13; xent: 1.14; lr: 0.00010; 11223/4497 tok/s;   2858 sec
[2021-05-15 14:20:04,203 INFO] Step 14850/50000; acc:  65.54; ppl:  3.23; xent: 1.17; lr: 0.00010; 10635/4146 tok/s;   2868 sec
[2021-05-15 14:20:13,777 INFO] Step 14900/50000; acc:  66.57; ppl:  3.13; xent: 1.14; lr: 0.00010; 10432/4144 tok/s;   2878 sec
[2021-05-15 14:20:23,755 INFO] Step 14950/50000; acc:  66.03; ppl:  3.17; xent: 1.15; lr: 0.00010; 10405/4045 tok/s;   2888 sec
[2021-05-15 14:20:33,697 INFO] Step 15000/50000; acc:  66.00; ppl:  3.17; xent: 1.15; lr: 0.00010; 10291/4077 tok/s;   2898 sec
[2021-05-15 14:20:33,702 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/loose/valid.txt, align=None)...
[2021-05-15 14:20:41,611 INFO] Validation perplexity: 3.18088
[2021-05-15 14:20:41,611 INFO] Validation accuracy: 66.2558
[2021-05-15 14:20:41,614 INFO] Saving checkpoint ../models/group2_params/loose_ops/model_step_15000.pt
[2021-05-15 14:20:51,602 INFO] Step 15050/50000; acc:  66.60; ppl:  3.11; xent: 1.14; lr: 0.00010; 5673/2231 tok/s;   2915 sec
[2021-05-15 14:21:01,125 INFO] Step 15100/50000; acc:  65.79; ppl:  3.22; xent: 1.17; lr: 0.00010; 10762/4231 tok/s;   2925 sec
[2021-05-15 14:21:10,766 INFO] Step 15150/50000; acc:  66.59; ppl:  3.11; xent: 1.13; lr: 0.00010; 10316/4087 tok/s;   2935 sec
[2021-05-15 14:21:20,405 INFO] Step 15200/50000; acc:  66.25; ppl:  3.16; xent: 1.15; lr: 0.00010; 10903/4215 tok/s;   2944 sec
[2021-05-15 14:21:29,269 INFO] Step 15250/50000; acc:  66.56; ppl:  3.09; xent: 1.13; lr: 0.00010; 11199/4518 tok/s;   2953 sec
[2021-05-15 14:21:38,348 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:21:38,976 INFO] Step 15300/50000; acc:  66.36; ppl:  3.13; xent: 1.14; lr: 0.00010; 10458/4119 tok/s;   2963 sec
[2021-05-15 14:21:48,642 INFO] Step 15350/50000; acc:  66.13; ppl:  3.14; xent: 1.14; lr: 0.00010; 10524/4196 tok/s;   2973 sec
[2021-05-15 14:21:58,861 INFO] Step 15400/50000; acc:  66.22; ppl:  3.15; xent: 1.15; lr: 0.00010; 10048/3978 tok/s;   2983 sec
[2021-05-15 14:22:08,497 INFO] Step 15450/50000; acc:  66.35; ppl:  3.13; xent: 1.14; lr: 0.00010; 10412/4186 tok/s;   2992 sec
[2021-05-15 14:22:18,083 INFO] Step 15500/50000; acc:  66.23; ppl:  3.13; xent: 1.14; lr: 0.00010; 10538/4208 tok/s;   3002 sec
[2021-05-15 14:22:27,767 INFO] Step 15550/50000; acc:  66.22; ppl:  3.14; xent: 1.14; lr: 0.00010; 10637/4138 tok/s;   3012 sec
[2021-05-15 14:22:37,142 INFO] Step 15600/50000; acc:  66.46; ppl:  3.11; xent: 1.14; lr: 0.00010; 10559/4309 tok/s;   3021 sec
[2021-05-15 14:22:46,962 INFO] Step 15650/50000; acc:  66.20; ppl:  3.17; xent: 1.15; lr: 0.00010; 10778/4027 tok/s;   3031 sec
[2021-05-15 14:22:57,122 INFO] Step 15700/50000; acc:  66.39; ppl:  3.11; xent: 1.13; lr: 0.00010; 9819/3989 tok/s;   3041 sec
[2021-05-15 14:23:06,479 INFO] Step 15750/50000; acc:  66.74; ppl:  3.09; xent: 1.13; lr: 0.00010; 10870/4312 tok/s;   3050 sec
[2021-05-15 14:23:16,153 INFO] Step 15800/50000; acc:  66.17; ppl:  3.17; xent: 1.15; lr: 0.00010; 10658/4179 tok/s;   3060 sec
[2021-05-15 14:23:25,762 INFO] Step 15850/50000; acc:  66.41; ppl:  3.13; xent: 1.14; lr: 0.00010; 10528/4132 tok/s;   3070 sec
[2021-05-15 14:23:35,712 INFO] Step 15900/50000; acc:  66.57; ppl:  3.09; xent: 1.13; lr: 0.00010; 10359/4011 tok/s;   3080 sec
[2021-05-15 14:23:44,429 INFO] Step 15950/50000; acc:  66.75; ppl:  3.08; xent: 1.12; lr: 0.00010; 11372/4585 tok/s;   3088 sec
[2021-05-15 14:23:53,795 INFO] Step 16000/50000; acc:  66.42; ppl:  3.12; xent: 1.14; lr: 0.00010; 11065/4295 tok/s;   3098 sec
[2021-05-15 14:24:00,178 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:24:03,668 INFO] Step 16050/50000; acc:  66.66; ppl:  3.10; xent: 1.13; lr: 0.00010; 10199/4097 tok/s;   3108 sec
[2021-05-15 14:24:13,340 INFO] Step 16100/50000; acc:  66.94; ppl:  3.09; xent: 1.13; lr: 0.00010; 10507/4175 tok/s;   3117 sec
[2021-05-15 14:24:23,418 INFO] Step 16150/50000; acc:  66.01; ppl:  3.12; xent: 1.14; lr: 0.00010; 10064/4055 tok/s;   3127 sec
[2021-05-15 14:24:33,157 INFO] Step 16200/50000; acc:  66.53; ppl:  3.11; xent: 1.14; lr: 0.00010; 10456/4191 tok/s;   3137 sec
[2021-05-15 14:24:42,518 INFO] Step 16250/50000; acc:  66.84; ppl:  3.06; xent: 1.12; lr: 0.00010; 10797/4268 tok/s;   3146 sec
[2021-05-15 14:24:51,929 INFO] Step 16300/50000; acc:  66.60; ppl:  3.10; xent: 1.13; lr: 0.00010; 10700/4243 tok/s;   3156 sec
[2021-05-15 14:25:01,792 INFO] Step 16350/50000; acc:  66.16; ppl:  3.17; xent: 1.15; lr: 0.00010; 10528/4105 tok/s;   3166 sec
[2021-05-15 14:25:11,391 INFO] Step 16400/50000; acc:  66.97; ppl:  3.05; xent: 1.12; lr: 0.00010; 10459/4106 tok/s;   3175 sec
[2021-05-15 14:25:21,892 INFO] Step 16450/50000; acc:  66.14; ppl:  3.15; xent: 1.15; lr: 0.00010; 9992/3897 tok/s;   3186 sec
[2021-05-15 14:25:30,842 INFO] Step 16500/50000; acc:  67.22; ppl:  3.03; xent: 1.11; lr: 0.00010; 11019/4394 tok/s;   3195 sec
[2021-05-15 14:25:40,723 INFO] Step 16550/50000; acc:  66.78; ppl:  3.09; xent: 1.13; lr: 0.00010; 10379/4132 tok/s;   3205 sec
[2021-05-15 14:25:50,295 INFO] Step 16600/50000; acc:  66.60; ppl:  3.11; xent: 1.14; lr: 0.00010; 10728/4141 tok/s;   3214 sec
[2021-05-15 14:25:59,814 INFO] Step 16650/50000; acc:  66.78; ppl:  3.05; xent: 1.12; lr: 0.00010; 10681/4172 tok/s;   3224 sec
[2021-05-15 14:26:08,913 INFO] Step 16700/50000; acc:  66.54; ppl:  3.09; xent: 1.13; lr: 0.00010; 11239/4462 tok/s;   3233 sec
[2021-05-15 14:26:18,283 INFO] Step 16750/50000; acc:  66.93; ppl:  3.04; xent: 1.11; lr: 0.00010; 10540/4313 tok/s;   3242 sec
[2021-05-15 14:26:21,854 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:26:27,957 INFO] Step 16800/50000; acc:  66.63; ppl:  3.08; xent: 1.13; lr: 0.00010; 10820/4190 tok/s;   3252 sec
[2021-05-15 14:26:37,865 INFO] Step 16850/50000; acc:  66.76; ppl:  3.06; xent: 1.12; lr: 0.00010; 10136/4036 tok/s;   3262 sec
[2021-05-15 14:26:48,033 INFO] Step 16900/50000; acc:  66.52; ppl:  3.08; xent: 1.12; lr: 0.00010; 9938/4016 tok/s;   3272 sec
[2021-05-15 14:26:57,519 INFO] Step 16950/50000; acc:  67.04; ppl:  3.05; xent: 1.12; lr: 0.00010; 10671/4292 tok/s;   3281 sec
[2021-05-15 14:27:07,131 INFO] Step 17000/50000; acc:  66.58; ppl:  3.07; xent: 1.12; lr: 0.00010; 10623/4151 tok/s;   3291 sec
[2021-05-15 14:27:16,404 INFO] Step 17050/50000; acc:  66.75; ppl:  3.07; xent: 1.12; lr: 0.00010; 10874/4345 tok/s;   3300 sec
[2021-05-15 14:27:25,764 INFO] Step 17100/50000; acc:  66.65; ppl:  3.10; xent: 1.13; lr: 0.00010; 10973/4266 tok/s;   3310 sec
[2021-05-15 14:27:35,733 INFO] Step 17150/50000; acc:  66.60; ppl:  3.07; xent: 1.12; lr: 0.00010; 10380/3979 tok/s;   3320 sec
[2021-05-15 14:27:45,485 INFO] Step 17200/50000; acc:  67.27; ppl:  3.03; xent: 1.11; lr: 0.00010; 10167/4144 tok/s;   3329 sec
[2021-05-15 14:27:55,110 INFO] Step 17250/50000; acc:  66.81; ppl:  3.08; xent: 1.13; lr: 0.00010; 10920/4202 tok/s;   3339 sec
[2021-05-15 14:28:04,277 INFO] Step 17300/50000; acc:  67.07; ppl:  3.05; xent: 1.11; lr: 0.00010; 10764/4349 tok/s;   3348 sec
[2021-05-15 14:28:14,183 INFO] Step 17350/50000; acc:  66.82; ppl:  3.06; xent: 1.12; lr: 0.00010; 10369/4054 tok/s;   3358 sec
[2021-05-15 14:28:23,233 INFO] Step 17400/50000; acc:  67.21; ppl:  3.02; xent: 1.10; lr: 0.00010; 11331/4366 tok/s;   3367 sec
[2021-05-15 14:28:32,607 INFO] Step 17450/50000; acc:  66.69; ppl:  3.07; xent: 1.12; lr: 0.00010; 10778/4359 tok/s;   3376 sec
[2021-05-15 14:28:42,311 INFO] Step 17500/50000; acc:  67.04; ppl:  3.04; xent: 1.11; lr: 0.00010; 10585/4117 tok/s;   3386 sec
[2021-05-15 14:28:43,098 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:28:51,662 INFO] Step 17550/50000; acc:  67.53; ppl:  3.00; xent: 1.10; lr: 0.00010; 10649/4323 tok/s;   3396 sec
[2021-05-15 14:29:01,809 INFO] Step 17600/50000; acc:  66.61; ppl:  3.10; xent: 1.13; lr: 0.00010; 10235/3962 tok/s;   3406 sec
[2021-05-15 14:29:11,721 INFO] Step 17650/50000; acc:  67.30; ppl:  3.02; xent: 1.10; lr: 0.00010; 10064/4166 tok/s;   3416 sec
[2021-05-15 14:29:20,989 INFO] Step 17700/50000; acc:  66.84; ppl:  3.05; xent: 1.12; lr: 0.00010; 10924/4286 tok/s;   3425 sec
[2021-05-15 14:29:30,394 INFO] Step 17750/50000; acc:  67.00; ppl:  3.04; xent: 1.11; lr: 0.00010; 10768/4260 tok/s;   3434 sec
[2021-05-15 14:29:40,011 INFO] Step 17800/50000; acc:  67.38; ppl:  3.04; xent: 1.11; lr: 0.00010; 10652/4190 tok/s;   3444 sec
[2021-05-15 14:29:49,727 INFO] Step 17850/50000; acc:  66.81; ppl:  3.07; xent: 1.12; lr: 0.00010; 10624/4128 tok/s;   3454 sec
[2021-05-15 14:29:59,483 INFO] Step 17900/50000; acc:  66.95; ppl:  3.04; xent: 1.11; lr: 0.00010; 10352/4161 tok/s;   3463 sec
[2021-05-15 14:30:09,172 INFO] Step 17950/50000; acc:  67.13; ppl:  3.03; xent: 1.11; lr: 0.00010; 10636/4119 tok/s;   3473 sec
[2021-05-15 14:30:18,450 INFO] Step 18000/50000; acc:  67.29; ppl:  3.03; xent: 1.11; lr: 0.00010; 10723/4371 tok/s;   3482 sec
[2021-05-15 14:30:28,205 INFO] Step 18050/50000; acc:  66.79; ppl:  3.08; xent: 1.13; lr: 0.00010; 10704/4116 tok/s;   3492 sec
[2021-05-15 14:30:37,513 INFO] Step 18100/50000; acc:  67.56; ppl:  2.95; xent: 1.08; lr: 0.00010; 10725/4271 tok/s;   3501 sec
[2021-05-15 14:30:46,722 INFO] Step 18150/50000; acc:  67.29; ppl:  3.02; xent: 1.10; lr: 0.00010; 11073/4293 tok/s;   3511 sec
[2021-05-15 14:30:56,151 INFO] Step 18200/50000; acc:  66.96; ppl:  3.03; xent: 1.11; lr: 0.00010; 10829/4285 tok/s;   3520 sec
[2021-05-15 14:30:57,714 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:31:05,845 INFO] Step 18250/50000; acc:  67.16; ppl:  3.02; xent: 1.10; lr: 0.00010; 10484/4206 tok/s;   3530 sec
[2021-05-15 14:31:15,545 INFO] Step 18300/50000; acc:  67.57; ppl:  2.99; xent: 1.10; lr: 0.00010; 10613/4117 tok/s;   3539 sec
[2021-05-15 14:31:25,295 INFO] Step 18350/50000; acc:  67.28; ppl:  3.01; xent: 1.10; lr: 0.00010; 10073/4178 tok/s;   3549 sec
[2021-05-15 14:31:35,292 INFO] Step 18400/50000; acc:  66.85; ppl:  3.06; xent: 1.12; lr: 0.00010; 10430/4087 tok/s;   3559 sec
[2021-05-15 14:31:44,326 INFO] Step 18450/50000; acc:  67.46; ppl:  2.98; xent: 1.09; lr: 0.00010; 11063/4398 tok/s;   3568 sec
[2021-05-15 14:31:53,799 INFO] Step 18500/50000; acc:  67.18; ppl:  3.02; xent: 1.10; lr: 0.00010; 10639/4249 tok/s;   3578 sec
[2021-05-15 14:32:03,461 INFO] Step 18550/50000; acc:  67.10; ppl:  3.04; xent: 1.11; lr: 0.00010; 10639/4130 tok/s;   3587 sec
[2021-05-15 14:32:13,666 INFO] Step 18600/50000; acc:  67.13; ppl:  3.01; xent: 1.10; lr: 0.00010; 10133/3962 tok/s;   3598 sec
[2021-05-15 14:32:23,441 INFO] Step 18650/50000; acc:  67.19; ppl:  3.00; xent: 1.10; lr: 0.00010; 10371/4087 tok/s;   3607 sec
[2021-05-15 14:32:32,813 INFO] Step 18700/50000; acc:  67.56; ppl:  2.97; xent: 1.09; lr: 0.00010; 10797/4275 tok/s;   3617 sec
[2021-05-15 14:32:42,445 INFO] Step 18750/50000; acc:  67.06; ppl:  3.05; xent: 1.11; lr: 0.00010; 10708/4175 tok/s;   3626 sec
[2021-05-15 14:32:52,049 INFO] Step 18800/50000; acc:  67.69; ppl:  2.96; xent: 1.08; lr: 0.00010; 10364/4160 tok/s;   3636 sec
[2021-05-15 14:33:01,780 INFO] Step 18850/50000; acc:  67.10; ppl:  3.02; xent: 1.10; lr: 0.00010; 10829/4150 tok/s;   3646 sec
[2021-05-15 14:33:10,550 INFO] Step 18900/50000; acc:  67.89; ppl:  2.93; xent: 1.08; lr: 0.00010; 11158/4580 tok/s;   3654 sec
[2021-05-15 14:33:19,277 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:33:20,407 INFO] Step 18950/50000; acc:  67.42; ppl:  3.00; xent: 1.10; lr: 0.00010; 10407/4041 tok/s;   3664 sec
[2021-05-15 14:33:29,986 INFO] Step 19000/50000; acc:  67.32; ppl:  3.01; xent: 1.10; lr: 0.00010; 10677/4262 tok/s;   3674 sec
[2021-05-15 14:33:40,221 INFO] Step 19050/50000; acc:  67.16; ppl:  3.01; xent: 1.10; lr: 0.00010; 9939/3935 tok/s;   3684 sec
[2021-05-15 14:33:49,986 INFO] Step 19100/50000; acc:  67.37; ppl:  2.99; xent: 1.10; lr: 0.00010; 10423/4154 tok/s;   3694 sec
[2021-05-15 14:33:59,447 INFO] Step 19150/50000; acc:  67.63; ppl:  2.97; xent: 1.09; lr: 0.00010; 10468/4231 tok/s;   3703 sec
[2021-05-15 14:34:09,120 INFO] Step 19200/50000; acc:  67.10; ppl:  3.03; xent: 1.11; lr: 0.00010; 10768/4200 tok/s;   3713 sec
[2021-05-15 14:34:18,435 INFO] Step 19250/50000; acc:  67.47; ppl:  2.97; xent: 1.09; lr: 0.00010; 10766/4294 tok/s;   3722 sec
[2021-05-15 14:34:28,082 INFO] Step 19300/50000; acc:  67.66; ppl:  2.96; xent: 1.09; lr: 0.00010; 10617/4085 tok/s;   3732 sec
[2021-05-15 14:34:38,524 INFO] Step 19350/50000; acc:  66.95; ppl:  3.03; xent: 1.11; lr: 0.00010; 9833/3927 tok/s;   3742 sec
[2021-05-15 14:34:47,739 INFO] Step 19400/50000; acc:  67.90; ppl:  2.94; xent: 1.08; lr: 0.00010; 11021/4328 tok/s;   3752 sec
[2021-05-15 14:34:57,487 INFO] Step 19450/50000; acc:  67.48; ppl:  3.00; xent: 1.10; lr: 0.00010; 10460/4165 tok/s;   3761 sec
[2021-05-15 14:35:06,887 INFO] Step 19500/50000; acc:  67.36; ppl:  2.98; xent: 1.09; lr: 0.00010; 10738/4195 tok/s;   3771 sec
[2021-05-15 14:35:16,665 INFO] Step 19550/50000; acc:  67.58; ppl:  2.94; xent: 1.08; lr: 0.00010; 10583/4122 tok/s;   3781 sec
[2021-05-15 14:35:25,620 INFO] Step 19600/50000; acc:  67.93; ppl:  2.94; xent: 1.08; lr: 0.00010; 11070/4439 tok/s;   3789 sec
[2021-05-15 14:35:35,252 INFO] Step 19650/50000; acc:  67.26; ppl:  3.00; xent: 1.10; lr: 0.00010; 10801/4209 tok/s;   3799 sec
[2021-05-15 14:35:41,128 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:35:44,870 INFO] Step 19700/50000; acc:  67.80; ppl:  2.94; xent: 1.08; lr: 0.00010; 10322/4181 tok/s;   3809 sec
[2021-05-15 14:35:54,611 INFO] Step 19750/50000; acc:  67.83; ppl:  2.96; xent: 1.08; lr: 0.00010; 10564/4160 tok/s;   3818 sec
[2021-05-15 14:36:04,777 INFO] Step 19800/50000; acc:  67.08; ppl:  2.99; xent: 1.10; lr: 0.00010; 10027/4020 tok/s;   3829 sec
[2021-05-15 14:36:14,536 INFO] Step 19850/50000; acc:  67.64; ppl:  2.96; xent: 1.09; lr: 0.00010; 10328/4155 tok/s;   3838 sec
[2021-05-15 14:36:24,004 INFO] Step 19900/50000; acc:  67.79; ppl:  2.95; xent: 1.08; lr: 0.00010; 10822/4251 tok/s;   3848 sec
[2021-05-15 14:36:33,405 INFO] Step 19950/50000; acc:  67.90; ppl:  2.94; xent: 1.08; lr: 0.00010; 10504/4221 tok/s;   3857 sec
[2021-05-15 14:36:43,030 INFO] Step 20000/50000; acc:  67.24; ppl:  3.02; xent: 1.11; lr: 0.00010; 10914/4194 tok/s;   3867 sec
[2021-05-15 14:36:43,034 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/loose/valid.txt, align=None)...
[2021-05-15 14:36:50,961 INFO] Validation perplexity: 3.05731
[2021-05-15 14:36:50,961 INFO] Validation accuracy: 67.0119
[2021-05-15 14:36:50,963 INFO] Saving checkpoint ../models/group2_params/loose_ops/model_step_20000.pt
[2021-05-15 14:37:01,315 INFO] Step 20050/50000; acc:  67.68; ppl:  2.95; xent: 1.08; lr: 0.00010; 5554/2176 tok/s;   3885 sec
[2021-05-15 14:37:11,566 INFO] Step 20100/50000; acc:  67.64; ppl:  2.95; xent: 1.08; lr: 0.00010; 9907/3945 tok/s;   3895 sec
[2021-05-15 14:37:20,840 INFO] Step 20150/50000; acc:  67.97; ppl:  2.94; xent: 1.08; lr: 0.00010; 10957/4334 tok/s;   3905 sec
[2021-05-15 14:37:30,545 INFO] Step 20200/50000; acc:  67.77; ppl:  2.96; xent: 1.08; lr: 0.00010; 10558/4143 tok/s;   3914 sec
[2021-05-15 14:37:40,141 INFO] Step 20250/50000; acc:  67.44; ppl:  2.96; xent: 1.09; lr: 0.00010; 10580/4152 tok/s;   3924 sec
[2021-05-15 14:37:49,706 INFO] Step 20300/50000; acc:  68.02; ppl:  2.90; xent: 1.07; lr: 0.00010; 10593/4145 tok/s;   3934 sec
[2021-05-15 14:37:58,903 INFO] Step 20350/50000; acc:  67.57; ppl:  2.97; xent: 1.09; lr: 0.00010; 11156/4409 tok/s;   3943 sec
[2021-05-15 14:38:08,273 INFO] Step 20400/50000; acc:  67.91; ppl:  2.91; xent: 1.07; lr: 0.00010; 10564/4326 tok/s;   3952 sec
[2021-05-15 14:38:11,506 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:38:17,982 INFO] Step 20450/50000; acc:  67.58; ppl:  2.96; xent: 1.09; lr: 0.00010; 10800/4167 tok/s;   3962 sec
[2021-05-15 14:38:27,669 INFO] Step 20500/50000; acc:  68.03; ppl:  2.91; xent: 1.07; lr: 0.00010; 10227/4124 tok/s;   3972 sec
[2021-05-15 14:38:37,891 INFO] Step 20550/50000; acc:  67.45; ppl:  2.97; xent: 1.09; lr: 0.00010; 9990/3994 tok/s;   3982 sec
[2021-05-15 14:38:47,373 INFO] Step 20600/50000; acc:  67.75; ppl:  2.94; xent: 1.08; lr: 0.00010; 10759/4274 tok/s;   3991 sec
[2021-05-15 14:38:56,973 INFO] Step 20650/50000; acc:  67.82; ppl:  2.93; xent: 1.07; lr: 0.00010; 10530/4188 tok/s;   4001 sec
[2021-05-15 14:39:06,355 INFO] Step 20700/50000; acc:  67.68; ppl:  2.95; xent: 1.08; lr: 0.00010; 10887/4282 tok/s;   4010 sec
[2021-05-15 14:39:15,659 INFO] Step 20750/50000; acc:  67.89; ppl:  2.93; xent: 1.07; lr: 0.00010; 10843/4272 tok/s;   4020 sec
[2021-05-15 14:39:25,575 INFO] Step 20800/50000; acc:  67.83; ppl:  2.96; xent: 1.08; lr: 0.00010; 10532/4040 tok/s;   4029 sec
[2021-05-15 14:39:35,257 INFO] Step 20850/50000; acc:  67.88; ppl:  2.93; xent: 1.07; lr: 0.00010; 10378/4145 tok/s;   4039 sec
[2021-05-15 14:39:44,619 INFO] Step 20900/50000; acc:  68.20; ppl:  2.91; xent: 1.07; lr: 0.00010; 10866/4320 tok/s;   4048 sec
[2021-05-15 14:39:53,953 INFO] Step 20950/50000; acc:  67.85; ppl:  2.94; xent: 1.08; lr: 0.00010; 10875/4300 tok/s;   4058 sec
[2021-05-15 14:40:03,862 INFO] Step 21000/50000; acc:  68.08; ppl:  2.90; xent: 1.07; lr: 0.00010; 10373/4050 tok/s;   4068 sec
[2021-05-15 14:40:12,831 INFO] Step 21050/50000; acc:  68.17; ppl:  2.91; xent: 1.07; lr: 0.00010; 11299/4376 tok/s;   4077 sec
[2021-05-15 14:40:22,318 INFO] Step 21100/50000; acc:  67.91; ppl:  2.92; xent: 1.07; lr: 0.00010; 10607/4318 tok/s;   4086 sec
[2021-05-15 14:40:26,022 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:40:32,042 INFO] Step 21150/50000; acc:  67.99; ppl:  2.93; xent: 1.07; lr: 0.00010; 10632/4131 tok/s;   4096 sec
[2021-05-15 14:40:41,450 INFO] Step 21200/50000; acc:  68.60; ppl:  2.85; xent: 1.05; lr: 0.00010; 10580/4256 tok/s;   4105 sec
[2021-05-15 14:40:51,565 INFO] Step 21250/50000; acc:  67.23; ppl:  3.00; xent: 1.10; lr: 0.00010; 10287/4031 tok/s;   4115 sec
[2021-05-15 14:41:01,201 INFO] Step 21300/50000; acc:  68.26; ppl:  2.88; xent: 1.06; lr: 0.00010; 10219/4259 tok/s;   4125 sec
[2021-05-15 14:41:10,531 INFO] Step 21350/50000; acc:  67.84; ppl:  2.92; xent: 1.07; lr: 0.00010; 10968/4245 tok/s;   4134 sec
[2021-05-15 14:41:19,803 INFO] Step 21400/50000; acc:  68.14; ppl:  2.92; xent: 1.07; lr: 0.00010; 10993/4319 tok/s;   4144 sec
[2021-05-15 14:41:29,427 INFO] Step 21450/50000; acc:  67.99; ppl:  2.90; xent: 1.07; lr: 0.00010; 10556/4174 tok/s;   4153 sec
[2021-05-15 14:41:39,442 INFO] Step 21500/50000; acc:  67.57; ppl:  2.96; xent: 1.09; lr: 0.00010; 10438/4018 tok/s;   4163 sec
[2021-05-15 14:41:49,057 INFO] Step 21550/50000; acc:  68.16; ppl:  2.88; xent: 1.06; lr: 0.00010; 10295/4195 tok/s;   4173 sec
[2021-05-15 14:41:58,882 INFO] Step 21600/50000; acc:  68.04; ppl:  2.91; xent: 1.07; lr: 0.00010; 10615/4058 tok/s;   4183 sec
[2021-05-15 14:42:08,183 INFO] Step 21650/50000; acc:  68.05; ppl:  2.92; xent: 1.07; lr: 0.00010; 10800/4359 tok/s;   4192 sec
[2021-05-15 14:42:17,880 INFO] Step 21700/50000; acc:  68.14; ppl:  2.89; xent: 1.06; lr: 0.00010; 10457/4127 tok/s;   4202 sec
[2021-05-15 14:42:27,409 INFO] Step 21750/50000; acc:  68.54; ppl:  2.87; xent: 1.05; lr: 0.00010; 10756/4209 tok/s;   4211 sec
[2021-05-15 14:42:36,641 INFO] Step 21800/50000; acc:  68.13; ppl:  2.89; xent: 1.06; lr: 0.00010; 11031/4275 tok/s;   4221 sec
[2021-05-15 14:42:45,930 INFO] Step 21850/50000; acc:  68.06; ppl:  2.89; xent: 1.06; lr: 0.00010; 10868/4328 tok/s;   4230 sec
[2021-05-15 14:42:47,169 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:42:55,589 INFO] Step 21900/50000; acc:  68.24; ppl:  2.87; xent: 1.05; lr: 0.00010; 10482/4247 tok/s;   4239 sec
[2021-05-15 14:43:05,422 INFO] Step 21950/50000; acc:  68.00; ppl:  2.90; xent: 1.06; lr: 0.00010; 10528/4060 tok/s;   4249 sec
[2021-05-15 14:43:15,412 INFO] Step 22000/50000; acc:  68.09; ppl:  2.89; xent: 1.06; lr: 0.00010; 9831/4087 tok/s;   4259 sec
[2021-05-15 14:43:25,299 INFO] Step 22050/50000; acc:  67.62; ppl:  2.94; xent: 1.08; lr: 0.00010; 10585/4124 tok/s;   4269 sec
[2021-05-15 14:43:34,207 INFO] Step 22100/50000; acc:  68.69; ppl:  2.85; xent: 1.05; lr: 0.00010; 11071/4440 tok/s;   4278 sec
[2021-05-15 14:43:43,764 INFO] Step 22150/50000; acc:  68.15; ppl:  2.89; xent: 1.06; lr: 0.00010; 10649/4234 tok/s;   4288 sec
[2021-05-15 14:43:53,566 INFO] Step 22200/50000; acc:  67.98; ppl:  2.93; xent: 1.07; lr: 0.00010; 10589/4061 tok/s;   4297 sec
[2021-05-15 14:44:03,906 INFO] Step 22250/50000; acc:  68.11; ppl:  2.89; xent: 1.06; lr: 0.00010; 9893/3891 tok/s;   4308 sec
[2021-05-15 14:44:13,671 INFO] Step 22300/50000; acc:  68.09; ppl:  2.89; xent: 1.06; lr: 0.00010; 10516/4123 tok/s;   4318 sec
[2021-05-15 14:44:22,915 INFO] Step 22350/50000; acc:  68.76; ppl:  2.83; xent: 1.04; lr: 0.00010; 10730/4353 tok/s;   4327 sec
[2021-05-15 14:44:32,653 INFO] Step 22400/50000; acc:  67.78; ppl:  2.94; xent: 1.08; lr: 0.00010; 10703/4111 tok/s;   4337 sec
[2021-05-15 14:44:42,334 INFO] Step 22450/50000; acc:  68.49; ppl:  2.85; xent: 1.05; lr: 0.00010; 10415/4116 tok/s;   4346 sec
[2021-05-15 14:44:51,490 INFO] Step 22500/50000; acc:  68.47; ppl:  2.85; xent: 1.05; lr: 0.00010; 11132/4394 tok/s;   4355 sec
[2021-05-15 14:45:00,626 INFO] Step 22550/50000; acc:  68.48; ppl:  2.85; xent: 1.05; lr: 0.00010; 11037/4436 tok/s;   4364 sec
[2021-05-15 14:45:08,768 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:45:10,350 INFO] Step 22600/50000; acc:  68.32; ppl:  2.87; xent: 1.06; lr: 0.00010; 10540/4076 tok/s;   4374 sec
[2021-05-15 14:45:19,966 INFO] Step 22650/50000; acc:  68.18; ppl:  2.87; xent: 1.05; lr: 0.00010; 10531/4227 tok/s;   4384 sec
[2021-05-15 14:45:30,073 INFO] Step 22700/50000; acc:  68.20; ppl:  2.88; xent: 1.06; lr: 0.00010; 10007/4011 tok/s;   4394 sec
[2021-05-15 14:45:39,850 INFO] Step 22750/50000; acc:  68.22; ppl:  2.90; xent: 1.06; lr: 0.00010; 10467/4165 tok/s;   4404 sec
[2021-05-15 14:45:49,226 INFO] Step 22800/50000; acc:  68.77; ppl:  2.84; xent: 1.04; lr: 0.00010; 10571/4252 tok/s;   4413 sec
[2021-05-15 14:45:59,077 INFO] Step 22850/50000; acc:  68.27; ppl:  2.90; xent: 1.06; lr: 0.00010; 10599/4144 tok/s;   4423 sec
[2021-05-15 14:46:08,110 INFO] Step 22900/50000; acc:  68.71; ppl:  2.85; xent: 1.05; lr: 0.00010; 10972/4414 tok/s;   4432 sec
[2021-05-15 14:46:17,759 INFO] Step 22950/50000; acc:  68.66; ppl:  2.85; xent: 1.05; lr: 0.00010; 10723/4059 tok/s;   4442 sec
[2021-05-15 14:46:28,281 INFO] Step 23000/50000; acc:  67.87; ppl:  2.91; xent: 1.07; lr: 0.00010; 9806/3912 tok/s;   4452 sec
[2021-05-15 14:46:37,461 INFO] Step 23050/50000; acc:  68.95; ppl:  2.82; xent: 1.04; lr: 0.00010; 10980/4335 tok/s;   4461 sec
[2021-05-15 14:46:47,385 INFO] Step 23100/50000; acc:  68.17; ppl:  2.90; xent: 1.06; lr: 0.00010; 10398/4104 tok/s;   4471 sec
[2021-05-15 14:46:56,693 INFO] Step 23150/50000; acc:  68.94; ppl:  2.84; xent: 1.04; lr: 0.00010; 10638/4217 tok/s;   4481 sec
[2021-05-15 14:47:06,675 INFO] Step 23200/50000; acc:  68.36; ppl:  2.84; xent: 1.05; lr: 0.00010; 10501/4060 tok/s;   4491 sec
[2021-05-15 14:47:15,586 INFO] Step 23250/50000; acc:  68.64; ppl:  2.84; xent: 1.04; lr: 0.00010; 11233/4445 tok/s;   4499 sec
[2021-05-15 14:47:25,112 INFO] Step 23300/50000; acc:  68.75; ppl:  2.84; xent: 1.04; lr: 0.00010; 10575/4253 tok/s;   4509 sec
[2021-05-15 14:47:30,559 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:47:34,745 INFO] Step 23350/50000; acc:  68.80; ppl:  2.83; xent: 1.04; lr: 0.00010; 10603/4221 tok/s;   4519 sec
[2021-05-15 14:47:44,634 INFO] Step 23400/50000; acc:  68.56; ppl:  2.85; xent: 1.05; lr: 0.00010; 10403/4073 tok/s;   4528 sec
[2021-05-15 14:47:54,727 INFO] Step 23450/50000; acc:  68.19; ppl:  2.86; xent: 1.05; lr: 0.00010; 9981/4023 tok/s;   4539 sec
[2021-05-15 14:48:04,239 INFO] Step 23500/50000; acc:  68.54; ppl:  2.84; xent: 1.04; lr: 0.00010; 10570/4280 tok/s;   4548 sec
[2021-05-15 14:48:13,814 INFO] Step 23550/50000; acc:  68.41; ppl:  2.87; xent: 1.05; lr: 0.00010; 10749/4190 tok/s;   4558 sec
[2021-05-15 14:48:22,920 INFO] Step 23600/50000; acc:  69.14; ppl:  2.80; xent: 1.03; lr: 0.00010; 10832/4381 tok/s;   4567 sec
[2021-05-15 14:48:32,718 INFO] Step 23650/50000; acc:  67.95; ppl:  2.92; xent: 1.07; lr: 0.00010; 10787/4107 tok/s;   4577 sec
[2021-05-15 14:48:42,404 INFO] Step 23700/50000; acc:  68.75; ppl:  2.81; xent: 1.03; lr: 0.00010; 10327/4129 tok/s;   4586 sec
[2021-05-15 14:48:52,605 INFO] Step 23750/50000; acc:  68.47; ppl:  2.85; xent: 1.05; lr: 0.00010; 10050/3937 tok/s;   4596 sec
[2021-05-15 14:49:02,033 INFO] Step 23800/50000; acc:  68.68; ppl:  2.84; xent: 1.04; lr: 0.00010; 10863/4286 tok/s;   4606 sec
[2021-05-15 14:49:11,392 INFO] Step 23850/50000; acc:  68.54; ppl:  2.84; xent: 1.04; lr: 0.00010; 10839/4273 tok/s;   4615 sec
[2021-05-15 14:49:21,287 INFO] Step 23900/50000; acc:  68.75; ppl:  2.83; xent: 1.04; lr: 0.00010; 10375/4062 tok/s;   4625 sec
[2021-05-15 14:49:30,495 INFO] Step 23950/50000; acc:  69.12; ppl:  2.77; xent: 1.02; lr: 0.00010; 10816/4266 tok/s;   4634 sec
[2021-05-15 14:49:39,708 INFO] Step 24000/50000; acc:  68.34; ppl:  2.86; xent: 1.05; lr: 0.00010; 11256/4412 tok/s;   4644 sec
[2021-05-15 14:49:49,230 INFO] Step 24050/50000; acc:  68.61; ppl:  2.81; xent: 1.03; lr: 0.00010; 10526/4262 tok/s;   4653 sec
[2021-05-15 14:49:51,990 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:49:58,636 INFO] Step 24100/50000; acc:  69.05; ppl:  2.80; xent: 1.03; lr: 0.00010; 10805/4273 tok/s;   4662 sec
[2021-05-15 14:50:08,503 INFO] Step 24150/50000; acc:  68.52; ppl:  2.85; xent: 1.05; lr: 0.00010; 10316/4090 tok/s;   4672 sec
[2021-05-15 14:50:18,692 INFO] Step 24200/50000; acc:  68.48; ppl:  2.85; xent: 1.05; lr: 0.00010; 10006/4003 tok/s;   4683 sec
[2021-05-15 14:50:28,057 INFO] Step 24250/50000; acc:  69.03; ppl:  2.79; xent: 1.03; lr: 0.00010; 10772/4303 tok/s;   4692 sec
[2021-05-15 14:50:37,594 INFO] Step 24300/50000; acc:  68.90; ppl:  2.81; xent: 1.03; lr: 0.00010; 10572/4198 tok/s;   4701 sec
[2021-05-15 14:50:47,070 INFO] Step 24350/50000; acc:  68.82; ppl:  2.83; xent: 1.04; lr: 0.00010; 10833/4268 tok/s;   4711 sec
[2021-05-15 14:50:56,444 INFO] Step 24400/50000; acc:  69.00; ppl:  2.81; xent: 1.03; lr: 0.00010; 10785/4247 tok/s;   4720 sec
[2021-05-15 14:51:06,511 INFO] Step 24450/50000; acc:  68.55; ppl:  2.84; xent: 1.05; lr: 0.00010; 10399/3988 tok/s;   4730 sec
[2021-05-15 14:51:16,178 INFO] Step 24500/50000; acc:  68.88; ppl:  2.80; xent: 1.03; lr: 0.00010; 10249/4126 tok/s;   4740 sec
[2021-05-15 14:51:25,701 INFO] Step 24550/50000; acc:  68.95; ppl:  2.81; xent: 1.03; lr: 0.00010; 10790/4269 tok/s;   4750 sec
[2021-05-15 14:51:35,288 INFO] Step 24600/50000; acc:  68.55; ppl:  2.84; xent: 1.05; lr: 0.00010; 10652/4171 tok/s;   4759 sec
[2021-05-15 14:51:45,003 INFO] Step 24650/50000; acc:  69.04; ppl:  2.78; xent: 1.02; lr: 0.00010; 10486/4147 tok/s;   4769 sec
[2021-05-15 14:51:53,978 INFO] Step 24700/50000; acc:  68.63; ppl:  2.82; xent: 1.04; lr: 0.00010; 11438/4354 tok/s;   4778 sec
[2021-05-15 14:52:03,382 INFO] Step 24750/50000; acc:  68.92; ppl:  2.79; xent: 1.03; lr: 0.00010; 10491/4302 tok/s;   4787 sec
[2021-05-15 14:52:06,783 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:52:13,259 INFO] Step 24800/50000; acc:  68.67; ppl:  2.83; xent: 1.04; lr: 0.00010; 10606/4144 tok/s;   4797 sec
[2021-05-15 14:52:22,746 INFO] Step 24850/50000; acc:  69.20; ppl:  2.77; xent: 1.02; lr: 0.00010; 10591/4188 tok/s;   4807 sec
[2021-05-15 14:52:32,844 INFO] Step 24900/50000; acc:  68.56; ppl:  2.83; xent: 1.04; lr: 0.00010; 9997/4039 tok/s;   4817 sec
[2021-05-15 14:52:42,574 INFO] Step 24950/50000; acc:  68.86; ppl:  2.80; xent: 1.03; lr: 0.00010; 10396/4221 tok/s;   4826 sec
[2021-05-15 14:52:51,985 INFO] Step 25000/50000; acc:  68.80; ppl:  2.81; xent: 1.03; lr: 0.00010; 10877/4215 tok/s;   4836 sec
[2021-05-15 14:52:51,987 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/loose/valid.txt, align=None)...
[2021-05-15 14:52:59,931 INFO] Validation perplexity: 3.01598
[2021-05-15 14:52:59,931 INFO] Validation accuracy: 67.4096
[2021-05-15 14:52:59,934 INFO] Saving checkpoint ../models/group2_params/loose_ops/model_step_25000.pt
[2021-05-15 14:53:10,032 INFO] Step 25050/50000; acc:  68.99; ppl:  2.79; xent: 1.03; lr: 0.00010; 5576/2217 tok/s;   4854 sec
[2021-05-15 14:53:19,470 INFO] Step 25100/50000; acc:  69.03; ppl:  2.79; xent: 1.03; lr: 0.00010; 10748/4275 tok/s;   4863 sec
[2021-05-15 14:53:29,654 INFO] Step 25150/50000; acc:  68.52; ppl:  2.85; xent: 1.05; lr: 0.00010; 10299/3909 tok/s;   4874 sec
[2021-05-15 14:53:39,181 INFO] Step 25200/50000; acc:  69.23; ppl:  2.77; xent: 1.02; lr: 0.00010; 10411/4283 tok/s;   4883 sec
[2021-05-15 14:53:48,978 INFO] Step 25250/50000; acc:  68.85; ppl:  2.80; xent: 1.03; lr: 0.00010; 10687/4059 tok/s;   4893 sec
[2021-05-15 14:53:58,083 INFO] Step 25300/50000; acc:  69.25; ppl:  2.78; xent: 1.02; lr: 0.00010; 10862/4390 tok/s;   4902 sec
[2021-05-15 14:54:07,918 INFO] Step 25350/50000; acc:  68.93; ppl:  2.79; xent: 1.03; lr: 0.00010; 10421/4096 tok/s;   4912 sec
[2021-05-15 14:54:17,593 INFO] Step 25400/50000; acc:  69.21; ppl:  2.77; xent: 1.02; lr: 0.00010; 10666/4174 tok/s;   4921 sec
[2021-05-15 14:54:26,684 INFO] Step 25450/50000; acc:  69.18; ppl:  2.77; xent: 1.02; lr: 0.00010; 11084/4331 tok/s;   4931 sec
[2021-05-15 14:54:36,134 INFO] Step 25500/50000; acc:  69.15; ppl:  2.79; xent: 1.02; lr: 0.00010; 10851/4260 tok/s;   4940 sec
[2021-05-15 14:54:36,966 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:54:45,563 INFO] Step 25550/50000; acc:  69.28; ppl:  2.76; xent: 1.02; lr: 0.00010; 10518/4313 tok/s;   4949 sec
[2021-05-15 14:54:55,519 INFO] Step 25600/50000; acc:  68.77; ppl:  2.81; xent: 1.03; lr: 0.00010; 10522/4042 tok/s;   4959 sec
[2021-05-15 14:55:05,731 INFO] Step 25650/50000; acc:  69.04; ppl:  2.77; xent: 1.02; lr: 0.00010; 9714/4009 tok/s;   4970 sec
[2021-05-15 14:55:15,223 INFO] Step 25700/50000; acc:  68.96; ppl:  2.79; xent: 1.03; lr: 0.00010; 10681/4243 tok/s;   4979 sec
[2021-05-15 14:55:24,589 INFO] Step 25750/50000; acc:  69.22; ppl:  2.77; xent: 1.02; lr: 0.00010; 10844/4296 tok/s;   4988 sec
[2021-05-15 14:55:34,097 INFO] Step 25800/50000; acc:  69.16; ppl:  2.78; xent: 1.02; lr: 0.00010; 10694/4224 tok/s;   4998 sec
[2021-05-15 14:55:43,787 INFO] Step 25850/50000; acc:  69.20; ppl:  2.80; xent: 1.03; lr: 0.00010; 10591/4110 tok/s;   5008 sec
[2021-05-15 14:55:53,775 INFO] Step 25900/50000; acc:  69.08; ppl:  2.76; xent: 1.02; lr: 0.00010; 10206/4015 tok/s;   5018 sec
[2021-05-15 14:56:03,504 INFO] Step 25950/50000; acc:  68.87; ppl:  2.79; xent: 1.03; lr: 0.00010; 10590/4156 tok/s;   5027 sec
[2021-05-15 14:56:12,816 INFO] Step 26000/50000; acc:  69.73; ppl:  2.73; xent: 1.00; lr: 0.00010; 10680/4313 tok/s;   5037 sec
[2021-05-15 14:56:22,726 INFO] Step 26050/50000; acc:  68.59; ppl:  2.84; xent: 1.04; lr: 0.00010; 10555/4054 tok/s;   5047 sec
[2021-05-15 14:56:32,366 INFO] Step 26100/50000; acc:  69.55; ppl:  2.71; xent: 1.00; lr: 0.00010; 10309/4143 tok/s;   5056 sec
[2021-05-15 14:56:41,326 INFO] Step 26150/50000; acc:  69.30; ppl:  2.75; xent: 1.01; lr: 0.00010; 11467/4438 tok/s;   5065 sec
[2021-05-15 14:56:50,621 INFO] Step 26200/50000; acc:  69.30; ppl:  2.76; xent: 1.02; lr: 0.00010; 10926/4368 tok/s;   5074 sec
[2021-05-15 14:56:58,151 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:57:00,094 INFO] Step 26250/50000; acc:  69.30; ppl:  2.76; xent: 1.02; lr: 0.00010; 10734/4219 tok/s;   5084 sec
[2021-05-15 14:57:09,847 INFO] Step 26300/50000; acc:  69.36; ppl:  2.74; xent: 1.01; lr: 0.00010; 10514/4161 tok/s;   5094 sec
[2021-05-15 14:57:19,729 INFO] Step 26350/50000; acc:  69.05; ppl:  2.77; xent: 1.02; lr: 0.00010; 10028/4084 tok/s;   5104 sec
[2021-05-15 14:57:29,786 INFO] Step 26400/50000; acc:  69.02; ppl:  2.78; xent: 1.02; lr: 0.00010; 10302/4060 tok/s;   5114 sec
[2021-05-15 14:57:39,112 INFO] Step 26450/50000; acc:  69.55; ppl:  2.75; xent: 1.01; lr: 0.00010; 10758/4310 tok/s;   5123 sec
[2021-05-15 14:57:48,751 INFO] Step 26500/50000; acc:  69.20; ppl:  2.77; xent: 1.02; lr: 0.00010; 10482/4176 tok/s;   5133 sec
[2021-05-15 14:57:57,962 INFO] Step 26550/50000; acc:  69.00; ppl:  2.80; xent: 1.03; lr: 0.00010; 11083/4385 tok/s;   5142 sec
[2021-05-15 14:58:07,727 INFO] Step 26600/50000; acc:  69.47; ppl:  2.75; xent: 1.01; lr: 0.00010; 10606/4009 tok/s;   5152 sec
[2021-05-15 14:58:18,228 INFO] Step 26650/50000; acc:  68.98; ppl:  2.79; xent: 1.02; lr: 0.00010; 9690/3892 tok/s;   5162 sec
[2021-05-15 14:58:27,305 INFO] Step 26700/50000; acc:  69.60; ppl:  2.71; xent: 1.00; lr: 0.00010; 11088/4394 tok/s;   5171 sec
[2021-05-15 14:58:37,199 INFO] Step 26750/50000; acc:  69.51; ppl:  2.78; xent: 1.02; lr: 0.00010; 10471/4109 tok/s;   5181 sec
[2021-05-15 14:58:46,599 INFO] Step 26800/50000; acc:  69.72; ppl:  2.72; xent: 1.00; lr: 0.00010; 10544/4178 tok/s;   5190 sec
[2021-05-15 14:58:56,683 INFO] Step 26850/50000; acc:  69.02; ppl:  2.76; xent: 1.02; lr: 0.00010; 10413/4038 tok/s;   5201 sec
[2021-05-15 14:59:05,413 INFO] Step 26900/50000; acc:  69.52; ppl:  2.71; xent: 1.00; lr: 0.00010; 11306/4532 tok/s;   5209 sec
[2021-05-15 14:59:15,003 INFO] Step 26950/50000; acc:  69.07; ppl:  2.77; xent: 1.02; lr: 0.00010; 10624/4205 tok/s;   5219 sec
[2021-05-15 14:59:20,062 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 14:59:24,793 INFO] Step 27000/50000; acc:  69.34; ppl:  2.74; xent: 1.01; lr: 0.00010; 10493/4186 tok/s;   5229 sec
[2021-05-15 14:59:34,491 INFO] Step 27050/50000; acc:  69.36; ppl:  2.74; xent: 1.01; lr: 0.00010; 10529/4143 tok/s;   5238 sec
[2021-05-15 14:59:44,863 INFO] Step 27100/50000; acc:  68.84; ppl:  2.77; xent: 1.02; lr: 0.00010; 9824/3889 tok/s;   5249 sec
[2021-05-15 14:59:54,245 INFO] Step 27150/50000; acc:  69.84; ppl:  2.71; xent: 1.00; lr: 0.00010; 10507/4345 tok/s;   5258 sec
[2021-05-15 15:00:03,962 INFO] Step 27200/50000; acc:  69.04; ppl:  2.79; xent: 1.03; lr: 0.00010; 10714/4149 tok/s;   5268 sec
[2021-05-15 15:00:13,066 INFO] Step 27250/50000; acc:  69.87; ppl:  2.70; xent: 0.99; lr: 0.00010; 10951/4380 tok/s;   5277 sec
[2021-05-15 15:00:22,613 INFO] Step 27300/50000; acc:  69.15; ppl:  2.76; xent: 1.02; lr: 0.00010; 10750/4167 tok/s;   5286 sec
[2021-05-15 15:00:32,582 INFO] Step 27350/50000; acc:  69.33; ppl:  2.74; xent: 1.01; lr: 0.00010; 10311/4057 tok/s;   5296 sec
[2021-05-15 15:00:42,587 INFO] Step 27400/50000; acc:  69.19; ppl:  2.76; xent: 1.01; lr: 0.00010; 10237/4005 tok/s;   5306 sec
[2021-05-15 15:00:51,934 INFO] Step 27450/50000; acc:  69.94; ppl:  2.70; xent: 1.00; lr: 0.00010; 10836/4307 tok/s;   5316 sec
[2021-05-15 15:01:01,301 INFO] Step 27500/50000; acc:  69.49; ppl:  2.74; xent: 1.01; lr: 0.00010; 10788/4298 tok/s;   5325 sec
[2021-05-15 15:01:11,061 INFO] Step 27550/50000; acc:  69.21; ppl:  2.75; xent: 1.01; lr: 0.00010; 10574/4095 tok/s;   5335 sec
[2021-05-15 15:01:20,234 INFO] Step 27600/50000; acc:  70.07; ppl:  2.66; xent: 0.98; lr: 0.00010; 10865/4286 tok/s;   5344 sec
[2021-05-15 15:01:29,731 INFO] Step 27650/50000; acc:  69.51; ppl:  2.75; xent: 1.01; lr: 0.00010; 10945/4267 tok/s;   5354 sec
[2021-05-15 15:01:39,099 INFO] Step 27700/50000; acc:  69.74; ppl:  2.70; xent: 0.99; lr: 0.00010; 10576/4344 tok/s;   5363 sec
[2021-05-15 15:01:41,490 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:01:48,370 INFO] Step 27750/50000; acc:  69.92; ppl:  2.71; xent: 1.00; lr: 0.00010; 11070/4330 tok/s;   5372 sec
[2021-05-15 15:01:58,252 INFO] Step 27800/50000; acc:  69.29; ppl:  2.76; xent: 1.01; lr: 0.00010; 10356/4055 tok/s;   5382 sec
[2021-05-15 15:02:08,438 INFO] Step 27850/50000; acc:  69.51; ppl:  2.73; xent: 1.01; lr: 0.00010; 9905/4029 tok/s;   5392 sec
[2021-05-15 15:02:17,791 INFO] Step 27900/50000; acc:  69.74; ppl:  2.72; xent: 1.00; lr: 0.00010; 10945/4327 tok/s;   5402 sec
[2021-05-15 15:02:27,311 INFO] Step 27950/50000; acc:  69.89; ppl:  2.70; xent: 0.99; lr: 0.00010; 10378/4185 tok/s;   5411 sec
[2021-05-15 15:02:36,731 INFO] Step 28000/50000; acc:  69.57; ppl:  2.74; xent: 1.01; lr: 0.00010; 11033/4281 tok/s;   5421 sec
[2021-05-15 15:02:46,284 INFO] Step 28050/50000; acc:  69.71; ppl:  2.73; xent: 1.00; lr: 0.00010; 10701/4203 tok/s;   5430 sec
[2021-05-15 15:02:56,051 INFO] Step 28100/50000; acc:  69.90; ppl:  2.71; xent: 1.00; lr: 0.00010; 10386/4092 tok/s;   5440 sec
[2021-05-15 15:03:05,783 INFO] Step 28150/50000; acc:  69.30; ppl:  2.74; xent: 1.01; lr: 0.00010; 10473/4132 tok/s;   5450 sec
[2021-05-15 15:03:15,271 INFO] Step 28200/50000; acc:  69.72; ppl:  2.72; xent: 1.00; lr: 0.00010; 10821/4264 tok/s;   5459 sec
[2021-05-15 15:03:24,849 INFO] Step 28250/50000; acc:  69.85; ppl:  2.71; xent: 1.00; lr: 0.00010; 10535/4179 tok/s;   5469 sec
[2021-05-15 15:03:34,645 INFO] Step 28300/50000; acc:  69.98; ppl:  2.66; xent: 0.98; lr: 0.00010; 10380/4121 tok/s;   5479 sec
[2021-05-15 15:03:43,514 INFO] Step 28350/50000; acc:  69.88; ppl:  2.71; xent: 1.00; lr: 0.00010; 11620/4389 tok/s;   5487 sec
[2021-05-15 15:03:52,892 INFO] Step 28400/50000; acc:  69.78; ppl:  2.69; xent: 0.99; lr: 0.00010; 10515/4304 tok/s;   5497 sec
[2021-05-15 15:03:56,025 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:04:02,751 INFO] Step 28450/50000; acc:  69.48; ppl:  2.74; xent: 1.01; lr: 0.00010; 10673/4173 tok/s;   5507 sec
[2021-05-15 15:04:12,294 INFO] Step 28500/50000; acc:  70.06; ppl:  2.67; xent: 0.98; lr: 0.00010; 10379/4163 tok/s;   5516 sec
[2021-05-15 15:04:22,197 INFO] Step 28550/50000; acc:  69.44; ppl:  2.73; xent: 1.00; lr: 0.00010; 10278/4097 tok/s;   5526 sec
[2021-05-15 15:04:31,859 INFO] Step 28600/50000; acc:  69.86; ppl:  2.70; xent: 0.99; lr: 0.00010; 10555/4251 tok/s;   5536 sec
[2021-05-15 15:04:41,203 INFO] Step 28650/50000; acc:  69.96; ppl:  2.71; xent: 1.00; lr: 0.00010; 10853/4247 tok/s;   5545 sec
[2021-05-15 15:04:50,718 INFO] Step 28700/50000; acc:  69.85; ppl:  2.72; xent: 1.00; lr: 0.00010; 10709/4233 tok/s;   5555 sec
[2021-05-15 15:05:00,065 INFO] Step 28750/50000; acc:  70.16; ppl:  2.67; xent: 0.98; lr: 0.00010; 10667/4284 tok/s;   5564 sec
[2021-05-15 15:05:10,212 INFO] Step 28800/50000; acc:  69.06; ppl:  2.77; xent: 1.02; lr: 0.00010; 10453/3946 tok/s;   5574 sec
[2021-05-15 15:05:19,878 INFO] Step 28850/50000; acc:  69.89; ppl:  2.69; xent: 0.99; lr: 0.00010; 10362/4183 tok/s;   5584 sec
[2021-05-15 15:05:29,398 INFO] Step 28900/50000; acc:  70.31; ppl:  2.66; xent: 0.98; lr: 0.00010; 10661/4198 tok/s;   5593 sec
[2021-05-15 15:05:38,708 INFO] Step 28950/50000; acc:  69.86; ppl:  2.71; xent: 1.00; lr: 0.00010; 10926/4350 tok/s;   5603 sec
[2021-05-15 15:05:48,450 INFO] Step 29000/50000; acc:  69.83; ppl:  2.70; xent: 0.99; lr: 0.00010; 10528/4088 tok/s;   5612 sec
[2021-05-15 15:05:58,009 INFO] Step 29050/50000; acc:  70.17; ppl:  2.65; xent: 0.97; lr: 0.00010; 10673/4224 tok/s;   5622 sec
[2021-05-15 15:06:07,027 INFO] Step 29100/50000; acc:  70.16; ppl:  2.66; xent: 0.98; lr: 0.00010; 11107/4379 tok/s;   5631 sec
[2021-05-15 15:06:16,731 INFO] Step 29150/50000; acc:  69.64; ppl:  2.72; xent: 1.00; lr: 0.00010; 10640/4143 tok/s;   5641 sec
[2021-05-15 15:06:17,161 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:06:26,319 INFO] Step 29200/50000; acc:  70.05; ppl:  2.66; xent: 0.98; lr: 0.00010; 10345/4247 tok/s;   5650 sec
[2021-05-15 15:06:36,404 INFO] Step 29250/50000; acc:  69.71; ppl:  2.73; xent: 1.00; lr: 0.00010; 10422/3991 tok/s;   5660 sec
[2021-05-15 15:06:46,469 INFO] Step 29300/50000; acc:  70.21; ppl:  2.65; xent: 0.97; lr: 0.00010; 9713/4044 tok/s;   5670 sec
[2021-05-15 15:06:56,121 INFO] Step 29350/50000; acc:  69.72; ppl:  2.71; xent: 1.00; lr: 0.00010; 10622/4178 tok/s;   5680 sec
[2021-05-15 15:07:05,412 INFO] Step 29400/50000; acc:  70.19; ppl:  2.67; xent: 0.98; lr: 0.00010; 11004/4301 tok/s;   5689 sec
[2021-05-15 15:07:15,063 INFO] Step 29450/50000; acc:  69.92; ppl:  2.68; xent: 0.99; lr: 0.00010; 10447/4205 tok/s;   5699 sec
[2021-05-15 15:07:24,785 INFO] Step 29500/50000; acc:  69.68; ppl:  2.72; xent: 1.00; lr: 0.00010; 10689/4059 tok/s;   5709 sec
[2021-05-15 15:07:34,721 INFO] Step 29550/50000; acc:  69.98; ppl:  2.66; xent: 0.98; lr: 0.00010; 10058/4054 tok/s;   5719 sec
[2021-05-15 15:07:44,221 INFO] Step 29600/50000; acc:  69.86; ppl:  2.69; xent: 0.99; lr: 0.00010; 10940/4242 tok/s;   5728 sec
[2021-05-15 15:07:53,577 INFO] Step 29650/50000; acc:  70.39; ppl:  2.65; xent: 0.98; lr: 0.00010; 10769/4297 tok/s;   5737 sec
[2021-05-15 15:08:03,335 INFO] Step 29700/50000; acc:  69.99; ppl:  2.68; xent: 0.99; lr: 0.00010; 10383/4110 tok/s;   5747 sec
[2021-05-15 15:08:13,193 INFO] Step 29750/50000; acc:  70.21; ppl:  2.66; xent: 0.98; lr: 0.00010; 10382/4079 tok/s;   5757 sec
[2021-05-15 15:08:22,176 INFO] Step 29800/50000; acc:  70.12; ppl:  2.67; xent: 0.98; lr: 0.00010; 11401/4439 tok/s;   5766 sec
[2021-05-15 15:08:31,185 INFO] Step 29850/50000; acc:  70.12; ppl:  2.66; xent: 0.98; lr: 0.00010; 11153/4445 tok/s;   5775 sec
[2021-05-15 15:08:38,652 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:08:40,864 INFO] Step 29900/50000; acc:  70.04; ppl:  2.66; xent: 0.98; lr: 0.00010; 10489/4155 tok/s;   5785 sec
[2021-05-15 15:08:50,599 INFO] Step 29950/50000; acc:  70.17; ppl:  2.65; xent: 0.98; lr: 0.00010; 10590/4184 tok/s;   5794 sec
[2021-05-15 15:09:00,289 INFO] Step 30000/50000; acc:  69.93; ppl:  2.67; xent: 0.98; lr: 0.00010; 10218/4160 tok/s;   5804 sec
[2021-05-15 15:09:00,291 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/loose/valid.txt, align=None)...
[2021-05-15 15:09:08,198 INFO] Validation perplexity: 3.00209
[2021-05-15 15:09:08,198 INFO] Validation accuracy: 67.7938
[2021-05-15 15:09:08,200 INFO] Saving checkpoint ../models/group2_params/loose_ops/model_step_30000.pt
[2021-05-15 15:09:18,836 INFO] Step 30050/50000; acc:  69.82; ppl:  2.71; xent: 1.00; lr: 0.00010; 5610/2207 tok/s;   5823 sec
[2021-05-15 15:09:28,026 INFO] Step 30100/50000; acc:  70.50; ppl:  2.62; xent: 0.96; lr: 0.00010; 10743/4385 tok/s;   5832 sec
[2021-05-15 15:09:37,801 INFO] Step 30150/50000; acc:  69.89; ppl:  2.69; xent: 0.99; lr: 0.00010; 10450/4122 tok/s;   5842 sec
[2021-05-15 15:09:47,237 INFO] Step 30200/50000; acc:  69.81; ppl:  2.71; xent: 1.00; lr: 0.00010; 10911/4277 tok/s;   5851 sec
[2021-05-15 15:09:57,013 INFO] Step 30250/50000; acc:  70.44; ppl:  2.64; xent: 0.97; lr: 0.00010; 10501/3982 tok/s;   5861 sec
[2021-05-15 15:10:07,305 INFO] Step 30300/50000; acc:  69.76; ppl:  2.70; xent: 0.99; lr: 0.00010; 10006/3982 tok/s;   5871 sec
[2021-05-15 15:10:16,264 INFO] Step 30350/50000; acc:  70.74; ppl:  2.61; xent: 0.96; lr: 0.00010; 11009/4467 tok/s;   5880 sec
[2021-05-15 15:10:26,125 INFO] Step 30400/50000; acc:  70.02; ppl:  2.69; xent: 0.99; lr: 0.00010; 10617/4092 tok/s;   5890 sec
[2021-05-15 15:10:35,618 INFO] Step 30450/50000; acc:  70.59; ppl:  2.62; xent: 0.96; lr: 0.00010; 10566/4155 tok/s;   5899 sec
[2021-05-15 15:10:45,378 INFO] Step 30500/50000; acc:  70.24; ppl:  2.63; xent: 0.97; lr: 0.00010; 10431/4127 tok/s;   5909 sec
[2021-05-15 15:10:54,392 INFO] Step 30550/50000; acc:  70.10; ppl:  2.66; xent: 0.98; lr: 0.00010; 11269/4480 tok/s;   5918 sec
[2021-05-15 15:11:03,898 INFO] Step 30600/50000; acc:  70.02; ppl:  2.68; xent: 0.98; lr: 0.00010; 10716/4223 tok/s;   5928 sec
[2021-05-15 15:11:08,573 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:11:13,363 INFO] Step 30650/50000; acc:  70.52; ppl:  2.62; xent: 0.96; lr: 0.00010; 10719/4290 tok/s;   5937 sec
[2021-05-15 15:11:23,275 INFO] Step 30700/50000; acc:  70.11; ppl:  2.65; xent: 0.98; lr: 0.00010; 10258/4054 tok/s;   5947 sec
[2021-05-15 15:11:33,710 INFO] Step 30750/50000; acc:  69.83; ppl:  2.66; xent: 0.98; lr: 0.00010; 9835/3876 tok/s;   5958 sec
[2021-05-15 15:11:43,061 INFO] Step 30800/50000; acc:  70.38; ppl:  2.62; xent: 0.96; lr: 0.00010; 10536/4361 tok/s;   5967 sec
[2021-05-15 15:11:52,837 INFO] Step 30850/50000; acc:  70.31; ppl:  2.68; xent: 0.99; lr: 0.00010; 10693/4144 tok/s;   5977 sec
[2021-05-15 15:12:01,775 INFO] Step 30900/50000; acc:  70.83; ppl:  2.61; xent: 0.96; lr: 0.00010; 10998/4452 tok/s;   5986 sec
[2021-05-15 15:12:11,424 INFO] Step 30950/50000; acc:  70.26; ppl:  2.67; xent: 0.98; lr: 0.00010; 10752/4133 tok/s;   5995 sec
[2021-05-15 15:12:21,424 INFO] Step 31000/50000; acc:  70.06; ppl:  2.66; xent: 0.98; lr: 0.00010; 10334/3998 tok/s;   6005 sec
[2021-05-15 15:12:31,244 INFO] Step 31050/50000; acc:  70.11; ppl:  2.65; xent: 0.97; lr: 0.00010; 10314/4109 tok/s;   6015 sec
[2021-05-15 15:12:40,607 INFO] Step 31100/50000; acc:  70.72; ppl:  2.62; xent: 0.96; lr: 0.00010; 10977/4280 tok/s;   6024 sec
[2021-05-15 15:12:49,922 INFO] Step 31150/50000; acc:  70.49; ppl:  2.63; xent: 0.97; lr: 0.00010; 10647/4311 tok/s;   6034 sec
[2021-05-15 15:12:59,881 INFO] Step 31200/50000; acc:  70.23; ppl:  2.65; xent: 0.98; lr: 0.00010; 10473/4033 tok/s;   6044 sec
[2021-05-15 15:13:09,007 INFO] Step 31250/50000; acc:  70.78; ppl:  2.59; xent: 0.95; lr: 0.00010; 11040/4345 tok/s;   6053 sec
[2021-05-15 15:13:18,269 INFO] Step 31300/50000; acc:  70.67; ppl:  2.61; xent: 0.96; lr: 0.00010; 10882/4330 tok/s;   6062 sec
[2021-05-15 15:13:27,756 INFO] Step 31350/50000; acc:  70.27; ppl:  2.64; xent: 0.97; lr: 0.00010; 10748/4300 tok/s;   6072 sec
[2021-05-15 15:13:29,763 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:13:37,127 INFO] Step 31400/50000; acc:  70.53; ppl:  2.63; xent: 0.97; lr: 0.00010; 10950/4297 tok/s;   6081 sec
[2021-05-15 15:13:47,045 INFO] Step 31450/50000; acc:  70.00; ppl:  2.66; xent: 0.98; lr: 0.00010; 10185/4025 tok/s;   6091 sec
[2021-05-15 15:13:57,077 INFO] Step 31500/50000; acc:  70.43; ppl:  2.64; xent: 0.97; lr: 0.00010; 10029/4119 tok/s;   6101 sec
[2021-05-15 15:14:06,446 INFO] Step 31550/50000; acc:  70.65; ppl:  2.62; xent: 0.96; lr: 0.00010; 10985/4292 tok/s;   6110 sec
[2021-05-15 15:14:15,970 INFO] Step 31600/50000; acc:  70.70; ppl:  2.60; xent: 0.96; lr: 0.00010; 10370/4196 tok/s;   6120 sec
[2021-05-15 15:14:25,560 INFO] Step 31650/50000; acc:  70.17; ppl:  2.66; xent: 0.98; lr: 0.00010; 10893/4196 tok/s;   6129 sec
[2021-05-15 15:14:35,042 INFO] Step 31700/50000; acc:  70.60; ppl:  2.61; xent: 0.96; lr: 0.00010; 10633/4213 tok/s;   6139 sec
[2021-05-15 15:14:44,737 INFO] Step 31750/50000; acc:  70.44; ppl:  2.64; xent: 0.97; lr: 0.00010; 10549/4164 tok/s;   6149 sec
[2021-05-15 15:14:54,629 INFO] Step 31800/50000; acc:  70.32; ppl:  2.64; xent: 0.97; lr: 0.00010; 10384/4067 tok/s;   6158 sec
[2021-05-15 15:15:04,034 INFO] Step 31850/50000; acc:  70.41; ppl:  2.63; xent: 0.97; lr: 0.00010; 10811/4286 tok/s;   6168 sec
[2021-05-15 15:15:13,740 INFO] Step 31900/50000; acc:  70.38; ppl:  2.64; xent: 0.97; lr: 0.00010; 10529/4111 tok/s;   6178 sec
[2021-05-15 15:15:23,188 INFO] Step 31950/50000; acc:  71.21; ppl:  2.54; xent: 0.93; lr: 0.00010; 10572/4257 tok/s;   6187 sec
[2021-05-15 15:15:32,148 INFO] Step 32000/50000; acc:  70.31; ppl:  2.64; xent: 0.97; lr: 0.00010; 11622/4385 tok/s;   6196 sec
[2021-05-15 15:15:41,559 INFO] Step 32050/50000; acc:  70.80; ppl:  2.61; xent: 0.96; lr: 0.00010; 10609/4308 tok/s;   6205 sec
[2021-05-15 15:15:44,180 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:15:51,183 INFO] Step 32100/50000; acc:  70.74; ppl:  2.60; xent: 0.96; lr: 0.00010; 10585/4218 tok/s;   6215 sec
[2021-05-15 15:16:00,855 INFO] Step 32150/50000; acc:  70.66; ppl:  2.60; xent: 0.95; lr: 0.00010; 10548/4121 tok/s;   6225 sec
[2021-05-15 15:16:10,774 INFO] Step 32200/50000; acc:  70.03; ppl:  2.65; xent: 0.97; lr: 0.00010; 10228/4122 tok/s;   6235 sec
[2021-05-15 15:16:20,442 INFO] Step 32250/50000; acc:  70.91; ppl:  2.59; xent: 0.95; lr: 0.00010; 10447/4210 tok/s;   6244 sec
[2021-05-15 15:16:29,647 INFO] Step 32300/50000; acc:  70.53; ppl:  2.61; xent: 0.96; lr: 0.00010; 10969/4309 tok/s;   6254 sec
[2021-05-15 15:16:39,262 INFO] Step 32350/50000; acc:  70.43; ppl:  2.64; xent: 0.97; lr: 0.00010; 10655/4200 tok/s;   6263 sec
[2021-05-15 15:16:48,529 INFO] Step 32400/50000; acc:  70.84; ppl:  2.58; xent: 0.95; lr: 0.00010; 10780/4329 tok/s;   6272 sec
[2021-05-15 15:16:58,760 INFO] Step 32450/50000; acc:  69.89; ppl:  2.67; xent: 0.98; lr: 0.00010; 10391/3916 tok/s;   6283 sec
[2021-05-15 15:17:08,351 INFO] Step 32500/50000; acc:  70.88; ppl:  2.58; xent: 0.95; lr: 0.00010; 10287/4215 tok/s;   6292 sec
[2021-05-15 15:17:17,944 INFO] Step 32550/50000; acc:  70.71; ppl:  2.60; xent: 0.96; lr: 0.00010; 10697/4170 tok/s;   6302 sec
[2021-05-15 15:17:27,380 INFO] Step 32600/50000; acc:  70.40; ppl:  2.63; xent: 0.97; lr: 0.00010; 10851/4298 tok/s;   6311 sec
[2021-05-15 15:17:36,979 INFO] Step 32650/50000; acc:  70.67; ppl:  2.59; xent: 0.95; lr: 0.00010; 10595/4117 tok/s;   6321 sec
[2021-05-15 15:17:46,674 INFO] Step 32700/50000; acc:  70.70; ppl:  2.58; xent: 0.95; lr: 0.00010; 10638/4188 tok/s;   6331 sec
[2021-05-15 15:17:55,530 INFO] Step 32750/50000; acc:  71.13; ppl:  2.55; xent: 0.93; lr: 0.00010; 11112/4448 tok/s;   6339 sec
[2021-05-15 15:18:05,353 INFO] Step 32800/50000; acc:  70.50; ppl:  2.64; xent: 0.97; lr: 0.00010; 10629/4116 tok/s;   6349 sec
[2021-05-15 15:18:05,363 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:18:15,005 INFO] Step 32850/50000; acc:  71.03; ppl:  2.57; xent: 0.94; lr: 0.00010; 10381/4217 tok/s;   6359 sec
[2021-05-15 15:18:25,046 INFO] Step 32900/50000; acc:  70.61; ppl:  2.62; xent: 0.96; lr: 0.00010; 10143/3995 tok/s;   6369 sec
[2021-05-15 15:18:34,943 INFO] Step 32950/50000; acc:  70.56; ppl:  2.60; xent: 0.96; lr: 0.00010; 10177/4147 tok/s;   6379 sec
[2021-05-15 15:18:44,636 INFO] Step 33000/50000; acc:  70.66; ppl:  2.61; xent: 0.96; lr: 0.00010; 10550/4122 tok/s;   6388 sec
[2021-05-15 15:18:53,935 INFO] Step 33050/50000; acc:  71.10; ppl:  2.58; xent: 0.95; lr: 0.00010; 10881/4324 tok/s;   6398 sec
[2021-05-15 15:19:03,528 INFO] Step 33100/50000; acc:  70.72; ppl:  2.60; xent: 0.96; lr: 0.00010; 10506/4231 tok/s;   6407 sec
[2021-05-15 15:19:13,291 INFO] Step 33150/50000; acc:  70.50; ppl:  2.63; xent: 0.97; lr: 0.00010; 10690/4008 tok/s;   6417 sec
[2021-05-15 15:19:23,229 INFO] Step 33200/50000; acc:  70.66; ppl:  2.59; xent: 0.95; lr: 0.00010; 10052/4080 tok/s;   6427 sec
[2021-05-15 15:19:32,840 INFO] Step 33250/50000; acc:  70.62; ppl:  2.61; xent: 0.96; lr: 0.00010; 10835/4184 tok/s;   6437 sec
[2021-05-15 15:19:42,236 INFO] Step 33300/50000; acc:  71.37; ppl:  2.55; xent: 0.94; lr: 0.00010; 10589/4251 tok/s;   6446 sec
[2021-05-15 15:19:51,979 INFO] Step 33350/50000; acc:  70.85; ppl:  2.59; xent: 0.95; lr: 0.00010; 10499/4122 tok/s;   6456 sec
[2021-05-15 15:20:01,951 INFO] Step 33400/50000; acc:  71.09; ppl:  2.56; xent: 0.94; lr: 0.00010; 10342/4037 tok/s;   6466 sec
[2021-05-15 15:20:10,716 INFO] Step 33450/50000; acc:  71.05; ppl:  2.57; xent: 0.94; lr: 0.00010; 11553/4539 tok/s;   6475 sec
[2021-05-15 15:20:20,012 INFO] Step 33500/50000; acc:  71.12; ppl:  2.60; xent: 0.95; lr: 0.00010; 10952/4343 tok/s;   6484 sec
[2021-05-15 15:20:27,028 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:20:29,590 INFO] Step 33550/50000; acc:  71.04; ppl:  2.56; xent: 0.94; lr: 0.00010; 10396/4193 tok/s;   6493 sec
[2021-05-15 15:20:39,434 INFO] Step 33600/50000; acc:  70.67; ppl:  2.60; xent: 0.96; lr: 0.00010; 10606/4125 tok/s;   6503 sec
[2021-05-15 15:20:49,391 INFO] Step 33650/50000; acc:  70.76; ppl:  2.58; xent: 0.95; lr: 0.00010; 10046/4075 tok/s;   6513 sec
[2021-05-15 15:20:59,185 INFO] Step 33700/50000; acc:  70.83; ppl:  2.59; xent: 0.95; lr: 0.00010; 10310/4155 tok/s;   6523 sec
[2021-05-15 15:21:08,538 INFO] Step 33750/50000; acc:  71.26; ppl:  2.55; xent: 0.94; lr: 0.00010; 10848/4309 tok/s;   6532 sec
[2021-05-15 15:21:18,361 INFO] Step 33800/50000; acc:  70.80; ppl:  2.61; xent: 0.96; lr: 0.00010; 10389/4118 tok/s;   6542 sec
[2021-05-15 15:21:27,858 INFO] Step 33850/50000; acc:  70.91; ppl:  2.59; xent: 0.95; lr: 0.00010; 10730/4237 tok/s;   6552 sec
[2021-05-15 15:21:37,589 INFO] Step 33900/50000; acc:  71.22; ppl:  2.54; xent: 0.93; lr: 0.00010; 10502/4032 tok/s;   6561 sec
[2021-05-15 15:21:47,730 INFO] Step 33950/50000; acc:  70.59; ppl:  2.61; xent: 0.96; lr: 0.00010; 10213/4009 tok/s;   6572 sec
[2021-05-15 15:21:56,984 INFO] Step 34000/50000; acc:  71.45; ppl:  2.52; xent: 0.92; lr: 0.00010; 10667/4327 tok/s;   6581 sec
[2021-05-15 15:22:06,921 INFO] Step 34050/50000; acc:  70.79; ppl:  2.60; xent: 0.95; lr: 0.00010; 10555/4062 tok/s;   6591 sec
[2021-05-15 15:22:16,290 INFO] Step 34100/50000; acc:  71.44; ppl:  2.51; xent: 0.92; lr: 0.00010; 10575/4222 tok/s;   6600 sec
[2021-05-15 15:22:26,086 INFO] Step 34150/50000; acc:  70.87; ppl:  2.57; xent: 0.94; lr: 0.00010; 10488/4090 tok/s;   6610 sec
[2021-05-15 15:22:35,118 INFO] Step 34200/50000; acc:  70.94; ppl:  2.55; xent: 0.94; lr: 0.00010; 11318/4454 tok/s;   6619 sec
[2021-05-15 15:22:44,614 INFO] Step 34250/50000; acc:  70.94; ppl:  2.58; xent: 0.95; lr: 0.00010; 10635/4279 tok/s;   6628 sec
[2021-05-15 15:22:48,898 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:22:54,345 INFO] Step 34300/50000; acc:  71.31; ppl:  2.55; xent: 0.94; lr: 0.00010; 10560/4158 tok/s;   6638 sec
[2021-05-15 15:23:03,886 INFO] Step 34350/50000; acc:  71.39; ppl:  2.55; xent: 0.94; lr: 0.00010; 10442/4159 tok/s;   6648 sec
[2021-05-15 15:23:14,404 INFO] Step 34400/50000; acc:  70.72; ppl:  2.60; xent: 0.95; lr: 0.00010; 9878/3903 tok/s;   6658 sec
[2021-05-15 15:23:24,060 INFO] Step 34450/50000; acc:  71.30; ppl:  2.55; xent: 0.94; lr: 0.00010; 10323/4203 tok/s;   6668 sec
[2021-05-15 15:23:33,509 INFO] Step 34500/50000; acc:  71.35; ppl:  2.55; xent: 0.94; lr: 0.00010; 10712/4261 tok/s;   6677 sec
[2021-05-15 15:23:42,797 INFO] Step 34550/50000; acc:  71.18; ppl:  2.56; xent: 0.94; lr: 0.00010; 10902/4324 tok/s;   6687 sec
[2021-05-15 15:23:52,416 INFO] Step 34600/50000; acc:  70.80; ppl:  2.59; xent: 0.95; lr: 0.00010; 10783/4148 tok/s;   6696 sec
[2021-05-15 15:24:02,392 INFO] Step 34650/50000; acc:  71.15; ppl:  2.56; xent: 0.94; lr: 0.00010; 10218/3935 tok/s;   6706 sec
[2021-05-15 15:24:12,197 INFO] Step 34700/50000; acc:  71.09; ppl:  2.57; xent: 0.94; lr: 0.00010; 10311/4169 tok/s;   6716 sec
[2021-05-15 15:24:21,559 INFO] Step 34750/50000; acc:  71.13; ppl:  2.56; xent: 0.94; lr: 0.00010; 11052/4294 tok/s;   6725 sec
[2021-05-15 15:24:30,789 INFO] Step 34800/50000; acc:  71.52; ppl:  2.52; xent: 0.93; lr: 0.00010; 10722/4341 tok/s;   6735 sec
[2021-05-15 15:24:40,979 INFO] Step 34850/50000; acc:  70.86; ppl:  2.57; xent: 0.94; lr: 0.00010; 10283/3976 tok/s;   6745 sec
[2021-05-15 15:24:50,006 INFO] Step 34900/50000; acc:  71.86; ppl:  2.50; xent: 0.91; lr: 0.00010; 10989/4348 tok/s;   6754 sec
[2021-05-15 15:24:59,179 INFO] Step 34950/50000; acc:  71.24; ppl:  2.55; xent: 0.94; lr: 0.00010; 11117/4392 tok/s;   6763 sec
[2021-05-15 15:25:08,639 INFO] Step 35000/50000; acc:  71.10; ppl:  2.56; xent: 0.94; lr: 0.00010; 10850/4288 tok/s;   6773 sec
[2021-05-15 15:25:08,643 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/loose/valid.txt, align=None)...
[2021-05-15 15:25:16,567 INFO] Validation perplexity: 3.00896
[2021-05-15 15:25:16,568 INFO] Validation accuracy: 67.7829
[2021-05-15 15:25:16,570 INFO] Saving checkpoint ../models/group2_params/loose_ops/model_step_35000.pt
[2021-05-15 15:25:18,921 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:25:26,861 INFO] Step 35050/50000; acc:  71.37; ppl:  2.53; xent: 0.93; lr: 0.00010; 5579/2216 tok/s;   6791 sec
[2021-05-15 15:25:36,812 INFO] Step 35100/50000; acc:  70.89; ppl:  2.58; xent: 0.95; lr: 0.00010; 10274/4004 tok/s;   6801 sec
[2021-05-15 15:25:46,724 INFO] Step 35150/50000; acc:  71.54; ppl:  2.51; xent: 0.92; lr: 0.00010; 9946/4155 tok/s;   6811 sec
[2021-05-15 15:25:56,207 INFO] Step 35200/50000; acc:  70.88; ppl:  2.58; xent: 0.95; lr: 0.00010; 10994/4260 tok/s;   6820 sec
[2021-05-15 15:26:05,482 INFO] Step 35250/50000; acc:  71.67; ppl:  2.52; xent: 0.92; lr: 0.00010; 10759/4277 tok/s;   6829 sec
[2021-05-15 15:26:15,186 INFO] Step 35300/50000; acc:  71.32; ppl:  2.54; xent: 0.93; lr: 0.00010; 10441/4176 tok/s;   6839 sec
[2021-05-15 15:26:24,827 INFO] Step 35350/50000; acc:  71.07; ppl:  2.57; xent: 0.94; lr: 0.00010; 10758/4149 tok/s;   6849 sec
[2021-05-15 15:26:34,630 INFO] Step 35400/50000; acc:  71.15; ppl:  2.55; xent: 0.94; lr: 0.00010; 10428/4121 tok/s;   6858 sec
[2021-05-15 15:26:44,447 INFO] Step 35450/50000; acc:  71.41; ppl:  2.53; xent: 0.93; lr: 0.00010; 10332/4073 tok/s;   6868 sec
[2021-05-15 15:26:53,997 INFO] Step 35500/50000; acc:  71.47; ppl:  2.54; xent: 0.93; lr: 0.00010; 10627/4255 tok/s;   6878 sec
[2021-05-15 15:27:03,657 INFO] Step 35550/50000; acc:  71.29; ppl:  2.55; xent: 0.93; lr: 0.00010; 10630/4128 tok/s;   6888 sec
[2021-05-15 15:27:13,048 INFO] Step 35600/50000; acc:  72.11; ppl:  2.47; xent: 0.91; lr: 0.00010; 10652/4249 tok/s;   6897 sec
[2021-05-15 15:27:22,192 INFO] Step 35650/50000; acc:  71.02; ppl:  2.55; xent: 0.94; lr: 0.00010; 11392/4354 tok/s;   6906 sec
[2021-05-15 15:27:31,312 INFO] Step 35700/50000; acc:  71.80; ppl:  2.50; xent: 0.92; lr: 0.00010; 10804/4387 tok/s;   6915 sec
[2021-05-15 15:27:33,705 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:27:41,042 INFO] Step 35750/50000; acc:  71.45; ppl:  2.55; xent: 0.94; lr: 0.00010; 10574/4212 tok/s;   6925 sec
[2021-05-15 15:27:50,731 INFO] Step 35800/50000; acc:  71.24; ppl:  2.53; xent: 0.93; lr: 0.00010; 10603/4081 tok/s;   6935 sec
[2021-05-15 15:28:00,870 INFO] Step 35850/50000; acc:  70.99; ppl:  2.55; xent: 0.94; lr: 0.00010; 9914/4052 tok/s;   6945 sec
[2021-05-15 15:28:10,548 INFO] Step 35900/50000; acc:  71.50; ppl:  2.53; xent: 0.93; lr: 0.00010; 10580/4205 tok/s;   6954 sec
[2021-05-15 15:28:19,496 INFO] Step 35950/50000; acc:  71.95; ppl:  2.49; xent: 0.91; lr: 0.00010; 11062/4427 tok/s;   6963 sec
[2021-05-15 15:28:29,260 INFO] Step 36000/50000; acc:  70.87; ppl:  2.58; xent: 0.95; lr: 0.00010; 10602/4159 tok/s;   6973 sec
[2021-05-15 15:28:38,819 INFO] Step 36050/50000; acc:  71.51; ppl:  2.51; xent: 0.92; lr: 0.00010; 10593/4166 tok/s;   6983 sec
[2021-05-15 15:28:48,827 INFO] Step 36100/50000; acc:  71.20; ppl:  2.55; xent: 0.94; lr: 0.00010; 10275/4004 tok/s;   6993 sec
[2021-05-15 15:28:58,647 INFO] Step 36150/50000; acc:  71.27; ppl:  2.53; xent: 0.93; lr: 0.00010; 10350/4142 tok/s;   7003 sec
[2021-05-15 15:29:08,142 INFO] Step 36200/50000; acc:  71.65; ppl:  2.51; xent: 0.92; lr: 0.00010; 10797/4194 tok/s;   7012 sec
[2021-05-15 15:29:17,530 INFO] Step 36250/50000; acc:  71.45; ppl:  2.52; xent: 0.92; lr: 0.00010; 10787/4305 tok/s;   7021 sec
[2021-05-15 15:29:27,229 INFO] Step 36300/50000; acc:  71.55; ppl:  2.51; xent: 0.92; lr: 0.00010; 10452/4103 tok/s;   7031 sec
[2021-05-15 15:29:36,724 INFO] Step 36350/50000; acc:  71.55; ppl:  2.51; xent: 0.92; lr: 0.00010; 10936/4248 tok/s;   7041 sec
[2021-05-15 15:29:45,549 INFO] Step 36400/50000; acc:  72.12; ppl:  2.46; xent: 0.90; lr: 0.00010; 11126/4495 tok/s;   7049 sec
[2021-05-15 15:29:55,100 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:29:55,496 INFO] Step 36450/50000; acc:  71.04; ppl:  2.56; xent: 0.94; lr: 0.00010; 10526/4060 tok/s;   7059 sec
[2021-05-15 15:30:04,976 INFO] Step 36500/50000; acc:  71.75; ppl:  2.49; xent: 0.91; lr: 0.00010; 10434/4271 tok/s;   7069 sec
[2021-05-15 15:30:15,228 INFO] Step 36550/50000; acc:  71.22; ppl:  2.54; xent: 0.93; lr: 0.00010; 10036/3937 tok/s;   7079 sec
[2021-05-15 15:30:24,952 INFO] Step 36600/50000; acc:  71.39; ppl:  2.53; xent: 0.93; lr: 0.00010; 10434/4195 tok/s;   7089 sec
[2021-05-15 15:30:34,570 INFO] Step 36650/50000; acc:  71.68; ppl:  2.50; xent: 0.92; lr: 0.00010; 10531/4162 tok/s;   7098 sec
[2021-05-15 15:30:44,191 INFO] Step 36700/50000; acc:  71.72; ppl:  2.51; xent: 0.92; lr: 0.00010; 10647/4166 tok/s;   7108 sec
[2021-05-15 15:30:53,625 INFO] Step 36750/50000; acc:  71.84; ppl:  2.50; xent: 0.91; lr: 0.00010; 10486/4287 tok/s;   7117 sec
[2021-05-15 15:31:03,395 INFO] Step 36800/50000; acc:  71.17; ppl:  2.55; xent: 0.94; lr: 0.00010; 10801/4048 tok/s;   7127 sec
[2021-05-15 15:31:13,592 INFO] Step 36850/50000; acc:  71.51; ppl:  2.51; xent: 0.92; lr: 0.00010; 9917/3985 tok/s;   7137 sec
[2021-05-15 15:31:22,855 INFO] Step 36900/50000; acc:  72.02; ppl:  2.46; xent: 0.90; lr: 0.00010; 10868/4334 tok/s;   7147 sec
[2021-05-15 15:31:32,515 INFO] Step 36950/50000; acc:  71.54; ppl:  2.50; xent: 0.92; lr: 0.00010; 10605/4175 tok/s;   7156 sec
[2021-05-15 15:31:42,150 INFO] Step 37000/50000; acc:  71.56; ppl:  2.51; xent: 0.92; lr: 0.00010; 10596/4130 tok/s;   7166 sec
[2021-05-15 15:31:52,075 INFO] Step 37050/50000; acc:  72.44; ppl:  2.46; xent: 0.90; lr: 0.00010; 10258/4034 tok/s;   7176 sec
[2021-05-15 15:32:00,871 INFO] Step 37100/50000; acc:  71.84; ppl:  2.48; xent: 0.91; lr: 0.00010; 11493/4545 tok/s;   7185 sec
[2021-05-15 15:32:10,121 INFO] Step 37150/50000; acc:  71.42; ppl:  2.52; xent: 0.92; lr: 0.00010; 11075/4340 tok/s;   7194 sec
[2021-05-15 15:32:16,900 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:32:19,843 INFO] Step 37200/50000; acc:  71.80; ppl:  2.48; xent: 0.91; lr: 0.00010; 10250/4151 tok/s;   7204 sec
[2021-05-15 15:32:29,785 INFO] Step 37250/50000; acc:  71.81; ppl:  2.51; xent: 0.92; lr: 0.00010; 10548/4094 tok/s;   7214 sec
[2021-05-15 15:32:39,623 INFO] Step 37300/50000; acc:  71.58; ppl:  2.49; xent: 0.91; lr: 0.00010; 10010/4087 tok/s;   7223 sec
[2021-05-15 15:32:49,390 INFO] Step 37350/50000; acc:  71.85; ppl:  2.50; xent: 0.91; lr: 0.00010; 10440/4229 tok/s;   7233 sec
[2021-05-15 15:32:58,881 INFO] Step 37400/50000; acc:  71.89; ppl:  2.48; xent: 0.91; lr: 0.00010; 10767/4204 tok/s;   7243 sec
[2021-05-15 15:33:08,391 INFO] Step 37450/50000; acc:  71.54; ppl:  2.51; xent: 0.92; lr: 0.00010; 10622/4222 tok/s;   7252 sec
[2021-05-15 15:33:18,050 INFO] Step 37500/50000; acc:  71.45; ppl:  2.53; xent: 0.93; lr: 0.00010; 10695/4193 tok/s;   7262 sec
[2021-05-15 15:33:27,713 INFO] Step 37550/50000; acc:  72.16; ppl:  2.46; xent: 0.90; lr: 0.00010; 10391/4049 tok/s;   7272 sec
[2021-05-15 15:33:37,921 INFO] Step 37600/50000; acc:  71.34; ppl:  2.53; xent: 0.93; lr: 0.00010; 10250/3994 tok/s;   7282 sec
[2021-05-15 15:33:47,096 INFO] Step 37650/50000; acc:  72.34; ppl:  2.46; xent: 0.90; lr: 0.00010; 10889/4344 tok/s;   7291 sec
[2021-05-15 15:33:56,906 INFO] Step 37700/50000; acc:  72.20; ppl:  2.47; xent: 0.90; lr: 0.00010; 10352/4127 tok/s;   7301 sec
[2021-05-15 15:34:06,451 INFO] Step 37750/50000; acc:  71.75; ppl:  2.49; xent: 0.91; lr: 0.00010; 10687/4194 tok/s;   7310 sec
[2021-05-15 15:34:16,079 INFO] Step 37800/50000; acc:  72.22; ppl:  2.46; xent: 0.90; lr: 0.00010; 10659/4104 tok/s;   7320 sec
[2021-05-15 15:34:25,182 INFO] Step 37850/50000; acc:  72.08; ppl:  2.47; xent: 0.90; lr: 0.00010; 11095/4433 tok/s;   7329 sec
[2021-05-15 15:34:34,608 INFO] Step 37900/50000; acc:  71.89; ppl:  2.49; xent: 0.91; lr: 0.00010; 10674/4314 tok/s;   7338 sec
[2021-05-15 15:34:38,510 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:34:44,253 INFO] Step 37950/50000; acc:  71.84; ppl:  2.49; xent: 0.91; lr: 0.00010; 10736/4171 tok/s;   7348 sec
[2021-05-15 15:34:53,827 INFO] Step 38000/50000; acc:  72.01; ppl:  2.46; xent: 0.90; lr: 0.00010; 10368/4163 tok/s;   7358 sec
[2021-05-15 15:35:04,389 INFO] Step 38050/50000; acc:  71.29; ppl:  2.52; xent: 0.93; lr: 0.00010; 9876/3902 tok/s;   7368 sec
[2021-05-15 15:35:13,794 INFO] Step 38100/50000; acc:  72.16; ppl:  2.45; xent: 0.89; lr: 0.00010; 10460/4313 tok/s;   7378 sec
[2021-05-15 15:35:23,257 INFO] Step 38150/50000; acc:  71.96; ppl:  2.48; xent: 0.91; lr: 0.00010; 10803/4226 tok/s;   7387 sec
[2021-05-15 15:35:32,625 INFO] Step 38200/50000; acc:  72.09; ppl:  2.49; xent: 0.91; lr: 0.00010; 10882/4311 tok/s;   7396 sec
[2021-05-15 15:35:41,939 INFO] Step 38250/50000; acc:  71.78; ppl:  2.50; xent: 0.92; lr: 0.00010; 11048/4262 tok/s;   7406 sec
[2021-05-15 15:35:51,785 INFO] Step 38300/50000; acc:  71.88; ppl:  2.48; xent: 0.91; lr: 0.00010; 10474/4027 tok/s;   7416 sec
[2021-05-15 15:36:01,706 INFO] Step 38350/50000; acc:  71.88; ppl:  2.46; xent: 0.90; lr: 0.00010; 9990/4094 tok/s;   7426 sec
[2021-05-15 15:36:11,117 INFO] Step 38400/50000; acc:  71.88; ppl:  2.49; xent: 0.91; lr: 0.00010; 11134/4261 tok/s;   7435 sec
[2021-05-15 15:36:20,502 INFO] Step 38450/50000; acc:  72.28; ppl:  2.45; xent: 0.90; lr: 0.00010; 10658/4281 tok/s;   7444 sec
[2021-05-15 15:36:30,378 INFO] Step 38500/50000; acc:  72.33; ppl:  2.44; xent: 0.89; lr: 0.00010; 10290/4074 tok/s;   7454 sec
[2021-05-15 15:36:39,399 INFO] Step 38550/50000; acc:  72.21; ppl:  2.45; xent: 0.89; lr: 0.00010; 11301/4377 tok/s;   7463 sec
[2021-05-15 15:36:48,782 INFO] Step 38600/50000; acc:  72.11; ppl:  2.47; xent: 0.91; lr: 0.00010; 10859/4323 tok/s;   7473 sec
[2021-05-15 15:36:58,344 INFO] Step 38650/50000; acc:  72.24; ppl:  2.46; xent: 0.90; lr: 0.00010; 10612/4178 tok/s;   7482 sec
[2021-05-15 15:36:59,651 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:37:07,909 INFO] Step 38700/50000; acc:  72.38; ppl:  2.44; xent: 0.89; lr: 0.00010; 10610/4245 tok/s;   7492 sec
[2021-05-15 15:37:18,018 INFO] Step 38750/50000; acc:  71.66; ppl:  2.51; xent: 0.92; lr: 0.00010; 10160/3963 tok/s;   7502 sec
[2021-05-15 15:37:27,819 INFO] Step 38800/50000; acc:  72.19; ppl:  2.44; xent: 0.89; lr: 0.00010; 10059/4238 tok/s;   7512 sec
[2021-05-15 15:37:37,260 INFO] Step 38850/50000; acc:  71.83; ppl:  2.49; xent: 0.91; lr: 0.00010; 11066/4220 tok/s;   7521 sec
[2021-05-15 15:37:46,547 INFO] Step 38900/50000; acc:  72.76; ppl:  2.43; xent: 0.89; lr: 0.00010; 10604/4288 tok/s;   7530 sec
[2021-05-15 15:37:56,213 INFO] Step 38950/50000; acc:  71.98; ppl:  2.47; xent: 0.90; lr: 0.00010; 10600/4175 tok/s;   7540 sec
[2021-05-15 15:38:05,799 INFO] Step 39000/50000; acc:  71.80; ppl:  2.49; xent: 0.91; lr: 0.00010; 10882/4180 tok/s;   7550 sec
[2021-05-15 15:38:15,674 INFO] Step 39050/50000; acc:  71.99; ppl:  2.47; xent: 0.90; lr: 0.00010; 10266/4112 tok/s;   7560 sec
[2021-05-15 15:38:25,328 INFO] Step 39100/50000; acc:  72.46; ppl:  2.46; xent: 0.90; lr: 0.00010; 10625/4129 tok/s;   7569 sec
[2021-05-15 15:38:34,717 INFO] Step 39150/50000; acc:  72.35; ppl:  2.44; xent: 0.89; lr: 0.00010; 10592/4309 tok/s;   7579 sec
[2021-05-15 15:38:44,419 INFO] Step 39200/50000; acc:  71.85; ppl:  2.47; xent: 0.91; lr: 0.00010; 10721/4145 tok/s;   7588 sec
[2021-05-15 15:38:53,824 INFO] Step 39250/50000; acc:  72.61; ppl:  2.41; xent: 0.88; lr: 0.00010; 10757/4212 tok/s;   7598 sec
[2021-05-15 15:39:02,919 INFO] Step 39300/50000; acc:  72.31; ppl:  2.43; xent: 0.89; lr: 0.00010; 11112/4368 tok/s;   7607 sec
[2021-05-15 15:39:12,359 INFO] Step 39350/50000; acc:  72.18; ppl:  2.45; xent: 0.90; lr: 0.00010; 10736/4294 tok/s;   7616 sec
[2021-05-15 15:39:14,279 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:39:21,976 INFO] Step 39400/50000; acc:  72.19; ppl:  2.45; xent: 0.90; lr: 0.00010; 10670/4242 tok/s;   7626 sec
[2021-05-15 15:39:31,657 INFO] Step 39450/50000; acc:  72.53; ppl:  2.43; xent: 0.89; lr: 0.00010; 10495/4102 tok/s;   7636 sec
[2021-05-15 15:39:41,513 INFO] Step 39500/50000; acc:  71.99; ppl:  2.46; xent: 0.90; lr: 0.00010; 10160/4152 tok/s;   7645 sec
[2021-05-15 15:39:51,273 INFO] Step 39550/50000; acc:  72.01; ppl:  2.45; xent: 0.90; lr: 0.00010; 10566/4159 tok/s;   7655 sec
[2021-05-15 15:40:00,285 INFO] Step 39600/50000; acc:  72.77; ppl:  2.41; xent: 0.88; lr: 0.00010; 10971/4395 tok/s;   7664 sec
[2021-05-15 15:40:09,966 INFO] Step 39650/50000; acc:  72.00; ppl:  2.49; xent: 0.91; lr: 0.00010; 10740/4195 tok/s;   7674 sec
[2021-05-15 15:40:19,578 INFO] Step 39700/50000; acc:  72.76; ppl:  2.42; xent: 0.88; lr: 0.00010; 10394/4128 tok/s;   7683 sec
[2021-05-15 15:40:29,672 INFO] Step 39750/50000; acc:  72.01; ppl:  2.46; xent: 0.90; lr: 0.00010; 10263/4009 tok/s;   7694 sec
[2021-05-15 15:40:39,541 INFO] Step 39800/50000; acc:  72.21; ppl:  2.45; xent: 0.90; lr: 0.00010; 10389/4085 tok/s;   7703 sec
[2021-05-15 15:40:48,957 INFO] Step 39850/50000; acc:  72.71; ppl:  2.42; xent: 0.88; lr: 0.00010; 10766/4249 tok/s;   7713 sec
[2021-05-15 15:40:58,501 INFO] Step 39900/50000; acc:  72.11; ppl:  2.46; xent: 0.90; lr: 0.00010; 10759/4212 tok/s;   7722 sec
[2021-05-15 15:41:08,159 INFO] Step 39950/50000; acc:  72.75; ppl:  2.40; xent: 0.87; lr: 0.00010; 10301/4110 tok/s;   7732 sec
[2021-05-15 15:41:17,735 INFO] Step 40000/50000; acc:  72.54; ppl:  2.43; xent: 0.89; lr: 0.00010; 10981/4207 tok/s;   7742 sec
[2021-05-15 15:41:17,738 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/loose/valid.txt, align=None)...
[2021-05-15 15:41:25,647 INFO] Validation perplexity: 3.06572
[2021-05-15 15:41:25,647 INFO] Validation accuracy: 67.6173
[2021-05-15 15:41:25,650 INFO] Saving checkpoint ../models/group2_params/loose_ops/model_step_40000.pt
[2021-05-15 15:41:35,141 INFO] Step 40050/50000; acc:  72.64; ppl:  2.41; xent: 0.88; lr: 0.00010; 5696/2315 tok/s;   7759 sec
[2021-05-15 15:41:44,237 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:41:44,854 INFO] Step 40100/50000; acc:  72.43; ppl:  2.42; xent: 0.88; lr: 0.00010; 10456/4117 tok/s;   7769 sec
[2021-05-15 15:41:54,468 INFO] Step 40150/50000; acc:  72.20; ppl:  2.43; xent: 0.89; lr: 0.00010; 10578/4211 tok/s;   7778 sec
[2021-05-15 15:42:04,715 INFO] Step 40200/50000; acc:  72.12; ppl:  2.46; xent: 0.90; lr: 0.00010; 10014/3952 tok/s;   7789 sec
[2021-05-15 15:42:14,438 INFO] Step 40250/50000; acc:  72.51; ppl:  2.42; xent: 0.88; lr: 0.00010; 10335/4182 tok/s;   7798 sec
[2021-05-15 15:42:23,932 INFO] Step 40300/50000; acc:  72.60; ppl:  2.42; xent: 0.88; lr: 0.00010; 10627/4221 tok/s;   7808 sec
[2021-05-15 15:42:33,514 INFO] Step 40350/50000; acc:  72.72; ppl:  2.45; xent: 0.90; lr: 0.00010; 10749/4199 tok/s;   7817 sec
[2021-05-15 15:42:42,818 INFO] Step 40400/50000; acc:  72.45; ppl:  2.42; xent: 0.89; lr: 0.00010; 10656/4335 tok/s;   7827 sec
[2021-05-15 15:42:52,568 INFO] Step 40450/50000; acc:  72.29; ppl:  2.45; xent: 0.90; lr: 0.00010; 10841/4048 tok/s;   7836 sec
[2021-05-15 15:43:02,738 INFO] Step 40500/50000; acc:  72.29; ppl:  2.43; xent: 0.89; lr: 0.00010; 9807/3993 tok/s;   7847 sec
[2021-05-15 15:43:12,142 INFO] Step 40550/50000; acc:  72.71; ppl:  2.40; xent: 0.88; lr: 0.00010; 10819/4282 tok/s;   7856 sec
[2021-05-15 15:43:21,792 INFO] Step 40600/50000; acc:  72.56; ppl:  2.44; xent: 0.89; lr: 0.00010; 10684/4204 tok/s;   7866 sec
[2021-05-15 15:43:31,240 INFO] Step 40650/50000; acc:  72.71; ppl:  2.40; xent: 0.87; lr: 0.00010; 10716/4162 tok/s;   7875 sec
[2021-05-15 15:43:41,101 INFO] Step 40700/50000; acc:  72.88; ppl:  2.39; xent: 0.87; lr: 0.00010; 10439/4080 tok/s;   7885 sec
[2021-05-15 15:43:50,081 INFO] Step 40750/50000; acc:  72.85; ppl:  2.39; xent: 0.87; lr: 0.00010; 11044/4461 tok/s;   7894 sec
[2021-05-15 15:43:59,524 INFO] Step 40800/50000; acc:  72.27; ppl:  2.46; xent: 0.90; lr: 0.00010; 10969/4254 tok/s;   7903 sec
[2021-05-15 15:44:05,837 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:44:09,269 INFO] Step 40850/50000; acc:  72.44; ppl:  2.41; xent: 0.88; lr: 0.00010; 10336/4151 tok/s;   7913 sec
[2021-05-15 15:44:18,987 INFO] Step 40900/50000; acc:  72.81; ppl:  2.41; xent: 0.88; lr: 0.00010; 10483/4165 tok/s;   7923 sec
[2021-05-15 15:44:29,029 INFO] Step 40950/50000; acc:  72.20; ppl:  2.45; xent: 0.90; lr: 0.00010; 10082/4046 tok/s;   7933 sec
[2021-05-15 15:44:38,873 INFO] Step 41000/50000; acc:  72.55; ppl:  2.42; xent: 0.89; lr: 0.00010; 10337/4133 tok/s;   7943 sec
[2021-05-15 15:44:48,173 INFO] Step 41050/50000; acc:  73.17; ppl:  2.38; xent: 0.87; lr: 0.00010; 10879/4324 tok/s;   7952 sec
[2021-05-15 15:44:57,704 INFO] Step 41100/50000; acc:  72.43; ppl:  2.43; xent: 0.89; lr: 0.00010; 10563/4182 tok/s;   7962 sec
[2021-05-15 15:45:07,319 INFO] Step 41150/50000; acc:  72.45; ppl:  2.45; xent: 0.90; lr: 0.00010; 10791/4197 tok/s;   7971 sec
[2021-05-15 15:45:16,907 INFO] Step 41200/50000; acc:  72.58; ppl:  2.40; xent: 0.88; lr: 0.00010; 10491/4136 tok/s;   7981 sec
[2021-05-15 15:45:27,343 INFO] Step 41250/50000; acc:  72.38; ppl:  2.45; xent: 0.90; lr: 0.00010; 10039/3895 tok/s;   7991 sec
[2021-05-15 15:45:36,310 INFO] Step 41300/50000; acc:  73.55; ppl:  2.35; xent: 0.85; lr: 0.00010; 11006/4401 tok/s;   8000 sec
[2021-05-15 15:45:46,142 INFO] Step 41350/50000; acc:  72.75; ppl:  2.41; xent: 0.88; lr: 0.00010; 10430/4143 tok/s;   8010 sec
[2021-05-15 15:45:55,857 INFO] Step 41400/50000; acc:  72.67; ppl:  2.41; xent: 0.88; lr: 0.00010; 10576/4127 tok/s;   8020 sec
[2021-05-15 15:46:05,367 INFO] Step 41450/50000; acc:  73.19; ppl:  2.37; xent: 0.86; lr: 0.00010; 10683/4136 tok/s;   8029 sec
[2021-05-15 15:46:14,567 INFO] Step 41500/50000; acc:  72.76; ppl:  2.41; xent: 0.88; lr: 0.00010; 11101/4427 tok/s;   8038 sec
[2021-05-15 15:46:23,934 INFO] Step 41550/50000; acc:  72.78; ppl:  2.39; xent: 0.87; lr: 0.00010; 10555/4297 tok/s;   8048 sec
[2021-05-15 15:46:27,507 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:46:33,570 INFO] Step 41600/50000; acc:  72.88; ppl:  2.40; xent: 0.88; lr: 0.00010; 10842/4211 tok/s;   8057 sec
[2021-05-15 15:46:43,313 INFO] Step 41650/50000; acc:  72.65; ppl:  2.40; xent: 0.87; lr: 0.00010; 10325/4103 tok/s;   8067 sec
[2021-05-15 15:46:53,490 INFO] Step 41700/50000; acc:  72.46; ppl:  2.41; xent: 0.88; lr: 0.00010; 9931/4018 tok/s;   8077 sec
[2021-05-15 15:47:02,932 INFO] Step 41750/50000; acc:  72.69; ppl:  2.40; xent: 0.87; lr: 0.00010; 10726/4304 tok/s;   8087 sec
[2021-05-15 15:47:12,490 INFO] Step 41800/50000; acc:  72.92; ppl:  2.40; xent: 0.88; lr: 0.00010; 10686/4187 tok/s;   8096 sec
[2021-05-15 15:47:21,739 INFO] Step 41850/50000; acc:  72.86; ppl:  2.39; xent: 0.87; lr: 0.00010; 10895/4347 tok/s;   8106 sec
[2021-05-15 15:47:31,168 INFO] Step 41900/50000; acc:  72.80; ppl:  2.41; xent: 0.88; lr: 0.00010; 10899/4233 tok/s;   8115 sec
[2021-05-15 15:47:40,950 INFO] Step 41950/50000; acc:  72.65; ppl:  2.42; xent: 0.88; lr: 0.00010; 10559/4056 tok/s;   8125 sec
[2021-05-15 15:47:50,606 INFO] Step 42000/50000; acc:  72.76; ppl:  2.39; xent: 0.87; lr: 0.00010; 10293/4182 tok/s;   8134 sec
[2021-05-15 15:48:00,165 INFO] Step 42050/50000; acc:  72.50; ppl:  2.42; xent: 0.88; lr: 0.00010; 10979/4230 tok/s;   8144 sec
[2021-05-15 15:48:09,299 INFO] Step 42100/50000; acc:  73.29; ppl:  2.36; xent: 0.86; lr: 0.00010; 10799/4384 tok/s;   8153 sec
[2021-05-15 15:48:19,232 INFO] Step 42150/50000; acc:  73.07; ppl:  2.38; xent: 0.87; lr: 0.00010; 10353/4038 tok/s;   8163 sec
[2021-05-15 15:48:28,283 INFO] Step 42200/50000; acc:  73.01; ppl:  2.38; xent: 0.87; lr: 0.00010; 11334/4353 tok/s;   8172 sec
[2021-05-15 15:48:37,736 INFO] Step 42250/50000; acc:  72.94; ppl:  2.39; xent: 0.87; lr: 0.00010; 10675/4325 tok/s;   8182 sec
[2021-05-15 15:48:41,847 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:48:47,433 INFO] Step 42300/50000; acc:  72.94; ppl:  2.39; xent: 0.87; lr: 0.00010; 10606/4121 tok/s;   8191 sec
[2021-05-15 15:48:56,893 INFO] Step 42350/50000; acc:  73.58; ppl:  2.34; xent: 0.85; lr: 0.00010; 10513/4274 tok/s;   8201 sec
[2021-05-15 15:49:06,746 INFO] Step 42400/50000; acc:  72.43; ppl:  2.43; xent: 0.89; lr: 0.00010; 10538/4093 tok/s;   8211 sec
[2021-05-15 15:49:16,677 INFO] Step 42450/50000; acc:  73.04; ppl:  2.37; xent: 0.86; lr: 0.00010; 10046/4160 tok/s;   8221 sec
[2021-05-15 15:49:25,903 INFO] Step 42500/50000; acc:  73.07; ppl:  2.38; xent: 0.87; lr: 0.00010; 10982/4308 tok/s;   8230 sec
[2021-05-15 15:49:35,185 INFO] Step 42550/50000; acc:  73.03; ppl:  2.39; xent: 0.87; lr: 0.00010; 10910/4296 tok/s;   8239 sec
[2021-05-15 15:49:44,945 INFO] Step 42600/50000; acc:  72.84; ppl:  2.40; xent: 0.88; lr: 0.00010; 10491/4128 tok/s;   8249 sec
[2021-05-15 15:49:54,822 INFO] Step 42650/50000; acc:  72.60; ppl:  2.42; xent: 0.88; lr: 0.00010; 10458/4078 tok/s;   8259 sec
[2021-05-15 15:50:04,290 INFO] Step 42700/50000; acc:  72.95; ppl:  2.38; xent: 0.87; lr: 0.00010; 10663/4249 tok/s;   8268 sec
[2021-05-15 15:50:14,208 INFO] Step 42750/50000; acc:  73.13; ppl:  2.38; xent: 0.87; lr: 0.00010; 10400/4032 tok/s;   8278 sec
[2021-05-15 15:50:23,442 INFO] Step 42800/50000; acc:  73.12; ppl:  2.37; xent: 0.86; lr: 0.00010; 10761/4371 tok/s;   8287 sec
[2021-05-15 15:50:33,307 INFO] Step 42850/50000; acc:  72.48; ppl:  2.40; xent: 0.87; lr: 0.00010; 10607/4091 tok/s;   8297 sec
[2021-05-15 15:50:42,821 INFO] Step 42900/50000; acc:  73.61; ppl:  2.32; xent: 0.84; lr: 0.00010; 10465/4199 tok/s;   8307 sec
[2021-05-15 15:50:51,897 INFO] Step 42950/50000; acc:  73.25; ppl:  2.36; xent: 0.86; lr: 0.00010; 11240/4339 tok/s;   8316 sec
[2021-05-15 15:51:01,232 INFO] Step 43000/50000; acc:  72.99; ppl:  2.38; xent: 0.87; lr: 0.00010; 10941/4316 tok/s;   8325 sec
[2021-05-15 15:51:02,900 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:51:11,010 INFO] Step 43050/50000; acc:  73.05; ppl:  2.37; xent: 0.86; lr: 0.00010; 10380/4191 tok/s;   8335 sec
[2021-05-15 15:51:20,748 INFO] Step 43100/50000; acc:  72.99; ppl:  2.37; xent: 0.86; lr: 0.00010; 10568/4101 tok/s;   8345 sec
[2021-05-15 15:51:30,665 INFO] Step 43150/50000; acc:  73.22; ppl:  2.36; xent: 0.86; lr: 0.00010; 9915/4107 tok/s;   8355 sec
[2021-05-15 15:51:40,616 INFO] Step 43200/50000; acc:  72.83; ppl:  2.40; xent: 0.88; lr: 0.00010; 10476/4089 tok/s;   8364 sec
[2021-05-15 15:51:49,670 INFO] Step 43250/50000; acc:  73.32; ppl:  2.35; xent: 0.85; lr: 0.00010; 11048/4395 tok/s;   8374 sec
[2021-05-15 15:51:59,097 INFO] Step 43300/50000; acc:  73.11; ppl:  2.38; xent: 0.87; lr: 0.00010; 10678/4270 tok/s;   8383 sec
[2021-05-15 15:52:08,827 INFO] Step 43350/50000; acc:  72.88; ppl:  2.39; xent: 0.87; lr: 0.00010; 10598/4120 tok/s;   8393 sec
[2021-05-15 15:52:19,095 INFO] Step 43400/50000; acc:  72.95; ppl:  2.38; xent: 0.87; lr: 0.00010; 10048/3895 tok/s;   8403 sec
[2021-05-15 15:52:28,809 INFO] Step 43450/50000; acc:  73.26; ppl:  2.36; xent: 0.86; lr: 0.00010; 10448/4142 tok/s;   8413 sec
[2021-05-15 15:52:38,206 INFO] Step 43500/50000; acc:  73.50; ppl:  2.34; xent: 0.85; lr: 0.00010; 10764/4308 tok/s;   8422 sec
[2021-05-15 15:52:47,730 INFO] Step 43550/50000; acc:  73.07; ppl:  2.38; xent: 0.87; lr: 0.00010; 10813/4179 tok/s;   8432 sec
[2021-05-15 15:52:57,382 INFO] Step 43600/50000; acc:  73.59; ppl:  2.34; xent: 0.85; lr: 0.00010; 10328/4148 tok/s;   8441 sec
[2021-05-15 15:53:06,922 INFO] Step 43650/50000; acc:  73.17; ppl:  2.37; xent: 0.86; lr: 0.00010; 11031/4229 tok/s;   8451 sec
[2021-05-15 15:53:15,969 INFO] Step 43700/50000; acc:  73.74; ppl:  2.32; xent: 0.84; lr: 0.00010; 10823/4460 tok/s;   8460 sec
[2021-05-15 15:53:24,531 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:53:25,722 INFO] Step 43750/50000; acc:  73.05; ppl:  2.37; xent: 0.86; lr: 0.00010; 10522/4068 tok/s;   8470 sec
[2021-05-15 15:53:35,272 INFO] Step 43800/50000; acc:  73.02; ppl:  2.36; xent: 0.86; lr: 0.00010; 10722/4265 tok/s;   8479 sec
[2021-05-15 15:53:45,515 INFO] Step 43850/50000; acc:  73.02; ppl:  2.37; xent: 0.86; lr: 0.00010; 9913/3937 tok/s;   8489 sec
[2021-05-15 15:53:55,134 INFO] Step 43900/50000; acc:  73.20; ppl:  2.37; xent: 0.86; lr: 0.00010; 10580/4223 tok/s;   8499 sec
[2021-05-15 15:54:04,598 INFO] Step 43950/50000; acc:  73.42; ppl:  2.34; xent: 0.85; lr: 0.00010; 10471/4218 tok/s;   8508 sec
[2021-05-15 15:54:14,213 INFO] Step 44000/50000; acc:  73.05; ppl:  2.38; xent: 0.87; lr: 0.00010; 10826/4214 tok/s;   8518 sec
[2021-05-15 15:54:23,541 INFO] Step 44050/50000; acc:  73.42; ppl:  2.35; xent: 0.85; lr: 0.00010; 10759/4331 tok/s;   8527 sec
[2021-05-15 15:54:33,139 INFO] Step 44100/50000; acc:  73.47; ppl:  2.33; xent: 0.85; lr: 0.00010; 10678/4066 tok/s;   8537 sec
[2021-05-15 15:54:43,583 INFO] Step 44150/50000; acc:  72.80; ppl:  2.39; xent: 0.87; lr: 0.00010; 9814/3942 tok/s;   8547 sec
[2021-05-15 15:54:52,869 INFO] Step 44200/50000; acc:  73.59; ppl:  2.32; xent: 0.84; lr: 0.00010; 10961/4287 tok/s;   8557 sec
[2021-05-15 15:55:02,689 INFO] Step 44250/50000; acc:  73.56; ppl:  2.34; xent: 0.85; lr: 0.00010; 10376/4147 tok/s;   8567 sec
[2021-05-15 15:55:12,127 INFO] Step 44300/50000; acc:  73.61; ppl:  2.32; xent: 0.84; lr: 0.00010; 10697/4184 tok/s;   8576 sec
[2021-05-15 15:55:21,922 INFO] Step 44350/50000; acc:  73.66; ppl:  2.33; xent: 0.85; lr: 0.00010; 10573/4102 tok/s;   8586 sec
[2021-05-15 15:55:30,802 INFO] Step 44400/50000; acc:  73.60; ppl:  2.32; xent: 0.84; lr: 0.00010; 11150/4456 tok/s;   8595 sec
[2021-05-15 15:55:40,541 INFO] Step 44450/50000; acc:  72.95; ppl:  2.38; xent: 0.87; lr: 0.00010; 10682/4193 tok/s;   8604 sec
[2021-05-15 15:55:46,233 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:55:50,022 INFO] Step 44500/50000; acc:  73.62; ppl:  2.32; xent: 0.84; lr: 0.00010; 10465/4264 tok/s;   8614 sec
[2021-05-15 15:55:59,839 INFO] Step 44550/50000; acc:  73.58; ppl:  2.34; xent: 0.85; lr: 0.00010; 10502/4098 tok/s;   8624 sec
[2021-05-15 15:56:10,099 INFO] Step 44600/50000; acc:  72.91; ppl:  2.37; xent: 0.86; lr: 0.00010; 9921/3997 tok/s;   8634 sec
[2021-05-15 15:56:19,700 INFO] Step 44650/50000; acc:  73.42; ppl:  2.33; xent: 0.85; lr: 0.00010; 10502/4209 tok/s;   8644 sec
[2021-05-15 15:56:29,242 INFO] Step 44700/50000; acc:  73.89; ppl:  2.33; xent: 0.85; lr: 0.00010; 10737/4219 tok/s;   8653 sec
[2021-05-15 15:56:38,354 INFO] Step 44750/50000; acc:  73.53; ppl:  2.32; xent: 0.84; lr: 0.00010; 10817/4355 tok/s;   8662 sec
[2021-05-15 15:56:48,053 INFO] Step 44800/50000; acc:  72.85; ppl:  2.38; xent: 0.87; lr: 0.00010; 10859/4159 tok/s;   8672 sec
[2021-05-15 15:56:57,745 INFO] Step 44850/50000; acc:  73.17; ppl:  2.34; xent: 0.85; lr: 0.00010; 10472/4097 tok/s;   8682 sec
[2021-05-15 15:57:07,946 INFO] Step 44900/50000; acc:  73.24; ppl:  2.35; xent: 0.85; lr: 0.00010; 9949/3981 tok/s;   8692 sec
[2021-05-15 15:57:17,182 INFO] Step 44950/50000; acc:  73.65; ppl:  2.32; xent: 0.84; lr: 0.00010; 11005/4331 tok/s;   8701 sec
[2021-05-15 15:57:26,879 INFO] Step 45000/50000; acc:  73.65; ppl:  2.32; xent: 0.84; lr: 0.00010; 10561/4174 tok/s;   8711 sec
[2021-05-15 15:57:26,882 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/loose/valid.txt, align=None)...
[2021-05-15 15:57:34,789 INFO] Validation perplexity: 3.11649
[2021-05-15 15:57:34,789 INFO] Validation accuracy: 67.7761
[2021-05-15 15:57:34,791 INFO] Saving checkpoint ../models/group2_params/loose_ops/model_step_45000.pt
[2021-05-15 15:57:45,000 INFO] Step 45050/50000; acc:  73.74; ppl:  2.33; xent: 0.84; lr: 0.00010; 5593/2195 tok/s;   8729 sec
[2021-05-15 15:57:54,403 INFO] Step 45100/50000; acc:  73.82; ppl:  2.29; xent: 0.83; lr: 0.00010; 10797/4212 tok/s;   8738 sec
[2021-05-15 15:58:03,644 INFO] Step 45150/50000; acc:  73.61; ppl:  2.35; xent: 0.85; lr: 0.00010; 11108/4376 tok/s;   8748 sec
[2021-05-15 15:58:13,050 INFO] Step 45200/50000; acc:  73.58; ppl:  2.32; xent: 0.84; lr: 0.00010; 10527/4299 tok/s;   8757 sec
[2021-05-15 15:58:16,274 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 15:58:22,680 INFO] Step 45250/50000; acc:  73.24; ppl:  2.35; xent: 0.85; lr: 0.00010; 10887/4209 tok/s;   8767 sec
[2021-05-15 15:58:32,304 INFO] Step 45300/50000; acc:  73.81; ppl:  2.31; xent: 0.84; lr: 0.00010; 10285/4156 tok/s;   8776 sec
[2021-05-15 15:58:42,504 INFO] Step 45350/50000; acc:  73.15; ppl:  2.35; xent: 0.86; lr: 0.00010; 10005/4008 tok/s;   8786 sec
[2021-05-15 15:58:51,993 INFO] Step 45400/50000; acc:  73.51; ppl:  2.32; xent: 0.84; lr: 0.00010; 10746/4274 tok/s;   8796 sec
[2021-05-15 15:59:01,554 INFO] Step 45450/50000; acc:  73.67; ppl:  2.33; xent: 0.85; lr: 0.00010; 10589/4189 tok/s;   8805 sec
[2021-05-15 15:59:10,923 INFO] Step 45500/50000; acc:  73.49; ppl:  2.33; xent: 0.85; lr: 0.00010; 10891/4294 tok/s;   8815 sec
[2021-05-15 15:59:20,389 INFO] Step 45550/50000; acc:  73.66; ppl:  2.32; xent: 0.84; lr: 0.00010; 10676/4204 tok/s;   8824 sec
[2021-05-15 15:59:30,354 INFO] Step 45600/50000; acc:  73.32; ppl:  2.35; xent: 0.86; lr: 0.00010; 10478/4024 tok/s;   8834 sec
[2021-05-15 15:59:40,053 INFO] Step 45650/50000; acc:  73.64; ppl:  2.33; xent: 0.85; lr: 0.00010; 10354/4125 tok/s;   8844 sec
[2021-05-15 15:59:49,476 INFO] Step 45700/50000; acc:  73.89; ppl:  2.31; xent: 0.84; lr: 0.00010; 10801/4315 tok/s;   8853 sec
[2021-05-15 15:59:58,839 INFO] Step 45750/50000; acc:  73.68; ppl:  2.31; xent: 0.84; lr: 0.00010; 10833/4263 tok/s;   8863 sec
[2021-05-15 16:00:08,753 INFO] Step 45800/50000; acc:  73.86; ppl:  2.29; xent: 0.83; lr: 0.00010; 10366/4053 tok/s;   8873 sec
[2021-05-15 16:00:17,695 INFO] Step 45850/50000; acc:  73.89; ppl:  2.31; xent: 0.84; lr: 0.00010; 11339/4385 tok/s;   8882 sec
[2021-05-15 16:00:27,216 INFO] Step 45900/50000; acc:  73.88; ppl:  2.32; xent: 0.84; lr: 0.00010; 10555/4284 tok/s;   8891 sec
[2021-05-15 16:00:30,902 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 16:00:37,070 INFO] Step 45950/50000; acc:  73.48; ppl:  2.33; xent: 0.85; lr: 0.00010; 10510/4105 tok/s;   8901 sec
[2021-05-15 16:00:46,410 INFO] Step 46000/50000; acc:  74.37; ppl:  2.27; xent: 0.82; lr: 0.00010; 10640/4281 tok/s;   8910 sec
[2021-05-15 16:00:56,823 INFO] Step 46050/50000; acc:  73.08; ppl:  2.37; xent: 0.86; lr: 0.00010; 10011/3914 tok/s;   8921 sec
[2021-05-15 16:01:06,436 INFO] Step 46100/50000; acc:  74.00; ppl:  2.28; xent: 0.82; lr: 0.00010; 10225/4265 tok/s;   8930 sec
[2021-05-15 16:01:15,786 INFO] Step 46150/50000; acc:  73.71; ppl:  2.32; xent: 0.84; lr: 0.00010; 10962/4255 tok/s;   8940 sec
[2021-05-15 16:01:25,433 INFO] Step 46200/50000; acc:  74.03; ppl:  2.31; xent: 0.84; lr: 0.00010; 10553/4158 tok/s;   8949 sec
[2021-05-15 16:01:34,824 INFO] Step 46250/50000; acc:  73.73; ppl:  2.31; xent: 0.84; lr: 0.00010; 10822/4274 tok/s;   8959 sec
[2021-05-15 16:01:44,858 INFO] Step 46300/50000; acc:  73.41; ppl:  2.34; xent: 0.85; lr: 0.00010; 10396/3998 tok/s;   8969 sec
[2021-05-15 16:01:54,319 INFO] Step 46350/50000; acc:  74.02; ppl:  2.30; xent: 0.83; lr: 0.00010; 10488/4276 tok/s;   8978 sec
[2021-05-15 16:02:04,104 INFO] Step 46400/50000; acc:  73.75; ppl:  2.32; xent: 0.84; lr: 0.00010; 10669/4074 tok/s;   8988 sec
[2021-05-15 16:02:13,303 INFO] Step 46450/50000; acc:  73.96; ppl:  2.31; xent: 0.84; lr: 0.00010; 10902/4350 tok/s;   8997 sec
[2021-05-15 16:02:23,130 INFO] Step 46500/50000; acc:  74.00; ppl:  2.29; xent: 0.83; lr: 0.00010; 10324/4101 tok/s;   9007 sec
[2021-05-15 16:02:32,742 INFO] Step 46550/50000; acc:  74.16; ppl:  2.28; xent: 0.82; lr: 0.00010; 10662/4186 tok/s;   9017 sec
[2021-05-15 16:02:41,984 INFO] Step 46600/50000; acc:  74.05; ppl:  2.29; xent: 0.83; lr: 0.00010; 11011/4266 tok/s;   9026 sec
[2021-05-15 16:02:51,390 INFO] Step 46650/50000; acc:  74.15; ppl:  2.31; xent: 0.84; lr: 0.00010; 10755/4286 tok/s;   9035 sec
[2021-05-15 16:02:52,634 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 16:03:01,021 INFO] Step 46700/50000; acc:  73.90; ppl:  2.29; xent: 0.83; lr: 0.00010; 10503/4246 tok/s;   9045 sec
[2021-05-15 16:03:10,855 INFO] Step 46750/50000; acc:  73.80; ppl:  2.32; xent: 0.84; lr: 0.00010; 10532/4060 tok/s;   9055 sec
[2021-05-15 16:03:20,941 INFO] Step 46800/50000; acc:  73.74; ppl:  2.30; xent: 0.83; lr: 0.00010; 9724/4059 tok/s;   9065 sec
[2021-05-15 16:03:30,860 INFO] Step 46850/50000; acc:  73.66; ppl:  2.32; xent: 0.84; lr: 0.00010; 10550/4097 tok/s;   9075 sec
[2021-05-15 16:03:39,946 INFO] Step 46900/50000; acc:  74.26; ppl:  2.27; xent: 0.82; lr: 0.00010; 10863/4390 tok/s;   9084 sec
[2021-05-15 16:03:49,519 INFO] Step 46950/50000; acc:  74.07; ppl:  2.31; xent: 0.84; lr: 0.00010; 10626/4193 tok/s;   9093 sec
[2021-05-15 16:03:59,268 INFO] Step 47000/50000; acc:  73.97; ppl:  2.32; xent: 0.84; lr: 0.00010; 10653/4098 tok/s;   9103 sec
[2021-05-15 16:04:09,269 INFO] Step 47050/50000; acc:  73.71; ppl:  2.31; xent: 0.84; lr: 0.00010; 10215/3985 tok/s;   9113 sec
[2021-05-15 16:04:18,942 INFO] Step 47100/50000; acc:  74.02; ppl:  2.30; xent: 0.83; lr: 0.00010; 10606/4181 tok/s;   9123 sec
[2021-05-15 16:04:28,269 INFO] Step 47150/50000; acc:  74.61; ppl:  2.25; xent: 0.81; lr: 0.00010; 10657/4330 tok/s;   9132 sec
[2021-05-15 16:04:38,186 INFO] Step 47200/50000; acc:  73.59; ppl:  2.32; xent: 0.84; lr: 0.00010; 10506/4045 tok/s;   9142 sec
[2021-05-15 16:04:47,831 INFO] Step 47250/50000; acc:  74.27; ppl:  2.28; xent: 0.82; lr: 0.00010; 10455/4133 tok/s;   9152 sec
[2021-05-15 16:04:56,894 INFO] Step 47300/50000; acc:  74.60; ppl:  2.26; xent: 0.82; lr: 0.00010; 11223/4421 tok/s;   9161 sec
[2021-05-15 16:05:06,088 INFO] Step 47350/50000; acc:  73.82; ppl:  2.30; xent: 0.83; lr: 0.00010; 10972/4405 tok/s;   9170 sec
[2021-05-15 16:05:14,018 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 16:05:15,601 INFO] Step 47400/50000; acc:  74.13; ppl:  2.29; xent: 0.83; lr: 0.00010; 10786/4185 tok/s;   9179 sec
[2021-05-15 16:05:25,345 INFO] Step 47450/50000; acc:  73.99; ppl:  2.28; xent: 0.83; lr: 0.00010; 10400/4177 tok/s;   9189 sec
[2021-05-15 16:05:35,462 INFO] Step 47500/50000; acc:  73.98; ppl:  2.30; xent: 0.83; lr: 0.00010; 9981/4003 tok/s;   9199 sec
[2021-05-15 16:05:45,316 INFO] Step 47550/50000; acc:  73.77; ppl:  2.31; xent: 0.84; lr: 0.00010; 10401/4115 tok/s;   9209 sec
[2021-05-15 16:05:54,466 INFO] Step 47600/50000; acc:  74.49; ppl:  2.24; xent: 0.81; lr: 0.00010; 10831/4375 tok/s;   9218 sec
[2021-05-15 16:06:04,384 INFO] Step 47650/50000; acc:  73.68; ppl:  2.33; xent: 0.85; lr: 0.00010; 10523/4100 tok/s;   9228 sec
[2021-05-15 16:06:13,489 INFO] Step 47700/50000; acc:  74.35; ppl:  2.26; xent: 0.82; lr: 0.00010; 10886/4380 tok/s;   9237 sec
[2021-05-15 16:06:23,197 INFO] Step 47750/50000; acc:  74.15; ppl:  2.28; xent: 0.83; lr: 0.00010; 10685/4043 tok/s;   9247 sec
[2021-05-15 16:06:33,496 INFO] Step 47800/50000; acc:  73.45; ppl:  2.33; xent: 0.85; lr: 0.00010; 9990/3991 tok/s;   9257 sec
[2021-05-15 16:06:42,682 INFO] Step 47850/50000; acc:  74.69; ppl:  2.25; xent: 0.81; lr: 0.00010; 10990/4350 tok/s;   9267 sec
[2021-05-15 16:06:52,539 INFO] Step 47900/50000; acc:  74.29; ppl:  2.28; xent: 0.83; lr: 0.00010; 10451/4130 tok/s;   9276 sec
[2021-05-15 16:07:01,910 INFO] Step 47950/50000; acc:  74.57; ppl:  2.24; xent: 0.80; lr: 0.00010; 10574/4171 tok/s;   9286 sec
[2021-05-15 16:07:11,860 INFO] Step 48000/50000; acc:  74.03; ppl:  2.29; xent: 0.83; lr: 0.00010; 10522/4081 tok/s;   9296 sec
[2021-05-15 16:07:20,798 INFO] Step 48050/50000; acc:  74.54; ppl:  2.26; xent: 0.81; lr: 0.00010; 11209/4449 tok/s;   9305 sec
[2021-05-15 16:07:30,313 INFO] Step 48100/50000; acc:  74.30; ppl:  2.28; xent: 0.83; lr: 0.00010; 10595/4228 tok/s;   9314 sec
[2021-05-15 16:07:35,813 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 16:07:39,983 INFO] Step 48150/50000; acc:  74.10; ppl:  2.27; xent: 0.82; lr: 0.00010; 10553/4218 tok/s;   9324 sec
[2021-05-15 16:07:49,820 INFO] Step 48200/50000; acc:  74.01; ppl:  2.29; xent: 0.83; lr: 0.00010; 10480/4085 tok/s;   9334 sec
[2021-05-15 16:08:00,070 INFO] Step 48250/50000; acc:  73.96; ppl:  2.28; xent: 0.82; lr: 0.00010; 9815/3964 tok/s;   9344 sec
[2021-05-15 16:08:09,719 INFO] Step 48300/50000; acc:  74.16; ppl:  2.27; xent: 0.82; lr: 0.00010; 10410/4231 tok/s;   9354 sec
[2021-05-15 16:08:19,280 INFO] Step 48350/50000; acc:  74.46; ppl:  2.27; xent: 0.82; lr: 0.00010; 10771/4187 tok/s;   9363 sec
[2021-05-15 16:08:28,304 INFO] Step 48400/50000; acc:  74.61; ppl:  2.25; xent: 0.81; lr: 0.00010; 10919/4436 tok/s;   9372 sec
[2021-05-15 16:08:38,052 INFO] Step 48450/50000; acc:  73.91; ppl:  2.31; xent: 0.84; lr: 0.00010; 10850/4094 tok/s;   9382 sec
[2021-05-15 16:08:47,729 INFO] Step 48500/50000; acc:  74.59; ppl:  2.25; xent: 0.81; lr: 0.00010; 10338/4153 tok/s;   9392 sec
[2021-05-15 16:08:57,911 INFO] Step 48550/50000; acc:  73.98; ppl:  2.29; xent: 0.83; lr: 0.00010; 10077/3953 tok/s;   9402 sec
[2021-05-15 16:09:07,262 INFO] Step 48600/50000; acc:  74.81; ppl:  2.26; xent: 0.82; lr: 0.00010; 10944/4314 tok/s;   9411 sec
[2021-05-15 16:09:16,654 INFO] Step 48650/50000; acc:  74.64; ppl:  2.25; xent: 0.81; lr: 0.00010; 10797/4280 tok/s;   9421 sec
[2021-05-15 16:09:26,355 INFO] Step 48700/50000; acc:  74.29; ppl:  2.27; xent: 0.82; lr: 0.00010; 10587/4093 tok/s;   9430 sec
[2021-05-15 16:09:35,562 INFO] Step 48750/50000; acc:  75.05; ppl:  2.20; xent: 0.79; lr: 0.00010; 10813/4299 tok/s;   9439 sec
[2021-05-15 16:09:44,898 INFO] Step 48800/50000; acc:  74.10; ppl:  2.29; xent: 0.83; lr: 0.00010; 11097/4331 tok/s;   9449 sec
[2021-05-15 16:09:54,484 INFO] Step 48850/50000; acc:  74.56; ppl:  2.26; xent: 0.81; lr: 0.00010; 10478/4273 tok/s;   9458 sec
[2021-05-15 16:09:57,190 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 16:10:03,716 INFO] Step 48900/50000; acc:  74.57; ppl:  2.24; xent: 0.81; lr: 0.00010; 11012/4337 tok/s;   9468 sec
[2021-05-15 16:10:13,474 INFO] Step 48950/50000; acc:  74.19; ppl:  2.27; xent: 0.82; lr: 0.00010; 10420/4114 tok/s;   9477 sec
[2021-05-15 16:10:23,691 INFO] Step 49000/50000; acc:  74.05; ppl:  2.28; xent: 0.82; lr: 0.00010; 9972/4002 tok/s;   9488 sec
[2021-05-15 16:10:32,968 INFO] Step 49050/50000; acc:  74.75; ppl:  2.24; xent: 0.81; lr: 0.00010; 10886/4347 tok/s;   9497 sec
[2021-05-15 16:10:42,457 INFO] Step 49100/50000; acc:  74.55; ppl:  2.26; xent: 0.81; lr: 0.00010; 10619/4233 tok/s;   9506 sec
[2021-05-15 16:10:51,948 INFO] Step 49150/50000; acc:  74.41; ppl:  2.27; xent: 0.82; lr: 0.00010; 10823/4239 tok/s;   9516 sec
[2021-05-15 16:11:01,314 INFO] Step 49200/50000; acc:  74.65; ppl:  2.25; xent: 0.81; lr: 0.00010; 10789/4269 tok/s;   9525 sec
[2021-05-15 16:11:11,295 INFO] Step 49250/50000; acc:  74.14; ppl:  2.28; xent: 0.82; lr: 0.00010; 10490/4026 tok/s;   9535 sec
[2021-05-15 16:11:20,886 INFO] Step 49300/50000; acc:  74.66; ppl:  2.24; xent: 0.81; lr: 0.00010; 10334/4147 tok/s;   9545 sec
[2021-05-15 16:11:30,403 INFO] Step 49350/50000; acc:  74.56; ppl:  2.26; xent: 0.81; lr: 0.00010; 10803/4260 tok/s;   9554 sec
[2021-05-15 16:11:39,997 INFO] Step 49400/50000; acc:  74.49; ppl:  2.26; xent: 0.82; lr: 0.00010; 10630/4204 tok/s;   9564 sec
[2021-05-15 16:11:49,680 INFO] Step 49450/50000; acc:  75.07; ppl:  2.22; xent: 0.80; lr: 0.00010; 10527/4165 tok/s;   9574 sec
[2021-05-15 16:11:58,536 INFO] Step 49500/50000; acc:  74.62; ppl:  2.24; xent: 0.81; lr: 0.00010; 11596/4376 tok/s;   9582 sec
[2021-05-15 16:12:07,952 INFO] Step 49550/50000; acc:  74.72; ppl:  2.23; xent: 0.80; lr: 0.00010; 10459/4304 tok/s;   9592 sec
[2021-05-15 16:12:11,447 INFO] Loading ParallelCorpus(../data/small/abstract_methods/train_buggy.txt, ../data/small/edit_ops/typed/loose/train.txt, align=None)...
[2021-05-15 16:12:17,747 INFO] Step 49600/50000; acc:  74.10; ppl:  2.27; xent: 0.82; lr: 0.00010; 10715/4186 tok/s;   9602 sec
[2021-05-15 16:12:27,422 INFO] Step 49650/50000; acc:  74.99; ppl:  2.22; xent: 0.80; lr: 0.00010; 10376/4103 tok/s;   9611 sec
[2021-05-15 16:12:37,056 INFO] Step 49700/50000; acc:  74.50; ppl:  2.26; xent: 0.82; lr: 0.00010; 10456/4224 tok/s;   9621 sec
[2021-05-15 16:12:46,822 INFO] Step 49750/50000; acc:  74.48; ppl:  2.24; xent: 0.81; lr: 0.00010; 10377/4226 tok/s;   9631 sec
[2021-05-15 16:12:56,137 INFO] Step 49800/50000; acc:  74.28; ppl:  2.26; xent: 0.81; lr: 0.00010; 10990/4248 tok/s;   9640 sec
[2021-05-15 16:13:05,467 INFO] Step 49850/50000; acc:  74.58; ppl:  2.24; xent: 0.81; lr: 0.00010; 10786/4279 tok/s;   9649 sec
[2021-05-15 16:13:15,008 INFO] Step 49900/50000; acc:  74.55; ppl:  2.24; xent: 0.81; lr: 0.00010; 10642/4235 tok/s;   9659 sec
[2021-05-15 16:13:25,196 INFO] Step 49950/50000; acc:  73.95; ppl:  2.29; xent: 0.83; lr: 0.00010; 10290/3926 tok/s;   9669 sec
[2021-05-15 16:13:34,618 INFO] Step 50000/50000; acc:  74.85; ppl:  2.22; xent: 0.80; lr: 0.00005; 10521/4276 tok/s;   9678 sec
[2021-05-15 16:13:34,624 INFO] Loading ParallelCorpus(../data/small/abstract_methods/valid_buggy.txt, ../data/small/edit_ops/typed/loose/valid.txt, align=None)...
[2021-05-15 16:13:42,545 INFO] Validation perplexity: 3.17754
[2021-05-15 16:13:42,545 INFO] Validation accuracy: 67.5142
[2021-05-15 16:13:42,547 INFO] Saving checkpoint ../models/group2_params/loose_ops/model_step_50000.pt