add detailed documents for beam search implementation #535

shawnthu · 2019-02-27T06:30:34Z

The SequenceGenerator class is so hard to understand, can someone provide a detailed document?
e.g.

        # get the top beam_size active hypotheses, which are just the hypos
        # with the smallest values in active_mask
        active_hypos, _ignore = buffer('active_hypos'), buffer('_ignore')  # [b, k]
        torch.topk(
            active_mask, k=beam_size, dim=1, largest=False,
            out=(_ignore, active_hypos)
        )

        active_bbsz_idx = buffer('active_bbsz_idx')
        torch.gather(
            cand_bbsz_idx, dim=1, index=active_hypos,
            out=active_bbsz_idx,
        )
        active_scores = torch.gather(
            cand_scores, dim=1, index=active_hypos,
            out=scores[:, step].view(bsz, beam_size),
        )

        active_bbsz_idx = active_bbsz_idx.view(-1)
        active_scores = active_scores.view(-1)

        # copy tokens and scores for active hypotheses
        torch.index_select(
            tokens[:, :step + 1], dim=0, index=active_bbsz_idx,
            out=tokens_buf[:, :step + 1],
        )
        torch.gather(
            cand_indices, dim=1, index=active_hypos,
            out=tokens_buf.view(bsz, beam_size, -1)[:, :, step + 1],
        )
        if step > 0:
            torch.index_select(
                scores[:, :step], dim=0, index=active_bbsz_idx,
                out=scores_buf[:, :step],
            )
        torch.gather(
            cand_scores, dim=1, index=active_hypos,
            out=scores_buf.view(bsz, beam_size, -1)[:, :, step],`

The above vectorized code makes me almost crazy. I know it helps speeding computation, but at the cost of understanding.

The text was updated successfully, but these errors were encountered:

myleott · 2019-02-28T12:41:02Z

We plan to release a simpler (albeit much slower) implementation as an alternative soon. If you want to modify the search code, the simplified version may be a bit easier to work with.

Or, are you trying to understand the fast version? If so, I highly recommend putting a break point (pdb) and stepping through it line by line with a batch of data to see what’s happening. The code you referenced is simply selecting the topk hypotheses that don’t end in eos, and adding the selected tokens/scores to the final tokens/scores tensors.

shawnthu · 2019-03-05T05:19:40Z

We plan to release a simpler (albeit much slower) implementation as an alternative soon. If you want to modify the search code, the simplified version may be a bit easier to work with.

Or, are you trying to understand the fast version? If so, I highly recommend putting a break point (pdb) and stepping through it line by line with a batch of data to see what’s happening. The code you referenced is simply selecting the topk hypotheses that don’t end in eos, and adding the selected tokens/scores to the final tokens/scores tensors.

thx O(∩_∩)O

ashutoshsaboo · 2019-03-13T16:16:49Z

Hi, has there been any further update on the more-understandable beam search that the one from SequenceGenerator?
Also, what is SequenceScorer doing exactly? From the code if i'm not wrong, is it just a normal forward pass inference equivalent to beam_size=1?

myleott · 2019-04-23T03:03:19Z

Someone wrote a really nice tutorial about the beam search implementation in fairseq: http://www.telesens.co/2019/04/21/understanding-incremental-decoding-in-fairseq/

We have a slightly simpler beam search implementation, but we'd like to simplify it even further (by removing all batching) before releasing it. In any case this will be much slower than the current vectorized implementation.

villmow · 2019-05-29T10:03:17Z

Any updates on the release of the simpler beam search implementation?

myleott · 2019-05-30T18:12:09Z

@villmow, take a look here: 20bbbdc

It should be a drop-in replacement for SequenceGenerator. It's still batched, but removes a lot of the other complexity. Happy to take a PR if you want to take a stab at integrating this more cleanly or simplifying it further.

hokkaido · 2019-11-20T20:23:12Z

I echo @shawnthu's sentiment, the current SequenceGenerator class and especially its generate method is very hard to dissect for an outsider (like me).

stale · 2022-04-17T20:20:55Z

This issue has been automatically marked as stale. If this issue is still affecting you, please leave any comment (for example, "bump"), and we'll keep it open. We are sorry that we haven't been able to prioritize it yet. If you have any new additional information, please include it with your comment!

stale · 2022-04-27T23:21:51Z

Closing this issue after a prolonged period of inactivity. If this issue is still present in the latest release, please create a new issue with up-to-date information. Thank you!

BitMindLab mentioned this issue Nov 29, 2019

be confused about sequence_generator.py #173

Closed

myleott added documentation help wanted labels Dec 16, 2019

stale bot added the stale label Apr 17, 2022

stale bot closed this as completed Apr 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add detailed documents for beam search implementation #535

add detailed documents for beam search implementation #535

shawnthu commented Feb 27, 2019

myleott commented Feb 28, 2019 •

edited

Loading

shawnthu commented Mar 5, 2019

ashutoshsaboo commented Mar 13, 2019

myleott commented Apr 23, 2019

villmow commented May 29, 2019

myleott commented May 30, 2019

hokkaido commented Nov 20, 2019

stale bot commented Apr 17, 2022

stale bot commented Apr 27, 2022

add detailed documents for beam search implementation #535

add detailed documents for beam search implementation #535

Comments

shawnthu commented Feb 27, 2019

myleott commented Feb 28, 2019 • edited Loading

shawnthu commented Mar 5, 2019

ashutoshsaboo commented Mar 13, 2019

myleott commented Apr 23, 2019

villmow commented May 29, 2019

myleott commented May 30, 2019

hokkaido commented Nov 20, 2019

stale bot commented Apr 17, 2022

stale bot commented Apr 27, 2022

myleott commented Feb 28, 2019 •

edited

Loading