Does this project have this function ? #3162

SeekPoint · 2020-03-06T17:33:11Z

🚀 Feature request

can we use this project to calculate the probability that a input text as a real/resonable sentence base on the corpus we trained

frankniujc · 2020-03-19T20:05:13Z

#2311

SeekPoint · 2020-03-20T03:27:36Z

@frankniujc it is helpful
but maybe a better way is take the all tokens in a whole, not prediction the next tokens

frankniujc · 2020-03-20T16:09:33Z

The probability of a sentence P(s0s1s2s3s4...sn) = P(s1|s0) * P(s2|s0s1) * P(s3|s0s1s2) * ... * P(sn|s0s1s2...sn-1)

So you can do something like this

def sentence_probability(sent):
    bos = tokenizer.encode('<|endoftext|>')
    tokens = tokenizer.encode(sent)
    tokens = bos + tokens
    input_ids = torch.tensor(tokens).unsqueeze(0).to('cuda')

    sent_probs = []

    for i, next_word in enumerate(tokens[1:]):
        next_word_logits = model(input_ids[:,:i+1])[0][0, -1].detach()
        next_word_prob = F.log_softmax(next_word_logits, dim=0)[next_word].item()

        sent_probs.append(next_word_prob)

    return sum(sent_probs)

simonepri · 2020-04-06T23:19:20Z

@lovejasmine Have a look at lm-scorer.

It is a tiny wrapper around transformers I wrote that allows you to get sentences probabilities using models that support it (only GPT2 models are implemented at the time of writing).

stale · 2020-06-06T00:11:08Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale bot added the wontfix label Jun 6, 2020

stale bot closed this as completed Jun 13, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does this project have this function ? #3162

Does this project have this function ? #3162

SeekPoint commented Mar 6, 2020

frankniujc commented Mar 19, 2020

SeekPoint commented Mar 20, 2020

frankniujc commented Mar 20, 2020

simonepri commented Apr 6, 2020

stale bot commented Jun 6, 2020

Does this project have this function ? #3162

Does this project have this function ? #3162

Comments

SeekPoint commented Mar 6, 2020

🚀 Feature request

frankniujc commented Mar 19, 2020

SeekPoint commented Mar 20, 2020

frankniujc commented Mar 20, 2020

simonepri commented Apr 6, 2020

stale bot commented Jun 6, 2020