-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
data #43
Comments
Hi- the format description of these files are given here: https://github.com/Georgetown-IR-Lab/cedr#getting-started In short, training pairs are sampled from lines like |
Does the .run and .pair files need to be built manually or automatically by running some program? |
There is also an integration plugin for CEDR using PyTerrier - see |
@wangxinzhe123 -- ultimately how you construct these files depends on your experimental setup. The main questions are:
|
Excuse me, can you provide the index file containing the indexbuildindex parameter? |
That again depends on what experiment you're running -- especially since you mention that you're running it with different datasets. Since you brought up Indri, here's documentation on it: https://sourceforge.net/p/lemur/wiki/IndriBuildIndex%20Parameters/ I'm not very familiar with Indri, however. I'm happy to help out using PyTerrier though -- especially if you provide some details on what you're trying to do. Here's the documentation on indexing: https://pyterrier.readthedocs.io/en/latest/terrier-indexing.html |
Because I want to run this code with other data sets, how can I get .run and .pair files similar to those in /data?
The text was updated successfully, but these errors were encountered: