Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exclude dirs depending on dir full name (relative to root) #561

Merged
merged 3 commits into from
Jul 31, 2018

Conversation

dadoonet
Copy link
Owner

@dadoonet dadoonet commented Jul 23, 2018

For example if I have a directory /mypath/folder with subdirectories

/mypath/folder
├── folderA
│   ├── subfolderA
│   ├── subfolderB
│   └── subfolderC
├── folderB
│   ├── subfolderA
│   ├── subfolderB
│   └── subfolderC
└── folderC
    ├── subfolderA
    ├── subfolderB
    └── subfolderC

I would like to be able to start crawling at /mypath/folder and crawl everything except /folderB/subfolderB for example.

I would like to be able to put "excludes": ["/folderB/subfolderB"] or even a wildcard like "excludes": ["/folderB/subfolder*"].

Closes #553.

@dadoonet dadoonet added update When updating an existing feature breaking For breaking changes labels Jul 23, 2018
@dadoonet dadoonet added this to the 2.5 milestone Jul 23, 2018
@dadoonet dadoonet self-assigned this Jul 23, 2018
dadoonet added 2 commits July 28, 2018 10:18
For example if I have a directory /mypath/folder with subdirectories

```
/mypath/folder
├── folderA
│   ├── subfolderA
│   ├── subfolderB
│   └── subfolderC
├── folderB
│   ├── subfolderA
│   ├── subfolderB
│   └── subfolderC
└── folderC
    ├── subfolderA
    ├── subfolderB
    └── subfolderC
```

I would like to be able to start crawling at `/mypath/folder` and crawl everything except `/folderB/subfolderB` for example.

I would like to be able to put `"excludes": ["/folderB/subfolderB"]` or even a wildcard like `"excludes": ["/folderB/subfolder*"]`.

Closes #553.
@dadoonet dadoonet force-pushed the fix/553-exclude-dirs branch from 2fd37fb to 812fe7d Compare July 28, 2018 08:19
# Conflicts:
#	core/src/main/java/fr/pilato/elasticsearch/crawler/fs/FsParser.java
@dadoonet dadoonet merged commit dad9572 into master Jul 31, 2018
@dadoonet dadoonet deleted the fix/553-exclude-dirs branch July 31, 2018 09:06
@dadoonet dadoonet mentioned this pull request Aug 2, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
breaking For breaking changes update When updating an existing feature
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant