Skip to content

Given a large text corpus find the longest common substring that is repeated most often

Notifications You must be signed in to change notification settings

reisepass/K-Longest-Repeating-Substrings

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Given a large text corpus find the longest common substring that is repeated most often

The sample corups I was working on inth R file was analyzing Swiss Supreme court rulings which frequently copy exact paragraphs from past decisions. Goal was t to understand if law is getting more complex over time.

OpenData can be found here : https://www.bger.ch/ext/eurospider/live/de/php/clir/http/index_atf.php?lang=de

image (1)

image (2)

image (3)

About

Given a large text corpus find the longest common substring that is repeated most often

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published