Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Translation: Library Carpentry OpenRefine Episode 5 #160

Open
joelnitta opened this issue Aug 5, 2021 · 0 comments
Open

Translation: Library Carpentry OpenRefine Episode 5 #160

joelnitta opened this issue Aug 5, 2021 · 0 comments

Comments

@joelnitta
Copy link
Member

Translation: Library Carpentry OpenRefine Episode 5

Original lesson: https://librarycarpentry.org/lc-open-refine/05-clustering/index.html


  • PO fileの編集についてはREADMEをご参照ください
  • あなたが翻訳し始めたと分かるように、翻訳を始める前にこのイシューを自分に割り振って(assign)、あるいはコメントを書いて下さい。
  • 翻訳が終わったら、プルリクエストを開いて、このイシューを参照・リンクして下さい。

i18n/po/lc-open-refine.ja.po

Lines 1227 to 1285 in bb59a56

#: lc-open-refine/_episodes/05-clustering.md:1
# Front Matter
msgid "---\n"
"title: \"Clustering\"\n"
"teaching: 10\n"
"exercises: 10\n"
"questions:\n"
"- \"What is Clustering in OpenRefine and when would you use it?\"\n"
"- \"How does clustering work in OpenRefine?\"\n"
"objectives:\n"
"- \"Explain what clustering is in OpenRefine\"\n"
"- \"Use clustering to identify and fix replace varying forms of the same data with a single consistent value\"\n"
"keypoints:\n"
"- \"Clustering is a way of finding variant forms of the same piece of data within a dataset (e.g. different spellings of a name)\"\n"
"- \"There are a number of different Clustering algorithms that work in different ways and will produce different results\"\n"
"- \"The best clustering algorithm to use will depend on the data\"\n"
"- \"Using clustering you can replace varying forms of the same data with a single consistent value\"\n"
"---"
msgstr ""
#: lc-open-refine/_episodes/05-clustering.md:18
#: lc-open-refine/files/draft-instructor-notes.md:94
# header
msgid "## Clustering"
msgstr ""
#: lc-open-refine/_episodes/05-clustering.md:19
msgid "The Cluster function groups together similar, but inconsistent values in a given column and lets you merge these inconsistent values into a single value you choose."
msgstr ""
#: lc-open-refine/_episodes/05-clustering.md:21
msgid "This is very effective where you have data with minor variations in data values, e.g. names of people, organisations, places, classification terms."
msgstr ""
#: lc-open-refine/_episodes/05-clustering.md:23
msgid "To use the 'Cluster' function, click on the `Edit Cells` menu option in the relevant column and choose `Cluster and edit...`"
msgstr ""
#: lc-open-refine/_episodes/05-clustering.md:25
msgid "The 'Clusters' are created automatically according to an algorithm. OpenRefine supports a number of different clustering algorithms - some experimentation may be required to see which clustering algorithm works best with any particular set of data, and you may find that using different algorithms highlights different clusters."
msgstr ""
#: lc-open-refine/_episodes/05-clustering.md:27
msgid "For more information on the methods used to create Clusters, see [https://github.com/OpenRefine/OpenRefine/wiki/Clustering-In-Depth](https://github.com/OpenRefine/OpenRefine/wiki/Clustering-In-Depth)"
msgstr ""
#: lc-open-refine/_episodes/05-clustering.md:29
msgid "For each cluster, you have the option of 'merging' the values together - that is, replace the various inconsistent values with a single consistent value. By default, OpenRefine uses the most common value in the cluster as the new value, but you can select another value by clicking the value itself, or you can simply type the desired value into the 'New Cell Value' box."
msgstr ""
#: lc-open-refine/_episodes/05-clustering.md:31
msgid ">## Use Clustering to clean up author data\n"
">\n"
">1. Split out the author names into individual cells using `Edit cells -> Split multi-valued cells`, using the pipe ( \\| ) character as the separator\n"
">2. Choose `Edit cells -> Cluster and edit` from the 'author' column.\n"
">3. Using the `key collision` method with the `fingerprint` Keying Function, work through the clusters of values, merging them to a single value where appropriate\n"
">4. Try changing the clustering method being used - which ones work well?"
msgstr ""

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant