Skip to content

Commit

Permalink
Merge pull request ClickHouse#71730 from rschu1ze/docs-cleanup
Browse files Browse the repository at this point in the history
Fix trash in the docs, pt. II
  • Loading branch information
rschu1ze authored Nov 11, 2024
2 parents 2748e76 + 5aa9e64 commit d040513
Show file tree
Hide file tree
Showing 10 changed files with 17 additions and 245 deletions.
2 changes: 1 addition & 1 deletion docs/en/getting-started/example-datasets/tpch.md
Original file line number Diff line number Diff line change
Expand Up @@ -46,7 +46,7 @@ Detailed table sizes with scale factor 100:
| orders | 150.000.000 | 6.15 GB |
| lineitem | 600.00.00 | 26.69 GB |

(The table sizes in ClickHouse are taken from `system.tables.total_bytes` and based on below table definitions.
(Compressed sizes in ClickHouse are taken from `system.tables.total_bytes` and based on below table definitions.)

Now create tables in ClickHouse.

Expand Down
113 changes: 1 addition & 112 deletions docs/en/sql-reference/aggregate-functions/reference/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,115 +7,4 @@ toc_hidden: true

# List of Aggregate Functions

ClickHouse supports all standard SQL functions (sum, avg, min, max, count) and a wide range of aggregate functions for various applications:

- [aggThrow](../reference/aggthrow.md)
- [analysisOfVariance](../reference/analysis_of_variance.md)
- [anyHeavy](../reference/anyheavy.md)
- [anyLast](../reference/anylast.md)
- [any](../reference/any.md)
- [argMax](../reference/argmax.md)
- [argMin](../reference/argmin.md)
- [avgWeighted](../reference/avgweighted.md)
- [avg](../reference/avg.md)
- [boundingRatio](../reference/boundrat.md)
- [categoricalInformationValue](../reference/categoricalinformationvalue.md)
- [contingency](../reference/contingency.md)
- [corrMatrix](../reference/corrmatrix.md)
- [corr](../reference/corr.md)
- [corr](../reference/corrstable.md)
- [count](../reference/count.md)
- [covarPopMatrix](../reference/covarpopmatrix.md)
- [covarPop](../reference/covarpop.md)
- [covarSampMatrix](../reference/covarsampmatrix.md)
- [covarSampStable](../reference/covarsampstable.md)
- [covarSamp](../reference/covarsamp.md)
- [covarStable](../reference/covarpopstable.md)
- [cramersVBiasCorrected](../reference/cramersvbiascorrected.md)
- [cramersV](../reference/cramersv.md)
- [deltaSumTimestamp](../reference/deltasumtimestamp.md)
- [deltaSum](../reference/deltasum.md)
- [entropy](../reference/entropy.md)
- [exponentialMovingAverage](../reference/exponentialmovingaverage.md)
- [first_value](../reference/first_value.md)
- [flameGraph](../reference/flame_graph.md)
- [groupArrayInsertAt](../reference/grouparrayinsertat.md)
- [groupArrayIntersect](../reference/grouparrayintersect.md)
- [groupArrayLast](../reference/grouparraylast.md)
- [groupArrayMovingAvg](../reference/grouparraymovingavg.md)
- [groupArrayMovingSum](../reference/grouparraymovingsum.md)
- [groupArraySample](../reference/grouparraysample.md)
- [groupArraySorted](../reference/grouparraysorted.md)
- [groupArray](../reference/grouparray.md)
- [groupBitAnd](../reference/groupbitand.md)
- [groupBitOr](../reference/groupbitor.md)
- [groupBitXor](../reference/groupbitxor.md)
- [groupBitmapAnd](../reference/groupbitmapand.md)
- [groupBitmapOr](../reference/groupbitmapor.md)
- [groupBitmapXor](../reference/groupbitmapxor.md)
- [groupBitmap](../reference/groupbitmap.md)
- [groupUniqArray](../reference/groupuniqarray.md)
- [intervalLengthSum](../reference/intervalLengthSum.md)
- [kolmogorovSmirnovTest](../reference/kolmogorovsmirnovtest.md)
- [kurtPop](../reference/kurtpop.md)
- [kurtSamp](../reference/kurtsamp.md)
- [largestTriangleThreeBuckets](../reference/largestTriangleThreeBuckets.md)
- [last_value](../reference/last_value.md)
- [mannwhitneyutest](../reference/mannwhitneyutest.md)
- [maxIntersectionsPosition](../reference/maxintersectionsposition.md)
- [maxIntersections](../reference/maxintersections.md)
- [maxMap](../reference/maxmap.md)
- [max](../reference/max.md)
- [meanZTest](../reference/meanztest.md)
- [median](../reference/median.md)
- [minMap](../reference/minmap.md)
- [min](../reference/min.md)
- [quantileBFloat16Weighted](../reference/quantilebfloat16.md#quantilebfloat16weighted)
- [quantileBFloat16](../reference/quantilebfloat16.md#quantilebfloat16)
- [quantileDD](../reference/quantileddsketch.md#quantileddsketch)
- [quantileDeterministic](../reference/quantiledeterministic.md)
- [quantileExactHigh](../reference/quantileexact.md#quantileexacthigh)
- [quantileExactLow](../reference/quantileexact.md#quantileexactlow)
- [quantileExactWeighted](../reference/quantileexactweighted.md)
- [quantileExact](../reference/quantileexact.md)
- [quantileGK](../reference/quantileGK.md)
- [quantileInterpolatedWeighted](../reference/quantileinterpolatedweighted.md)
- [quantileTDigestWeighted](../reference/quantiletdigestweighted.md)
- [quantileTDigest](../reference/quantiletdigest.md)
- [quantileTimingWeighted](../reference/quantiletimingweighted.md)
- [quantileTiming](../reference/quantiletiming.md)
- [quantile](../reference/quantile.md)
- [quantiles](../reference/quantiles.md)
- [rankCorr](../reference/rankCorr.md)
- [simpleLinearRegression](../reference/simplelinearregression.md)
- [singleValueOrNull](../reference/singlevalueornull.md)
- [skewPop](../reference/skewpop.md)
- [skewSamp](../reference/skewsamp.md)
- [sparkBar](../reference/sparkbar.md)
- [stddevPopStable](../reference/stddevpopstable.md)
- [stddevPop](../reference/stddevpop.md)
- [stddevSampStable](../reference/stddevsampstable.md)
- [stddevSamp](../reference/stddevsamp.md)
- [stochasticLinearRegression](../reference/stochasticlinearregression.md)
- [stochasticLogisticRegression](../reference/stochasticlogisticregression.md)
- [studentTTest](../reference/studentttest.md)
- [sumCount](../reference/sumcount.md)
- [sumKahan](../reference/sumkahan.md)
- [sumMapFilteredWithOverflow](../parametric-functions.md/#summapfilteredwithoverflow)
- [sumMapFiltered](../parametric-functions.md/#summapfiltered)
- [sumMapWithOverflow](../reference/summapwithoverflow.md)
- [sumMap](../reference/summap.md)
- [sumWithOverflow](../reference/sumwithoverflow.md)
- [sum](../reference/sum.md)
- [theilsU](../reference/theilsu.md)
- [topKWeighted](../reference/topkweighted.md)
- [topK](../reference/topk.md)
- [uniqCombined64](../reference/uniqcombined64.md)
- [uniqCombined](../reference/uniqcombined.md)
- [uniqExact](../reference/uniqexact.md)
- [uniqHLL12](../reference/uniqhll12.md)
- [uniqTheta](../reference/uniqthetasketch.md)
- [uniq](../reference/uniq.md)
- [varPop](../reference/varpop.md)
- [varSamp](../reference/varsamp.md)
- [welchTTest](../reference/welchttest.md)
ClickHouse supports all standard SQL aggregate functions ([sum](../reference/sum.md), [avg](../reference/avg.md), [min](../reference/min.md), [max](../reference/max.md), [count](../reference/count.md)), as well as a wide range of other aggregate functions.
4 changes: 3 additions & 1 deletion docs/en/sql-reference/data-types/aggregatefunction.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,9 @@ sidebar_label: AggregateFunction

# AggregateFunction

Aggregate functions can have an implementation-defined intermediate state that can be serialized to an `AggregateFunction(...)` data type and stored in a table, usually, by means of [a materialized view](../../sql-reference/statements/create/view.md). The common way to produce an aggregate function state is by calling the aggregate function with the `-State` suffix. To get the final result of aggregation in the future, you must use the same aggregate function with the `-Merge`suffix.
Aggregate functions have an implementation-defined intermediate state that can be serialized to an `AggregateFunction(...)` data type and stored in a table, usually, by means of [a materialized view](../../sql-reference/statements/create/view.md).
The common way to produce an aggregate function state is by calling the aggregate function with the `-State` suffix.
To get the final result of aggregation in the future, you must use the same aggregate function with the `-Merge`suffix.

`AggregateFunction(name, types_of_arguments...)` — parametric data type.

Expand Down
29 changes: 4 additions & 25 deletions docs/en/sql-reference/data-types/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,29 +6,8 @@ sidebar_position: 1

# Data Types in ClickHouse

ClickHouse can store various kinds of data in table cells. This section describes the supported data types and special considerations for using and/or implementing them if any.
This section describes the data types supported by ClickHouse, for example [integers](int-uint.md), [floats](float.md) and [strings](string.md).

:::note
You can check whether a data type name is case-sensitive in the [system.data_type_families](../../operations/system-tables/data_type_families.md#system_tables-data_type_families) table.
:::

ClickHouse data types include:

- **Integer types**: [signed and unsigned integers](./int-uint.md) (`UInt8`, `UInt16`, `UInt32`, `UInt64`, `UInt128`, `UInt256`, `Int8`, `Int16`, `Int32`, `Int64`, `Int128`, `Int256`)
- **Floating-point numbers**: [floats](./float.md)(`Float32` and `Float64`) and [`Decimal` values](./decimal.md)
- **Boolean**: ClickHouse has a [`Boolean` type](./boolean.md)
- **Strings**: [`String`](./string.md) and [`FixedString`](./fixedstring.md)
- **Dates**: use [`Date`](./date.md) and [`Date32`](./date32.md) for days, and [`DateTime`](./datetime.md) and [`DateTime64`](./datetime64.md) for instances in time
- **Object**: the [`Object`](./json.md) stores a JSON document in a single column (deprecated)
- **JSON**: the [`JSON` object](./newjson.md) stores a JSON document in a single column
- **UUID**: a performant option for storing [`UUID` values](./uuid.md)
- **Low cardinality types**: use an [`Enum`](./enum.md) when you have a handful of unique values, or use [`LowCardinality`](./lowcardinality.md) when you have up to 10,000 unique values of a column
- **Arrays**: any column can be defined as an [`Array` of values](./array.md)
- **Maps**: use [`Map`](./map.md) for storing key/value pairs
- **Aggregation function types**: use [`SimpleAggregateFunction`](./simpleaggregatefunction.md) and [`AggregateFunction`](./aggregatefunction.md) for storing the intermediate status of aggregate function results
- **Nested data structures**: A [`Nested` data structure](./nested-data-structures/index.md) is like a table inside a cell
- **Tuples**: A [`Tuple` of elements](./tuple.md), each having an individual type.
- **Nullable**: [`Nullable`](./nullable.md) allows you to store a value as `NULL` when a value is "missing" (instead of the column settings its default value for the data type)
- **IP addresses**: use [`IPv4`](./ipv4.md) and [`IPv6`](./ipv6.md) to efficiently store IP addresses
- **Geo types**: for [geographical data](./geo.md), including `Point`, `Ring`, `Polygon` and `MultiPolygon`
- **Special data types**: including [`Expression`](./special-data-types/expression.md), [`Set`](./special-data-types/set.md), [`Nothing`](./special-data-types/nothing.md) and [`Interval`](./special-data-types/interval.md)
System table [system.data_type_families](../../operations/system-tables/data_type_families.md#system_tables-data_type_families) provides an
overview of all available data types.
It also shows whether a data type is an alias to another data type and its name is case-sensitive (e.g. `bool` vs. `BOOL`).
2 changes: 1 addition & 1 deletion docs/en/sql-reference/data-types/json.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ keywords: [object, data type]

# Object Data Type (deprecated)

**This feature is not production-ready and is now deprecated.** If you need to work with JSON documents, consider using [this guide](/docs/en/integrations/data-formats/json/overview) instead. A new implementation to support JSON object is in progress and can be tracked [here](https://github.com/ClickHouse/ClickHouse/issues/54864).
**This feature is not production-ready and deprecated.** If you need to work with JSON documents, consider using [this guide](/docs/en/integrations/data-formats/json/overview) instead. A new implementation to support JSON object is in progress and can be tracked [here](https://github.com/ClickHouse/ClickHouse/issues/54864).

<hr />

Expand Down
4 changes: 3 additions & 1 deletion docs/en/sql-reference/data-types/simpleaggregatefunction.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,9 @@ sidebar_label: SimpleAggregateFunction
---
# SimpleAggregateFunction

`SimpleAggregateFunction(name, types_of_arguments...)` data type stores current value of the aggregate function, and does not store its full state as [`AggregateFunction`](../../sql-reference/data-types/aggregatefunction.md) does. This optimization can be applied to functions for which the following property holds: the result of applying a function `f` to a row set `S1 UNION ALL S2` can be obtained by applying `f` to parts of the row set separately, and then again applying `f` to the results: `f(S1 UNION ALL S2) = f(f(S1) UNION ALL f(S2))`. This property guarantees that partial aggregation results are enough to compute the combined one, so we do not have to store and process any extra data.
`SimpleAggregateFunction(name, types_of_arguments...)` data type stores current value (intermediate state) of the aggregate function, but not its full state as [`AggregateFunction`](../../sql-reference/data-types/aggregatefunction.md) does.
This optimization can be applied to functions for which the following property holds: the result of applying a function `f` to a row set `S1 UNION ALL S2` can be obtained by applying `f` to parts of the row set separately, and then again applying `f` to the results: `f(S1 UNION ALL S2) = f(f(S1) UNION ALL f(S2))`.
This property guarantees that partial aggregation results are enough to compute the combined one, so we do not have to store and process any extra data.

The common way to produce an aggregate function value is by calling the aggregate function with the [-SimpleState](../../sql-reference/aggregate-functions/combinators.md#agg-functions-combinator-simplestate) suffix.

Expand Down
68 changes: 1 addition & 67 deletions docs/en/sql-reference/functions/geo/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,70 +5,4 @@ sidebar_position: 62
title: "Geo Functions"
---


## Geographical Coordinates Functions

- [greatCircleDistance](./coordinates.md#greatcircledistance)
- [geoDistance](./coordinates.md#geodistance)
- [greatCircleAngle](./coordinates.md#greatcircleangle)
- [pointInEllipses](./coordinates.md#pointinellipses)
- [pointInPolygon](./coordinates.md#pointinpolygon)

## Geohash Functions
- [geohashEncode](./geohash.md#geohashencode)
- [geohashDecode](./geohash.md#geohashdecode)
- [geohashesInBox](./geohash.md#geohashesinbox)

## H3 Indexes Functions

- [h3IsValid](./h3.md#h3isvalid)
- [h3GetResolution](./h3.md#h3getresolution)
- [h3EdgeAngle](./h3.md#h3edgeangle)
- [h3EdgeLengthM](./h3.md#h3edgelengthm)
- [h3EdgeLengthKm](./h3.md#h3edgelengthkm)
- [geoToH3](./h3.md#geotoh3)
- [h3ToGeo](./h3.md#h3togeo)
- [h3ToGeoBoundary](./h3.md#h3togeoboundary)
- [h3kRing](./h3.md#h3kring)
- [h3GetBaseCell](./h3.md#h3getbasecell)
- [h3HexAreaM2](./h3.md#h3hexaream2)
- [h3HexAreaKm2](./h3.md#h3hexareakm2)
- [h3IndexesAreNeighbors](./h3.md#h3indexesareneighbors)
- [h3ToChildren](./h3.md#h3tochildren)
- [h3ToParent](./h3.md#h3toparent)
- [h3ToString](./h3.md#h3tostring)
- [stringToH3](./h3.md#stringtoh3)
- [h3GetResolution](./h3.md#h3getresolution)
- [h3IsResClassIII](./h3.md#h3isresclassiii)
- [h3IsPentagon](./h3.md#h3ispentagon)
- [h3GetFaces](./h3.md#h3getfaces)
- [h3CellAreaM2](./h3.md#h3cellaream2)
- [h3CellAreaRads2](./h3.md#h3cellarearads2)
- [h3ToCenterChild](./h3.md#h3tocenterchild)
- [h3ExactEdgeLengthM](./h3.md#h3exactedgelengthm)
- [h3ExactEdgeLengthKm](./h3.md#h3exactedgelengthkm)
- [h3ExactEdgeLengthRads](./h3.md#h3exactedgelengthrads)
- [h3NumHexagons](./h3.md#h3numhexagons)
- [h3Line](./h3.md#h3line)
- [h3Distance](./h3.md#h3distance)
- [h3HexRing](./h3.md#h3hexring)
- [h3GetUnidirectionalEdge](./h3.md#h3getunidirectionaledge)
- [h3UnidirectionalEdgeIsValid](./h3.md#h3unidirectionaledgeisvalid)
- [h3GetOriginIndexFromUnidirectionalEdge](./h3.md#h3getoriginindexfromunidirectionaledge)
- [h3GetDestinationIndexFromUnidirectionalEdge](./h3.md#h3getdestinationindexfromunidirectionaledge)
- [h3GetIndexesFromUnidirectionalEdge](./h3.md#h3getindexesfromunidirectionaledge)
- [h3GetUnidirectionalEdgesFromHexagon](./h3.md#h3getunidirectionaledgesfromhexagon)
- [h3GetUnidirectionalEdgeBoundary](./h3.md#h3getunidirectionaledgeboundary)

## S2 Index Functions

- [geoToS2](./s2.md#geotos2)
- [s2ToGeo](./s2.md#s2togeo)
- [s2GetNeighbors](./s2.md#s2getneighbors)
- [s2CellsIntersect](./s2.md#s2cellsintersect)
- [s2CapContains](./s2.md#s2capcontains)
- [s2CapUnion](./s2.md#s2capunion)
- [s2RectAdd](./s2.md#s2rectadd)
- [s2RectContains](./s2.md#s2rectcontains)
- [s2RectUnion](./s2.md#s2rectunion)
- [s2RectIntersection](./s2.md#s2rectintersection)
Functions for working with geometric objects, for example [to calculate distances between points on a sphere](./coordinates.md), [compute geohashes](./geohash.md), and work with [h3 indexes](./h3.md).
14 changes: 1 addition & 13 deletions docs/en/sql-reference/statements/create/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,16 +6,4 @@ sidebar_label: CREATE

# CREATE Queries

Create queries make a new entity of one of the following kinds:

- [DATABASE](/docs/en/sql-reference/statements/create/database.md)
- [TABLE](/docs/en/sql-reference/statements/create/table.md)
- [VIEW](/docs/en/sql-reference/statements/create/view.md)
- [DICTIONARY](/docs/en/sql-reference/statements/create/dictionary.md)
- [FUNCTION](/docs/en/sql-reference/statements/create/function.md)
- [USER](/docs/en/sql-reference/statements/create/user.md)
- [ROLE](/docs/en/sql-reference/statements/create/role.md)
- [ROW POLICY](/docs/en/sql-reference/statements/create/row-policy.md)
- [QUOTA](/docs/en/sql-reference/statements/create/quota.md)
- [SETTINGS PROFILE](/docs/en/sql-reference/statements/create/settings-profile.md)
- [NAMED COLLECTION](/docs/en/sql-reference/statements/create/named-collection.md)
CREATE queries create (for example) new [databases](/docs/en/sql-reference/statements/create/database.md), [tables](/docs/en/sql-reference/statements/create/table.md) and [views](/docs/en/sql-reference/statements/create/view.md).
25 changes: 1 addition & 24 deletions docs/en/sql-reference/statements/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,27 +6,4 @@ sidebar_label: List of statements

# ClickHouse SQL Statements

Statements represent various kinds of action you can perform using SQL queries. Each kind of statement has it’s own syntax and usage details that are described separately:

- [SELECT](/docs/en/sql-reference/statements/select/index.md)
- [INSERT INTO](/docs/en/sql-reference/statements/insert-into.md)
- [CREATE](/docs/en/sql-reference/statements/create/index.md)
- [ALTER](/docs/en/sql-reference/statements/alter/index.md)
- [SYSTEM](/docs/en/sql-reference/statements/system.md)
- [SHOW](/docs/en/sql-reference/statements/show.md)
- [GRANT](/docs/en/sql-reference/statements/grant.md)
- [REVOKE](/docs/en/sql-reference/statements/revoke.md)
- [ATTACH](/docs/en/sql-reference/statements/attach.md)
- [CHECK TABLE](/docs/en/sql-reference/statements/check-table.md)
- [DESCRIBE TABLE](/docs/en/sql-reference/statements/describe-table.md)
- [DETACH](/docs/en/sql-reference/statements/detach.md)
- [DROP](/docs/en/sql-reference/statements/drop.md)
- [EXISTS](/docs/en/sql-reference/statements/exists.md)
- [KILL](/docs/en/sql-reference/statements/kill.md)
- [OPTIMIZE](/docs/en/sql-reference/statements/optimize.md)
- [RENAME](/docs/en/sql-reference/statements/rename.md)
- [SET](/docs/en/sql-reference/statements/set.md)
- [SET ROLE](/docs/en/sql-reference/statements/set-role.md)
- [TRUNCATE](/docs/en/sql-reference/statements/truncate.md)
- [USE](/docs/en/sql-reference/statements/use.md)
- [EXPLAIN](/docs/en/sql-reference/statements/explain.md)
Users interact with ClickHouse using SQL statements. ClickHouse supports common SQL statements like [SELECT](select/index.md) and [CREATE](create/index.md), but it also provides specialized statements like [KILL](kill.md) and [OPTIMIZE](optimize.md).
1 change: 1 addition & 0 deletions utils/check-style/aspell-ignore/en/aspell-dict.txt
Original file line number Diff line number Diff line change
Expand Up @@ -1811,6 +1811,7 @@ geocode
geohash
geohashDecode
geohashEncode
geohashes
geohashesInBox
geoip
geospatial
Expand Down

0 comments on commit d040513

Please sign in to comment.