-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Release Notes 0.14.0 #5374
Labels
Comments
good |
nice |
the last commit is 577b62b |
1 task
#4756 has been reverted. |
10 tasks
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Since the release of 0.13, Apache Doris (incubating) contains around 390 new features, bug fixes, performance enhancements, documentation improvements, code refactors from 60+ contributors. We are ready to release Apache Doris (incubating) 0.14.
New Feature
Import and delete
Support to delete multiple pieces of data at one time through the import method to avoid performance degradation caused by multiple deletions. For tables of the UniqueKey model, support to specify the Sequence column when importing. Doris will judge the sequence of the data according to the value of the Sequence column to ensure that the data is imported Time order
[#4310] [#4256]
Support database backup
The support in the backup stmt specifies the backup content (metadata and data).
Support exclude backup and restore some tables in stmt. When backing up the entire database, you can exclude some very large and unimportant tables.
Supports backing up and restoring the entire database instead of declaring each table name in the backup and restore statement.
[#5314]
ODBC external table support
Support access to external tables such as MySQL, postgresql, Oracle, etc. through ODBC protocol
[#4798] [#4438] [#4559] [#4699]
Support SQL level and Partition level result Cache
Support for caching query results to improve the efficiency of repeated queries, support SQL-level and Partition-level results Cache [#4330]
Built-in functions
FE interface and HTTP interface
The new FE UI interface can be enabled by setting the FE configuration item enable_http_server_v2 [[UI Part 5] Enable HTTP Server 2 by FE config #4684]
BE adds an http interface to show the distribution of all tablets in a partition among different disks in a BE [[Feature] Add a http interface to acquire the tablets distribution between different disks #5096]
BE adds an http interface to manually migrate a tablet to other disks on the same node [[Feature] Add a http interface for single tablet migration between different disks #5101]
Support to modify the configuration items of FE and BE through http, and persist these modifications [[Feature][Config] Support persistence of configuration items modified at runtime #4704]
Compatibility with MySQL
Monitoring
BE added tablet-level monitoring indicators, including scanned data volume and row number, written data volume and row number, to help locate hot tablets [[Metrics] Support tablet level metrics #4428]
BE added metrics to view the usage of various LRU caches [[LRUCache] Expose LRU Cache status to metrics #4688]
Table building related
Support backup,restore,load,export directly connect to s3 [#5399]
Other
Support adding Optimizer Hints of type SET_VAR in the Select statement to set session variables [[Feature]Support SELECT Optimizer Hints SET_VAR #4504]
Support to repair damaged tablets by filling in empty tablets [[Tablet][Recovery] Support using empty tablet to repair the damaged or missing tablet #4255]
Support Bucket Shuffle Join function (when the Join condition column is a subset of the table bucket column, the right table will be shuffled to the node where the data in the left table is located, which can significantly reduce the network overhead caused by Shuffle Join and improve query speed) [# 4677]
Support batch cancel import tasks through cancel load statement [[Feature] Support cancel load jobs in batch #4515]
Add a Session variable to set whether to allow the partition column to be NULL [[Repair] Add an option whether to allow the partition column to be NULL #5013]
Support TopN aggregation function [[Feature] Add Topn udaf #4803]
Support a new data balancing logic based on the number of partitions and buckets [[Rebalancer] support partition rebalancer #5010]
Support creating indexes on the value column of unique table [Support create index on unique value column #5305]
Enhancement
Performance improvement
Functional improvements
Show Proc
stmt #4446]-Support cast (expr as signed/unsigned int) syntax to be compatible with MySQL ecology
-Add more columns to the information_schema.columns table to be compatible with the MySQL ecosystem
@
leading column name #5006]Other
BugFix
Fix the bug that may be caused during playback of Erase Table metadata operations [Fix fe restart failed bug when replay erase table log #5221]
Fix the problem that the BE process crashes due to the orc::TimezoneError not being caught when importing ORC format files [[Bug]Fix bug that BE crash when load ORC file #4350]
Fix the problem that the result of the Except operator is incorrect [[BUG] Fix except wrong answer bug #4369]
Fix the problem that the query always route to the same BE node when querying ES data [[Doris On ES][Bug-Fix] ES queries always route at same 3 BE nodes (#4351) #4352]
Fix the problem that the operation is not correctly persisted when setting the Global Variable [[Bug] Fix bug that modification of global variable can not be persisted. #4324]
Fixed the problem that the MemTracker was not constructed correctly in PushHandler which caused the BE process to crash [[Bug][MemTracker] Cleanup the mem tracker's constructor to avoid wrong usage #4345]
Fix the problem of importing blank lines when importing Json data format [[JsonLoad] Fix bug that row num stat is not correct when loading json #4379]
Fix the problem that the SQL rewriting rules failed to correctly handle count distinct [Modify mv rewrite rule on 'Count distinct' #4382]
Fix the problem that the data model type of the materialized view is not set correctly when creating the materialized view [Fix errors when alter materialized view which based on dup table #4375]
Fix the problem of wrong query result of left semi/anti join [[BUG] Remove the deduplication of LEFT SEMI/ANTI JOIN with not equal … #4417]
Prioritize the join method specified by the user [Fix explicit broadcast join bug #4424]
Fix the problem of incorrect results when Inline view is included in the Left join operation [FixTupleIsNull miss in SelectStmt resultExpr #4279]
[[MySQL Compatibility 2/4][Bug] Fix bug and improve compatibility with mysql protocol #4362]
select database() no longer returns the cluster qualified name, and fix the problem that select user() does not display the user ip
Fix the problem that the number of table copies displayed by show create table is incorrect for tables that use the dynamic partition function [FIX: fix dynamic partition replicationNum error #4393]
Fix the inconsistent precision of decimal, char and varchar columns in the base table and the materialized view in the materialized view [Keep the scale and precision of type when creating mv #4436]
Fix the problem of wild pointer in PlanFragmentExecutor, fix the problem of null pointer when importing in json format [Fix core issue of 4447 and change declare order for compatibility #4448]
Fixed the problem that some remaining tablet directories on BE were not cleared [[Bug-Fix] Some deleted tablets are not recycled on BE #4401]
Fix some issues with Spark Load [[Spark load] Fix dpp and submit push task bugs #4464]
Fix the problem that the balance of the colocation table cannot be completed [[Colocation] Fix Colocation balance endless loop bug #4471]
Fix MemIndex::load_segment possible memory copy exception problem [[Bug] Fix bug that memory copy may overflow in MemIndex::load_segment #4458]
Fix the problem of BE crashing when using Load Error Hub function when WITH_MYSQL compilation option is not added [[Bug] Fix bug of load error hub and schema change #4486]
Fix the problem of execution error when using @@sql_mode environment variable in SQL [[Bug] Fix bug of select @@sql_mode #4484]
Fix the problem of splitting the same column in Spark Load and Broker Load, and the splitting behavior is inconsistent [[Spark load][Bug] Fix column terminator for spark load #4491]
Fix the problem of BE downtime caused by querying the information_schema.columns table [[Bug] Fix bug that BE will crash when querying information_schema.columns #4511]
Fix some issues in the persistence of rowset metadata in historical versions [[BUG] Fix recover persistent stale rowsets bug from multi-single version rowsets in stale rowsets #4513]
Fix the problem of inconsistent behavior of str_do_date() function on FE side and BE side [[Bug] function str_to_date()'s behavior on BE and FE is inconsistent #4495]
Fixed the issue where BE was down due to some historical data conversion when performing linked schema change [[BUG] Fix segment group add zone map bug when schema change. #4526]
Fix the problem that Spark Load stays in the ETL stage after FE restart [[Spark Load] [Bug] Load job's state will stay in ETL state all the time after FE restart #4528]
Fixed an issue that caused unreadable data when the delete condition contained "\n" [[BUG] Tablet is not readable and delete handler report -1903 error, when condition value contains \n #4531]
Fix the problem that Spark Load job in PENDING state cannot be cancelled [[Spark load][Bug] fix that cancelling a spark load in the
PENDING
phase will not succeed #4536]Fix the problem of inconsistent behavior when splitting columns between Spark Load and other import methods [[Spark load][Bug] fix that cancelling a spark load in the
PENDING
phase will not succeed #4536]Fix the problem that net.sourceforge.czt.dev cannot be found when compiling the FE module [[Compile] Add pluginRepository for java-cup-plugins #4636]
Fix the problem that the statement parsing fails when the cast function exists in the case when statement [[Bug] Fix analysis error when there are different types in case-when-then-else with group by clause #4646]
Fix the problem that all queries will fail when there is a problem with the RPC of a certain BE [Fix all queries failed when one BE network or disk has issue #4651]
Fixed the issue that related import transactions were not cleaned up after the BE node went down [[BUG] Fix transaction not be cleared after BE down. #4661]
Fix the problem that the column types of the columns table of information_schema are not compatible with MySQL [Fix DATA_TYPE in information_schema.columns is not compatible to mysql meta #4648]
Fix the problem of SQL Cache access out of bounds [[Docs] Supply BE config docs of setting and examples #4641]
Fix the problem that import throws a null pointer exception when there is no partition in the table [[Bug] Fix that the partitions of a dynamic-partitioned table has not been created at the time of load or insert #4658]
Fix an error when tools/show_segment_status access external tables [[Bug]External engines(e.g. ES) don't have segments, ignore those tables #4671]
Fix the issue that delete on clause may not take effect in Routine Load [[Bug]Fix information_schema.columns table column_comment does not show #4676]
Fix the problem that the columns of information_schema do not display comments [Fix delete on clause may not work in routineLoad #4683]
Fix the problem that hidden columns (delete flag column, etc.) may be lost after schema change [Fix hidden cloumn may disappeared #4686]
Fix the problem that the window function lag()/lead() reports an error when matching the decimal type [Fix Windows function lag()/lead() function throw AnalysisException. #4666]
Fix the problem that the client is stuck in high concurrency scenarios when using MySQL NIO Server [Fix mysqlslap hang under high concurrent #4680]
Fix the problem of always reporting out of date in tablet report [[Bug] Fix bug that tablet report always out of date #4695]
Fix the problem of duplicate columns in case when statement after query planning [[Bug] Fix duplicate columns in case when statement #4693]
Fix the problem that the rand() function generates the same random value every time [Fix rand() function return same value #4709]
Fix the problem of query error caused by incorrect column cardinality statistics [[Bug] Fix hard cardinality check which makes queries fail #4678]
Fix the problem of BE downtime caused by function error of split_part function [[Bug] Fix the core problem of function
split_part
and add the UT of core case #4721]Fix the problem of query execution error when SQL statement contains constant subquery [[Bug] Add regular column when materialized slot is empty in tuple #4719]
Fix the problem of join query error when the table contains the delete tag column [[BUG] Fix join error when the table has enbale batch delete #4734]
Fix the problem of syntax parsing errors when the CTE statement contains nested subqueries [[Bug]Fix bug CTE statement with nested select #4731]
Fix the problem of lead/lag type matching error in window function [[BUG] Ensure that the correct lead/lag function is selected #4732]
Fix the problem that tablet cannot be selected correctly when selecting tablet for compaction [[Bug] Fix bug of cumulative compaction and deletion of stale version #4593]
Fix the problem that limit conditions are incorrectly pushed down to the odbc external table and Es external table [[Bug] Do not push down limit operation when ODBC table do not push all conjunct as filter. #4764] [[Doris On ES][Bug-Fix] Can not pushdown limit when some predicate not processed by ES #4768]
Fix the problem that the compaction thread stops working [[Bug][Compaction] Fix bug that compaction may be blocked #4750]
Fix the problem that the timeout idle connection is not automatically killed in some cases [[Bug] Fix Bug that fe's connection which is timed out can't be released #4774]
Fix the problem of error when querying tables with delete flag column when SQL contains join [Fix delete_sign predicate assing to join node #4770]
Fix the calculation results of some time functions in FE to keep the results consistent with BE calculations [[Bug] Fix some date functions to make their result same as MySQL #4786]
Fix the issue that BE crashes when displaying tablet information on BE web page [[Bug] Fix bug and optimize implementation logic of tablets web page #4775]
Fix the type conversion problem of the time type filter condition, so that it can be correctly converted to the corresponding event type [[BUG] Cast int type to date type #4806]
Fixed the problem of repeatedly creating hidden columns when creating Rollup [Fix create rollup may duplicate hidden column #4816]
Fix the problem of hidden sequence column not displaying [[Bug] Sequence column should be visible when show_hidden_columns = true #4818]
Fix the problem of incorrect query results of some union statements [[Bug] Fix union bug (#4772) #4807]
Fixed an issue where offline node tasks could not be completed in some cases [[TabletScheduler] Fix some bug where decommission operations cannot be completed #4804]
Intelligently identify illegal date constants during SQL parsing to avoid query scanning all partitions [[FEATURE]Check date type to avoid scan all partitions #4756]
Fix the problem that BE crashes when the BE side selects the tablet for compaction without locking [[Compaction][Bug-Fix] Fix bug that meta lock need to be held when calculating compaction score #4829]
Fix some front-end display issues and back-end cookie processing logic issues in the new version of the UI [[FE UI] Fix some bugs about new FE UI #4830]
Fixed the problem that the tablet could not be found when querying errors when UNION and Colocation Join are included in SQL [[Bug][SQL] Fix bug that query failed when SQL contains Union and Colocate join #4842]
When submitting import tasks, the submission failed due to the full task queue, but the failure exception was not captured correctly [[BUG] Catch retry submit exception #4796]
Fix the problem of Broker Load job scheduling. Avoid the problem that some jobs cannot be scheduled after submission [[Bug] Fix some bugs of load job scheduler #4869]
Just before Master FE is started, avoid forwarding commands to Master FE [【Improvement】Avoid null host when forward to master #4844]
Ignore Parquet and ORC format empty files when importing to avoid reading errors [[Broker Load] Ignore empty file when file format is parquet or orc. #4810]
Fix the problem that the materialized view name conflict is not checked when renaming the OLAP table [[Bug] Rename table logic error #4870]
Fix the problem that the creation fails when using complex SQL to create a logical view [[Bug] Fix bug that failed to create view with complex select stmt #4840]
Fixed an issue where Routine Load could not end the task correctly due to reading empty messages when consuming Kafka data [[Bug] Fix bug that routine load blocked with TOO_MANY_TASKS error #4861]
Fix the problem that some column names are not recognized when using CTE syntax [[Bug] Fix bug #4886 and #4586 by refactoring code of method 'getDbs' #4887]
Fix the problem that the content of the columns table of the Information_schema library is incorrect [[BUG] Fix field error in information_schema.columns #4858]
Fix the problem that BitmapValue serialization fails when only 32-bit integers are included in the implementation of BitmapValue on the FE side [(#4883) Java Version BitmapValue deserialized failed #4884]
Fix that when calculating BE disk usage, all disk space not used by Doris in the node is incorrectly included. This will cause calculation errors during the Decommission operation [[BUG] modify isDecommissioned be capacity calculate rule #4889]
Fix the problem that an additional column may be added incorrectly when only constant expressions are included in the SELECT list [Avoid duplicate column when adding slot in empty tuple #4901]
Fix the problem that the Thrift Server type on the FE side and the BE side are inconsistent and cause communication failure [[Bug] Fix bug that be thrift client cannot connect to fe thrift server when fe thrift server use
TThreadedSelectorServer
model #4908]When partition cutting, ignore the filter conditions on non-columns [[Bug] Fix partition prune (#4833) #4921]
Fix the problem that the log directory is created incorrectly in the start_fe.sh startup script [fix the FE logs dir create issue #4929]
Fix the problem that some NULL values are not displayed when using CTE syntax [[Bug] Fix the bug of NULL do not show in CTE statement. #4932]
Fix the problem that Colocation Group is always in unstable state when some BE nodes are down [[BUG] Fix Colocate table balance bug #4936]
It is forbidden to create a table in Segment V1 format [disable the creation of segment v1 table #4913]
Fix the problem that Bool type condition processing error when Doris queries ES data [[Doris On ES][Bug-fix] fix boolean predicate pushdown manner #4990]
Fix a problem of Tablet Shard lock on BE side [[Bug] Fix concurrent access of _tablets_under_clone in TabletManager #5000]
Fix the problem of ConcurrentModificationException that may appear on the FE side when deleting a table that is being imported [(#5002) ConcurrentModificationException when finish transaction #5003]
Fix the problem of incorrect return type of str_to_date function [[Function] Let "str_to_date" return correct type #5004]
Fix the problem that the precision of some floating point types is lost when importing Json format data [[Bug] Fix the bug of Largetint and Decimal json load failed. #4983]
Fix the problem of incomplete query results when using Union to connect multiple external tables to query [[Bug] Fix bug that query multi mysql external table with union will get incomplete result #5067]
Fix the problem that the query result is incorrect when the SQL contains multiple in conditions [[Bug] Fix the bug of where condition a in ('A', 'B', 'V') and a in ('A') return error result #5072]
Fix a problem that the order of Profile destruction caused BE downtime [[Bug] Fix a core dump of counter in BE #5078]
Fix the problem of memory leakage when importing Json format data [[Bug] Fix Memory Leak in Json Load #5073]
Fix the problem that Colocation balance logic occupies 100% CPU when there is no BE node [[BUG] Fix colocate balance bug when no available BE #5079]
Fix the issue that creating a new tablet may cause BE downtime [[Bug] Fix coredump bug when create new tablets #5089]
Fixed the problem that the shared pointer circular reference caused the tablet to be unable to be cleared and occupied disk space [[Bug] Fix tablet shared ptr circular reference causing the tablet not to be cleared #5100]
Fix the issue that the BE will crash when the is null condition is included in the delete condition [[Bug] Fix bug when delete condition is null but zonemap is not null #5109]
Fix a problem with Partition Cache hit strategy [[Bug-Fix] Fix partition cache match bug #5060]
Optimize the strategy of Spark Load to read Hive tables to avoid full scanning of Hive tables [avoid to read whole hive table when spark load from hive table #5047]
Added support for Ninjia build system to speed up the compilation speed of BE [support ninja build system #5076]
Optimize the efficiency of importing data in Json format [[enhancement]improve performance of json load #5055]
Support FE to directly use thrift protocol to transmit heartbeat information to avoid heartbeat blocking failure that may be caused by http communication model [ Support fe heartbeat use thrift protocol to get stable response #5027]
Simplify the opening logic of the dynamic partition function, and prohibit hourly partitioning for date type columns [Forbidden creating table with dynamic partition when FE.config dynamic_partition_enable=false #5043]
Support to view Broker Load Profile through FE Web page [[Enhance] Add profile for load job #5052]
When viewing Resource information, clear text password is no longer displayed [[ODBC] ODBC Catalog do not show password in 'show resource' #5088]
The BE side adds trace information for tablet creation to help locate the problem of slow tablet creation [[Trace] Add trace for create tablet tasks #5091]
Fix the issue that may cause data loss when Routine Load consumes Kafka data in some cases [[Bug] Fix bug that routine load may lost some data #5093]
Fix the problem that desc statement to view all materialized views may return Malformed packet [[Bug-Fix] Fix 'Malformed packet' error when desc OlapTable with Rollup #5115]
Fix the issue that may cause BE to crash when BE starts loading the data directory [[Bug] Fix old tablet inserting bug #5113]
Fix the problem that non-Master FE repeatedly sends non-query requests to Master FE [[BUG] Follower shouldn't forward non-query statement to master repeatedly #5160]
Fix the problem of partition cache hit logic error [[Bug] Hit none partition cache, but hit range is still right #5065]
Fixed an error when bucket join was executed on an empty table [[Bug-Fix] Bucket shuffle join executes failed when two tables have no data #5145]
Fix the problem that the percentile_approx function returns the wrong result [[Bug-Fix] Fix the bug of
PERCENTILE_APPROX
return error resultnan
and addPERCENTILE_APPROX
UT #5172]Fix the problem of the calling sequence of Olap Scanner thread ending [5111]
Fixed an error when creating the colocation attribute for an empty partitioned table [Fix create colocate table bug #5139]
Fixed an error when querying materialized views in CTE statement [Fix MaterializedView select with CTE bug #5165]
Fix the problem that the min max function does not handle the null value of string type column correctly [[Bug] Fix bug that the min/max function has an error in handling string null values #5189]
Modify the string encoding in Spark-Doris-Connector to utf8 [[Spark on Doris] fix the encode of varchar when convertArrowToRowBatch #5202]
Fix the problem that delete column may be added repeatedly in routine load [Fix duplicated add delete condition when run routine load #5222]
Fix bucket shuffle join bug [[Bug] Fix bucket shuffle join bug of query failed #5228]
Fix the issue that the ALTER ROUTINE LOAD operation is invalid for some parameters [[BUG] fix alter routine load not work #5257]
Fixed an issue where metadata signatures of different tables may be the same during backup and recovery operations [[Bug] Remove schema hash and fix bug of calculating table signature #5254]
Fix the problem that Colocate Join and Buckets shuffle join may cause data to be scanned repeatedly [[Bug] Colocate Join and Bucket shuffle join may scan some tablet twice time. #5256]
Fix the issue of metadata errors caused by unchecked log id when FE pushes metadata [Add some consistency check in image put api #5219]
Fix the problem of error in aggregate query processing -0.0 [[Bug] Fix row_number and group by have inconsistent partition results for (0.0, -0.0) #5226]
Fix outer join query error [[Bug] Fix bug of outer join cause error result #5285]
Other
Add some non-Apache protocol code protocol declarations to the NOTICE file [Add other license declare in NOTICE #4831]
Reformatted the code of BE using clang-format [Clang-format cpp sources #4965]
Added clang-format checking and formatting scripts to unify the C++ code style of BE before submission [Add clang-format script #4934]
The third-party library adds the AWS S3 SDK, which can be used to directly read the data in the object storage through the SDK [add aws sdk to thirdparty #5234]
Fixed some issues related to License: [[License] Organize and modify the license of the code #4371]
The dependencies of the two third-party libraries, MySQL client and LZO, will no longer be enabled in the default compilation options. If users need MySQL external table function, they need to turn it on
Removed the js and css code in the code library and introduced it in the form of a third-party library dependency
Updated the Docker development environment image build-env-1.2
Updated the compilation method of the UnixODBC tripartite library, so that the BE process no longer depends on the libltdl.so dynamic library of the system when it is running
Added third-party UDF to support more efficient set calculation of orthogonal bitmap data [Add bitmap longitudinal cutting udaf #4198]
Added UnixODBC third-party library dependency to support ODBC external table function [[ODBC SCAN NODE] 1/4 Add unix odbc library. #4377]
API Change
streaming_load_max_batch_size_mb
tostreaming_load_json_max_mb
[Change config name 'streaming_load_max_batch_size_mb' to 'streaming_load_json_max_mb' #4791]Credits
924060929
acelyc111
Astralidea
benbiti
blueChild
caiconghui
caoyang10
ccoffline
coalchan
Dam1029
e0c9
EmmyMiao87
gengjun-git
HangyuanLiu
HappenLee
hffariel
jollykingCN
kangkaisen
killxdcj
lihuigang
liutang123
luozenglin
marising
mengqinghuan
morningman
nimuyuhan
Nivane
pengxiangyu
px-l
qidaye
sduzh
Skysheepwang
songchuangyuan
stalary
stdpain
Sunt-ing
vagetablechicken
vergilchiu
wangbo
wangxiaobaidu11
weizuo93
WingsGo
wutiangan
wuyunfeng
xinghuayu007
xinyiZzz
Xpray
xy720
yangzhg
Youngwb
yxqweasd
zh0122
ZhangYu0123
zhaojintaozhao
xxiao2018
bookeezhou
JNSimba
killxdcj
yuliangwan
The text was updated successfully, but these errors were encountered: