RCRank

This is the official PyTorch code for "RCRank: Multimodal Ranking of Root Causes of Slow Queries in Cloud Database Systems".

We propose RCRank, the first method to utilize a multimodal approach for identifying and ranking the root causes of slow queries by estimating their impact. We employ a pre-training method to align the multimodal information of queries, enhancing the performance and training speed of root cause impact estimated. Based on the aligned pre-trained embedding module, we use Cross-modal fusion of feature modalities to ultimately estimate the impact of root causes, identifying and ranking the root causes of slow queries.

Installation

First clone the repository and then enter the project directory. The location and description of all files in this project are provided in the final "Complete Directory Structure" section.

We use the BERT model and parameters to initialize the query encoder. We provide a script to download the BERT model and parameters, which should be placed in the project's root directory.

git lfs install
git clone https://huggingface.co/google-bert/bert-base-uncased

The project dependencies can be installed by executing the following script in the root of the repository:

conda create --name RCRank python=3.9
conda activate RCRank
pip install -r requirements.txt

Run

Our method is divided into two stages. The pre-training stage pre-trains the encoders of query statements, execution plans, execution logs, and key performance indicators. Pre-training requires significant time and GPU resources. Therefore, for convenience, we provide a link to download the pre-trained checkpoint. After downloading the pre-trained checkpoint, you can run the following script to place the pre-trained checkpoint in the ./pretrain/ directory.

mv pretrain.pth ./pretrain/

The second stage is the training and testing stage, datasets can be downloaded from this link. Then run follow script to place the data files into the data directory.

mkdir data
unzip -d ./data train_data.zip

TPC-H dataset

Run the following script to obtain the results for the TPC-H dataset.

python main.py --device cuda:0 --dataset tpc_h

The execution results of this script correspond to the TPC-H results in Table 3 in the paper.

TPC-C dataset

Run the following script to obtain the results for the TPC-C dataset.

python main.py --device cuda:0 --dataset tpc_c

The execution results of this script correspond to the TPC-C results in Table 3 in the paper.

TPC-DS dataset

Run the following script to obtain the results for the TPC-DS dataset.

python main.py --device cuda:0 --dataset tpc_ds

The execution results of this script correspond to the TPC-DS results in Table 4 in the paper.

The final results will be saved in the res directory.

Hologres dataset

The Hologres dataset uses the same code as the public datasets. However, Hologres dataset involves business-related privacy information; therefore, we only provide the open-source datasets: TPC-C, TPC-H, and TPC-DS.

Detailed guide

Pre-train

If you need to pre-train from scratch, you can execute the following method. Please download the our pre-training data from this link. Then run follow script to place the data files into the ./pretrain/ directory.

mv pretrain_data.pkl ./pretrain/

Execute the following script and the checkpoint will be saved in the ./pretrain/save directory.

python pretrain/pretrain.py

Config

Configuration File: config.py - Training Parameters

batch_size: Number of samples per batch during training.
lr: Learning rate.
epoch: Number of training iterations.
device: Device for computation, either GPU or CPU.
opt_threshold: Threshold for valid root causes.
model_name: Name of the model.
use_fuse_model: Whether to use a fused model.
use_threshold_loss: Whether to use validity and orderliness loss.
model_path: Path to save the model.
margin_loss_type: Margin loss type: "MarginLoss"、"ListnetLoss"、"ListMleLoss".
use_margin_loss: Whether to use margin loss.
use_label_loss: Whether to predict root cause type.
use_weight_loss: Whether to assign sample weights.
use_log: Whether to use log information.
use_metrics: Whether to use metrics information.
embed_size: "emb_size" in QueryFormer.
embed_size: "emb_size" in QueryFormer.
pred_hid: "pred_hid" in QueryFormer.
ffn_dim: "ffn_dim" in QueryFormer.
head_size: "head_size" in QueryFormer.
n_layers: "n_layers" in QueryFormer.
dropout: "dropout" in QueryFormer.
input_emb: "dropout" in QueryFormer.
use_sample: "use_sample" in QueryFormer.
ts_weight: Weight for validity and orderliness.
mul_label_weight: Weight for type of root cause.
pred_type: Type of task.

Complete Directory Structure

RCRank
├─ data  # TPC-C, TPC-H, TPC-DS datasets
│  ├─ tpc_c.csv  # TPC-C dataset
│  ├─ tpc_ds.csv  # TPC-DS dataset
│  ├─ tpc_h.csv  # TPC-H dataset
├─ bert-base-uncased # Bert model and parameters directory
├─ model  # model, loss function, train and test
│  ├─ __init__.py
│  ├─ loss
│  │  └─ loss.py
│  ├─ modules
│  │  ├─ FuseModel
│  │  │  ├─ Attention.py
│  │  │  └─ CrossTransformer.py
│  │  ├─ LogModel
│  │  │  ├─ __init__.py
│  │  │  └─ log_model.py
│  │  ├─ QueryFormer
│  │  │  ├─ QueryFormer.py
│  │  │  ├─ datasetQF.py
│  │  │  └─ utils.py
│  │  ├─ TSModel
│  │  │  └─ ts_model.py
│  │  ├─ __init__.py
│  │  ├─ rcrank_model.py
│  └─ train_test.py  # train and test
├─ pretrain  # pretrain function and pretrain parameters
│  └─ pretrain.py  
│  └─ pretrain.pth
│  └─ pretrain_data.pkl   
├─ res  # Test result save directory
│  └─ GateComDiffPretrainModel tpc_c confidence eta0.07
│     └─ res.txt
├─ utils
│   ├─ config.py
│   ├─ data_tensor.py
│   ├─ evaluate.py
│   ├─ load_data.py
│   └─ plan_encoding.py
├─ main.py  # Root cause rank
├─ README.md
├─ requirements.txt

Citation

If you find the code useful, please cite our paper.

Pending.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RCRank

Installation

Run

TPC-H dataset

TPC-C dataset

TPC-DS dataset

Hologres dataset

Detailed guide

Pre-train

Config

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
model		model
pretrain		pretrain
res		res
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py

decisionintelligence/RCRank

Folders and files

Latest commit

History

Repository files navigation

RCRank

Installation

Run

TPC-H dataset

TPC-C dataset

TPC-DS dataset

Hologres dataset

Detailed guide

Pre-train

Config

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages