Description

ElectroMon is the app that monitors high-voltage parameters. It uses data uploaded from devices that make minute-by-minute measurements in power plants, transformers and other electrical appliances.

Implementation

Backend is written in Python using pandas and numpy. Graphs, plots and presentations are provided by Jupyter Notebook using matplotlib and reportlab.

Requirements and peculiarities

Requires some Python additional classic libs, all of which can be downloaded via pip install:

Check requirements.txt in root directory for a list of required libraries

The actual devices are applied with usage of cyrillic characters. Encoding and translation features should be considered.

Installation

The app is stored @ GitHub.

If you are familiar with Git choose a directory on your local machine and clone git repo there with this command:

git clone https://github.com/sadzax/ElectroMon.git

If you don't have Git installed download a project in ZIP-Archive by a direct link from GitHub, unzip it on your local machine and proceed to the points 3 and 4 of the installation documentation

Put files uploaded from devices files to internal directory of the app /upload/ depending on the type of device

/upload/kiv # Input isolation monitoring device 
/upload/mon # Monitoring of devices for continuous monitoring and protection of high-voltage inputs
/upload/nkvv # Device for continuous monitoring and protection of high-voltage inputs

Example

Copy UPLOAD001.I file in c:\users\user\downloads\ElectroMon-master\upload\mon\ for a device "Monitoring of devices for continuous monitoring and protection of high-voltage inputs"

For using via Python

Install Python v.3.7.+
Download required libraries from requirements.txt via pip install as described in Requirements

Example

run pip install pandas from command-line

You can run the application according to Runner via Python instructions

For using via Docker

Install Docker. Python interpreter and additional libs are not required.

Build the image and launch it according to Runner via Docker instructions

App Structure

Device initialisation (devices.py)

Measurers

ElectroMon bases on actual physical devices-measurers

There is a list of them:

nkvv - Device for continuous monitoring and protection of high-voltage inputs
cyrillic
Устройство непрерывного контроля и защиты высоковольтных вводов
kiv - Input isolation monitoring device
cyrillic
Устройство контроля изоляции вводов
monitoring - Monitoring of devices for continuous monitoring and protection of high-voltage inputs
cyrillic
Мониторинг устройств непрерывного контроля и защиты высоковольтных вводов

Attributes

Devices mostly store and return attributes for further processing. Module consist of methods setting properties and statuses of device, such as:

name of the device
default folder and files for uploads of data
type of data source, used delimiters, splitters etc.
columns used for setting values of measurements and their decoding map
typical errors of the devices measurements
warning- and accident- values for a measurement-types
misc. info

It's recommended to use dev or device_type variable for setting a device code (f.e. 'mon' or 'kiv')

Example of device initialisation

self.name = 'nkvv'
self.full_name = 'Устройство непрерывного контроля и защиты высоковольтных вводов'
self.monitoring_params = {'input': 220000, 'output': 110000}
self.log_types = {'measure': 'csv', 'event': 'csv'}
self.file_folder = 'upload/' + name + '/'
self.file_name_starts = {'measure': 'DB_i'}
self.file_sep = ';'
self.file_default_encoding = 'WINDOWS-1251'
self.file_parse_dates = ['Дата создания записи', 'Дата сохранения в БД']
self.default_dict_for_replacement_to_nan = {'power': [-300.0, 0.0],
                                            'tg': -10.0,
                                            '∆tg': -10.0,
                                            'c_delta': -10.0,
                                            'c_deviation': 0.0,
                                            'voltage_difference': 0.0}
self.warning_map = {'∆tg': [1.0, 1.5], '∆C': [3.0, 5.0]}

Sources

Data source located in /upload/ and should refer to the requirements of the chosen device described in devices.py file in class Device with init-properties such as self.file_name_starts or self.file_name_ends of device-objects In case of adding another file-type/patterns these properties should be updated.

Example of upload files

'upload/kiv/MeasJ a284 #572665920 30-12-2022.xlsx' 
'upload/mon/22_06/21217004.I' 
'upload/mon/22_07/21217004.I' 
'upload/mon/22_08/21217004.I' 
'upload/mon/22_09/21217004.I' 
'upload/mon/22_10/21217004.I' 
'upload/mon/22_11/21217004.I' 
'upload/mon/22_12/21217004.I' 
'upload/mon/23_01/21217004.I' 
'upload/nkvv/DB_i.csv;

Links

links() method contains main properties of a devices returned as a list. The method can be enlarged with new properties, but the order of the list must never be rearranged because its indexes are used in analytical functions.

Current links-list and indexes of the properties

        self.name,  # 0
        self.file,  # 1
        self.file_sep,  # 2
        self.file_default_encoding,  # 3
        self.file_parse_dates,  # 4
        self.file_list,  # 5
        self.default_dict_for_replacement_to_nan,  # 6
        self.file_parse_dates_basis,  # 7
        self.default_dict_for_dtypes,  # 8
        self.full_name,  # 9
        self.warning_map  # 10

Quick save/load

Saved cache of data for a device can be stored in a .pkl format in save/ directory for further fast upload.

Use class Pkl and save() or load() method in devices

Example of fast save & load

You can save the processing data with

data = devices.Pkl.save(device_type='kiv', data=data)

And then upload it with

data = devices.Pkl.load(device_type='kiv')

Columns processing (columns.py)

Main columns analyzer

Based on devices attributes columns_analyzer() processes data columns into Python dictionary with enumerated keys (can be used as indexes too) and values as list of parameters:

Original name of the column passed as string
Measurement used
Code of sensor and a phase
Voltage parameter
Short search name
Full search name
Concatenation of short search name and voltage parameter

Columns are set to cover column's parameters across as many viewpoints as possible in order to increase efficiency of analytical functions.

Besides of that columns_analyzer() creates some new columns sometimes, f.e. it creates column with concatinating of measures date and time columns for 'mon' device

Example of columns dictionary

0: ['Дата создания записи', 'datetime', 'overall', 'no_voltage', 'time', 'time_of_measure', 'time_no_voltage'],
1: ['Дата сохранения в БД', 'datetime', 'overall', 'no_voltage', 'save', 'time_of_saving', 'save_no_voltage'],
2: ['U_A1,кВ', 'voltage', 'A1', 'HV', 'U', 'voltage_difference', 'U_HV'],
3: ['Ia_A1,мА', 'power', 'A1', 'HV', 'Ia', 'power_active', 'Ia_HV'],
4: ['Ir_A1,мА', 'power', 'A1', 'HV', 'Ir', 'power_reactive', 'Ir_HV'],
5: ['Tg_A1,%', 'percentage', 'A1', 'HV', 'tg', 'tangent', 'tg_HV'],
6: ['C_A1,пФ', 'other', 'A1', 'HV', 'C', 'c_deviation', 'C_HV'],
7: ['DeltaTg_A1,%', 'percentage', 'A1', 'HV', '∆tg', 'tangent_delta', '∆tg_HV'],
8: ['DeltaC_A1,%', 'percentage', 'A1', 'HV', '∆C', 'c_delta', '∆C_HV'],
9: ['U_B1,кВ', 'voltage', 'B1', 'HV', 'U', 'voltage_difference', 'U_HV'],
10: ['Ia_B1,мА', 'power', 'B1', 'HV', 'Ia', 'power_active', 'Ia_HV'],
11: ['Ir_B1,мА', 'power', 'B1', 'HV', 'Ir', 'power_reactive', 'Ir_HV'],
12: ['Tg_B1,%', 'percentage', 'B1', 'HV', 'tg', 'tangent', 'tg_HV'],
13: ['C_B1,пФ', 'other', 'B1', 'HV', 'C', 'c_deviation', 'C_HV'],
14: ['DeltaTg_B1,%', 'percentage', 'B1', 'HV', '∆tg', 'tangent_delta', '∆tg_HV'],
15: ['DeltaC_B1,%', 'percentage', 'B1', 'HV', '∆C', 'c_delta', '∆C_HV'],
16: ['U_C1,кВ', 'voltage', 'C1', 'HV', 'U', 'voltage_difference', 'U_HV'],
17: ['Ia_C1,мА', 'power', 'C1', 'HV', 'Ia', 'power_active', 'Ia_HV'],
18: ['Ir_C1,мА', 'power', 'C1', 'HV', 'Ir', 'power_reactive', 'Ir_HV'],
19: ['Tg_C1,%', 'percentage', 'C1', 'HV', 'tg', 'tangent', 'tg_HV'],
20: ['C_C1,пФ', 'other', 'C1', 'HV', 'C', 'c_deviation', 'C_HV'],
21: ['DeltaTg_C1,%', 'percentage', 'C1', 'HV', '∆tg', 'tangent_delta', '∆tg_HV'],
22: ['DeltaC_C1,%', 'percentage', 'C1', 'HV', '∆C', 'c_delta', '∆C_HV'],
23: ['U_A2,кВ', 'voltage', 'A2', 'MV', 'U', 'voltage_difference', 'U_MV'],
24: ['Ia_A2,мА', 'power', 'A2', 'MV', 'Ia', 'power_active', 'Ia_MV'],
25: ['Ir_A2,мА', 'power', 'A2', 'MV', 'Ir', 'power_reactive', 'Ir_MV'],
26: ['Tg_A2,%', 'percentage', 'A2', 'MV', 'tg', 'tangent', 'tg_MV'],
27: ['C_A2,пФ', 'other', 'A2', 'MV', 'C', 'c_deviation', 'C_MV'],
28: ['DeltaTg_A2,%', 'percentage', 'A2', 'MV', '∆tg', 'tangent_delta', '∆tg_MV'],
29: ['DeltaC_A2,%', 'percentage', 'A2', 'MV', '∆C', 'c_delta', '∆C_MV'],
30: ['U_B2,кВ', 'voltage', 'B2', 'MV', 'U', 'voltage_difference', 'U_MV'],
31: ['Ia_B2,мА', 'power', 'B2', 'MV', 'Ia', 'power_active', 'Ia_MV'],
32: ['Ir_B2,мА', 'power', 'B2', 'MV', 'Ir', 'power_reactive', 'Ir_MV'],
33: ['Tg_B2,%', 'percentage', 'B2', 'MV', 'tg', 'tangent', 'tg_MV'],
34: ['C_B2,пФ', 'other', 'B2', 'MV', 'C', 'c_deviation', 'C_MV'],
35: ['DeltaTg_B2,%', 'percentage', 'B2', 'MV', '∆tg', 'tangent_delta', '∆tg_MV'],
36: ['DeltaC_B2,%', 'percentage', 'B2', 'MV', '∆C', 'c_delta', '∆C_MV'],
37: ['U_C2,кВ', 'voltage', 'C2', 'MV', 'U', 'voltage_difference', 'U_MV'],
38: ['Ia_C2,мА', 'power', 'C2', 'MV', 'Ia', 'power_active', 'Ia_MV'],
39: ['Ir_C2,мА', 'power', 'C2', 'MV', 'Ir', 'power_reactive', 'Ir_MV'],
40: ['Tg_C2,%', 'percentage', 'C2', 'MV', 'tg', 'tangent', 'tg_MV'],
41: ['C_C2,пФ', 'other', 'C2', 'MV', 'C', 'c_deviation', 'C_MV'],
42: ['DeltaTg_C2,%', 'percentage', 'C2', 'MV', '∆tg', 'tangent_delta', '∆tg_MV'],
43: ['DeltaC_C2,%', 'percentage', 'C2', 'MV', '∆C', 'c_delta', '∆C_MV'],
44: ['Tair,°С', 'temperature', 'overall', 'no_voltage', 'tair', 'temperature_of_air', 'tair_no_voltage'],
45: ['Tdevice,°С', 'temperature', 'overall', 'no_voltage', 'tdev', 'temperature_of_device', 'tdev_no_voltage'],
46: ['Tcpu,°С', 'temperature', 'overall', 'no_voltage', 'tcpu', 'temperature_of_cpu', 'tcpu_no_voltage'],
47: ['Freq,Гц', 'frequency', 'overall', 'no_voltage', 'freq', 'frequency', 'freq_no_voltage'],
48: ['Unnamed: 48', 'other', 'overall', 'no_voltage', 'no_name', 'no_name', 'no_name_no_voltage']

Defining the time column

time_column() returns full name of a timestamp-column in dats based on device property self.file_parse_dates. This property stores a list of columns which contain time-type data and the main time column (with the fixed time of the measurement) must be first in this list.

Analyzers and processors (analyzer.py)

Data import

The import is based on work-files (described in Sources) with a get_data() function. In case of using a multiple files for creating an uninterrupted dataflow stack_data() function should be used. Stacking data from different files also automatically sorted by ascending timestamps of the measurement. Data import functions returns pandas DataFrame. It is recommended to set a data as a variable to store this result.

Replace data

There are some measurements that actually mean "NaN" but due to the technical issues of the devices are actually written as values in upload-files. pass_the_nan() replaces specified values in a DataFrame with NaN values.

Example of NaN-measures

{
    'power': [-300.0, 0.0],
    'tg': [-10.0],
    '∆tg': [-10.0],
    'c_delta': [-10.0],
    'c_deviation': [0.0],
    'voltage_difference': [0.0]
}

set_dtypes sets the data types for specified columns in a DataFrame.

The dictionaries are stored in device:

the values to be replaced with NaN values is a dictionary self.default_dict_for_replacement_to_nan property
the data types for each column is a dictionary self.default_dict_for_dtypes property

It's recommended to pass variable data and return it as a data variable as well for these functions

Counter

total_counter() is a simple function that takes a device type and a pandas DataFrame as input and returns the total number of logs in the DataFrame. If no DataFrame is provided, the function calls get_data() to obtain one.

total_periods() check for received data to return the start and the end of the measures. You can pass format of the data by format= argument.

Timestamp analyzing

values_time_analyzer() analyzes the time gaps between consecutive rows in a given time column of a Pandas DataFrame and returns a DataFrame with information about any gaps that exceed a specified duration or fall outside a certain range of durations. You can pass args time_sequence_min= and inaccuracy_sec= to adjust the correct interval of measures (default is 1 minute) and allowed inaccuracy to be ignored for every next measure (default is 3 seconds).

values_time_slicer() slices the data based on time intervals. Returns a dictionary containing information about sliced data, every dictionary value contains separate pandas DataFrame. Data is separated by the big time gaps, f.e. if there are measures in quite different periods of time. Time-gap parameter for slicing set by argument minutes_slice_mode=. Default is 1440 (a day) Default values counter for forming a slice set by argument min_values_required=. Default is 150.

Periods

time_period_choose() is designed to enable a user to select a time period of interest within a given dataset. It then calculates the start and end dates of the dataset, prints them, and prompts the user to enter a specific time period of interest. The function sorts the dataset by the time of measurement and resets the indexes.

Nans

total_nan_counter() analyzes the percentage of NaN values in each row of the input data and returns a pandas DataFrame that shows the time periods where the percentage of NaN values exceeds a specified threshold. Default argument false_data_percentage= set to 33.0 means that measurements in the particular moment is too uncertain if more than 1/3 of measures are errors Map of the typical errors of the devices measurements can be found in devices.py as a self.default_dict_for_replacement_to_nan property of the device objec. They are slightly described in Attributes

If there are too many NaN values in each row in a database total_nan_counter_ease() can be used to stack continuous periods of NaN values in measure database.

Filtering

data_filter() returns a new DataFrame that only contains the columns that match the filter list. Filter a Pandas DataFrame by a list of column names but also can be given codes of measurers or shortnames and practically anything from columns.py / Main columns analyzer

Averages

data_average_finder() filters and calculates the average value of a specified list of columns in a given dataset. It returns a dictionary with column names as keys and their corresponding average value as values.

Distributions

data_distribution_finder() analyzes the distribution of values in the specified columns of the data.

Correlations

data_correlation() calculates the correlation between columns of two filtered dataframes. Returns a dictionary with the correlation parameters as keys and the corresponding sequence of correlation values. If you plot this data, f.e. 100% strict correlation would like "x=y" type of graph with a 45° angle.

Warnings

warning_finder() is the basic function for analyzing warnings. It finds time intervals when certain parameters exceed specified warning or accident threshold values set by arguments warning_param_war= and warning_param_acc=. Default warning or accident threshold values set in devices.py for each device in a self.warning_map property. They are slightly described in Attributes

The next step for analyzing of warnings would be using warning_finder_merge() that merges the dataframes in the input dictionary log and adds columns for warning parameters based on the given warn_type= and threshold values. It returns the resulting merged dataframe.

If there are too many warnings warning_finder_ease() can be used to stack continuous periods of warnings in database. It takes in a dictionary log and several optional arguments to filter and process the data in order to identify warning or accident events.

Other Modules

prints.py contains scripts for printing messages to user and launching functions from other modules. It frames the execution of functions with descriptions and textual information.

plots.py draws graphs, histograms and plots.

frontend.py consists of scripts setting fonts, styles, margins, page-numbering, templates, capturing outputs, texts, graphs, plots and tables.

services.py contains technical scripts for processing inputs, questioning user, transforming of dates and strings, checking for correct formats, declensing by cases in Russian and so on.

Runner

Run ElectroMon depending on the way it is installed

Run via Python

Perform installation as described in installation guide steps 1-2 and steps 3-4

Launch file run.py with a Python interpreter from a command-line from the app directory.

Example

python c:\users\user\downloads\ElectroMon-master\\run.py

For full information display including graphs and plots Jupyter Notebook is preferable. Here's installation guide for it.

Print a PDF

Running output.py does the same as described above but in the end of the processing data it asks you for the name of the file that will be saved in reports directory as a document with a .pdf extension.

It doesn't require 'Jupyter Notebook' for printings graphs.

Example

python c:\users\user\downloads\ElectroMon-master\\output.py

Run via Docker

Perform installation as described in installation guide steps 1-2 and step 3

Launch Docker in your OS

Build the image from the app directory (f.e., c:\users\user\downloads\ElectroMon-master\\) with following command-line:

docker build -t em .

The image of the application will be built with a em name. You don't need to build the image for every further launch of the application, the build should be stored at your local machine.

To launch it simply enter:

docker run -it -v C:\Users\User\Desktop:/usr/src/reports em

where C:\Users\User\Desktop is the directory of your local machine for saving report-file in PDF

Run via EXE-file

In Windows launch em.exe

Name		Name	Last commit message	Last commit date
Latest commit History 266 Commits
.idea		.idea
debug		debug
misc		misc
reports		reports
upload		upload
.gitignore		.gitignore
Dockerfile		Dockerfile
Readme.MD		Readme.MD
Readme_rus.MD		Readme_rus.MD
analyzer.py		analyzer.py
columns.py		columns.py
devices.py		devices.py
em.exe		em.exe
frontend.py		frontend.py
output.py		output.py
plots.py		plots.py
prints.py		prints.py
requirements.txt		requirements.txt
run.py		run.py
services.py		services.py
tst.ipynb		tst.ipynb

sadzax/ElectroMon

Folders and files

Latest commit

History

Repository files navigation

Contents

Description

Implementation

Requirements and peculiarities

Installation

For using via Python

For using via Docker

App Structure

Device initialisation (devices.py)

Measurers

Attributes

Sources

Links

Quick save/load

Columns processing (columns.py)

Main columns analyzer

Defining the time column

Analyzers and processors (analyzer.py)

Data import

Replace data

Counter

Timestamp analyzing

Periods

Nans

Filtering

Averages

Distributions

Correlations

Warnings

Other Modules

Runner

Run via Python

Print a PDF

Run via Docker

Run via EXE-file

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages