Node Application Metrics monitoring and profiling agent
Node Application Metrics instruments the Node.js runtime for performance monitoring, providing the monitoring data via an API. Additionally the data can be visualized in an Eclipse IDE using the IBM Monitoring and Diagnostics Tools - Health Center client.
See https://www.ibm.com/developerworks/java/jdk/tools/healthcenter/ for more details.
Node Application Metrics provides the following built-in data collection sources:
Source | Description |
---|---|
Environment | Machine and runtime environment information |
CPU | Process and system CPU |
Memory | Process and system memory usage |
GC | Node/V8 garbage collection statistics |
Event Loop | Event loop latency information |
Express | Express 4.x Web Framework application request monitoring |
Loop | Event loop timing metrics |
Function profiling | Node/V8 function profiling (disabled by default) |
HTTP | HTTP request calls made of the application |
HTTP Outbound | HTTP requests made by the application |
socket.io | WebSocket data sent and received by the application |
LevelDB | LevelDB queries made by the application |
MySQL | MySQL queries made by the application |
MongoDB | MongoDB queries made by the application |
PostgreSQL | PostgreSQL queries made by the application |
MQTT | MQTT messages sent and received by the application |
MQLight | MQLight messages sent and received by the application |
Memcached | Data that is stored or manipulated in Memcached |
OracleDB | OracleDB queries made by the application |
Oracle | Oracle queries made by the application |
StrongOracle | StrongOracle database queries made by the application |
Redis | Redis commands issued by the application |
Riak | Riak methods called by the application |
Request tracking | A tree of application requests, events and optionally trace (disabled by default) |
Function trace | Tracing of application function calls that occur during a request (disabled by default) |
The Node Application Metrics agent supports the following runtime environments:
- Node.js v7 on:
- 64-bit or 32-bit runtime on Windows (x64 or x86)
- 64-bit or 32-bit runtime on Linux (x64, x86, PPC64LE)
- 64-bit runtime on Mac OS X (x64)
- Node.js v6 on:
- 64-bit or 32-bit runtime on Windows (x64 or x86)
- 64-bit or 32-bit runtime on Linux (x64, x86, PPC64LE)
- 64-bit runtime on Mac OS X (x64)
- Node.js v4 on:
- 64-bit or 32-bit runtime on Windows (x64 or x86)
- 64-bit or 32-bit runtime on Linux (x64, x86, PPC64LE)
- 64-bit runtime on Mac OS X (x64)
- IBM SDK for Node.js 4 on:
- 64-bit or 32-bit runtime on Windows (x64 or x86)
- 64-bit or 32-bit runtime on Linux (x64, x86, PPC32, PPC64, PPC64LE, z31, z64)
- 64-bit or 32-bit runtime on AIX (PPC32, PPC64)
- 64-bit runtime on Mac OS X (x64)
Node Application Metrics can be installed using npm either locally or globally.
When installed locally you can access monitoring data via both the API and the Health Center client by modifying your application to use appmetrics (see Modifying your application to use the local installation).
To perform a local install:
$ npm install appmetrics
A local install will put the module inside "./node_modules
of the current package root" (see the npm documentation for more information); usually this is the current directory and in that case the module installation directory will be ./node_modules/appmetrics
.
When installed globally you can access monitoring data via the Health Center client (but not the API) by using the node-hc
command-line utility (see The node-hc
command).
To perform a global install:
$ npm install -g appmetrics
A global install will put the module inside a directory tied to your Node.js SDK.
- On Windows, either:
<UserDirectory>\AppData\Roaming\npm\node_modules
- or:
<NodeInstallDirectory>\node_modules
- On other platforms:
<node_install_directory>/lib/node_modules
It also adds the node-hc
command to another directory tied to your Node.js SDK, one that was added to your executable search path by the Node.js SDK installer.
- On Windows, either:
<UserDirectory>\AppData\Roaming\npm
- or:
<NodeInstallDirectory>
- On other platforms:
<node_install_directory>/bin
Node Application Metrics can be configured in two ways, by using the configuration file described below or via a call to configure(options).
Node Application Metrics comes with a configuration file inside the module installation directory (.../node_modules/appmetrics/appmetrics.properties
). This can be used to configure connection options, logging and data source options.
Node Application Metrics will attempt to load appmetrics.properties
from one of the following locations (in order):
- the application directory
- the current working directory
- the appmetrics module installation directory
The default configuration has minimal logging enabled, will attempt to send data to a local MQTT server on the default port and has method profiling disabled.
Many of the options provide configuration of the Health Center core agent library and are documented in the Health Center documentation: Health Center configuration properties.
The following options are specific to appmetrics:
com.ibm.diagnostics.healthcenter.data.profiling=[off|on]
Specifies whether method profiling data will be captured. The default value isoff
. This specifies the value at start-up; it can be enabled and disabled dynamically as the application runs, either by a monitoring client or the API.
If you globally installed this module with npm, you can use the node-hc
command to run your application instead of the node
command. This will run your application as it would normally under node (including any node options) but additionally load and start appmetrics
.
$ node-hc app.js
The purpose of this mode of operation is to provide monitoring of the application without requiring any changes to the application code. The data is sent to the Health Center Eclipse IDE client.
If you locally install this module with npm then you will additionally have access to the monitoring data via the appmetrics
API (see API Documentation).
To load appmetrics
and get the monitoring API object, add the following to the start-up code for your application:
var appmetrics = require('appmetrics');
var monitoring = appmetrics.monitor();
The call to appmetrics.monitor()
starts the data collection agent, making the data available via the API and to the Heath Center client via MQTT.
You should start your application using the node
command as usual (not node-hc
).
You must call require('appmetrics');
before the require statements for any npm modules you want to monitor. Appmetrics must be initialized first so that it can instrument modules for monitoring as they are loaded. If this is a problem due to the structure of your application you can require the module on the node command line with -r to make sure it is pre-loaded:
> node -r appmetrics myapp.js
Once you have loaded appmetrics you can then use the monitoring object to register callbacks and request information about the application:
monitoring.on('initialized', function (env) {
env = monitoring.getEnvironment();
for (var entry in env) {
console.log(entry + ':' + env[entry]);
};
});
monitoring.on('cpu', function (cpu) {
console.log('[' + new Date(cpu.time) + '] CPU: ' + cpu.process);
});
Connecting to the Health Center client requires the additional installation of a MQTT broker. The Node Application Metrics agent sends data to the MQTT broker specified in the appmetrics.properties
file or set via a call to configure(options). Installation and configuration documentation for the Health Center client is available from the Health Center documentation in IBM Knowledge Center.
Note that both the API and the Health Center client can be used at the same time and will receive the same data. Use of the API requires a local install and application modification (see Modifying your application to use the local installation).
Further information regarding the use of the Health Center client with Node Application Metrics can be found on the appmetrics wiki: Using Node Application Metrics with the Health Center client.
Sets various properties on the appmetrics monitoring agent. If the agent has already been started, this function does nothing.
options
(Object) key value pairs of properties and values to be set on the monitoring agent.
Property name | Property value | Effect |
---|---|---|
applicationID |
string |
Specifies a unique identifier for the mqtt connection |
mqtt |
`[off | on]` |
mqttHost |
host name |
Specifies the host name of the mqtt broker |
mqttPort |
port number |
Specifies the port number of the mqtt broker |
profiling |
`[off | on]` |
Starts the appmetrics monitoring agent. If the agent is already running this function does nothing.
Stops the appmetrics monitoring agent. If the agent is not running this function does nothing.
Enable data generation of the specified data type.
type
(String) the type of event to start generating data for. Values ofeventloop
,express
,profiling
,http
,http-outbound
,mongo
,socketio
,mqlight
,postgresql
,mqtt
,mysql
,redis
,riak
,memcached
,oracledb
,oracle
,strong-oracle
,requests
andtrace
are currently supported. Astrace
is added to request data, bothrequests
andtrace
must be enabled in order to receive trace data.config
(Object) (optional) configuration map to be added for the data type being enabled. (see setConfig) for more information.
The following data types are disabled by default: profiling
, requests
, trace
Disable data generation of the specified data type.
type
(String) the type of event to stop generating data for. Values ofeventloop
,express
,profiling
,http
,mongo
,socketio
,mqlight
,postgresql
,mqtt
,mysql
,redis
,riak
,memcached
,oracledb
,oracle
,strong-oracle
,requests
andtrace
are currently supported.
Set the configuration to be applied to a specific data type. The configuration available is specific to the data type.
type
(String) the type of event to apply the configuration to.config
(Object) key value pairs of configurations to be applied to the specified event. The available configuration options are as follows:
Source | Configuration | Effect |
---|---|---|
http |
filters |
(Array) of URL filter Objects consisting of: |
| | `pattern` (String) a regular expression pattern to match HTTP method and URL against, eg. 'GET /favicon.ico$'
| | `to` (String) a conversion for the URL to allow grouping. A value of `''` causes the URL to be ignored.
requests
| excludeModules
| (Array) of String names of modules to exclude from request tracking.
trace
| includeModules
| (Array) of String names for modules to include in function tracing. By default only non-module functions are traced when trace is enabled.
advancedProfiling
| threshold
| (Number) millisecond run time of an event loop cycle that will trigger profiling
Allows custom monitoring events to be added into the Node Application Metrics agent.
type
(String) the name you wish to use for the data. A subsequent event of that type will be raised, allowing callbacks to be registered for it.data
(Object) the data to be made available with the event. The object must not contain circular references, and by convention should contain atime
value representing the milliseconds when the event occurred.
Dumps the v8 heap via heapdump
.
For more information, see https://github.com/bnoordhuis/node-heapdump/blob/master/README.md
Creates a Node Application Metrics agent client instance. This can subsequently be used to get environment data and subscribe to data events. This function will start the appmetrics monitoring agent if it is not already running.
Requests an object containing all of the available environment information for the running application. This will not contain all possible environment information until an 'initialized' event has been received.
Emitted when a CPU monitoring sample is taken.
data
(Object) the data from the CPU sample:time
(Number) the milliseconds when the sample was taken. This can be converted to a Date usingnew Date(data.time)
.process
(Number) the percentage of CPU used by the Node.js application itself. This is a value between 0.0 and 1.0.system
(Number) the percentage of CPU used by the system as a whole. This is a value between 0.0 and 1.0.
Emitted when a memory monitoring sample is taken.
data
(Object) the data from the memory sample:time
(Number) the milliseconds when the sample was taken. This can be converted to a Date usingnew Date(data.time)
.physical_total
(Number) the total amount of RAM available on the system in bytes.physical_used
(Number) the total amount of RAM in use on the system in bytes.physical_free
(Number) the total amount of free RAM available on the system in bytes.virtual
(Number) the memory address space used by the Node.js application in bytes.private
(Number) the amount of memory used by the Node.js application that cannot be shared with other processes, in bytes.physical
(Number) the amount of RAM used by the Node.js application in bytes.
Emitted when all possible environment variables have been collected. Use appmetrics.monitor.getEnvironment()
to access the available environment variables.
Emitted when a garbage collection (GC) cycle occurs in the underlying V8 runtime.
data
(Object) the data from the GC sample:time
(Number) the milliseconds when the sample was taken. This can be converted to a Date usingnew Date(data.time)
.type
(String) the type of GC cycle, either 'M' or 'S'.size
(Number) the size of the JavaScript heap in bytes.used
(Number) the amount of memory used on the JavaScript heap in bytes.duration
(Number) the duration of the GC cycle in milliseconds.
Emitted every 5 seconds, summarising sample based information of the event loop latency
data
(Object) the data from the event loop sample:time
(Number) the milliseconds when the event was emitted. This can be converted to a Date usingnew Date(data.time)
.latency.min
(Number) the shortest sampled latency, in milliseconds.latency.max
(Number) the longest sampled latency, in milliseconds.latency.avg
(Number) the average sampled latency, in milliseconds.
Emitted every 60 seconds, summarising event tick information in time interval
data
(Object) the data from the event loop sample:count
(Number) the number of event loop ticks in the last interval.minimum
(Number) the shortest (i.e. fastest) tick in milliseconds.maximum
(Number) the longest (slowest) tick in milliseconds.average
(Number) the average tick time in milliseconds.
Emitted when a profiling sample is available from the underlying V8 runtime.
data
(Object) the data from the profiling sample:time
(Number) the milliseconds when the sample was taken. This can be converted to a Date usingnew Date(data.time)
.functions
(Array) an array of functions that ran during the sample. Each array entry consists of:self
(Number) the ID for this function.parent
(Number) the ID for this function's caller.name
(String) the name of this function.file
(String) the file in which this function is defined.line
(Number) the line number in the file.count
(Number) the number of samples for this function.
Emitted when a HTTP request is made of the application.
data
(Object) the data from the HTTP request:time
(Number) the milliseconds when the request was made. This can be converted to a Date usingnew Date(data.time)
.method
(String) the HTTP method used for the request.url
(String) the URL on which the request was made.duration
(Number) the time taken for the HTTP request to be responded to in ms.
Emitted when the application makes an outbound HTTP request.
data
(Object) the data from the HTTP request:time
(Number) the milliseconds when the request was made. This can be converted to a Date usingnew Date(data.time)
.method
(String) the HTTP method used for the request.url
(String) the URL on which the request was made.contentType
(String) the HTTP response content-type.statusCode
(String) the HTTP response status code.duration
(Number) the time taken for the HTTP request to be responded to in ms.
Emitted when WebSocket data is sent or received by the application using socketio.
data
(Object) the data from the socket.io request:time
(Number) the milliseconds when the event occurred. This can be converted to a Date usingnew Date(data.time)
.method
(String) whether the event is abroadcast
oremit
from the application, or areceive
from a client .event
(String) the name used for the event.duration
(Number) the time taken for event to be sent or for a received event to be handled.
Emitted when a MySQL query is made using the mysql
module.
data
(Object) the data from the MySQL query:time
(Number) the milliseconds when the MySQL query was made. This can be converted to a Date usingnew Date(data.time)
.query
(String) the query made of the MySQL database.duration
(Number) the time taken for the MySQL query to be responded to in ms.
Emitted when a MongoDB query is made using the mongodb
module.
data
(Object) the data from the MongoDB request:time
(Number) the milliseconds when the MongoDB query was made. This can be converted to a Date usingnew Date(data.time)
query
(String) the query made of the MongoDB database.duration
(Number) the time taken for the MongoDB query to be responded to in ms.method
(String) the executed method for the query, such as find, update.collection
(String) the MongoDB collection name.
Emitted when a MQTT message is sent or received.
data
(Object) the data from the MQTT event:time
(Number) the time in milliseconds when the MQTT event occurred. This can be converted to a Date using new Date(data.time).method
(String) the name of the call or event (will be one of 'publish' or 'message').topic
(String) the topic on which a message is published or received.qos
(Number) the QoS level for the message.duration
(Number) the time taken in milliseconds.
Emitted when a MQLight message is sent or received.
data
(Object) the data from the MQLight event:time
(Number) the time in milliseconds when the MQLight event occurred. This can be converted to a Date using new Date(data.time).clientid
(String) the id of the client.data
(String) the data sent if a 'send' or 'message', undefined for other calls. Truncated if longer than 25 characters.method
(String) the name of the call or event (will be one of 'send' or 'message').topic
(String) the topic on which a message is sent/received.qos
(Number) the QoS level for a 'send' call, undefined if not set.duration
(Number) the time taken in milliseconds.
Emitted when a LevelDB query is made using the leveldown
module.
data
(Object) the data from the LevelDB query:time
(Number) the time in milliseconds when the LevelDB query was made. This can be converted to a Date usingnew Date(data.time)
.method
(String) The leveldown method being used.key
(Object) The key being used for a call toget
,put
ordel
(Undefined for other methods)value
(Object) The value being added to the LevelDB database using theput
method (Undefined for other methods)opCount
(Number) The number of operations carried out by abatch
method (Undefined for other methods)duration
(Number) the time taken for the LevelDB query to be responded to in ms.
Emitted when a Redis command is sent.
data
(Object) the data from the Redis event:time
(Number) the time in milliseconds when the redis event occurred. This can be converted to a Date using new Date(data.time).cmd
(String) the Redis command sent to the server or 'batch.exec'/'multi.exec' for groups of command sent using batch/multi calls.duration
(Number) the time taken in milliseconds.
Emitted when a Riak method is called using the basho-riak-client
module.
data
(Object) the data from the Riak event:time
(Number) the time in milliseconds when the riak event occurred. This can be converted to a Date using new Date(data.time).method
(String) the Riak method called.options
(Object) the options parameter passed to Riak.command
(Object) the command parameter used in theexecute
method.query
(String) the query parameter used in themapReduce
method.duration
(Number) the time taken in milliseconds.
Emitted when a data is stored, retrieved or modified in Memcached using the memcached
module.
data
(Object) the data from the memcached event:time
(Number) the milliseconds when the memcached event occurred. This can be converted to a Date usingnew Date(data.time)
method
(String) the method used in the memcached client, egset
,get
,append
,delete
, etc.key
(String) the key associated with the data.duration
(Number) the time taken for the operation on the memcached data to occur.
Emitted when a query is executed using the oracledb
module.
data
(Object) the data from the OracleDB query:time
(Number) the milliseconds when the OracleDB query was made. This can be converted to a Date usingnew Date(data.time)
.query
(String) the query made of the OracleDB database.duration
(Number) the time taken for the OracleDB query to be responded to in ms.
Emitted when a query is executed using the oracle
module.
data
(Object) the data from the Oracle query:time
(Number) the milliseconds when the Oracle query was made. This can be converted to a Date usingnew Date(data.time)
.query
(String) the query made of the Oracle database.duration
(Number) the time taken for the Oracle query to be responded to in ms.
Emitted when a query is executed using the strong-oracle
module.
data
(Object) the data from the Strong Oracle query:time
(Number) the milliseconds when the Strong Oracle query was made. This can be converted to a Date usingnew Date(data.time)
.query
(String) the query made of the database.duration
(Number) the time taken for the Strong Oracle query to be responded to in ms.
Emitted when a request is made of the application that involves one or more monitored application level events. Request events are disabled by default.
data
(Object) the data from the request:time
(Number) the milliseconds when the request occurred. This can be converted to a Date usingnew Date(data.time)
.type
(String) The type of the request event. This can currently be set to 'HTTP' or 'DB'.name
(String) The name of the request event. This is the request task, eg. the url, or the method being used.request
(Object) the detailed data for the root request event:type
(String) The type of the request event. This can currently be set to 'HTTP' or 'DB'.name
(String) The name of the request event. This is the request task, eg. the url, or the method being used.context
(Object) A map of any additional context information for the request event.stack
(String) An optional stack trace for the event call.children
(Array) An array of child request events that occurred as part of the overall request event. Child request events may include function trace entries, which will have atype
of null.duration
(Number) the time taken for the request to complete in ms.
duration
(Number) the time taken for the overall request to complete in ms.
Emitted when a PostgreSQL query is made to the pg
module.
data
(Object) the data from the PostgreSQL query:time
(Number) the milliseconds when the PostgreSQL query was made. This can be converted to a Date usingnew Date(data.time)
.query
(String) the query made of the PostgreSQL database.duration
(Number) the time taken for the PostgreSQL query to be responded to in ms.
Emitted when an express request finishes its response. Note. appmetrics has only been tested with express 4.x, support is not guaranteed for lower versions.
data
(Object) the data from the Express request/response.method
(String) The HTTP method for this request.url
(String) The target URL for this request.statusCode
(Number) The HTTP status code of the response.duration
(Number) The time in ms between receiving the request and sending the response.
Find below some possible problem scenarios and corresponding diagnostic steps. Updates to troubleshooting information will be made available on the appmetrics wiki: Troubleshooting. If these resources do not help you resolve the issue, you can open an issue on the Node Application Metrics appmetrics issue tracker.
By default, a message similar to the following will be written to console output when Node Application Metrics starts:
[Fri Aug 21 09:36:58 2015] com.ibm.diagnostics.healthcenter.loader INFO: Node Application Metrics 1.0.1-201508210934 (Agent Core 3.0.5.201508210934)
Error "Conflicting appmetrics module was already loaded by node-hc. Try running with node instead." when using node-hc
This error indicates you are using node-hc
to run an application that uses the Node Application Metrics monitoring API (see Modifying your application to use the local installation). Resolve this by using node
to run the application instead. Alternatively, you could remove (or disable temporarily) the use of the Node Application Metrics monitoring API in your application.
This error was added to prevent the scenario where 2 instances of the agent can be accidentally created and started in parallel -- the globally installed one created by node-hc
and the locally installed one created by the require('appmetrics');
call in an application modified to use the Node Application Metrics monitoring API.
This error indicates there was a problem while loading the native part of the module or one of its dependent libraries. On Windows, appmetrics.node
depends on a particular version of the C runtime library and if it cannot be found this error is the likely result.
Check:
- Does the
appmetrics.node
file exist in the indicated location? If not, try reinstalling the module. - For version
1.0.0
on Windows: aremsvcr100.dll
andmsvcp100.dll
installed on your Windows system, and do they match the bitness (32-bit or 64-bit) of your Node.js runtime environment? If not, you may be able to install them with the Visual C++ Redistributable Packages for Visual Studio 2010 package from the Microsoft website. - For version
1.0.1
on Windows: doesmsvcr120.dll
andmsvcp120.dll
exist in the module installation directory (see Installation) and does it match the bitness of your Node.js runtime environment? If not, try reinstalling the module.
Note: On Windows, the global module installation directory might be shared between multiple Node.js runtime environments. This can cause problems with globally installed modules with native components, particularly if some of the Node.js runtime environments are 32-bit and others are 64-bit because the native components will only work with those with matching bitness.
Error "Failed to open library .../libagentcore.so: /usr/lib64/libstdc++.so.6: version `GLIBCXX_3.4.15' not found"
This error indicates there was a problem while loading the native part of the module or one of its dependent libraries. On non-Windows platforms, libagentcore.so
depends on a particular (minimum) version of the C runtime library and if it cannot be found this error is the result.
Check:
- Your system has the required version of
libstdc++
installed. You may need to install or update a package in your package manager. If your OS does not supply a package at this version, you may have to install standalone software - consult the documentation or support forums for your OS. - If you have an appropriate version of
libstdc++
installed, ensure it is on the system library path, or use a method (such as settingLD_LIBRARY_PATH
environment variable on Linux, or LIBPATH environment variable on AIX) to add the library to the search path.
Method profiling data is not collected by default, check Configuring Node Application Metrics for information on how to enable it.
If collection is enabled, an absence of method profiling data from a Node.js application could be caused by the type of tasks that are being run by your application -- it may be running long, synchronous tasks that prevent collection events from being scheduled on the event loop.
If a task uses the Node.js thread exclusively then shuts down the Node.js runtime environment, the Health Center agent may not get the opportunity to obtain any profiling data. An example of such an application is the Octane JavaScript benchmark suite, which loads the CPU continuously rather than dividing the load across multiple units of work.
The source code for Node Application Metrics is available in the appmetrics project. Information on working with the source code -- installing from source, developing, contributing -- is available on the appmetrics wiki.
This project is released under an Apache 2.0 open source license.
The npm package for this project uses a semver-parsable X.0.Z version number for releases, where X is incremented for breaking changes to the public API described in this document and Z is incremented for bug fixes and for non-breaking changes to the public API that provide new function.
Non-release versions of this project (for example on github.com/RuntimeTools/appmetrics) will use semver-parsable X.0.Z-dev.B version numbers, where X.0.Z is the last release with Z incremented and B is an integer. For further information on the development process go to the appmetrics wiki: Developing.
2.0.0 development
2.0.0
- Remove support for Node.js 0.10, 0.12, 5. Add heapdump api call
1.2.0
- Add file data collection capability and option configuration via api.
1.1.2
- Update agent core to 3.0.10, support Node.js v7.
1.1.1
- Fix node-gyp rebuild failure and don't force MQTT broker to on
1.1.0
- Bug fixes, improved MongoDB data, updated dependencies, CPU watchdog feature
1.0.13
- Express probe, strong-supervisor integration
1.0.12
- Appmetrics now fully open sourced under Apache 2.0 license
1.0.11
- Bug fixes
1.0.10
- Bug fixes
1.0.9
- Loopback and Riak support, bug fixes and update to agent core 3.0.9.
1.0.8
- Oracle support, bug fixes and api tests runnable using 'npm test'.
1.0.7
- StrongOracle support, support for installing with a proxy, expose MongoDB, MQLight and MySQL events to connectors.
1.0.6
- OracleDB support and bug fixes.
1.0.5
- Expose HTTP events to connectors (including MQTT).
1.0.4
- Redis, Leveldown, Postgresql, Memcached, MQLight and MQTT support, higher precision timings, and improved performance.
1.0.3
- Node.js v4 support.
1.0.2
- HTTP, MySQL, MongoDB, request tracking and function tracing support.
1.0.1
- Mac OS X support, io.js v2 support.
1.0.0
- First release.