elastic · mtojek · Apr 9, 2020 · Apr 1, 2020 · Apr 1, 2020 · Apr 2, 2020
diff --git a/CONTRIBUTING.md b/CONTRIBUTING.md
@@ -6,29 +6,300 @@ This page is intended for contributors to the registry and packages.
 
 ### Package
 
-A Package contains the dashboards, visualisations, and configurations to monitor the logs and metrics of a particular technology or group of related services, such as “MySQL”, or “System”.
+A package contains the dashboards, visualisations, and configurations to monitor the logs and metrics of a particular technology or group of related services, such as “MySQL”, or “System”.
 
-A Package consists of:
+The package consists of:
 
 * Name
 * Zero or more dashboards and visualisations and Canvas workpads
 * Zero or more ML job definitions
 * Zero or more dataset templates
 
+The package is versioned.
 
+### Integration
+
+An integration is a specific type of a _package_ defining datasets used to observe the same product (logs and metrics).
 
 ### Dataset Template
 
-The dataset template is part of a package and contains all the assets which are needed to create a dataset. Example for assets are: ingest pipeline, agent config template, ES index template, ...
+A dataset template is part of a package and contains all the assets which are needed to create a dataset. Example for assets are: ingest pipeline, agent config template, ES index template, ...
 
-The dataset templates are inside the package directory under `dataset`.
+Dataset templates are inside the package directory under `dataset`.
 
-A dataset template consists of:
+The dataset template consists of:
 
 * An alias templates (or the fields.yml to create it)
 * Zero or more ingest pipelines
 * An Elastic Agent config template
 
-## How to build a package
+### Migration from Beats
+
+A defined importing procedure used to transform both, Filebeat and Metricbeat modules, related to
+the same observed product, into a single integration. The integration contains extracted dataset configuration of beat
+modules, hence no modules are required to exist anymore.
+
+## Package structure
+
+### Elements
+
+Link: https://github.com/elastic/package-registry/blob/master/ASSETS.md
+
+### Reference packages
+
+The following packages can be considered as reference points for all integrations.
+
+#### Integration: reference
+
+Link: https://github.com/elastic/package-registry/tree/master/dev/packages/example/reference-1.0.0
+
+The directory contains mandatory manifest files defining the integration and its datasets. All manifests have fields
+annotated with comments to better understand their goals.
+
+_Keep in mind that this package doesn't contain all file resources (images, screenshots, icons) referenced in manifests.
+Let's assume that they're also there._
+
+#### Integration: mysql
+
+Link: https://github.com/mtojek/package-registry/tree/package-mysql-0.0.2/dev/packages/alpha/mysql-0.0.2
+
+The MySQL integration was the first integration built using the [import-beats](https://github.com/elastic/package-registry/tree/master/dev/import-beats) script.
+The script imported filesets and metricsets from both MySQL modules, and converted them to a package.
+
+The MySQL integration contains all parts that should be present (or are required) in the integration package.
+
+After using the _import-beats_ script, the integration has been manually adjusted and extended with dedicated docs.
+
+## Create a new integration
+
+This section describes steps required to build a new integration. If you plan to prepare the integration
+with a product unsupported by [Beats](https://github.com/elastic/beats), feel free to skip the section about importing
+existing modules.
+
+### Import from existing modules
+
+The import procedure heavily uses on the _import-beats_ script. If you are interested how does it work internally,
+feel free to review the script's [README](https://github.com/elastic/package-registry/blob/master/dev/import-beats/README.md).
+
+1. Focus on the particular product (e.g. MySQL, ActiveMQ) you would like to integrate with.
+2. Prepare the developer environment:
+    1. Clone/refresh the following repositories:
+        * https://github.com/elastic/beats
+        * https://github.com/elastic/ecs
+        * https://github.com/elastic/eui
+        * https://github.com/elastic/kibana
+
+       Make sure you don't have any manual changes applied as they will reflect on the integration.
+    2. Clone/refresh the Elastic Package Registry (EPR) to always use the latest version of the script:
+        * https://github.com/elastic/package-registry
+    3. Make sure you've the `mage` tool installed.
+3. Boot up required dependencies:
+    1. Elasticseach instance:
+        * Kibana's dependency
+    2. Kibana instance:
+        * used to migrate dashboards, if not available, you can skip the generation (`SKIP_KIBANA=true`)
+
+    _Hint_. There is dockerized environment in beats (`cd testing/environments`). Boot it up with the following command:
+    `docker-compose -f snapshot.yml -f local.yml up --force-recreate elasticsearch kibana`.
+4. Create a new branch for the integration in the EPR project (diverge from master).
+5. Run the command: `mage ImportBeats` to start the import process.
+
+    The outcome of running the `import-beats` script is directory with refreshed and updated integrations.
+
+    It will take a while to finish, but the console output should be updated frequently to track the progress.
+    The command must end up with the exit code 0. Kindly please to open an issue if it doesn't.
+
+    Generated packages are stored by default in the `dev/packages/beats` directory. Generally, the import process
+    updates all of the integrations, so don't be surprised if you notice updates to multiple integrations, including
+    the one you're currently working on (e.g. `dev/packages/beats/foobarbaz-0.0.1`). You can either commit this changes
+    or leave them for later.
+
+6. Copy the package output for your integration (e.g. `dev/packages/beats/foobarbaz-0.0.1`) to the _alpha_ directory and
+    raise the version manually: `dev/packages/alpha/foobarbaz-0.0.2`.
+
+### Fine-tune the integration
+
+#### Motivation
+
+Most of migration work has been done by the `import-beats` script, but there're tasks that require developer's
+interaction.
+
+It may happen that your integration misses a screenshot or an icon, it's a good moment to add missing resources to
+Beats/Kibana repositories and re-import the integration (idempotent). 
+
+#### Checklist
+
+The order of action items on the checklist is advised to prevent the contributor from repeating some actions (fixing
+what's been already fixed, as the script has overridden part of it).
+
+1. Add icon if missing.
+
+    The integration icons are presented in different places in Kibana, hence it's better to define custom icons to make
+    the UI easier to navigate.
+
+    As the `import-beats` script looks for icons in Kibana and EUI repositories, add an icon to the first one the same
+    way as for tutorial resources (Kibana directory: `src/legacy/core_plugins/kibana/public/home/tutorial_resources/logos/`).
+
+2. Add screenshot if missing.
+
+    The Kibana Integration Manager shows screenshots related with the integration. Screenshots present Kibana
+    dashboards visualizing the metric/log data.
+
+    The `import-beats` script finds references to screenshots mentioned in `_meta/docs.asciidoc` and copies image files
+    from the Beats directories:
+    * `metricbeat/docs/images`
+    * `filebeat/docs/images`
+
+3. Write README template file for the integration.
+
+    The README template is used to render the final README file including exported fields. The template should be placed
+    in the `dev/beats/import-beats-resources/docs/<integration-name>/docs/README.md`.
+
+    Review the MySQL docs template to see how to use template functions (e.g. `{{fields "dataset-name"}}`)
+
+4. Review fields file and exported fields in docs.
+
+    The fields files (`package-fields.yml`, `fields.yml` and `ecs.yml`) in the package were created from original
+    `fields.yml` files and the ECS schema. It may happen that original sources have a typo, bad description or misses
+    a field definition. The goal of this action item is to verify if produced artifacts are correct.
+
+5. Metricbeat: add missing configuration options.
+
+   The `import-beats` script extracts configuration options from Metricbeat module's `_meta` directory. It analyzes
+   the configuration files and selects options based on enabled metricsets (not commented). If you notice that some
+   configuration options are missing in your package's manifest files, simply create the `config.epr.yml` file with all
+   required options.
+
+   Sample PR: https://github.com/elastic/beats/pull/17323
+
+6. Review _titles_ and _descriptions_ in manifest files.
+
+    Titles and descriptions are fields visualized in the Kibana UI. Most users will use them to see how to configure
+    the integration with their installation of a product or to how to use advanced configuration options.
+
+7. Compact configuration options (vars).
+
+    Currently, all configuration options are set by the `import-beats` script on the stream level
+    (path: `dataset/<dataset-name>/manifest.yml`).
+
+    It may happen that some of them in different datasets are simply duplicates or concern the same setting, which
+    will be always equal (e.g. MySQL username, password). Keep in mind that two datasets may have the same configuration
+    option, but different values (e.g. `period`, `paths`), hence can't be compacted.
+
+    To sum up, compacting takes down from the user the necessity to setup the same configuration option few times (one
+    per dataset).
+
+8. Define all variable properties.
+
+    The variable properties customize visualization of configuration options in the Kibana UI. Make sure they're
+    defined in all manifest files.
+
+```yaml
+    vars:
+      - name: paths
+        required: true
+        show_user: true
+        title: Access log paths
+        description: Paths to the nginx access log file.
+        type: text
+        multi: true
+        default:
+          - /var/log/nginx/access.log*
+```
+
+**required** - option is required
+
+**show_user** - don't hide the configuration option (collapsed menu)
+
+**title** - human readable variable name
+
+**description** - variable description (may contain some details)
+
+**type** - field type (according to the reference: text, password, bool, integer)
+
+**multi** - the field has mutliple values.
+
+9. Update docs template with sample events.
+
+    The events collected by the agent slightly differ from original, Metricbeat's and Filebeat's, ones. Adjust the event
+    content manually basing on already migrated integrations (e.g. [MySQL integration](https://github.com/elastic/package-registry/tree/master/dev/import-beats-resources/mysql/docs))
+    or copy them once managed to run whole setup with real agent.
+
+## Testing and validation
+
+### Run the whole setup
+
+1. Build docker image with EPR and run it:
+
+    ```bash
+   $ docker build --rm -t integrations_registry:latest .
+   $ docker run -i -t -p 8080:8080 $(docker images -q integrations_registry:latest) 
+   ```
+
+3. Verify that your integration is available (in the right version), e.g. MySQL: http://localhost:8080/search?package=mysql
+
+    ```json
+    [
+      {
+        "description": "MySQL Integration",
+        "download": "/epr/mysql/mysql-0.0.1.tar.gz",
+        "icons": [
+          {
+            "src": "/package/mysql/0.0.1/img/logo_mysql.svg",
+            "title": "logo mysql",
+            "size": "32x32",
+            "type": "image/svg+xml"
+          }
+        ],
+        "name": "mysql",
+        "path": "/package/mysql/0.0.1",
+        "title": "MySQL",
+        "type": "integration",
+        "version": "0.0.1"
+      }
+    ]
+    ```
+
+4. Start Elasticsearch instance:
+
+    ```bash
+   $ yarn es snapshot -E xpack.security.authc.api_key.enabled=true 
+   ```
+
+5. Start Kibana with enabled Ingest Manager:
+
+    ```bash
+   $ yarn start --xpack.ingestManager.enabled=true --xpack.ingestManager.epm.enabled=true --xpack.ingestManager.fleet.enabled=true --xpack.ingestManager.epm.registryUrl=http://localhost:8080/
+    ```
+
+6. Build agent code:
+    ```bash
+   $ cd $GOPATH/src/github.com/elastic/beats/x-pack/elastic-agent
+   $ PLATFORMS=darwin mage package
+    ```
+
+   Unpack the distribution you'd like to use (e.g. tar.gz):
+   ```bash
+   $ cd build/distributions/
+   $ tar xzf elastic-agent-8.0.0-darwin-x86_64.tar.gz
+   $ cd elastic-agent-8.0.0-darwin-x86_64/
+   ```
+
+7. Enroll the agent and start it:
+
+   Use the "Enroll new agent" option in the Kibana UI and run a similar command:
+
+   ```bash
+   $ ./elastic-agent enroll http://localhost:5601/rel cFhNVlZIRUIxYjhmbFhqNTBoS2o6OUhMWkF4SFJRZmFNZTh3QmtvR1cxZw==
+   $ ./elastic-agent run
+   ```
+
+   The `elastic-agent` will start two other processes - `metricbeat` and `filebeat`.
+
+8. Run the product you're integrating with (e.g. a docker image with MySQL).
+
+9. Install package.
+
+    Click out the configuration in the Kibana UI, deploy it and wait for the agent to pick out the updated configuration.
 
-Coming soon ...
+10. Navigate with Kibana UI to freshly installed dashboards, verify the metrics/logs flow.
diff --git a/dev/import-beats/README.md b/dev/import-beats/README.md
@@ -34,13 +34,18 @@ https://github.com/elastic/beats
 https://github.com/elastic/ecs
 https://github.com/elastic/eui
 https://github.com/elastic/kibana
-2. Start Kibana server (make sure the endpoint is accessible: http://localhost:5601/)
-2. Run the importing procedure with the following command:
+2. Make sure you've the `mage` tool installed.
+3. Start Kibana server (make sure the endpoint is accessible: http://localhost:5601/)
+4. Run the importing procedure with the following command:
 
 ```bash
 $ mage ImportBeats
 ```
 
+## How does the import procedure work
+
+TODO
+
 ## Troubleshooting
 
 *Importing process takes too long.*