Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[@azure/monitor-opentelemetry] - Requests with status code 304 are shown as failed in application insights #29398

Closed
3 tasks
despgiat opened this issue Apr 23, 2024 · 6 comments · Fixed by #29957
Assignees
Labels
Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. Monitor - Exporter Monitor OpenTelemetry Exporter needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team question The issue doesn't require a change to the product in order to be resolved. Most issues start as that Service Attention Workflow: This issue is responsible by Azure service team.

Comments

@despgiat
Copy link

  • Package Name: @azure/monitor-opentelemetry
  • Package Version: 1.4.0
  • Operating system:
  • nodejs
    • version: 18.16.0
  • browser
    • name/version: macOS
  • typescript
    • version: 5.4.5

Describe the bug
Currently some requests are treated and shown as errors, e.g. 304 which is a redirect / cache-hit. The majority of the errors that we get are 304 requests. Moreover, we don't get any information about these requests in application insights (in the first screenshot below you can see how the the dashboard looks like when the 304 is clicked to reveal more information about the errors). Also, these requests are not shown as errors in transaction search, and when they are filtered (ex. by response code), they do not show in the failed requests graph anymore.

To Reproduce
Our setup is similar to this example:

import { AzureMonitorOpenTelemetryOptions, useAzureMonitor } from '@azure/monitor-opentelemetry';
import { metrics, trace } from '@opentelemetry/api';
import { registerInstrumentations } from '@opentelemetry/instrumentation';
import { ExpressInstrumentation } from '@opentelemetry/instrumentation-express';
import { HttpInstrumentationConfig } from '@opentelemetry/instrumentation-http';
import { MongoDBInstrumentationConfig } from '@opentelemetry/instrumentation-mongodb';
import { Resource } from '@opentelemetry/resources';
import { SEMRESATTRS_SERVICE_NAME } from '@opentelemetry/semantic-conventions';
import { IncomingMessage } from 'http';
import { env } from '../env';

const httpInstrumentationConfig: HttpInstrumentationConfig = {
  enabled: true,
  ignoreIncomingRequestHook: (request: IncomingMessage) => {
    // Ignore OPTIONS incoming requests
    if (request.method === 'OPTIONS') {
      return true;
    }
    return false;
  },
};

const mongoInstrumentationConfig: MongoDBInstrumentationConfig = {
  enabled: true,
  enhancedDatabaseReporting: true,
};

export const initializeTelemetry = () => {
  const customResource = Resource.EMPTY;
  customResource.attributes[SEMRESATTRS_SERVICE_NAME] = 'Backend';

  const options: AzureMonitorOpenTelemetryOptions = {
    resource: customResource,
    azureMonitorExporterOptions: {
      connectionString: env.APPLICATION_INSIGHTS_CONNECTION_STRING,
    },
    instrumentationOptions: {
      http: httpInstrumentationConfig,
      mongoDb: mongoInstrumentationConfig,
    },
  };

  useAzureMonitor(options);
  addOpenTelemetryInstrumentation();
};

/**
 * Add additional OpenTelemetry instrumentation that is not bundled with Azure OpenTelemetry Distro
 */
const addOpenTelemetryInstrumentation = () => {
  const instrumentations = [new ExpressInstrumentation()];
  registerInstrumentations({
    tracerProvider: trace.getTracerProvider(),
    meterProvider: metrics.getMeterProvider(),
    instrumentations: instrumentations,
  });
};

Expected behavior
The requests with status code 304 are not shown as errors in application insights.

Screenshots
Screenshot 2024-04-23 at 11 11 26 AM

Screenshot 2024-04-23 at 11 38 07 AM

Screenshot 2024-04-23 at 11 38 57 AM

Additional context
We played around a bit and tried to set custom attributes to the http requests using a SpanEnrichingProcessor but it seems that no custom attributes could be added to these failed requests.

@github-actions github-actions bot added Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. Monitor - Exporter Monitor OpenTelemetry Exporter needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team question The issue doesn't require a change to the product in order to be resolved. Most issues start as that Service Attention Workflow: This issue is responsible by Azure service team. labels Apr 23, 2024
Copy link

Thanks for the feedback! We are routing this to the appropriate team for follow-up. cc @hectorhdzg @JacksonWeber @ramthi.

@JacksonWeber JacksonWeber self-assigned this Apr 24, 2024
@JacksonWeber
Copy link
Member

@despgiat I'm not able to reproduce this situation of 304 status codes being shown as errors. Requests marked as response code 304 are being shown as I expect in the transaction blade when using your example code with an express server that returns a 304 response.

Can you provide more information on where these 304 response codes are being returned from or more details for reproducing your scenario? Thanks!

@p-hlp
Copy link

p-hlp commented May 4, 2024

@JacksonWeber Guess I can chime in here. You can use this repo for repro. It was initially used for a different problem with application insights js sdk/azure otel distro, but has the same issue with 304's showing up under failures. You can find detailed steps for reproducing the scenario in the readme. For the setup - personally I used two ai resources (web client/api) in same workspace, however a shared one also works.

@JacksonWeber
Copy link
Member

@p-hlp Thank you for doing this work to repro. I'll recreate this scenario and work on debug as soon as I can.

@p-hlp
Copy link

p-hlp commented Jun 6, 2024

@JacksonWeber Are there any updates by chance? It's a bit of an annoying problem. Otherwise anything I could do on my side to support with getting this going?

@JacksonWeber
Copy link
Member

@p-hlp Apologies I haven't had much time to look into this earlier. Investigating this afternoon and I'll provide an update ASAP.

JacksonWeber added a commit that referenced this issue Jun 13, 2024
…ard Metrics (#29957)

### Packages impacted by this PR
@azure/monitor-opentelemetry

### Issues associated with this PR
Fixes #29398

### Describe the problem that is addressed by this PR
Success/failure on standard metrics collection for dependencies and
requests should follow the spec:
< 400 is success

### What are the possible designs available to address the problem? If
there are more than one possible design, why was the one in this PR
chosen?


### Are there test cases added in this PR? _(If not, why?)_
Updated test cases that already use the dividing line between
success/failure (status code 400)

### Checklists
- [x] Added impacted package name to the issue description
- [ ] Does this PR needs any fixes in the SDK Generator?** _(If so,
create an Issue in the
[Autorest/typescript](https://github.com/Azure/autorest.typescript)
repository and link it here)_
- [x] Added a changelog (if necessary)
@github-actions github-actions bot locked and limited conversation to collaborators Sep 11, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Client This issue points to a problem in the data-plane of the library. customer-reported Issues that are reported by GitHub users external to the Azure organization. Monitor - Exporter Monitor OpenTelemetry Exporter needs-team-attention Workflow: This issue needs attention from Azure service team or SDK team question The issue doesn't require a change to the product in order to be resolved. Most issues start as that Service Attention Workflow: This issue is responsible by Azure service team.
Projects
None yet
3 participants