[async_hooks] stable API - tracking issue

Question

[async_hooks] stable API - tracking issue

AndreasMadsen opened this issue 7 years ago · 32 comments

Answer 1 · 2017-11-20T15:37:49.000Z

/cc @nodejs/async_hooks @nodejs/diagnostics

Answer 2 · 2017-11-20T16:25:31.000Z

/cc @jasnell who sometimes asks me about this

Answer 3 · 2017-11-20T19:50:28.000Z

@AndreasMadsen how about adding some external criteria, like TC39 uses or N-API (nodejs/node#14532)

major APM vendor that uses async_hooks (N/Solid)
npm modules with at least 5000 DL/month based on async_hooks (https://www.npmjs.com/package/trace & https://www.npmjs.com/package/cls-hooked)

Answer 4 · 2017-11-20T20:10:38.000Z

@refack added your 1. and 2. to the list. I'm not sure what you mean by TC39 uses. Regarding N-API I don't see why that should be a requirement. async_hooks has its own native Embedder API outside N-API, we can mark that as stable without N-API. In fact N-API shouldn't be marked stable before the async_hooks native Embedder API is marked stable since they depend on that. But async_hooks doesn't depend on N-API.

Answer 5 · 2017-11-20T20:14:54.000Z

Thanks. I just referenced TC39 and N-API as processes that use exit criteria that are independent of the development process, but take into account ecosystem adoption. So I agree that stability of N-API and async_hooks are independent (except for the embedder API)

Answer 6 · 2017-11-24T10:17:58.000Z

FWIW, We plan to start using Async Hooks in Sqreen Agent in the upcomming weeks.

Answer 7 · 2017-11-29T20:57:21.000Z

Link by @watson for elastic: elastic/apm-agent-nodejs#77

Answer 8 · 2017-11-29T21:00:38.000Z

PR to add async_hooks in Stackdriver Trace: googleapis/cloud-trace-nodejs#538

Answer 9 · 2018-01-10T20:36:54.000Z

Updated "Deprecate setTriggerId"

Answer 10 · 2018-01-10T20:40:31.000Z

Added:

Clarify breaking changes necessary for better performance (issue: nodejs/benchmarking#181)

Answer 11 · 2018-01-16T12:23:58.000Z

So based on discussion in the benchmarking WG meeting yesterday I did setup test cases to run the Promise heavy Bluebird and Wikipedia benchmarks (used by the V8 project) with and without async_hooks to answer the first question in this thread, and slow-down is pretty significant even with just an empty init hook.

@mhdawson suggested to kick-off some discussion on the performance via nodejs/benchmarking#188 and bubble up the issue to make sure we consider the performance aspect before async_hooks goes out of EXPERIMENTAL.

Answer 12 · 2018-01-24T17:38:26.000Z

FYI I added an item to list above to identify the perf impact we're willing to tolerate from async-hooks. Current benchmark data cited in nodejs/benchmarking#181 show a ~2x-3x slowdown.

Answer 13 · 2018-04-16T18:22:23.000Z

Any thoughts on if this will be coming out of Experimental before v10 launches? Looking at the remaining items I can't quite tell if there are still significant barriers because I'm not familiar enough with the subject 🤔

Answer 14 · 2018-04-16T22:04:45.000Z

We haven't run into any blockers in the New Relic agent.

The primary concern we've come up against is an extension of the lifetime of a promise leads to drastically increased memory usage. This only seems to be an issue with immediately resolved promises (e.g. Promise.resolve()), since those don't emit an after event and have to be cleared on destroy. This situation only causes an issue when there is an existing promise leak and just requires us to be more diligent about finding and eliminating leaks in libraries we interact with. I don't think this issue affects the shape of the API, so it's definitely not a blocker.

We currently only instrument core promises using async hooks, though the data exposed through the hooks should be enough to move over the rest of the core instrumentation when the feature is fully released.

Answer 15 · 2018-04-16T23:12:28.000Z

@lykkin do you have any data on real-world performance impact async-hooks adds to applications? Performance seems to be the main blocker at this point, and it would be good to learn what kind of impact you observe.

Answer 16 · 2018-04-19T22:21:28.000Z

Unfortunately it's really difficult for us to get exact numbers for the overhead caused by each of our components from users. As a proxy we have a couple of benchmarks we run. (Note: all tests were run against 9.11.1)

One holistic benchmark we run using acmeair where we saturate the app for ~30 seconds to measure the throughput delta our agent incurs. Running this test with a set of no-op hooks (i.e. all hooks are defined, but don't do anything) shows a small, but non-negligible amount of overhead created: 1854 requests/sec -> 1750 requests/sec

One thing to note is the above application doesn't use many promises, so for evaluating the overhead of our monkey patched promise instrumentation vs the async hook instrumentation we built a few promise based microbenchmarks. When running these tests with the no-op hooks we observe the following baseline overhead:

Without hooks:

forkedTest x 7,197 ops/sec ±0.82% (74 runs sampled)
longTest x 14,377 ops/sec ±0.50% (74 runs sampled)
longTestWithCatches x 14,481 ops/sec ±0.54% (75 runs sampled)
longThrowToEnd x 16,735 ops/sec ±0.68% (77 runs sampled)
promiseConstructor x 28,361 ops/sec ±1.39% (74 runs sampled)
promiseReturningPromise x 6,755 ops/sec ±0.89% (74 runs sampled)
thenReturningPromise x 2,034 ops/sec ±0.63% (76 runs sampled)
promiseConstructorThrow x 344 ops/sec ±3.37% (62 runs sampled)

With no-op hooks:

forkedTest x 1,189 ops/sec ±0.91% (74 runs sampled)
longTest x 2,378 ops/sec ±1.00% (71 runs sampled)
longTestWithCatches x 2,387 ops/sec ±1.11% (71 runs sampled)
longThrowToEnd x 2,424 ops/sec ±0.88% (70 runs sampled)
promiseConstructor x 4,159 ops/sec ±1.11% (61 runs sampled)
promiseReturningPromise x 1,173 ops/sec ±1.98% (74 runs sampled)
thenReturningPromise x 779 ops/sec ±2.08% (70 runs sampled)
promiseConstructorThrow x 315 ops/sec ±3.16% (61 runs sampled)

These microbenchmarks show drastically degraded performance around promise usage. With that said, for our use case it ends up being more performant than our monkey patched version of promise instrumentation. In general, instrumenting native promises is going to be relatively heavy, since they are computationally light. I don't think this should be a blocker, but it's a definite pain point.

Answer 17 · 2018-10-22T00:33:58.000Z

Hi!

First of all I wish to say many thanks for all this happens!
Really nice API, and works just good enough to make some really cool stuff!

And, I'm very sorry, but I need to ask a question about the issue I'm under and unfortunately unable to solve by myself. Despite everything with Async contexts seems to be working good, there is a problem with Sync instead.

More description here: Issue #249

Answer 18 · 2019-07-03T17:50:50.000Z

@ofrobots are you still able to champion this.

Answer 19 · 2019-07-08T17:28:35.000Z

Unfortunately no. It would be great if someone else has the drive to drive async_hooks forward.

Answer 20 · 2019-10-25T06:51:48.000Z

should this remain open? [ I am trying to chase dormant issues to closure ]

Answer 21 · 2019-10-30T07:10:24.000Z

no one working on this?

Answer 22 · 2019-10-30T14:05:47.000Z

Since async_hooks is still unstable, I would keep this open until it's considered stable.

Answer 23 · 2019-11-07T12:36:18.000Z

Hey there! 👋 I work for AppSignal and we're currently working on a Node.js integration. While I haven't used it in our integration just yet, async_hooks is something I personally would like to see move forward. How can we best help with the effort to progress this?

Answer 24 · 2019-11-07T12:56:41.000Z

@xadamy - thanks. one thing I would suggest is join our bi-weekly diagnostic working group meeting, and do a briefing on goal, engagement model, your availability etc. We will have one next to next Wednesday. makes sense?

Answer 25 · 2019-11-07T23:51:23.000Z

@gireeshpunathil Sounds good, I'll make sure I'm there next Wednesday! Thank you!

Answer 26 · 2019-11-13T14:11:17.000Z

@gireeshpunathil Apologies, did you mean today or next Wednesday for this?

Answer 27 · 2019-11-13T20:00:37.000Z

Hey @xadamy, sorry for jumping in here, the next Diagnostics WG Meeting is currently planned on Wednesday, 20. Nov 19:30 - 20:30 CEST :)

Answer 28 · 2020-07-19T00:38:57.000Z

This issue is stale because it has been open many days with no activity. It will be closed soon unless the stale label is removed or a comment is made.

Answer 29 · 2020-07-19T02:11:22.000Z

I figure this shouldn't go stale since it's a long-lived tracking issue?

Answer 30 · 2020-07-27T15:10:50.000Z

Hey team, sorry it has taken me so long to follow up. Last time we spoke I believe you asked me to experiment with the async_hooks API more. We now have an async_hooks implementation in production in our Node.js integration - although it is not much different (if at all) from Cloud Trace's implementation.

I have some thoughts and feedback on the API that it would be great to sync with you all on at some point (if you have time), but it would be good to know if there's anything further you'd like me to do to move this forward?

Answer 31 · 2020-07-27T18:15:48.000Z

@xadamy maybe you can plan to come ot the next diagnostics meeting and give the team a runthrough of your thoughts.

Answer 32 · 2020-10-28T17:23:14.000Z

closing in favor of #437

[async_hooks] stable API - tracking issue

Internal Technical

Internal Non-Technical

External Non-Technical

After Stable