NewsLab
Jun 28 15:48 UTC

DLL that was not present in memory despite not being formally unloaded (devblogs.microsoft.com)

75 points|by ibobev||30 comments|Read full story on devblogs.microsoft.com

Comments (30)

30 shown
  1. 1. defrost||context
    That's some doggedly determined back tracing to uncover an unexpected heisenbug (loose meaning).

      So a total of 46% of the crashes were due to this rogue force-unload of a DLL. This is a case of bucket spray, where a single underlying cause generates a large number of different types of crashes.
  2. 2. chrisjj||context
    We've not yet seen sufficient evidence this is any type of heisenbug.
  3. 3. brookst||context
    Looking more closely would resolve it one way or the other.
  4. 4. defrost||context
    My hat.
  5. 5. defrost||context
    It's not, by the article, in a strict taxonomy.

    In a wider sloppier sense some use the term for bugs that are hard to pin down and exhibit wide behaviours.

  6. 6. zabzonk||context
    > The good news for the shell32 team is that they are off the hook; they are the victim. The bad news is that we don’t know who the culprit is.

    The story of software development through the ages.

  7. 7. brookst||context
    When you’ve eliminated all possible explanation, it’s time to pack it in.
  8. 8. taneq||context
    Oh man, my journey from idealistic “there is always an explanation” youth to “some days it do be like that, and we may never know why” in a nutshell.
  9. 9. zabzonk||context
    Or, as the original article suggests, blame someone else.
  10. 10. masfuerte||context
  11. 11. Someone1234||context
    Part 1 was interesting; it isn't clear why he split that into a Part 2 since it adds little to the story and is a paragraph long.
  12. 12. taneq||context
    Might have been an “I need to look into this” segueing into “ never mind”?
  13. 13. londons_explore||context
    I assume the fact it is a third party application means debugging gets harder, and the business case for doing so is weaker/none.

    But I would hope that some kind of reverse debugger triggered on one of these crashes would make it pretty simple to say "who wrote this 01".

  14. 14. microgpt||context
    You could also look at modules loaded into all of those processes that crashed this way.
  15. 15. kumarvvr||context
    I see posts like this, this deep dive into the call stacks and am always humbled and reminded of the limits of my knowledge about computers and programs.
  16. 16. Panzer04||context
    Not a programmer?
  17. 17. kumarvvr||context
    I am, for 20 years now. I do embedded stuff too. Still.
  18. 18. Panzer04||context
    I'm a bit surprised you don't run into things like this then :). Do you use GDB and the like at all?

    Or do you mean all the windows specific stuff etc, I guess I was more imaging the call stack etc.

    No insult was intended XD

  19. 19. kumarvvr||context
    I have done everything from desktop apps to web apps and a bunch in between. Regular debugging is good enough for me. Never had the need to go down into call stack level.

    Even with embedded programming, regular C debugger has always been enough.

  20. 20. FartyMcFarter||context
    As someone who has debugged his fair share of tricky low-level issues, the parts that I find impressive in his blog posts are things such as "then we look at the bytes in memory and oh yeah, this looks like an exception record". I would usually not think to do that (or be able to recognise it as easily as I presume he did).
  21. 21. dist-epoch||context
    Goes both ways, author probably knows little about FPGA programming, React or PyTorch.
  22. 22. rwmj||context
    What MSFT support policy do you need to have the legendary Raymond Chen take a look at it?

    I say this because we've reported a bunch of Windows bugs (mainly running Windows under virtualization) and getting them to pay attention at all is an up-hill battle.

  23. 23. hackyhacky||context
    > What MSFT support policy do you need to have the legendary Raymond Chen take a look at it?

    If you have to ask, you can't afford it.

  24. 24. nopurpose||context
    How big and important third-party vendor must be for Raymond Chen to dissect its coredumps?
  25. 25. FartyMcFarter||context
    Given his seniority, it could also be that he picks whatever bugs he wants to work on. Whether that is from personal interest, frequency of crashes or any other criteria.

    When you're at that level in a company, it's rare that someone would be micromanaging what you work on at all times.

  26. 26. 1970-01-01||context
    >I asked for the 100 most recent crashes in that third party program and put them into a pivot table so I could see the distribution.

    Always wondered if crash reporting is some kind of shady business. It's good to know it does, at minimum, do what it promises and give valuable crash data to MS.

  27. 27. hackrmn||context
    The fact that Raymond Chen is debugging these kind of issues, tells me Microsoft is short on staff that has his particular set of skills, handing him the hairiest issues from the annals of Windows. The new hires are probably all about .NET and JavaScript and what have you -- whatever Microsoft is about these days. I doubt it's C/C++. Chen is probably on standby and is paid handsomely as a de-facto VIP consultant. He is a legend, but he's becoming somewhat of a vintage developer.
  28. 28. IChooseY0u||context
    Windows COM is super weird and way over engineered.
  29. 29. rpeden||context
    I actually think COM is an amazing bit of engineering considering its intended use case.

    It still feels like a much more advanced way of sharing compiled libraries between different languages than the current default of "export a C ABI and communicate across the barrier via primitive sticks and stones."

    COM isn't perfect but I still find it impressive especially since COM/OLE are 40 years old at this point.

  30. 30. microgpt||context
    It basically is that. It's a standardized sticks and stones. Plus objects for some reason. But I don't think the objects are a bad thing - it allows multiple implementations of sometimes to co-exist - consider using two different GPUs from different vendors at the same time. It took a really long time to make OpenGL support that, but DirectX could always do it (at least at the API level) by just creating two different ID3DDevice objects backed by different code from different DLLs both loaded at the same time.