Towards better troubleshooting

From FlightGear wiki
Jump to navigation Jump to search

These are some ideas towards better troubleshooting that we usually come up with after each release to improve the quality of end-user bug reports. Most of these are fairly trivial to implement, it just seems that we don't typically have a need for them until someone shows up having a real problem who is not able to build from source... It mostly boils down to showing additional information in the about dialog, or including it in the startup log file created in $FG_HOME, or maybe even making it available in the new crash reporting tool eventually.

You may also want to check out this post on the forum: Re:Helping users in the long term

Background

Cquote1.png Whatever release plan we come up with, it will never be able to solve our issues with respect to bug reporting/fixing. We have all the required infrastructure to do better here: the bugtracker, a forum, a mailing list and we even send crash reports - at least on windows. What we do not have are the people dealing with that. We need: Cquote2.png
Cquote1.png whether we do frequent automated releases or single hand-picked releases, the Achilles heel for the whole process is going to be getting enough test data and processing it properly.
— Thorsten Renk (Nov 19th, 2015). Re: [Flightgear-devel] Some thoughts about the release process.
(powered by Instant-Cquotes)
Cquote2.png
Cquote1.png Usually reports range from 'FG sucks' all the way to suggested C++ patches to fix an issue (with a strong bias towards the former). Ironically, we get at the same time too little and too much information - too little, because often people just write a frustrated post that FG version X crashes all the time and they reverted to the previous version - so we know there is an issue
— Thorsten Renk (Nov 19th, 2015). Re: [Flightgear-devel] Some thoughts about the release process.
(powered by Instant-Cquotes)
Cquote2.png
Cquote1.png I feel that getting the information better organied and brought to the attention of the relevant maintainers would be the key to make better stable releases - but I am at a loss how to achieve that.
— Thorsten Renk (Nov 19th, 2015). Re: [Flightgear-devel] Some thoughts about the release process.
(powered by Instant-Cquotes)
Cquote2.png
Cquote1.png however we do it in the future - this is what I see as the most pressing issue to solve - how to improve the flow of information from users to developers without making everyone unhappy in the process.
— Thorsten Renk (Nov 19th, 2015). Re: [Flightgear-devel] Some thoughts about the release process.
(powered by Instant-Cquotes)
Cquote2.png
Cquote1.png Might sth like a bug-report (or maybe crash-report?) feature be of help? We got a lot of information in the property-tree, like GPU-drivers etc. For instance, do we have OS and stuff like that too? Might be fairly easy to put this important info into a tmp-file, and in the case we crash, add it to some kind of http-request to flightgear.org. An experience-report might be of help, what frequently asked questions do you pose all the time? + the red box...
Cquote2.png
Cquote1.png we should absolutely stop telling anyone to edit preferences.xml in FG_ROOT; any documentation or advice which says to should be changes ASAP.
Cquote2.png
Cquote1.png I'm interested in going through the entire fleet to create a new base level that all aircraft will be flyable. I'm not talking about fixing clunky FDMs or incomplete parts, but really basic maintenance - aircraft which start with tonnes of errors, and especially aircraft that cannot be started. If you had a list or could point out a few, that'd be awesome! I know there is tonnes of information in old forum posts about this, but for me it's quicker just to start each up one by one and test. If you do have a good list, maybe it would be good to start a new topic.
— bugman (Sun Apr 26). Looking for aircraft with 'paper cut' bugs.
(powered by Instant-Cquotes)
Cquote2.png

Trivial

These are already available in the property tree, i.e. just requires editing about.xml:

  • Expose threading mode Not done Not done
  • Expose renderer mode (classic vs. rembrandt / osgEarth) Not done Not done
  • Expose availability of Built-in Profiler Not done Not done
  • show location of $FG_HOME in help/about dialog as a button to open it via file explorer ?
  • show checksum of preferences.xml using the md5() Nasal API in the About dialog to ensure that people didn't tamper with it
  • consider adding the checksum of preferenfces.xml to the binary, so that startup code can identify if files were modified that should not be changed
  • should show if fgfsrc is available/used
  • use intel detection heuristics and adapt those to detect AMD/ATI cards and show a startup warning to inform the user that PUI/styling may interfere with effects/shaders
  • we keep seeing people who install mismatching versions of aircraft/fgfs - introduce a version-check property so that -set.xml files can encode version requirements, which are checked by a script in $FG_ROOT/Nasal [1]

More logging

Cquote1.png Would it be possible to print the command line options FlightGear has been started with into the log file? At least from log-level "info" on upward. This would make helping users with issues a tad more easy. They would not have paste two files, the log plus the command line options. And the people trying to help would not need to explain how to gather the command line options. Which is not easy with with a bunch of launchers out there. It also would reduce the need for each of those said launchers to implement a feature to print the command line options.
— Alex D-HUND (Jan 12th, 2016). [Flightgear-devel] Feature request - Logging / debugging.
(powered by Instant-Cquotes)
Cquote2.png
Cquote1.png This is a valid request, but there is complication in how various files and options are combined. There are two ways to do this:
  • show the input data: .fgfsrc and other config files which people have largely forgotten exist, and the arguments. There files are applied in ‘most general’ to ‘most specific’ order, so arguments are the most specific and applied after everything else, so you can always override config state with arguments.
  • show the combined data inside the options system, as it’s processed. This would still show us when options are being over-riden, but would be less clear where options come from (we no longer know the file paths of config files). This is much easier to implement however.
The second approach is a little simpler to code, and has the advantage that nothing can be accidentally missed from it, but it’s a little harder to understand where a particular option came from. Although, there are probably ways we could track this too (more work of course).
— James Turner (Jan 12th, 2016). Re: [Flightgear-devel] Feature request - Logging / debugging.
(powered by Instant-Cquotes)
Cquote2.png

Core changes

Cleanup.png This article may require cleanup to meet the quality standards of the wiki. Please improve this article if you can.
Screen shot showing the property browser running on Netbook in a patched version of FlightGear with resource tracking support using the cross platform SIGAR library.
Screen shot showing the property browser running on Netbook in a patched version of FlightGear with resource tracking support using the cross platform SIGAR library.
CPU information exposed to the property tree using the SIGAR library


These will typically require changes on the C++ side, but would still be good to have whenever someone makes a bug report:

  • provide a startup option/property to disable aicraft-set.xml Nasal code for troubleshooting purposes
  • log frequency of Nasal callback invocation to the console to detect fdm coupled code ?
  • dump node-specific osg camera stats to the console/log file (or even splash screen) during startup for the scenery/aircraft ROOT nodes[2]
  • separate OpenGL/GLSL logging/log file for troubleshooting graphics related startup errors rendering FG useless (e.g. faulty drivers) [3]
  • case-sensitive path checking [4]
  • write the current pid (process ID) to the property tree, so that this can be used for debugging purposes (or even just for the suffix of log files) Not done Not done
  • add startup options to override/ignore startup files and trigger a navdb cache rebuild [5] Not done Not done
  • expose number of active timers/listeners[6] Not done Not done
  • introduce a tag for FDM properties to optionally notice listeners being tied to FDM rate, as per [7]
  • provide support for an optional TRACECB (callback) mode where multiple identical callbacks are reported ? Not done Not done
  • we could also provide a property and use it as a "soft limit" for the number of listeners per property before a warning is shown, e.g. ~20-50 ? Not done Not done
Cquote1.png I used two places to check how many Uniforms are created. EDXH (Isle of Helgoland in the North Sea) - barely any scenery; And KSFO with AI Traffic enabled, you know the scenery complexity.At EDXH, the Effect system attaches approx. 1500 Listeners to the property tree but only to 17 unique properties. At KSFO, the Effect system attaches some 10000 Listeners to the property tree - but also only to 17 unique properties. That's with shader quality set to zero (all off).

On my old Laptop, (1.6GHz dual core, GeForce Go 7400, 3Gb RAM) FlightGear has becom barely usable over the last few years. Framerates of max 20 at EDXH and single digits at KSFO.


Cquote2.png
  • Subsystem-level Memory Tracking for FlightGear (RAM/VRAM, CPU/GPU utilization) Not done Not done
  • For better diagnostics, and better end-user bug reports, we could consider exposing a cross-platform process and system utilities module via Nasal/CppBind, such as e.g. psutil (Windows, MacOS & BSD/Unix) Not done Not done
  • Nasal callback tracking (timers/listeners) [8]
  • Nasal callback tracing, i.e. identical callbacks being called more than once per frame ?
  • Expose scenery version info [9] [10] Not done Not done
  • Expose all startup arguments (would just be copied to a single property) and log them to the log file [11] Not done Not done
  • Expose complete content of fgfsrc (could be read into the property tree) Not done Not done
  • Expose cmake/compiler flags Not done Not done
  • Expose release/debug build info (osg,simgear & fg) Not done Not done
  • Expose architecture (32bit/64 bit) e.g. via __i386__ and __x86_64__ on Intel or via cmake [12] [13] Not done Not done
  • Expose amount of physical RAM availableNot done Not done
  • Expose amount of used RAMNot done Not done
  • Expose degree of swappingNot done Not done
  • Provide some means to show GLSL errors in a dedicated place, i.e. the loglist ?Not done Not done
  • Maybe provide an option to attach the startup log file to a flight recorder "tape" so that people can more easily share reproducible flights ?
  • Keep track of Nasal callbacks running via timers/listeners and maintain a list of origin (FILE/LINE, handle/identifier) so that we can inspect a list of "Nasal processes" [14]
  • Investigate providing support for Nasal backtraces across timers/listeners as per Philosopher's posting [15]
  • Extend Initializing Nasal early to provide a startup mode where the Nasal engine will be fully shut-down at run-time for better troubleshooting

Version checking

Cquote1.png Is there a way to bypass, fool or add multiple version #'s to VERSION file in the data folder so one can start any version of FG without having to edit that file every time?
— wlbragg (Mar 11th, 2014).  folder .
(powered by Instant-Cquotes)
Cquote2.png
Cquote1.png I have just removed the checked from the branch I am usually working with. But I admit that it would be better to provide a --prop: option to disable the check optionally - it would be useful for people juggling many different branches of SG/FG/FGDATA (while still knowing what they're doing obviously)
— Hooray (Mar 11th, 2014). Re: .
(powered by Instant-Cquotes)
Cquote2.png
Cquote1.png It works for people who know what they're doing - but providing it as an option could be problematic, because most people are unlikely to know what this entails, so it may be more trouble supporting this than requiring people to edit that file every time
— Hooray (Mar 12th, 2014). Re: .
(powered by Instant-Cquotes)
Cquote2.png
Cquote1.png I was just wondering if there might be any other existing mechanism (other than changing the code or the VERSION file) by which you could circumvent or satisfy that check.

I think if it were up to me, because I do bounce back and forth between two data sets, (cutting edge and current stable) and at least two FG program versions, (again cutting edge and current stable), I would or maybe will, change the code at the version check to look for the correct code, then if not satisfied, read to see if there is a next line in file [VERSION]. If so and it is say, TRUE, it would let it go. Then only people that "know what they are doing" can edit that file one time and not be bothered with it again. Simple fix, but absolutely necessary, no.

At any rate it is no big deal, just an annoyance that I now have the ability to change, at least in my version.
— wlbragg (Mar 13th, 2014). Re: .
(powered by Instant-Cquotes)
Cquote2.png
Cquote1.png My main issue is when I want to run 3.0 binary with 3.1 data and then also switch back to 3.1 binary on the same 3.1 data. I don't know that I ever need to use 3.0 data with 3.1 binary, but maybe I do on occasion. I find myself doing it on a regular enough basis that I got annoyed enough to ask the question. It's a simple version check, no mystery, I for one bounce back and forth with different combinations. As for the "why" I guess I do things in 3.1 data that I want to check out using 3.0 binary.
— wlbragg (Mar 13th, 2014). Re: .
(powered by Instant-Cquotes)
Cquote2.png

Canvas

  • make canvas optional [16]
  • load Canvas/cppbind bindings on demand [17]
  • add draw masks for Canvas placements [18]

Runtime stuff

  • provide a switch to enable/disable updating of canvas/groups on/off (or placements?), to help with troubleshooting
  • Investigate extending Canvas::Element to optionally keep track of memory required for each element (use/sub-class osg::ref_ptr), so that we can provide some runtime info Not done Not done
  • Provide a means to list all running Nasal callbacks invoked via timers/listeners[19] Not done Not done
  • Extend FGStatsHandler to expose Nasal GC stats via osg::StatsHandler Not done Not done
Cquote1.png The built-in osgviewer stats can be extended with custom stats, that works by subclassing osg::StatsHandler, this is already done in $FG_SRC/Viewer/FGEventHandler.?xx

The class can be extended to add your own stats via osgViewer::StatsHandler::addUserStatsLine()

You can even register totally custom stats via osg::Stats

The main suspects in this case are probably 1) scenery, 2) cockpit, 3) Nasal (GC - bottleneck() function in simgear/nasal )
For the Nasal stats, you'll probably want to register them in FGNasalSys::FGNasalSys() by accessing globals->viewer->
That should give you a better idea about what's going on there, and it's been suggested by core developers, too:

http://www.mail-archive.com/flightgear- ... 37823.html

Cquote1.png Another goal is to add more node bits (and a GUI dialog to control them) so

various categories of objects can be disabled during the update pass. This will
mean the direct hit of, say, AI models vs particles vs random trees can be
measured. Of course it won't account for resources (memory, textures) burned by
such things, but would still help different people identify slowness on their
setups.

Cquote2.png

— Hooray (Sun Feb 23). Re: New Aircraft: the Extra500.
(powered by Instant-Cquotes)
Cquote2.png


  • Canvas::Element/Canvas::Group should be extended to gather SGTimeStamp-based stats for each group/element, so that people can better tell what's going on behind the scenes [20] Not done Not done
  • Use a separate draw mask for cockpit panels (e.g. via a corresponding XML attribute that serves as a marker), so that the impact of heavy cockpits can be better evaluated Not done Not done
  • Whenever there's an exception while booting, we should try to re-init the sim and increase the log level to provide crashrpt with better info [21]