diffoscope
diffoscope plans for the meeting
- Reviewing these Post-Its:
Improve Platform Support
- Portability to different platforms should improve. Some tools or used abstractions (like /dev/fd) might not be available.
- Support for distro-specific or uncommon file formats can be improved.
- The testsuite does not work everywhere (e.g. newer versions of software can break things, currently happens for Pascal). It should be more reliable.
Integrate debdiff & diffoscope
- Should we implement a flag in debdiff to call diffoscope?
- Should we replace debdiff with diffoscope completely? What features of debdiff would diffoscope still need?
- It’s not really clear what the post-it author wants.
Parallel diffoscope (#842837)
- Execution time is a serious issue, diffoscope should get faster.
- It is not clear how well parallel Python is going to work for diffoscope due to the global interpreter lock.
- Prior work by nomeata might exist somewhere.
- We should have a hack session on parallel diffoscope.
Marketing/Docs/Undebianization
- https://try.diffoscope.org should be more well-known; it helpfully has a number of optional stuff for uncommon formats installed already.
- Get the word out for non-reproducibility use cases like comparing across versions for updates or due diligence before deployment.
- The bug tracking happens at the Debian bugtracker. That should be more visible. The website should have a “how to report bugs” section for people not familiar with the Debian bug tracker.
diffoscope Plugins
- Should diffoscope have a plugin mechanism to support other file formats that authors may not want to upstream?
Output Format
- Should diffoscope output markdown?
- The output should be more accessible, e.g., for screen readers. Possibly to be implemented as a new output format.
What are Usability Issues with diffoscope?
- Output limits could lead to spending a lot of processing time and then still not getting usable output. Should all arbitrary limits be removed?
- Short command line options
- Write documentation on how to implement support for a file format
- Should diffoscope support excluding specific paths in archives to cut down runtime and ignore parts that are already known to differ?
- Should diffoscope support disabling support for specific file formats?
Automatic Classification of Reproducibility Issues in diffoscope
- Is this in scope for diffoscope? Should this be in a separate tool?
- This would require knowledge of the format and lead to much more complex file format support.
- This could be helpful in outputs, e.g. if an offset changes in an ELF binary you’d get lots of related changes that could otherwise be ignored
Action Items
- lamby to open a bug for the output format accessibility
- Bapt to submit his FreeBSD patches upstream
- everyone to file tickets about portability problems
Session Proposals
- Hacking parallel diffoscope
- Porting diffoscope
- diffoscope usability
- Documenting, marketing and undebianizing diffoscope
-