Indeed. How about an analogous tool to the OP that uses binary search to deduce which updated gem(s) break the build, and then uses binary search on the intermediate versions of those gems to tell you exactly which version is the first bad one? Then spits out the changelog for that version ;)