GCC 10.1 Released | HN Mirror

rwmj 2275 days ago

Really? I tried to use it but I got a lot of false positives - in fact, every one of the errors in my medium-sized codebase was I believe a false positive. I spent a few hours last night looking at all of them and while I found a missing free, it was not one picked up by this analysis, but it happened to be in the same code that the analyzer flagged. Also the error messages are enormous in some cases.

It does show potential, and I hope it improves in future. Be nice to have a libre version of coverity one day.

Edit: Output from compiling nbdkit upstream with -fanalyzer: https://paste.centos.org/view/8381f926

asveikau 2275 days ago

Looking at the categories of warnings it produces, I'm not surprised. When I've used similar tools trying to detect the same things, I also saw false positives. It's probably not an easy set of problems, otherwise we'd have it built-in to more tools and enabled by default.

rwmj 2275 days ago

We routinely run nbdkit through Coverity and it finds bugs, although it too has false positives. Also the reports produced by Coverity are really nice - long enough to tell you where the bug is, but not too long to be overwhelming.

I've been meaning to formally prove one of our internal "mini libraries" using Frama-C. If we did that then no one would be able to complain about bugs in it :-)

google234123 2275 days ago

Sadly, that's typical with static analyzers todays...

flukus 2275 days ago

I like valgrind for this, combined with automated tests it gets decent coverage and finds actual (or at least probable) errors.

unmole 2275 days ago

Valgrind is also prone to throwing up false positives. I find ASan much better.

qalmakka 2275 days ago

Isn't it very similar to something Clang has had for years?

legends2k 2275 days ago

Tangential: I think many platforms that GCC supports and the kind of optimisation GCC provides might make it a superior choice for projects. For those developers its exciting. Also if an existing project is using it for a long-time and has not intentions of switching to Clang, for them too this would be interesting.

CoolGuySteve 2275 days ago

My project uses g++ but I’ve been using a clang static-analysis step in my CI pipeline for a while now.

Compiling with clang is much, much slower but it finds interesting errors sometimes. I only had to ifdef out one code section that it couldn’t understand (something about indexing an array with a constexpr function returning an enum class from a template parameter).

It depends on the project, but this one and another project I made work are both 100k lines of C++ and took half a day to setup. It was worth it imo.

tele_ski 2275 days ago

Interesting that your build times are slower with clang. I've found on most of my projects clang is roughly 30% faster to compile in debug, but release I haven't seen a huge difference between the two.

moonchild 2275 days ago

My build times are identical between GCC and Clang (debug and release).

But TCC is 10x faster than both of them.

petters 2275 days ago

Perhaps they are slower because Clang is doing static analysis in this case.

cheez 2275 days ago

Nope, also slower for me.

Clang has it although I've personally never tried it due to the horrible rigamarole of setting it up.

In GCC it's a compiler flag. In LLVM it's a convoluted process automated by either an irritating perl/python script (The python one isn't included by default for some reason despite being far more common these days) or AN ENTIRE WEB SERVER TO OUTPUT TEXT (Which thankfully isn't included by default).

I'm sure you could set it up in make but since my projects aren't large scale enough to really need it I can't be bothered, even if it's one of those things you'll probably only need to write once and can just copy and paste ad-infinitum.

ndesaulniers 2275 days ago

Lest other people get the wrong impression from someone who's admitted not even trying it, running Clang's static analyzer is as simple as switching cc with scan-build. You can even drive it from clang-tidy. No Perl or Python setup in my experience.

You appear to have it installed somewhere. Open up scan-build in your favorite text editor if you don't believe me.

0x09 2275 days ago

Though scan-build is usually the simpler option, clang itself does have an --analyze flag which writes analysis results in various formats, including the same html reports that scan-build would generate. But to see this on standard out

   clang --analyze --analyzer-output text ...

Will print the entire analysis tree in the same format as regular diagnostics.

The only problem is the CTU mess if you're analyzing more than one file, thus the aforementioned tools' necessity.

Hopefully in a future version the kinks are ironed out and we can just use the flag without any hassle. It's like if we needed to still manually link our files with ld before compiling them instead of clang auto doing it.

Freaky 2275 days ago

> Clang has it although I've personally never tried it due to the horrible rigamarole of setting it up.

    -% scan-build10 make
    scan-build: Using '/usr/local/llvm10/bin/clang-10' for static analysis
    ...
    scan-build: No bugs found.

What could ever have driven them to such a long and convoluted setup process?

Yep! Thanks for pointing out the perl script I mentioned. You appear to have completely ignored the rest of my post on them leaving important functionality to third party scripting languages.

Freaky 2275 days ago

Yes, to highlight that this "horrible rigmarole" and "irritation" you describe is entirely on the basis of there being a Perl script involved. Something I suspect the majority would overlook without even really noticing.

ufo 2276 days ago

What do you love the most about the new analyzer pass? I haven't had a chance to try it out for myself yet but I'm looking forward to it.

DyslexicAtheist 2275 days ago

I like not having to run another commercial tool[1] that will likely not be in use whenever I move on to the next project because no one has heard of it.

Biggest advantage I see is it's integrated into the compiler and so sees the same things the compiler does.

Having gcc do this out of the box helps people port their experience/skills with static analysis to other companies.

We already have clang-tidy and I like it too but it's nice to have a fall-back to compare when one produces a strange result. And a bit of competition is always good to have between such projects. And on most big projects it's not like you can just change the build system to use another compiler.

Also I found some interesting cases which valgrind didn't see because it was in an unreachable branch.

[1] https://news.ycombinator.com/item?id=22712338

htfy96 2276 days ago

An example to detect use-after-free: https://godbolt.org/z/zhiNLW

Basically this replicates what clang-tidy did

cornstalks 2276 days ago

Too bad it doesn't detect the mismatch between new and free.

peapicker 2275 days ago

And if you fix it to use delete instead of free, you get "can't delete void *" errors. Perhaps not the best example code.

gpderetta 2275 days ago

and if you fix the void->int, the warning goes away.

cozzyd 2274 days ago

Excited for this to come to gcc-arm-eabi-none

glouwbug 2276 days ago

Imagine if -fanalyze was like rusts borrow checker

simias 2275 days ago

You Rust borrow checker requires special annotations and restrictions put on the code to do its job. I don't think you could something like that automatically on a C or C++ full codebase without having to manually annotate and refactor it somewhat. There are many common (and safe) C and C++ patterns that would be outright rejected by Rust's borrow checker, for instance initializing a structure or array partially if you're sure that nobody is going to use the initialized portion. Or having multiple mutable pointers/reference to the same object.

You could do something like that at runtime though, but then you have Valgrind, basically.

zozbot234 2275 days ago

> for instance initializing a structure or array partially if you're sure that nobody is going to use the initialized portion. Or having multiple mutable pointers/reference to the same object.

Rust supports MaybeUninit<> for the former example, and unsafe raw pointers for the latter. It needs unsafe because these patterns are not safe in the general case and absent an actual proof of correctness embedded in the source code, a static analysis pass can only deal with the general case.

simias 2275 days ago

That's my point though, in both cases the developer needs to add additional syntax to make the intent clear. "Naive" Rust code that tries to do that stuff is rejected by the compiler.

I've expressed myself poorly in my original comment and apparently it looks like I was criticizing Rust but I wasn't. I was just pointing out that safety didn't come "for free" by toggling a compiler flag, you have to change the way you code some things. If C and C++ were to become safe languages, code would need to be rewritten using things like MaybeUninit, split_at_mut, RefCell etc...

estebank 2275 days ago

I would dispute that those common patterns are indeed safe, even if they could be argued they are when first written because code changes and can suddenly break your preconditions if they aren't enforced in the code itself.

C codebases then follow certain defensive programming customs to avoid reading uninitialized or out of bounds memory, at the cost of some performance. This is the right trade-off in C but, funnily enough, the more restrictive borrow checker has the opposite effect as you can give out inmutable and mutable references with wanton abandon because they get checked for unsafe behavior. It's the same difference as a a gun where the best practice is to keep it unchambered at all times to avoid the risk of a misfire, and a more modern gun with a safety: it's one more thing to think about but it actually smoothes the operation.

MaxBarraclough 2275 days ago

I'd describe the pattern in slightly different terms: when done right, restrictions in a programming language (or library/framework) are liberating for the programmer.

The restriction of immutability spares the programmer from worrying about whether unknown parts of the codebase are going to decide to mutate an object.

JavaScript's single-thread restriction (not counting web-workers) closes the door on all manner of nasty concurrent-programming problems that can arise in languages that promote overuse of threads. (Last I checked, NetBeans uses over 20 threads.)

Back to the example at hand, C has no restrictions, but that hobbles the programmer when it comes to reasoning about the way memory is handled in a program. It's completely free-form. Rust takes a more restrictive approach, and even enables automated reasoning. (Disclaimer: I don't know much about Rust.)

> Rust borrow checker requires special annotations and restrictions put on the code to do its job.

This is a good thing, because it makes lifetimes and ownership explicit and visible in the code. It serves the similar purpose as type annotations in function signatures.

> Or having multiple mutable pointers/reference to the same object

Sure you can have that with `unsafe`. And this is a good thing, because multiple mutable pointers to the same object is at best bad coding practice that leads to unsafe code, and you should avoid that in any language, including the ones with GC. Working with codebases where there are multiple distant things mutating shared stuff is a terrible experience.

If a C/C++ version of "borrow checker" could mark such (anti)patterns at least as warnings, that would bring a lot of value.

traes 2275 days ago

He wasn't criticizing Rust, he was just stating facts.

I read it differently, because he started with "You(r) Rust borrow checker", making his point automatically in oposition. But now after reading without this You at the beginning, I agree it was neutral.

verdagon 2275 days ago

Can you explain why multiple mutable pointers is bad practice?

I understand the benefits and the risks of them, and understand how Rust prevents both, but I dont yet understand why it's bad practice, and am interested to learn why.

cormacrelf 2275 days ago

The one that affects you as a programmer most is Iterator invalidation. Iter borrows from the vector, you mutate the vector, iter blows up. Simple really. But a lot of code is like this. Borrow from hashmap, insert into hashmap, the slot gets moved around and your pointer is now invalid. That’s just vectors and hashmaps; imagine the possibilities in a much more complex data structure.

There are compiler optimisations you can do if compiler knows about aliasing, but that’s not so much a software authorship problem. There are some curly problems with passing aliased mutable pointers to a function written for non-aliased inputs, like memcpy and I imagine quite a lot of other code.

But common to all of these things is that it’s pretty hard to figure out if the programmer is the one who has to track the aliasing. In hashmap pointer invalidation, your code might work perfectly for years until some input comes along and finally triggers a shift or a reallocation at the exact right time. (I know this — I recently had to write a lot of C and these are the kinds of issues you get even after you implement some of Rust’s std APIs in C.)

Because it leads to hard to understand code.

If N unrelated (or loosely related) things can mutate the same object, then you get a O(x^N) explosion of potential mutation orders and in order to understand that, you need to understand all the (sometimes complex) time-relationships between these N objects. This gets even much worse when some of these objects are also pointed from M other objects...

On the flip side, in case of using a simple unique_ptr (or a similar concept), this trivially reduces to a single sequence of modifications.

simias 2275 days ago

> Sure you can have that with `unsafe`

The parent was talking about the borrow checker so I only was talking about safe Rust code. Obviously if you consider that the entire C/C++ codebase is in a big unsafe {} block it'll work... because it won't do anything at all.

From Rust docs:

It's important to understand that unsafe doesn't turn off the borrow checker or disable any other of Rust's safety checks: if you use a reference in unsafe code, it will still be checked. The unsafe keyword only gives you access to these four features that are then not checked by the compiler for memory safety.

jfkebwjsbx 2275 days ago

> This is a good thing, because it makes lifetimes and ownership explicit and visible in the code.

No, it is additional burden. If it was possible to do it without annotations, you bet we would do it!

Rusky 2275 days ago

It's only a burden to the extent that type annotations are a burden. It's definitely possible (including in Rust) to do away with both, but that has downsides of its own.

https://www.youtube.com/watch?v=EeEjgT4OJ3E

pjmlp 2275 days ago

While it isn't at Rust level, that doesn't stop Google and Microsoft from trying.

"Update on C++ Core Guidelines Lifetime Analysis. Gábor Horváth. CoreHard Spring 2019"

coldtea 2275 days ago

>You Rust borrow checker requires special annotations and restrictions put on the code to do its job. I don't think you could something like that automatically on a C or C++ full codebase without having to manually annotate and refactor it somewhat.

What about with a constrained (not necessarily general purpose) AI with the expertise of Scott Meyers, Andrei Alexandrescu, Herb Sutter and Alexander Stepanov?

https://herbsutter.com/2018/09/20/lifetime-profile-v1-0-post...

pjmlp 2275 days ago

That is what Microsoft and Google are trying to do with C++ Lifetime Profile.

"Update on C++ Core Guidelines Lifetime Analysis. Gábor Horváth. CoreHard Spring 2019"

https://www.youtube.com/watch?v=EeEjgT4OJ3E

While it might never be Rust like due to language semantics, it is way better than not having anything.

mhh__ 2275 days ago

I think Rice's theorem means that you can't really do that without restricting/annotating semantics like Rust does.

The_rationalist 2275 days ago

No need to imagine, It's becoming reality -> https://internals.rust-lang.org/t/c-lifetime-profile-1-0-a-k...

MauranKilom 2275 days ago

> Extended characters in identifiers may now be specified directly in the input encoding (UTF-8, by default), in addition to the UCN syntax (\uNNNN or \UNNNNNNNN) that is already supported:

    static const int π = 3;
    int get_naïve_pi() {
      return π;
    }

Lovely!

hinkley 2275 days ago

The next obfuscated code competition is sure gonna be interesting.

signa11 2275 days ago

yup zero-width-space is going to a do a number on everyone.

asjw 2275 days ago

For what is worth being able to enforce conventions like

    present?(p) // return bool
    get!(g) // throw if not found

a-la ruby/elixir could be good

q92z8oeif 2275 days ago

i don't know if you are being ironic or not, but the non-english speaking world will love it.

alufers 2275 days ago

As a non-English programmer who has seen a lot of code not written using English please, god, NO. The mixture of English and other languages identifiers in the external libraries makes me cry.

wiz21c 2275 days ago

Yep, writing non english keywords in C++ is like having prolog code inside c++ :-)

(but as a non english person, I find it very helpful to be able to have unicode in strings)

malkia 2275 days ago

I wish Math used normal english, not greek letters - english is my second language too..

zorked 2275 days ago

They are latin letters, not english. Unless you mean futhorc. Which would be awesome.

throwaway894345 2275 days ago

He didn't say "english letters", he said "english" which is a language. The implication is that we should use conventions (including potentially whole words) that are familiar to the English-literate world and not Greek glyphs.

https://en.m.wikipedia.org/wiki/Greek_alphabet

notRobot 2275 days ago

Many of these characters are hard to type on regular keyboards, which is probably what the parent commenter was referring to.

lenkite 2275 days ago

From a non-english speaking country: I prefer US-ASCII for code. Anything else just obfuscates.

throwaway894345 2275 days ago

There are lots of other languages that support full UTF-8 in identifiers (e.g, Go) and the non-English-speaking world doesn't take advantage.

rurban 2274 days ago

Every such such language does it insecurely. The only exceptions are Java, rust and cperl.

I rather have no unicode identifiers than insecure identifiers, which don't follow the unicode security guidelines for identifiers.

erik_seaberg 2275 days ago

Some do: https://news.ycombinator.com/item?id=14276891

throwaway894345 2275 days ago

I've done a similar thing when I was writing a language (with generics) that compiled to Go and needed to implement name mangling. But in any case, this isn't an example of a non-native speaker using utf-8 to write in his native language. :)

MauranKilom 2275 days ago

Tbh, I mainly shared the example code because I was entertained by the contrast of sophisticated naming and crude approximation. Still, it illustrates that this can come in very handy if used properly.

Of course you can also cause all sorts of mayhem. Even disregarding fun such as GREEK QUESTION MARK looking like a semicolon and zero-width spaces, I probably would not use this feature in enterprise code. Too likely that some tool somewhere (e.g. an alternate compiler?) is KO'd by characters outside of basic ASCII range (which has served us well and will keep doing so).

greendave 2276 days ago

> Several C++20 features have been implemented: > P0912R5, Coroutines (requires -fcoroutines)

Nice to see a bunch of C++20 features making it in. Coroutines seems like a big one!

vmchale 2275 days ago

Lovely! Time to see my code go even faster, for free :)

FullyFunctional 2275 days ago

memory.c: In function ‘mk_entry’: memory.c:116:12: internal compiler error: in saved_diagnostic, at analyzer/diagnostic-manager.cc:84 116 | return (struct entry) {safe_calloc(end - start, 1), start, end}; | ^ Please submit a full bug report,

Goes to look at README.Bugs. Holy cow, I don't have time to to check all those places to see if it has been reported already.

canarypilot 2275 days ago

The GCC community is normally very good (if blunt) at picking up duplicate bugs and linking them to the right place. https://gcc.gnu.org/bugzilla/ Is all you need. Just don’t feel bad if your bug is closed!

FullyFunctional 2274 days ago

Thanks, it was fixed already :)

userbinator 2275 days ago

Every time I look at GCC's bugtracker, I feel a mix of disgust and astonishment at the state of such a foundational piece of software. I'm amazed it works as well as it does.

wiz21c 2275 days ago

(side note for RMS, I still have a "RUNGCC" sticker on my car (it's been 5 years !!!))

liquidify 2276 days ago

wish gcc 10 was built into ubuntu 20.04

https://fedoramagazine.org/announcing-fedora-32/

RMPR 2276 days ago

GCC 10 is in Fedora 32 though

Twirrim 2275 days ago

You want a bleeding edge release in your non-bleeding edge LTS distribution?

jcelerier 2275 days ago

it does not make sense to pair development toolchain versions with operating system versions.

klodolph 2275 days ago

What do you think LTS entails, exactly?

One of the core ideas of working with LTS is that you can build your software on an LTS release and ship it to somebody else on the same LTS release, either as a source or as a binary.

If you want the latest GCC, that's fine, you're not forced to use the default compiler distributed with your OS. But it doesn't make sense to update the default compiler used in an LTS release. If you want that, then you don't want LTS.

fnord123 2275 days ago

eh, no. the OS package manager is for sysadmins. LTS is for sysadmins to not have to worry about versions changing under their feet rapidly when they apply security updates.

If you want to develop an application, you use your own toolchain. But yes I know most C++ people don't d this because C++ tools don't easily support it. But that's on C++ for not having pyenv, rustup, multiruby, etc equivalent.

MaxBarraclough 2275 days ago

C++ lets you statically link the standard library though, right? Or am I missing your point?

Florin_Andrei 2275 days ago

> it doesn't make sense to update the default compiler used in an LTS release. If you want that, then you don't want LTS

This needs to be emphasized.

jcelerier 2275 days ago

> One of the core ideas of working with LTS is that you can build your software on an LTS release and ship it to somebody else on the same LTS release, either as a source or as a binary.

Yes, and updating compilers don't prevent that at all. You can use GCC 10 to ship code that will build and run on Ubuntu 12.04 without issues. Xcode 11 can ship code that works back to macOS 10.6 and Visual Studio 2019 can still optionally target windows fucking XP !

klodolph 2275 days ago

> [...] and updating compilers don't prevent that at all.

This is incorrect. In practice, for larger code bases, upgrading to a newer version of GCC or Clang is something that must be done purposefully, and you must test.

Sometimes it turns out that your code relies on some compiler behavior which has changed. Sometimes newer compilers are stricter than older compilers. There are plenty of real-world cases of these problems!

> Xcode 11 can ship code that works back to macOS 10.6 [...]

There are a number of features that are specific to the macOS toolchain which make this possible. Take a look at the "-mmacosx-version-min" flag on the macOS compiler. This selectively enables and disables various APIs. These features don't solve all the compatibility problems, either.

> Visual Studio 2019 can still optionally target windows fucking XP !

We're talking about Linux here. The Windows toolchain is radically different.

https://gcc.gnu.org/onlinedocs/libstdc++/manual/abi.html

rumanator 2275 days ago

> Yes, and updating compilers don't prevent that at all.

You do understand that ABI backward compabitility is not ensured, don't you?

Some software packages even break between distro releases.

The primary value of a distro is to provide a fixed platform that application developers and users can safely target. Risking ABI breakups just because a rare number of users wish to be on the bleeding edge without wanting to do any of the work to install their own software is something that's very hard to justify.

[1]: https://launchpad.net/~ubuntu-toolchain-r/+archive/ubuntu/pp...

woadwarrior01 2275 days ago

There's a PPA[1] for that.

throwawayanss21 2275 days ago

It is available in the repos it just isn't the default

cozzyd 2275 days ago

Compiling gcc is actually not so bad (and sometimes necessary if you want to e.g. use drd to debug openmp, so you can make libgomp use pthreads primitives that drd knows how to do deal with).

https://packages.ubuntu.com/focal/gcc-10

pantalaimon 2275 days ago

wyldfire 2275 days ago

It's nice when they're built in, sure. But gcc is pretty easy to build on its own.

loeg 2275 days ago

If you want the newest version of software, Ubuntu does not cater to that.

simion314 2275 days ago

If you are a dev then you can prefer using an LTS flavor of Ubuntu with a PPA for whatever you need to be newer. For important stuff this PPA are provided by Canonical so I run newer kernel and nvidia drivers on an older LTS.

e12e 2275 days ago

Well, you might want to run a newer/non-lts release via lxd/lxc. It's probably a much better idea than pulling in willy-nilly ppa's.

purerandomness 2275 days ago

But at this point, why not simply run Arch Linux?

simion314 2275 days ago

You might want LTS and upgrade some packages when needed/forced and not play the "update" lottery. New updates not only bring you cool new feature and fixes , they bring new bugs and sometimes features are removed or GUIs are moved around. At least with my LTS I worked around for existing bugs , upgraded from PPA the things I needed to, browsers are latest versions and my IDE is auto-updating too.

yjftsjthsd-h 2275 days ago

Stable base. I'm pretty fond of Ubuntu LTS as the OS running the bare metal, then [docker] containers on top of that to run applications, which means I can have as new of apps as I want while keeping a nice boring stable kernel/bootloader/sshd/whatever.

e12e 2275 days ago

I'm not sure I understand. You want a stable host system without the need for forced, sometimes breaking, upgrades - so an lts release "on the outside".

You want to develop with new tooling, so a newer release under lxd/lxc. But you probably want to deploy on an lts release - maybe the one comming in a year?

You could of course develop under arch in lxd/lxc - then validate for an lts release once your code is "done".

But I don't think you'd generally would want to deploy to arch - as you'd have to play catch-up in order to keep up with security patches (or backport yourself)?

umvi 2275 days ago

Just use docker and you can have any toolchain you want any time you want

brutt 2275 days ago

Just use Fedora, which targets developers.

SomeoneFromCA 2275 days ago

They still may have a snap for it....