| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by l24ztj 2648 days ago
	>A newline character is no longer automatically added at end of buffer. yesssssssssss

3 comments

lexicality 2648 days ago

Genuinely curious, why do you see this as a good change?

link

saagarjha 2648 days ago

There are programs that violate POSIX and are picky about it.

link

dfox 2648 days ago

POSIX definition of what is text file actually requires the file to end with \n, so such programs are technically correct.

The reason why this is (and also why the lines should not be longer than LINE_MAX or contain \0) is that when you simply call fgets() in a loop with LINE_MAX sized buffer, all of these things cause perfectly logical, but wrong/surprising behavior. This is the reason why almost any modern small unix tool contains something called myfgets(), getline() or whatever which wraps fgets() in loop with realloc() and correctly distinguishes eof from other errors.

link

saagarjha 2648 days ago

> POSIX definition of what is text file actually requires the file to actually end with \n, so such programs are technically correct.

No, they’re not correct, since they make files without the newline.

link

lexicality 2648 days ago

Which ones?

link

l24ztj 2648 days ago

Because if I type "hello" in a document, I expect the editor to save "hello", not "hello\n".

link

lexicality 2648 days ago

Where did you get that expectation? Every editor I've used up until Visual Studio Code did that by default.

(I was actually very surprised when vscode _didn't_ do this!)

link

Sniffnoy 2648 days ago

I mean, it ought to be possible to use a text editor to specify exactly what I want in a text file. If a newline is silently added at the end, then, well, that isn't happening.

Edit: Which is to say, I don't think l24ztj is using "expect" in the sense of "anticipate" but rather, y'know, the other sense. (Using Wiktionary's definitions: "To consider obligatory or required" or "To consider reasonably due".)

link

nemetroid 2648 days ago

It should be possible, sure, just like it should be possible to forgo utf-8 and save your file with the iso-8859-1 encoding. But it's the less useful case. Many (most) programs do not care whether there is a final newline or not, but among those that do care, missing endlines will run you into more problems than the opposite (e.g.: Git, GCC, wc and cron will either complain or not work as expected).

link

lexicality 2648 days ago

What do you mean by "exactly" though? What is the "exact" value of "e"? 0x65? How about "é"? You expect your editor to take what you've written and provide a valid text file with that information in it, which can then be read by something else. Applying the correct text encoding (UTF-8?) and adding a 0x0A as the last byte are part of making it valid.

If you want to specify the exact bytes on disk, you should use a hex/bin editor.

link

Sniffnoy 2648 days ago

The correct text encoding is the one I specify. UTF-8 is a decent default. Where did that bit about 0x0A come from? That's just an extra byte. There's no text encoding that says "to decode this you should first remove the 0x0A at the end, which is not part of the text".

I should only need a hex editor if I want to make a file that can't be decoded as text under a known encoding, and no encoding I know (and certainly none that I use) has the encoding/decoding rule I mentioned above. I certainly shouldn't need one just to make a validly-encoded UTF-8 file that happens to end in a character other than a newline.

Basically, either it's part of the encoding or it's part of the text, and if it's part of the text, I ought to be able to control it. And it isn't part of the encoding.

link

enriquto 2648 days ago

> I mean, it ought to be possible to use a text editor to specify exactly what I want in a text file.

I agree. But for me a text file that does not end in newline is badly formed. It is not a text file, it is a binary file that happens to contain some printable characters.

link

nikeee 2648 days ago

I think it's uncommon in the Windows world.

link

bpye 2648 days ago

Just so you know, VSCode can do this too. You just have to set the user preference files.insertFinalNewline=true

link

l24ztj 2648 days ago

I have the opposite experience: notepad, notepad++, vscode, kwrite/kate, sublime, gedit, and many more I've used... none of them had this unwanted behaviour.

link

paavoova 2648 days ago

Yes, but now if you do $ wc -l hello.txt, it will say 0. If a program is reading line by line, it won't get a full line if the last line is not terminated - this can lead to unwanted behavior, such as when you're matching end of line with regex.

link

badsectoracula 2648 days ago

This is a Unix thing, on Unix lines in text files are terminated and the terminator is supposed to always be there to mark the end of the line, as opposed to Windows where they are separated and a separator at the end of the file means that the text has an empty line at the end.

This is also why tools that have a Unix heritage tend to complain if there is no terminator/separator at the end of the file (even in Windows), like most C compilers.

link

emilfihlman 2648 days ago

This has been a configurable option since forever.

link

marcosdumay 2648 days ago

If I would bother configuring an editor I would not be using nano.

link

tyingq 2648 days ago

Sane defaults still have value.

link

nemetroid 2648 days ago

The old behaviour is the sane default, at least for editing text files. POSIX defines a text file as a file consisting of a number of lines, each terminated by an LF character.

link

coldtea 2648 days ago

sane is what the user expects, not what POSIX demands.

link

lexicality 2648 days ago

One could reasonably expect that a user expects a POSIX system to behave according to the POSIX standards?

link

williamdclt 2648 days ago

Not really, most users (even tech people) don't know that it is a POSIX standard, or what's a POSIX standard, or what's POSIX really

link

coldtea 2648 days ago

A sysadmin maybe. A user hardly.

Especially a 2019 user, 20+ years removed from the systems, decisions, and rationales, behind POSIX.

Just try to get someone (even a seasoned Linux user) to use a POSIX-only userland (as opposed to GNU), as see how fast they'll be pulling their hair out...

link

unveres 2648 days ago

I would say it's not just a POSIX demand. Most unix tools are POSIX-complaint, and the user should expect just it and nothing more. Non-proper text files without LF at the end may not be processed properly, so to be assured that everything works fine that LF is mandatory.

link

OJFord 2648 days ago

Many would say the old behaviour was the sane choice - apart from anything else it's consistent with vi(m) and git.

link

tyingq 2648 days ago

I suppose. Though you might expect a GNU editor to follow Emacs for this behavior.

link

mruts 2648 days ago

Such an insane default. Currently programming a bittorrent client and was edited files to test some functionality. Was confused for a long time until I realized nano was turning my "echo 'this is a test' > test" into "echo 'this is a test\n' > test"

link

nemetroid 2648 days ago

Running "echo 'this is a test' > test" produces a file with a newline at the end.

link

lexicality 2648 days ago

Pretty sure nano was changing your code to

    echo 'this is a test' > test\n

and then echo was adding the newline because you didn't specify -n.

link

pushpop 2648 days ago

...and also

  echo 'this is a test\n'

doesn't produce an additional new line because -e wasn't specified.

link

symlinkk 2648 days ago

Funny you mention echo, which prints a newline by default as well.

link