[[!meta copyright="Copyright © 2013 Free Software Foundation, Inc."]] [[!meta license="""[[!toggle id="license" text="GFDL 1.2+"]][[!toggleable id="license" text="Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the section entitled [[GNU Free Documentation License|/fdl]]."]]"""]] [[!toc]] # IRC, freenode, #hurd, 2013-06-29 so, how is your golang port going? I just started working on it. I had been reading documentation so far. Maybe over reading as people told me when I asked for their feedback but I will report on what I have done (technically tomorrow, and post it in the mailing list too. Hey guys, what could possibly cause the following error message when executing a program in the Hurd? "./dumper: Could not open note: (system server) error with unknown subsystem" My program is one that opens a file and dumps it into stdout pinotree: the code I am using is the one present here http://www.gnu.org/software/hurd/hacking-guide/hhg.html under paragraph 6.1 I investigated it a bit but can not find a lead. I seem to have all the rights to open the file that I want to dump to stdout what if you reset errno to 0 just after all the declarations in main, before the instructions? will check this out and get back to you. sure :) pinotree: Now it suggests that it can't get the number of readable files, which the source suggests that is normal behavior. Thanks for your assistance. # IRC, freenode, #hurd, 2013-07-01 youpi: from my part I can report that I have started working with the code, and doing as Thomas suggested. I was about to write my report yesterday, but I am facing some build errors on the HURD, which I would like to investigate further before I write my report. that's why I decided to write it later in the day. I don't think you have to wait you can simply write in your report that you are having build errors ok. I will have it written and delivered later in the day. braunr: that's cool. I think my reading has paid for itself. And you may be pleased to know that I have gotten my hands dirty with the code. I was about to write report yesterday, but some build errors with the gcc (that I am investigating atm) are holding me off. Will have that written later in the day. don't hesitate to ask help about build errors don't wait too much you need to progress on what matters, and not be blocked by secondary problems I will see myself asking for help rather sooner than later, but I would like to investigate it myself, and attempt to solve the issues that occur to me before resort to bugging you guys. sure just not too long too long being a day or so these were my build_results on the hurd they were linker errors https://gist.github.com/NlightNFotis/5896188#file-build_results I am trying to build gcc on a linux 32 bit environment. It also has some issues but not linker errors will resolve them to see if the linker errors are reproducible on linux oh, lex stuff should be easy enough # IRC, freenode, #hurd, 2013-07-05 I have not made much progress, but I see myself working with it. I have managed to build gcc go on Linux but Hurd seems to have some issues it seems to randomly crash the build process? not quite randomly it seems to be though yeah I have noticed that there is a pattern it does crash after some time ^^ but it doesn't crash at specific files define crash at some times it may crash during compiling insn-emit.c (hello guys) hi braunr :) braunr: hey there! It does seem to keep on compiling this file for a very long time (I have let it do so for 10, 20, 30 minutes) but the result is the same and it does so for different files for different build options ok so it doesn't crash it just doesn't complete is the virtual machine eating 100% cpu during that time ? I can still type at the terminal, but I can't send a term signal I can report that QEMU does hold 100% of one core at that time, (like it keeps processing) but there is no output on the terminal ok of course I can type at the terminal but nothing happens any idea of the size of the files involved ? I am checking it out right now before this goes any further, let me report on my investigation i expect that to be our classic writeback thread storm issue initially, I thought it might be that it run out of memory even though I know that compilation is not memory intensive, rather, cpu intensive anyway I increased the size of ram available to the vm from 1024 mb to 1536 that didn't seem to have any effect. The "crash" still happens at the same time, at the same files use freeze not crash crash is very misleading here freeze it is then. anyway then it striked me that it might be that the hard disk size (3gb) might be too small (considering the gcc git repo is 1gb+) so I resized the qemu image to 8gb of hdd size the new size is acknowledged by the vm for gcc in debug mode? might still not be enough but still it has no effect - it seems to follow its freezing patterns giving your work, i'd have not less than 15-20 i'd use 32 *given but that's because i like power of twos pinotree: thanks for the advice. Right now I was gonna increase the swap size according to vmstat in the hurd swap size is 173 mb don't know if it does have an impact it may but before rushing if you need swap, you're doomed anyway consider swap highly unreliable on the hurd please show the output of df -h on the file system you're using to build ideally, i'd recommend using separate / and /home file systems it really improves reliability I don't think it swaps to be honest; however that's something that my mentor thomas had suggested (increasing swap size) so I am gonna try it at some time. or have a separate file system in a subdi and work on it yes, /home or whatever suits you just not / braunr: pinotree: thanks both for your advice. Will do now, and report on the results. that's not all 11:17 < braunr> please show the output of df -h on the file system you're using to build braunr: I am on it. Oh and btw, everytime I am forced to close the vm (due to the freezes) when I restart it ext2 reports that the file system was not cleanly unmounted and does some repair to some files. I am trying to find an explanation for that, but I can think of many things well obviously ext2 has no journaling the file system was not cleanly unmounted since you restarted it with a cold reset braunr: df -h comes out with this: "df: cannot read table of mounted file systems" also, even if you manage to always shut down correctly, when fsck runs because of the maximum mount count it'd find errors anyway (so we have some bug) nlightnfotis: df -h /path/to/build/dir pinotree: not really bugs but it could be cleaned up filesystem: - Size 2.8G Used 2.8G Avail 0 Use% 100% Mounted on / wow nlightnfotis: see that seems to explain many things ^^ thanks for that braunr! you resized the disk, but not the partition and the file system braunr: well, if something in ext2 (or its libs) leaves issues in the fs, i'd call that a bug :> yeah, that was utterly stupid of me pinotree: they're not issues nlightnfotis: be careful, mach needs a reboot every time you change a partition table nlightnfotis: important thing is that you found the issue :) then only, you can use resize2fs braunr: weird, I thought mach nowadays can reload the partition tables? braunr: doesn't d-i need that? maybe a recent change i forgot or maybe fdisk still reports the error although it's fine in doubt, rebooting is still safe :p or maybe youpi hacked it into d-is gnumach i doubt it would be there for the installer only :) if it's there, it's there i just don't know it braunr: teythoon: and everyone else that helped me. Thanks you all guys. This was something that was driving me crazy. Will do all that you suggested and report back on my status # IRC, freenode, #hurd, 2013-07-08 tschwinge, I have managed to overcome most of the obstacles I had initially faced with my project but I still had some build errors, that's why I have not reported yet. Wanna try to see if I can resolve them today, and write my report in the afternoon. nlightnfotis: So, from a quick look into the IRC backlog, it was a "simple" out of disk space problem? %-) That happens. nlightnfotis: And yes, GCC needs a lot of disk space. nlightnfotis: What kind of build errors are you seeing now? tschwinge, yeah I felt stupid at the time, but it didn't actually strike me that the file system didn't see the extra space. Also it took me some time to figure out that in order to mount the new partition, I only had to edit /etc/fstab always tried to mount it with the ext2 translator and the translator kept dying but it's all figured out now the latest build errors I am seeing are these nlightnfotis: o_O you used fstab and it worked? yeah nlightnfotis: that's unexpected from my perspective... I only had to add the new partition into fstab teythoon: I can pastebin my fstab if you wanna take a look at it tschwinge: these were my latest build errors https://www.dropbox.com/s/b0pssdnfa22ajbp/build_results nlightnfotis: I'm pretty sure that mount -a isn't done on hurd w/o pinos runsystem.sysv weird tschwinge: I have also tried to build gcc with "make -w" which from what I know supresses the errors that stopped compilation but the weird thing is that gcc nearly took forever to build nlightnfotis: could you do a showtrans /your/mountpoint? teythoon: /hurd/ext2fs /dev/hd0s3 nlightnfotis: ok, so you've set a passive translator and an active is started on demand it must be a passive translator nlightnfotis: this is the hurd way of doing things, fstab is unrelated it seems to persist during reboots yes, exactly teythoon: my fstab if you wanna take a look http://pastebin.com/ef94JPhG after I added /dev/hd0s3 to fstab along with its mountpoint, and restarting the hurd, only then I did manage to use that partition before doing so I tried pretty much anything involving mounting the partition and setting the ext2fs translator for it, but it kept dying of course it was a ext2 filesystem err, perhaps adding to fstab simply triggered an fsck at reboot? nlightnfotis: might have been that you needed to reboot mach so that it picks up the new partition table youpi: I thought this was fixed, the partition reloading I mean? that is needed, yes let me check youpi: it could be, though, to be honest, my hurd system does an fsck all the time at boot how do you manage to do that w/o rebooting for d-i? (I don't remember whether device busy is detected) teythoon: by making all translators go away, iirc nlightnfotis: btw, you have ~/gcc_new as mountpoint in your fstab, pretty sure that this cannot work, the path has to be absolute and no ~ expansion is done tbh it does work, and it's weird nlightnfotis: it works b/c of the passive translator you set, not b/c of the fstab entry teythoon: should I change it? probably, yes Well, that is probably not used anywhere. tschwinge: not yet but soon ;) Isn't /etc/fstab only consulted for fsck. atm yes Anyway, it is definitely a very good idea to have a partition separate from the rootfs for doing actual work. I think I described that in one of the first GSoC coodridation emails. In the long one. teythoon: Oh it struck me now! Is it because tilde expansion is only happening in bash, but /etc/fstab is read before bash is initialized? nlightnfotis: Instead of fumbling around with partitioning of disk images, it may be easier in your KVM/QEMU setup to simply add a new disk using -hdb [file] (or similar). nlightnfotis: Basically, yes. nlightnfotis: fstab is not related with bash in any way anyway, it shouldn't matter now, it seems to be working, and I wouldn't like fiddling around with it and messing it up now. I will continue with resolving the gcc issues. But /etc/fstab has its very own "language" (layout), so tilde expansion will never be done there. nlightnfotis: df -h ~/gcc_new/ tschwinge: size 24G Used: 4.2G Avail 18G OK, that's fine. As you can see on , GCC will easily need some GiB. tschwinge: I have some questions about GCC: out of curiosity how much time does it take to compile it on your machine? Because yesterday I tried a -w (suppress warnings) build and it seemed to take forever mind you the vm has 1536 ram available (I have read somewhere that it can utilise such an amount) and the vm is KVM enabled without disabling g++, it can easily take hours nlightnfotis: The build error is unexpected, because I had addressed that issue in a recent patch. :-) nlightnfotis: This is wrong: »checking whether setcontext clobbers TLS variables... [...] yes«. Please check your sources, that they correspond to the current version of the upstream tschwinge/t/hurd/go branch. nlightnfotis: Quoting from that wiki page: »This takes up around 3.5 GiB, and needs roughly 3.5 h on kepler.SCHWINGE and 15 h on coulomb.SCHWINGE.« The latter is my Hurd machine. That's however with Java and Ada enabled, and a full three-stages bootstrap. ah, right, there's java & ada too tschwinge: git branch (in the repo): master, *tschwinge/t/hurd/go in debian they are built separately What I asked you to do is configure »--disable-bootstrap --enable-languages=go«. So that should be a lot quicker. tschwinge: oh yes, everytime I have tried to compile gcc I have done with these configurations But still a few hours perhaps. that's what I did yesterday too. OK, good. :-) A bootstrap build is a good way to check the just-built GCC for sanity, but we expect that it is fine, as we concentrate on the GCC Go port. the only "extra" configuration yesterday was my "-w" flag to make, because those errors were actually triggered by -Werror Let me read up what make -w does. ;-) ah, yes, d/w I have read and understood what the bootstrap build is. Seems like we don't need it atm afaik it suppresses all warnings youpi: gcj no more the way gcc builds, it does convert (some) warnings to errors Hmm. -w, --print-directory Print a message containing the working directory before and after other processing. youpi: doko folded gcj and gdc into gcc-4.8 to "workaround" Built-Using nlightnfotis: Ah, that'S configure --enable-werror or something like that. pinotree: right yep, and -w suppresses it (from what I have understood) nlightnfotis: Are you thinking about make -k? Yeah, I guess. let me see what -k does youpi: (just to make builds even more lightweight, eh) yeah, -k should do too, I shall try it But: if gcc -Werror fails, even with make -k, the build will not be able to come to a successful end, because that one complation artefact that failed will be missing. so I shall try again with -w (supressed warnings) Configureing with --disable-werror (or similar) will "help" if -Werror is the default, and the build fails due to that. from what I have understood these "errors" are not something critical: it's only that function prototypes for these functions are missing I have seen the code there, and even "default" gcc generated prototypes (from the first usage of the function) should do, so I can't understand why it might be a serious problem if I tell gcc to skip that point nlightnfotis: Ah, now I see. You don't mean make -w, but rather gcc -w: »-w Inhibit all warning messages.« But really, there shouldn't be such warnings/errors that make the build fail. yeah nlightnfotis: In your GCC sources directory, what does this tell: git rev-parse HEAD And, is the checkout clean: git status The latter will take some time. git status takes an awful amount of time last I checked but git rev-parse HEAD produces this result: 91840dfb3942a8d241cc4f0e573e5a9956011532 OK, that's correct. So probably some of the checked out files are not in a pristine state? I shall run a git clean and see. If that doesn't work too, maybe I shall reclone the repository? there's nothing foreign to the repo that I have added, only lib gmp, lib mpc and lib mpfr (and they are in their own folders inside my gcc working directory) nlightnfotis: You shouldn't need to do the latter if you instead run: apt-get build-dep gcc-4.8 I remember having done that inside the Hurd, but it always resulted in an error from what I can recall let me check this out yes nlightnfotis: Whenever you use Git on Hurd, pass the --quiet flag, to avoid the rare but possible corruption issue described on and . tschwinge: Forgive me for that. I will set up an alias immediately. nlightnfotis: I don't know if an alias is possible, because -- I think -- you'll need to do things like: git fetch --quiet So pass --quiet to subcommands. oh. ok. nlightnfotis: What you can also do, is shut down your Hurd VM, and mount the disk image on GNU/Linux (mount with offset to get the right partition), and then run a diff -ru against a Git clone done on GNU/Linux, and see whether there are any unexpected differences outside of the .git/ directory. sounds like a plan. I will check this out today then :) tschwinge: if all else fails, then recloning the repo with --quiet passed should work, right? Yes, that's probably the most straight-forward check to do. Heh, yes to both these questions. :-) nlightnfotis: Oh, you don't even have to re-clone, but rather re-check-out the branch. I was thinking of recloning just to bring the whole repository to a pristine state So something like (inside the source directory): rm -rf ./* (remove any files, but leave .* in place, in particular the .git/ directory), followd by git checkout -f HEAD --quiet nlightnfotis: But before doing that, please do the diff first, so that we know (hopefully) where the erroneous build results were coming from. # IRC, freenode, #hurd, 2013-07-10 tschwinge: I have run the diff of the GCC repo on the Hurd against the one on my host linux os, and there was nothing relevant to fixcontext and initcontext that are the ones that fail the compilation. In any case I did recheck out the branch, and I have attempted a build with it. It fails at the same point. Now I am attempting a build with the -w (inhibit warnings) flag enabled nlightnfotis: Have there been any differences in the diff? There should be none at all. tschwinge: there were some small changes due to the repo's being checked out at different times. It was a large diff however. I inspected it and didn't find anythign that was of much use. Here it is in case you might want to see it: https://www.dropbox.com/s/ilgc3skmhst7lpv/diffs_in_git.txt nlightnfotis: Well, the idea of this exercise precisely was to use the same Git revisions on both sides of the diff -- to show that there are no spurious differences -- which can't be shown from your 124486 lines diff. (Even though indeed there is no difference in libgo/configure that would explain the mis-match, but who knows what else might be relevant for that. Would you please repeat that? tschwinge: I will do so. It was wrong from me to not diff against the same revisions, but going through the diff results grepping for the problematic code didn't yield any results, so I thought that might not be the issue. I will perform the diff again tomorrow morning and report on the results. nlightnfotis: Anyway, if you checked out again, the latest revision, and it still fails in exactly the same way, there is something wrong. nlightnfotis: And -w won't help, as there is a hard error involved. nlightnfotis: Are yous till working on GSoC things today? tschwinge: yeah I am here. I decided to do the diff today instead of tomorrow. It finished now btw let me tell you ah and this time, the gits were checked out at the same time from the same source and are at the same branch nlightnfotis: Coulod you upload the gccbuild/i686-unknown-gnu0.3/libgo/config.log of the build that failed? tschwinge: sure. give me a minute tschwinge: there is something strange going on. The two repos are at the exact same state (or at least should be, and the logs indicate them to be) but still the diff output is 4.4 mb but no presence of initcontext of fixcontext tschwinge: the config.log file --> http://pastebin.com/bSCW1JfF wow! I can see several errors in the config.log file but I am not so sure about their fatality. Config returns 0 at the end of the log nlightnfotis: As the configure scripts probe for all kings of features on all kings of strange systems, it's to be expected that some of these fail on GNU/Hurd. What is not expected, however, is: configure:15046: checking whether setcontext clobbers TLS variables [...] configure:15172: ./conftest /root/gcc_new/gcc/libgo/configure: line 1740: 1015 Aborted ./conftest$ac_exeext Hmm. apt-cache policy libc0.3 nlightnfotis: ^ tschwinge: Installed 2.13-39+hurd.3 Candidate: 2.1-6 *2.17 Bummer. nlightnfotis: As indicated in and thereabouts, you need 2.17-3+hurd.4 or later... Well. At least that now explains what is going on. tschwinge: i see. I am in the process of updating my hurd vm. I saw that libc has also been updated to 2.17 I will confirm when updating is done nlightnfotis: Anyway, is the diff between the two repositories empty now or are there still differences? there are differences and they were checked out at the same time from the same source (the official git mirror) and they are both at the same branch and still diff output is 4.4 MB but quick grepping into it and there is not mention of initcontext or fixcontext That's... unexpected. may be a mistake I am making but considering that diff run for some time before completing In both Git repositories, »git rev-parse HEAD« shows the same thing? Could you please upload the diff again? tschwinge: confirmed. libc is now version 2.17-1 tschwinge: http://pastebin.com/bSCW1JfF for the rev-parse give me a second nlightnfotis: Where is libc0.3 2.17-1 coming from? You need 2.17-3+hurd.4 or later. it is 2.17-7+hurd.1 OK, good. The URL you just have is the config.log file, not the diff. s%have%gave oh my mistake wait a minute the two repos have different output to rev-parse Phew. That explains. So the Git branches are at different revisions. that confused me... when I run git pull -a the branches that were changed were all updated to the same revision unless... there were some automatic merges in the *host* GCC repo required during some pulls but that was some time ago would it have messed my local history that much? that's the only thing that may be different between the two repos they checkout from the same source nlightnfotis: At which revisions are the two repositories/branches? I have never used »put pull -a«. What does that do? tschwinge: from what I know it does an automatic git fetch followed by git merge. The -a flag must signal to pull all branches (I think it's possible to pull only one branch) That's the --all option. -a is something different (that I don't understand off-hand). Well, --all means to pull all remotes. But you just want the GCC upstream, I guess. I always use git fetch and git merge manually. oh my god! You are write. -a is equivallent to --append https://www.kernel.org/pub/software/scm/git/docs/git-pull.html git pull must be safe though http://stackoverflow.com/questions/292357/whats-the-difference-between-git-pull-and-git-fetch without the -a *right why did I even write "right" as "write" above I don't even... what did I write in the sentence above oh my god... tschwinge: they are indeed on different revisions: The host repo's last commit was made by me apparently, to merge master into tschwinge/t/hurd/go, whereas the last commit of the Hurd repo was by you and it reverted commit 2eb51ea and that should also explain the large diff file with master merged into the tschwinge/t/hurd/go branch I will purge the debian repo and redownload it *reclone it that should bring it to a safe state I suppose. # IRC, freenode, #hurd, 2013-07-11 nlightnfotis: how's your build going? I tried one earlier and it seemed to build without any issues, something that was...strange. I am repeating the build now, but I am saving the compilation output this time to study it. it was strange that the build succeeded? that sounds sad :/ teythoon: considering that 3 weeks now I failed to build it without errors, it sure seems weird that it builds without errors now :) what did you change ? braunr: not many things apparently. To be honest the change that seemed to do the trick was (under thomas' guidance) update of libc from 2.13 to 2.17 well that can explain tschwinge: Big update! GCC-go not compiles without errors under the Hurd. I have done 2 compilations so far, none of which had issues. Time needed for full build (without bootstrap) is 45 minutes +- 1 minute. I also run the test suite, and I can confirm your results s/not/now/, perhaps? pinotree yeah. I don't know how it came up with not there. I meant now tschwinge: link for the go.sum is here --> https://www.dropbox.com/s/7qze9znhv96t1wj/go.sum # IRC, freenode, #hurd, 2013-07-12 nlightnfotis: Great! So you finally reproduced my results. :-) tschwinge: Yep! I am now building a blog, so that I can move my reports there, so that they are more detailed, to allow for greater transparency of my actions nlightnfotis: Did you recently (in email, I think?) indicate that there is another Go testsuite, for libgo? nlightnfotis: As you prefer. tschwinge: there seemed to be one, at least in linux. I think I saw one in the Hurd too. Oh indeed there is a libgo testsuite, too. as a matter of fact, make check-go did check for the lib but lib was failing yeah So please have a look at that testsuite's results, too, and compare to the GNU/Linux ones. sure. I can do that now. And for the go.sum you posted, please have a look at the tests that do not pass (»grep -v ^PASS: < go.sum«), assuming they do pass on GNU/Linux. I suggest you add a list of the differences between GNU/Linux and GNU/Hurd testresults to the wiki page, , at the end of the Part I section. I'm on it. For now, please ignore any failing tests that have »select« in their name -- that is, do file them, but do not spend a lot of time figuring out what might be wrong there. The Hurd's select implementation is a bit of a beast, and I don't want you -- at this time -- spend a lot of time on that. We already know there are some deficiencies, so we should postpone that to later. tschwinge: noted. So what I would like at the moment, is a list of the testresult differences to GNU/Linux, then from the go.log file any useful information about the failing test (which perhaps already explains) what's going wrong, and then a analysis of the failure. nlightnfotis: I assume you must be really happy that you finally got it build fine, and reproduced my results. :-) tschwinge: yeah! I can not hide from you the fact that failing all those builds made me really nervous about me missing my schedule. Having finally built that and revisiting my application I can see I am on schedule, but I have to intensify my work to compensate for any potential unforeseen obstacles , in the futute *future # IRC, freenode, #hurd, 2013-07-15 nlightnfotis: btw, do you have a weekly progress report? youpi: not yet. Will write it shortly and post it here. I made a new blog to keep track of my progress. Will report much more frequently now via my blog did you add your blog url to the hurd iwki? currently I am running gcc tests on both gcc go and libgo to see what the differences are with Linux I believe I have done so, let me see youpi: gccgo passes most of its tests (it fails a small number, and I am looking into those tests) but libgo fails 130/131 tests (on the Hurd that is)