[[!meta copyright="Copyright © 2011, 2012, 2013, 2014 Free Software Foundation, Inc."]] [[!meta license="""[[!toggle id="license" text="GFDL 1.2+"]][[!toggleable id="license" text="Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the section entitled [[GNU Free Documentation License|/fdl]]."]]"""]] [[!tag open_issue_mig]] [[!toc]] # 32-Bit vs. 64-Bit Interfaces ## IRC, freenode, #hurd, 2011-10-16 <braunr> i guess it wouldn't be too hard to have a special mach kernel for 64 bits processors, but 32 bits userland only <youpi> well, it means tinkering with mig <braunr> like old sparc systems :p <youpi> to build the 32bit interface, not the 64bit one <braunr> ah yes <braunr> hm <braunr> i'm not sure <braunr> mig would assume a 32 bits kernel, like now <youpi> and you'll have all kinds of discrepancies in vm_size_t & such <braunr> yes <braunr> the 64 bits type should be completely internal <braunr> types* <braunr> but it would be far less work than changing all the userspace bits for 64 bit (ofc we'll do that some day but in the meanwhile ..) <youpi> yes <youpi> and it'd boost userland addrespace to 4GiB <braunr> yes <youpi> leaving time for a 64bit userland :) ## IRC, freenode, #hurd, 2011-11-14 <braunr> also, what's the best way to deal with types such as <braunr> type cache_info_t = struct[23] of integer_t; <braunr> whereas cache_info_t contains longs, which are obviously not integer-wide on 64-bits processors <braunr> ? <youpi> you mean, to port mach to 64bit? <braunr> no, to make the RPC declaration portable <braunr> just in case :) <youpi> refine integer_t into something more precise <youpi> such as size_t, off_t, etc. <braunr> i can't use a single line then <braunr> struct cache_info contains ints, vm_size_t, longs <braunr> should i just use the maximum size it can get ? <braunr> or declare two sizes depending on the word size ? <youpi> well, I'd say three <braunr> youpi: three ? <youpi> the ints, the vm_size_ts, and the longs <braunr> youpi: i don't get it <braunr> youpi: how would i write it in mig language ? <youpi> I don't know the mig language <braunr> me neither :) <youpi> but I'd say don't lie <braunr> i just see struct[23] of smething <braunr> the original zone_info struct includes both integer_t and vm_size_t, and declares it as <braunr> type zone_info_t = struct[9] of integer_t; <braunr> in its mig defs file <braunr> i don't have a good example to reuse <youpi> which is lying <braunr> yes <braunr> which is why i was wondering if mach architects themselves actually solved that problem :) <braunr> "There is no way to specify the fields of a <braunr> C structure to MIG. The size and type-desc are just used to give the size of <braunr> the structure. <braunr> " <braunr> well, this sucks :/ <braunr> well, i'll do what the rest of the code seems to do, and let it rot until a viable solution is available <antrik> braunr: we discussed the problem of expressing structs with MIG in the libburn thread <antrik> (which I still need to follow up on... [sigh]) ## IRC, freenode, #hurd, 2012-12-12 In context of [[microkernel/mach/gnumach/memory_management]]. <tschwinge> Or with a 64-bit one? ;-P <braunr> tschwinge: i think we all had that idea in mind :) <pinotree> tschwinge: patches welcome :P <youpi> tschwinge: sure, please help us settle down with the mig stuff <youpi> what was blocking me was just deciding how to do it <braunr> hum, what's blocking x86_64, except time to work on it ? <youpi> deciding the mig types & such things <youpi> i.e. the RPC ABI <braunr> ok <braunr> easy answer: keep it the same <youpi> sorry, let me rephrase <youpi> decide what ABI is supposed to be on a 64bit system, so as to know which way to rewrite the types of the kernel MIG part to support 64/32 conversion <braunr> can't this be done in two steps ? <youpi> well, it'd mean revamping the whole kernel twice <youpi> as the types at stake are referenced in the whole RPC code <braunr> the first step i imagine would simply imply having an x86_64 kernel for 32-bits userspace, without any type change (unless restricting to 32-bits when a type is automatically enlarged on 64-bits) <youpi> it's not so simple <youpi> the RPC code is tricky <youpi> and there are alignments things that RPC code uses <youpi> which become different when build with a 64bit compiler <pinotree> there are also things like int[N] for io_stat_struct and so on <braunr> i see <youpi> making the code wrong for 32 <youpi> thus having to change the types <youpi> pinotree: yes <pinotree> (doesn't mig support structs, or it is too clumsy to be used in practice?) <braunr> pinotree: what's the problem with that (i explcitely said changing int to e.g. int32_t) <youpi> that won't fly for some of the calls <youpi> e.g. getting a thread state <braunr> pinotree: no it doesn't support struct <pinotree> braunr: that some types in struct stat are long, for instance <braunr> pinotree: same thing with longs <braunr> youpi: why wouldn't it ? <youpi> that wouldn't work on a 64bit system <youpi> so we can't make it int32_t in the interface definition <braunr> i understand the alignment issues and that the mig code adjusts the generated code, but not the content of what is transfered <braunr> well of course <braunr> i'm talking about the first step here <braunr> which targets a 32-bits userspace only <youpi> ok, so we agree <youpi> the second step would have to revamp the whole RPC code again <braunr> i imagine the first to be less costly <braunr> well, actually no <braunr> you're right, the mig stuff would be easy on the application side, but more complicated on the kernel side, since it would really mean dealing with 64-bits values there <braunr> (unless we keep a 3/1 split instead of giving the full 4g to applications) See also [[microkernel/mach/gnumach/memory_management]]. <youpi> (I don't see what that changes) <braunr> if the kernel still runs with 32-bits addresses, everything it recevies from or sends through mig can be stored with the user side 32-bits types <youpi> err, ok, but what's the point of the 64bit kernel then ? :) <braunr> and it simply uses 64-bits addresses to deal with physical memory <youpi> ok <youpi> that could even be a 3.5/0.5 split then <braunr> but the memory model forces us to run either at the low 2g or the highest ones <youpi> but linux has 3/1, so we don't need that <braunr> otherwise we need an mcmodel=medium <braunr> we could do with mcmodel=medium though, for a time <braunr> hm actually no, it would require mcmodel=large <braunr> hum, that's stupid, we can make the kernel run at -2g, and use 3g up to the sign extension hole for the kernel map ## IRC, freenode, #hurd, 2013-12-03 <azeem> I believe the main issue is redoing the RPCs in 64bit, i.e. the Mach/Hurd interface <braunr> mach has always been 64-bits capable <braunr> the problem is both mach and the hurd <braunr> it's at the system interface (the .defs of the RPCs) <braunr> azeem: ah, actually that's why you also say <braunr> but i consider it to be a hurd problem <braunr> the hurd itself is defined as being a set of interfaces and servers implementing them, i wouldn't exclude the interfaces <braunr> that's what* # Structured Data ## IRC, freenode, #hurd, 2013-06-25 <teythoon> is there a nice way to get structured data through mig that I haven't found yet? <teythoon> say an array of string triples <braunr> no <teythoon> :/ <braunr> but you shouldn't need that <teythoon> my use case is getting info about fs translators from init to procfs [[hurd/translator/mtab]], [[hurd/translator/mtab/discussion]]. <teythoon> should I go for an iterator like interface instead? <braunr> depends <braunr> how many do you need ? <braunr> you could go for a variable sized array too <braunr> have a look at what already exists <teythoon> records, maybe 10-15, depends on many fs translators are running <braunr> a variable sized array is ok if the size isn't too big (and when i say too big, i mean hundreds of MiB) <braunr> an iterator is ok too if there aren't too many items <braunr> you may want to combine both (i think that's what proc does) <braunr> be aware that the maximum size of a message is limited to 512 MiB <teythoon> yeah I saw the array[] of stuff stuff, but array[] of string_t does not work, I guess b/c string_t is also an array <teythoon> how would I send an array of variable length strings? <braunr> i'm not sure you can <braunr> or maybe out of line <teythoon> somehow I expected mig to serialize arbitrary data structures, maybe it's to old for that? <teythoon> yeah, I read about uot of line, but that seems overkill <braunr> it is old yes <braunr> and not very user friendly in the end <braunr> let me check <teythoon> we could stuff json into mig... <braunr> see proc_getallpids for example <braunr> we could get rid of low level serialization altogether :p <teythoon> hah, exactly what I was looking at <braunr> (which is what i'll do in x15) <braunr> type pidarray_t = array[] of pid_t; <teythoon> but that is trivial b/c its array[] of pid_t <braunr> and always have the server writing guide near you <teythoon> yes <braunr> well, make one big string and an array of lengths :p <teythoon> thought about that and said to myself, there must be a better way that I haven't found yet <braunr> or one big string filled with real null-terminated c strings that you keep parsing until you ate all input bytes <braunr> i'm almost certain there isn't <braunr> type string_t = c_string[1024]; /* XXX */ <teythoon> yes <braunr> even that isn't really variable sized <teythoon> you think anyone would object to me putting a json encoder in /hurd/init? it is probably better than me at serializing stuff... <braunr> try with mig anyway <braunr> the less dependencies we have for core stuff, the simpler it is <braunr> but i agree, mig is painful <teythoon> would it be too hacky if I abused the argz functions? they do exactly what I'd need ## IRC, freenode, #hurd, 2013-06-26 <teythoon> there is https://code.google.com/p/protobuf-c/ and it has a rpc mechanism and I believe one could plug arbitrary transports easily <braunr> please don't think about it <braunr> we really don't want to add another layer of serialization <braunr> it's better to completely redesign mach ipc anyway <braunr> and there is a project for that :p <teythoon> ive seen x15 <teythoon> just food for thought <braunr> i've studied google protocol buffers <braunr> and fyi, no, it wouldn't be easy to plug arbitrary transports on top of mach <braunr> there is a lot of knowledge about mach ports in mig [[hurd/translator/mtab]], [[hurd/translator/mtab/discussion]]. <teythoon> but again I face the challenge of serializing a arbitrary sized list of arbitrary sized strings <braunr> yes <teythoon> list of ports is easier ;) but I think its worthwile <teythoon> so what about abusing argz* for this? you think it's too bad a hack? <braunr> no since it's in glibc <teythoon> awesome :) <braunr> but i don't remember the details well and i'm not sure the way you use it is safe <teythoon> yeah, I might have got the details wrong, I hadn't had the chance to test it ;) <braunr> about this dynamic size problem <braunr> a "simple" varying size array should do <braunr> you can easily put all your strings in there <teythoon> seperated by 0? <braunr> yes <teythoon> that's exactly what the argz stuff does <braunr> you'll get the size of the array anyway, and consume it until there is no byte left <braunr> good <braunr> but be careful with this too <braunr> since translators can be run by users, they somtimes can't be trusted <braunr> and even a translator running as root may behave badly <braunr> so careful with parsing <teythoon> noted