open_issues/osf_mach.mdwn


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237

[[!meta copyright="Copyright © 2011 Free Software Foundation, Inc."]]

[[!meta license="""[[!toggle id="license" text="GFDL 1.2+"]][[!toggleable
id="license" text="Permission is granted to copy, distribute and/or modify this
document under the terms of the GNU Free Documentation License, Version 1.2 or
any later version published by the Free Software Foundation; with no Invariant
Sections, no Front-Cover Texts, and no Back-Cover Texts.  A copy of the license
is included in the section entitled [[GNU Free Documentation
License|/fdl]]."]]"""]]

[[!tag open_issue_glibc open_issue_gnumach open_issue_hurd]]

IRC, freenode, #hurd, 2011-09-07

    <slpz> tschwinge: do you think that should be possible/convenient to
      maintain hurd and glibc versions for OSF Mach as branches in the offical
      git repo?
    <tschwinge> Is OSF Mach the MkLinux one?
    <slpz> Yes, it is
    <tschwinge> slpz: If there's a suitable license, then yes, of course!
    <tschwinge> Unless there is a proper upstream, of course.
    <tschwinge> But I don't assume there is?
    <tschwinge> slpz: What is interesting for us about OSF Mach?
    <slpz> tschwinge: Peter Bruin and Jose Marchesi did a gnuified version some
      time ago (gnu-osfmach), so I suppose the license is not a problem. But
      I'm going to check it, though
    <slpz> OSF Mach has a number of interesting features
    <slpz> like migrating threads, advisory pageout, clustered pageout, kernel
      loaded tasks, short circuited RPC...
    <tschwinge> Oh!
    <tschwinge> Good.
    <slpz> right now I'm testing if it's really worth the effort
    <tschwinge> Yes.
    <tschwinge> But if the core codebase is the same (is it?) it may be
      possible to merge some things?
    <tschwinge> If the changes can be identified reasonably...
    <slpz> comparing performance of the specialized RPC of OSF Mach with
      generic IPC
    <slpz> That was my first intention, but I think that porting all those
      features will be much more work than porting Hurd/glibc to it
    <braunr> slpz: ipc performance currently matters less than clustered
      pageouts
    <braunr> slpz: i'm really not sure ..
    <braunr> i'd personnally adapt the kernel
    <slpz> braunr: well, clustered pageouts is one of the changes that can be
      easily ported
    <slpz> braunr: We can consider OSF Mach code as reasonably stable, and
      porting its features to GNU Mach will take us to the point of having to
      debug all that code again
    <slpz> probably, the hardest feature to be ported is migrating threads
    <braunr> isn't that what was tried for gnu mach 2 ? or was it only about
      oskit ?
    <slpz> IIRC only oskit
    <tschwinge> slpz: But there have been some advancements in GNU Mach, too.
      For example the Xen port.
    <tschwinge> But wen can experiment with it, of course.
    <slpz> tschwinge: I find easier to move the Xen support from GNU Mach to
      OSF Mach, than porting MT in the other direction
    <tschwinge> slpz: And I think MkLinux is a single-server, so I don't this
      they used IPC as much as we did?
    <tschwinge> slpz: OK, I see.
    <braunr> slpz: MT aren't as needed as clustered pageouts :p
    <braunr> gnumach already has ipc handoff, so MT would just consume less
      stack space, and only slightly improve raw ipc performance
    <tschwinge> slpz: But we will surely accept patches that get the Hurd/glibc
      ported to OSF Mach, no question.
    <braunr> (it's required for other issues we discussed already, but not a
      priority imo)
    <slpz> tschwinge: MkLinux makes heavy use of IPC, but it tries to
      "short-circuit" it when running as a kernel loaded task
    <tschwinge> And it's obviously best to keep it in one place.  Luckily it's
      not CVS branches anymore...  :-)
    <slpz> braunr: well, I'm a bit obsessed with IPC peformance, if the RPC on
      OSF Mach really makes a difference, I want it for Hurd right now
    <slpz> braunr: clustered pages can be implemented at any time :-)
    <slpz> tschwinge: great!
    <tschwinge> slpz: In fact, haven'T there already been some Savannah
      repositories created, several (five?) years ago?
    <braunr> slpz: the biggest performance issue on the hurd is I/O
    <braunr> and the easiest way to improve that is better VM transfers
    <slpz> tschwinge: yes, the HARD project, but I think it wasn't too well
      received...
    <tschwinge> slpz: Quite some things changed since then, I'd say.
    <slpz> braunr: I agree, but IPC is the hardest part to optimize
    <slpz> braunr: If we have a fast IPC, the rest of improvements are way
      easier
    <braunr> slpz: i don't see how faster IPC makes I/O faster :(
    <braunr> slpz: read
      http://www.sceen.net/~rbraun/the_increasing_irrelevance_of_ipc_performance_for_microkernel_based_operating_systems.pdf
      again :)
    <slpz> braunr: IPC puts the upper limit of how fast I/O could be
    <braunr> the abstract for my thesis on x15 mach was that the ipc code was
      the most focused part of the kernel
    <braunr> so my approach was to optimize everything *else*
    <braunr> the improvements in UVM (and most notably clustered page
      transfers) show global system improvements up to 30% in netbsd
    <braunr> we should really focus on the VM first (which btw, is a pain in
      the ass with the crappy panicking swap code in place)
    <braunr> and then complete the I/O system
    <slpz> braunr: If a system can't transfer data between translators faster
      than 100 MB/s, faster devices doesn't make much sense
    <guillem> has anyone considered switching the syscalls to use
      sysenter/syscall instead of soft interrupts?
    <slpz> braunr: but I agree on the VM part
    <braunr> guillem: it's in my thesis .. but only there :)
    <braunr> slpz: let's reach 100 MiB/s first, then improve IPC
    <slpz> guillem: that's a must do, also moving to 64 bits :-)
    <braunr> guillem: there are many tiny observations in it, like the use of
      global page table entries, which was added by youpi around that time
    <guillem> slpz: I wanted to fix all warnings first before sending my first
      batch of 64 bit fixes, but I think I'll just send them after checking
      they don't introduce regressions on i386
    <guillem> braunr: interesting I think I might have skimmed over your
      thesis, maybe I should read it properly some time :)
    <slpz> braunr: I see exactly as the opposite. First push IPC to its limit,
      then improve devices/VM
    <slpz> guillem: that's great :-)
    <braunr> slpz: improving ipc now will bring *nothing*, whereas improving
      vm/io now will make the system considerably more useable
    <guillem> but then fixing 64-bit issues in the Linux code is pretty
      annoying given that the latest code from upstream has that already fixed,
      and we are “supposed” to drop the linux code from gnumach at some point
      :)
    <braunr> slpz: that's a basic principle in profiling, improve what brings
      the best gains
    <slpz> braunr: I'm not thinking about today, I'm thinking about how fast
      Hurd could be when running on Mach. And, as I said, IPC is the absolute
      upper limit.
    <braunr> i'm really not convinced
    <braunr> there are that many tasks making extensive use of IPCs
    <braunr> most are cpu/IO bound
    <slpz> but I have to acknowledge that this concern has been really
      aliviated by the EPT improvement discovery
    <braunr> there aren't* that many tasks
    <slpz> braunr: create a ramdisk an write some files on it
    <slpz> braunr: there's no I/O in that case, an performance it's really low
      too
    <braunr> well, ramdisks don't even work correctly iirc
    <slpz> I must say that I consider improvements in OOL data moving as if it
      were in IPC itself
    <slpz> braunr: you can simulate one with storeio
    <braunr> slpz: then measure what's slow
    <braunr> slpz: it couldn't simply be the vm layer
    <slpz> braunr:
      http://www.gnu.org/s/hurd/hurd/libstore/examples/ramdisk.html
    <braunr> ok, it's not a true ramdisk
    <braunr> it's a stack of a ramdisk and extfs servers
    <braunr> ext2fs*
    <braunr> i was thinking about tmpfs
    <slpz> True, but one of Hurd main advantages is the ability of doing that
      kind of things
    <slpz> so they must work with a reasonable performance
    <braunr> other systems can too ..
    <braunr> anyway
    <braunr> i get your point, you want faster IPCs, like everyone does
    <slpz> braunr: yes, and I also want to know how fast could be, to have a
      reference when profiling complex services
    <antrik> slpz: really improving IPC performance probably requires changing
      the semantics... but we don't know which semantics we want until we have
      actually tried fixing the existing bottlenecks
    <antrik> well, not only bottlenecks... also other issues such as resource
      management
    <slpz> antrik: I think fixing bottlenecks would probably require changes in
      some Mach interfaces, not in the IPC subsystem
    <slpz> antrik: I mean, IPC semantics just provide the basis for messaging,
      I don't think we will need to change them further
    <antrik> slpz: right, but only once we have addressed the bottlenecks (and
      other major shortcomings), we will know how the IPC mechanisms needs to
      change to get further improvements...
    <antrik> of course improving Mach IPC performance is interesting too -- if
      nothing else, then to see how much of a difference it really makes... I
      just don't think it should be considered an overriding priority :-)
    <youpi> slpz: I agree with braunr, I don't think improving IPC will bring
      much on the short term
    <youpi> the buildds are slow mostly because of bad VM
    <youpi> like lack of read-ahead, the randomness of object cache pageout,
      etc.
    <youpi> that doesn't mean IPC shouldn't be improved of course
    <youpi> but we have a big margin for iow
    <youpi> s/iow/now
    <slpz> youpi: I agree with you and with braunr in that regard. I'm not
      looking for an inmediate improvement, I just want to see how fast the IPC
      (specially, OOL data transfers) could be.
    <slpz> also, migrating threads will help to fix some problems related with
      resource management
    <antrik> slpz: BTW, what about Apple's Mach? isn't it essentialy OSF Mach
      with some further improvements?...
    <slpz> antrik: IPC is an area with very little room for improvement, so I
      don't we will fix that bottlenecks by applying some changes there
    <antrik> well, for large OOL transfers, the limiting facter is certainly
      also VM rather than the thread model?...
    <slpz> antrik: yes, but I think is encumbered with the APPLv2 license
    <antrik> ugh
    <slpz> antrik: for OOL transfers, VM plays a big role, but IPC also has
      great deal of responsibility
    <antrik> as for resource management, migrating threads do not really help
      much IMHO, as they only affect CPU scheduling. memory usage is a much
      more pressing issue
    <antrik> BTW, I have thought about passive objects in the past, but didn't
      reach any conclusion... so I'm a bit ambivalent about migrating threads
      :-)
    <slpz> As an example, in Hurd on GNU Mach, an io_read can't take advantage
      from copy-on-write, as buffers from the translator always arrive outside
      user's buffer
    <slpz> antrik: well, I think cpu scheduling is a big deal ;-)
    <slpz> antrik: and for memory management, until a better design is
      implemented, some fixes could be applied to get us to the same level as a
      monolithic kernel
    <antrik> to get even close to monolithic systems, we need either a way to
      account server resources used on client's behalf, or to make servers use
      client-provided resources. both require changes in the IPC mechanism I
      think...
    <antrik> (though *if* we go for the latter option, the CPU scheduling
      changes of migrating threads would of course be necessary, in addition to
      any changes regarding memory management...)
    <antrik> slpz: BTW, I didn't get the point about io_read and COW...
    <slpz> antrik: AFAIK, the FS cache (which is our primary concern) in most
      monolithic system is agnostic with respect the users, and only deals with
      absolute numbers. In our case we can do almost the same by combining Mach
      and pagers knowledege.
    <antrik> slpz: my primary concern is that anything program having a hiccup
      crashes the system... and I'm not sure this can be properly fixed without
      working memory accounting
    <antrik> (I guess in can be worked around to some extent by introducing
      various static limits on processes... but I'm not sure how well)
    <antrik> it can
    <slpz> antrik: monolithic system also suffer that problem (remember fork
      bombs) and it's "solved" by imposing static limits to user processes
      (ulimit).
    <slpz> antrik: we do have more problems due to port management, but I think
      some degree of control can be archieved with a reasonably amount of
      changes.
    <antrik> slpz: in a client-server architecture static limits are much less
      effective... that problem exists on traditional systems too, but only in
      some specific cases (such as X server); while on a microkernel system
      it's ubiquitous... that's why we need a *better* solution to this problem
      to get anywhere close to monolithic systems