rev |
line source |
meillo@36
|
1 .nr PS 11
|
meillo@36
|
2 .nr VS 13
|
meillo@0
|
3 .nr lu 0
|
meillo@0
|
4 .de CW
|
meillo@0
|
5 .nr PQ \\n(.f
|
meillo@0
|
6 .if t .ft CW
|
meillo@17
|
7 .ie ^\\$1^^ .if n .ul 999
|
meillo@0
|
8 .el .if n .ul 1
|
meillo@17
|
9 .if t .if !^\\$1^^ \&\\$1\f\\n(PQ\\$2
|
meillo@0
|
10 .if n .if \\n(.$=1 \&\\$1
|
meillo@0
|
11 .if n .if \\n(.$>1 \&\\$1\c
|
meillo@0
|
12 .if n .if \\n(.$>1 \&\\$2
|
meillo@0
|
13 ..
|
meillo@0
|
14 .ds [. \ [
|
meillo@0
|
15 .ds .] ]
|
meillo@42
|
16
|
meillo@42
|
17 .rn NH _N
|
meillo@42
|
18 .de NH
|
meillo@42
|
19 .if '\\$1'1' .sp 2v
|
meillo@42
|
20 .if '\\$1'1' .nr PS +2
|
meillo@42
|
21 ._N \\$1
|
meillo@42
|
22 .if '\\$1'1' .nr PS -2
|
meillo@42
|
23 ..
|
meillo@42
|
24
|
meillo@1
|
25 .\"----------------------------------------
|
meillo@42
|
26
|
meillo@0
|
27 .TL
|
meillo@42
|
28 .ps +4
|
meillo@6
|
29 Why the Unix Philosophy still matters
|
meillo@0
|
30 .AU
|
meillo@0
|
31 markus schnalke <meillo@marmaro.de>
|
meillo@0
|
32 .AB
|
meillo@1
|
33 .ti \n(.iu
|
meillo@39
|
34 This paper explains the importance of the Unix Philosophy for software design.
|
meillo@0
|
35 Today, few software designers are aware of these concepts,
|
meillo@39
|
36 and thus a lot of modern software is more limited than necessary
|
meillo@39
|
37 and makes less use of software leverage than possible.
|
meillo@38
|
38 Knowing and following the guidelines of the Unix Philosophy makes software more valuable.
|
meillo@0
|
39 .AE
|
meillo@0
|
40
|
meillo@2
|
41 .FS
|
meillo@2
|
42 .ps -1
|
meillo@39
|
43 This paper was prepared for the ``Software Analysis'' seminar at University Ulm.
|
meillo@39
|
44 Mentor was professor Schweiggert. 2010-04-05
|
meillo@2
|
45 .br
|
meillo@39
|
46 You may retrieve this document from
|
meillo@39
|
47 .CW \s-1http://marmaro.de/docs \ .
|
meillo@2
|
48 .FE
|
meillo@2
|
49
|
meillo@0
|
50 .NH 1
|
meillo@0
|
51 Introduction
|
meillo@42
|
52 .XS
|
meillo@42
|
53 \*(SN Introduction
|
meillo@42
|
54 .XE
|
meillo@0
|
55 .LP
|
meillo@40
|
56 The Unix Philosophy is the essence of how the Unix operating system,
|
meillo@40
|
57 especially its toolchest, was designed.
|
meillo@40
|
58 It is no limited set of fixed rules,
|
meillo@40
|
59 but a loose set of guidelines which tell how to write software that
|
meillo@40
|
60 suites well into Unix.
|
meillo@40
|
61 Actually, the Unix Philosophy describes what is common to typical Unix software.
|
meillo@40
|
62 The Wikipedia has an accurate definition:
|
meillo@40
|
63 .[
|
meillo@40
|
64 %A Wikipedia
|
meillo@40
|
65 %T Unix philosophy
|
meillo@40
|
66 %P Wikipedia, The Free Encyclopedia
|
meillo@40
|
67 %D 2010-03-21 17:20 UTC
|
meillo@40
|
68 %O .CW \s-1http://en.wikipedia.org/w/index.php?title=Unix_philosophy&oldid=351189719
|
meillo@40
|
69 .]
|
meillo@40
|
70 .QP
|
meillo@41
|
71 .ps -1
|
meillo@40
|
72 The \fIUnix philosophy\fP is a set of cultural norms and philosophical
|
meillo@40
|
73 approaches to developing software based on the experience of leading
|
meillo@40
|
74 developers of the Unix operating system.
|
meillo@1
|
75 .PP
|
meillo@40
|
76 As there is no single definition of the Unix Philosophy,
|
meillo@40
|
77 several people have stated their view on what it comprises.
|
meillo@1
|
78 Best known are:
|
meillo@1
|
79 .IP \(bu
|
meillo@1
|
80 Doug McIlroy's summary: ``Write programs that do one thing and do it well.''
|
meillo@1
|
81 .[
|
meillo@1
|
82 %A M. D. McIlroy
|
meillo@1
|
83 %A E. N. Pinson
|
meillo@1
|
84 %A B. A. Taque
|
meillo@1
|
85 %T UNIX Time-Sharing System Forward
|
meillo@1
|
86 %J The Bell System Technical Journal
|
meillo@1
|
87 %D 1978
|
meillo@1
|
88 %V 57
|
meillo@1
|
89 %N 6
|
meillo@1
|
90 %P 1902
|
meillo@1
|
91 .]
|
meillo@1
|
92 .IP \(bu
|
meillo@1
|
93 Mike Gancarz' book ``The UNIX Philosophy''.
|
meillo@1
|
94 .[
|
meillo@1
|
95 %A Mike Gancarz
|
meillo@1
|
96 %T The UNIX Philosophy
|
meillo@1
|
97 %D 1995
|
meillo@1
|
98 %I Digital Press
|
meillo@1
|
99 .]
|
meillo@1
|
100 .IP \(bu
|
meillo@1
|
101 Eric S. Raymond's book ``The Art of UNIX Programming''.
|
meillo@1
|
102 .[
|
meillo@1
|
103 %A Eric S. Raymond
|
meillo@1
|
104 %T The Art of UNIX Programming
|
meillo@1
|
105 %D 2003
|
meillo@1
|
106 %I Addison-Wesley
|
meillo@2
|
107 %O .CW \s-1http://www.faqs.org/docs/artu/
|
meillo@1
|
108 .]
|
meillo@0
|
109 .LP
|
meillo@1
|
110 These different views on the Unix Philosophy have much in common.
|
meillo@40
|
111 Especially, the main concepts are similar in all of them.
|
meillo@40
|
112 McIlroy's definition can surely be called the core of the Unix Philosophy,
|
meillo@40
|
113 but the fundamental idea behind it all, is ``small is beautiful''.
|
meillo@40
|
114
|
meillo@40
|
115 .PP
|
meillo@40
|
116 The Unix Philosophy tells how to design and write good software for Unix.
|
meillo@40
|
117 Many concepts described here base on facilities of Unix.
|
meillo@40
|
118 Other operating systems may not offer such facilities,
|
meillo@41
|
119 hence it may not be possible to design software in the way of the
|
meillo@41
|
120 Unix Philosophy for them.
|
meillo@40
|
121 .PP
|
meillo@41
|
122 The Unix Philosophy has an idea of how the process of software development
|
meillo@41
|
123 should look like, but large parts of the philosophy are quite independent
|
meillo@41
|
124 from the development process used.
|
meillo@41
|
125 However, one will soon recognize that some development processes work well
|
meillo@41
|
126 with the ideas of the Unix Philosophy and support them, while others are
|
meillo@41
|
127 at cross-purposes.
|
meillo@41
|
128 Kent Beck's books about Extreme Programming are valuable supplimental
|
meillo@41
|
129 resources.
|
meillo@1
|
130 .PP
|
meillo@41
|
131 The question of how to actually write code and how the code should looks
|
meillo@41
|
132 like internally, are out of focus here.
|
meillo@41
|
133 ``The Practice of Programming'' by Kernighan and Pike,
|
meillo@41
|
134 .[
|
meillo@41
|
135 %A Brian W. Kernighan
|
meillo@41
|
136 %A Rob Pike
|
meillo@41
|
137 %T The Practice of Programming
|
meillo@41
|
138 %I Addison-Wesley
|
meillo@41
|
139 %D 1999
|
meillo@41
|
140 .]
|
meillo@41
|
141 is a good book that covers this topic.
|
meillo@41
|
142 Its point of view matches to the one of this paper.
|
meillo@0
|
143
|
meillo@0
|
144 .NH 1
|
meillo@6
|
145 Importance of software design in general
|
meillo@42
|
146 .XS
|
meillo@42
|
147 .sp .5v
|
meillo@42
|
148 \*(SN Importance of software design in general
|
meillo@42
|
149 .XE
|
meillo@0
|
150 .LP
|
meillo@40
|
151 Software design is the planning of how the internal structure
|
meillo@40
|
152 and external interfaces of a software should look like.
|
meillo@39
|
153 It has nothing to do with visual appearance.
|
meillo@39
|
154 If we take a program as a car, then its color is of no matter.
|
meillo@39
|
155 Its design would be the car's size, its shape, the locations of doors,
|
meillo@39
|
156 the passenger/space ratio, the luggage capacity, and so forth.
|
meillo@39
|
157 .PP
|
meillo@39
|
158 Why should software get designed at all?
|
meillo@6
|
159 It is general knowledge, that even a bad plan is better than no plan.
|
meillo@39
|
160 Not designing software means programming without plan.
|
meillo@39
|
161 This will pretty sure lead to horrible results.
|
meillo@39
|
162 Horrible to use and horrible to maintain.
|
meillo@39
|
163 These two aspects are the visible ones.
|
meillo@39
|
164 Often invisible are the wasted possible gains.
|
meillo@39
|
165 Good software design can make these gains available.
|
meillo@2
|
166 .PP
|
meillo@39
|
167 A software's design deals with quality properties.
|
meillo@39
|
168 Good design leads to good quality, and quality is important.
|
meillo@39
|
169 Any car may be able to drive from A to B,
|
meillo@39
|
170 but it depends on the car's properties whether it is a good choice
|
meillo@39
|
171 for passenger transport or not.
|
meillo@39
|
172 It depends on its properties if it is a good choice
|
meillo@39
|
173 for a rough mountain area.
|
meillo@39
|
174 And it depends on its properties if the ride will be fun.
|
meillo@39
|
175
|
meillo@2
|
176 .PP
|
meillo@39
|
177 Requirements for a software are twofold:
|
meillo@39
|
178 functional and non-functional.
|
meillo@39
|
179 .IP \(bu
|
meillo@39
|
180 Functional requirements define directly the software's functions.
|
meillo@39
|
181 They are the reason why software gets written.
|
meillo@39
|
182 Someone has a problem and needs a tool to solve it.
|
meillo@39
|
183 Being able to solve the problem is the main functional goal.
|
meillo@39
|
184 It is the driving force behind all programming effort.
|
meillo@39
|
185 Functional requirements are easier to define and to verify.
|
meillo@39
|
186 .IP \(bu
|
meillo@39
|
187 Non-functional requirements are also called \fIquality\fP requirements.
|
meillo@39
|
188 The quality of a software are the properties that are not directly related to
|
meillo@39
|
189 the software's basic functions.
|
meillo@39
|
190 Tools of bad quality often solve the problems they were written for,
|
meillo@39
|
191 but introduce problems and difficulties for usage and development, later on.
|
meillo@39
|
192 Quality aspects are often overlooked at first sight,
|
meillo@39
|
193 and they are often difficult to define clearly and to verify.
|
meillo@2
|
194 .PP
|
meillo@39
|
195 Quality is of few matter when the software gets built initially,
|
meillo@39
|
196 but it is of matter for usage and maintenance of the software.
|
meillo@6
|
197 A short-sighted might see in developing a software mainly building something up.
|
meillo@39
|
198 But experience shows, that building the software the first time is
|
meillo@39
|
199 only a small amount of the overall work.
|
meillo@39
|
200 Bug fixing, extending, rebuilding of parts
|
meillo@39
|
201 \(en maintenance work, for short \(en
|
meillo@6
|
202 does soon take over the major part of the time spent on a software.
|
meillo@6
|
203 Not to forget the usage of the software.
|
meillo@6
|
204 These processes are highly influenced by the software's quality.
|
meillo@39
|
205 Thus, quality must not be neglected.
|
meillo@39
|
206 The problem with quality is that you hardly ``stumble over''
|
meillo@39
|
207 bad quality during the first build,
|
meillo@6
|
208 but this is the time when you should care about good quality most.
|
meillo@6
|
209 .PP
|
meillo@39
|
210 Software design is less the basic function of a software \(en
|
meillo@39
|
211 this requirement will get satisfied anyway.
|
meillo@39
|
212 Software design is more about quality aspects of the software.
|
meillo@39
|
213 Good design leads to good quality, bad design to bad quality.
|
meillo@6
|
214 The primary functions of the software will be affected modestly by bad quality,
|
meillo@39
|
215 but good quality can provide a lot of additional gain,
|
meillo@6
|
216 even at places where one never expected it.
|
meillo@6
|
217 .PP
|
meillo@6
|
218 The ISO/IEC 9126-1 standard, part 1,
|
meillo@6
|
219 .[
|
meillo@9
|
220 %I International Organization for Standardization
|
meillo@6
|
221 %T ISO Standard 9126: Software Engineering \(en Product Quality, part 1
|
meillo@6
|
222 %C Geneve
|
meillo@6
|
223 %D 2001
|
meillo@6
|
224 .]
|
meillo@6
|
225 defines the quality model as consisting out of:
|
meillo@6
|
226 .IP \(bu
|
meillo@6
|
227 .I Functionality
|
meillo@6
|
228 (suitability, accuracy, inter\%operability, security)
|
meillo@6
|
229 .IP \(bu
|
meillo@6
|
230 .I Reliability
|
meillo@6
|
231 (maturity, fault tolerance, recoverability)
|
meillo@6
|
232 .IP \(bu
|
meillo@6
|
233 .I Usability
|
meillo@6
|
234 (understandability, learnability, operability, attractiveness)
|
meillo@6
|
235 .IP \(bu
|
meillo@6
|
236 .I Efficiency
|
meillo@9
|
237 (time behavior, resource utilization)
|
meillo@6
|
238 .IP \(bu
|
meillo@6
|
239 .I Maintainability
|
meillo@23
|
240 (analyzability, changeability, stability, testability)
|
meillo@6
|
241 .IP \(bu
|
meillo@6
|
242 .I Portability
|
meillo@6
|
243 (adaptability, installability, co-existence, replaceability)
|
meillo@6
|
244 .LP
|
meillo@39
|
245 Good design can improve these properties of a software,
|
meillo@39
|
246 bad designed software probably suffers from not having them.
|
meillo@7
|
247 .PP
|
meillo@7
|
248 One further goal of software design is consistency.
|
meillo@7
|
249 Consistency eases understanding, working on, and using things.
|
meillo@39
|
250 Consistent internal structure and consistent interfaces to the outside
|
meillo@39
|
251 can be provided by good design.
|
meillo@7
|
252 .PP
|
meillo@39
|
253 Software should be well designed because good design avoids many
|
meillo@39
|
254 problems during the software's lifetime.
|
meillo@39
|
255 And software should be well designed because good design can offer
|
meillo@39
|
256 much additional gain.
|
meillo@39
|
257 Indeed, much effort should be spent into good design to make software more valuable.
|
meillo@39
|
258 The Unix Philosophy shows a way of how to design software well.
|
meillo@7
|
259 It offers guidelines to achieve good quality and high gain for the effort spent.
|
meillo@0
|
260
|
meillo@0
|
261
|
meillo@0
|
262 .NH 1
|
meillo@0
|
263 The Unix Philosophy
|
meillo@42
|
264 .XS
|
meillo@42
|
265 .sp .5v
|
meillo@42
|
266 \*(SN The Unix Philosophy
|
meillo@42
|
267 .XE
|
meillo@4
|
268 .LP
|
meillo@4
|
269 The origins of the Unix Philosophy were already introduced.
|
meillo@8
|
270 This chapter explains the philosophy, oriented on Gancarz,
|
meillo@8
|
271 and shows concrete examples of its application.
|
meillo@5
|
272
|
meillo@16
|
273 .NH 2
|
meillo@14
|
274 Pipes
|
meillo@42
|
275 .XS
|
meillo@42
|
276 \*(SN Pipes
|
meillo@42
|
277 .XE
|
meillo@4
|
278 .LP
|
meillo@4
|
279 Following are some examples to demonstrate how applied Unix Philosophy feels like.
|
meillo@4
|
280 Knowledge of using the Unix shell is assumed.
|
meillo@4
|
281 .PP
|
meillo@4
|
282 Counting the number of files in the current directory:
|
meillo@41
|
283 .DS
|
meillo@4
|
284 .CW
|
meillo@9
|
285 .ps -1
|
meillo@4
|
286 ls | wc -l
|
meillo@4
|
287 .DE
|
meillo@4
|
288 The
|
meillo@4
|
289 .CW ls
|
meillo@4
|
290 command lists all files in the current directory, one per line,
|
meillo@4
|
291 and
|
meillo@4
|
292 .CW "wc -l
|
meillo@8
|
293 counts the number of lines.
|
meillo@4
|
294 .PP
|
meillo@8
|
295 Counting the number of files that do not contain ``foo'' in their name:
|
meillo@41
|
296 .DS
|
meillo@4
|
297 .CW
|
meillo@9
|
298 .ps -1
|
meillo@4
|
299 ls | grep -v foo | wc -l
|
meillo@4
|
300 .DE
|
meillo@4
|
301 Here, the list of files is filtered by
|
meillo@4
|
302 .CW grep
|
meillo@4
|
303 to remove all that contain ``foo''.
|
meillo@4
|
304 The rest is the same as in the previous example.
|
meillo@4
|
305 .PP
|
meillo@4
|
306 Finding the five largest entries in the current directory.
|
meillo@41
|
307 .DS
|
meillo@4
|
308 .CW
|
meillo@9
|
309 .ps -1
|
meillo@4
|
310 du -s * | sort -nr | sed 5q
|
meillo@4
|
311 .DE
|
meillo@4
|
312 .CW "du -s *
|
meillo@4
|
313 returns the recursively summed sizes of all files
|
meillo@8
|
314 \(en no matter if they are regular files or directories.
|
meillo@4
|
315 .CW "sort -nr
|
meillo@4
|
316 sorts the list numerically in reverse order.
|
meillo@4
|
317 Finally,
|
meillo@4
|
318 .CW "sed 5q
|
meillo@4
|
319 quits after it has printed the fifth line.
|
meillo@4
|
320 .PP
|
meillo@4
|
321 The presented command lines are examples of what Unix people would use
|
meillo@4
|
322 to get the desired output.
|
meillo@4
|
323 There are also other ways to get the same output.
|
meillo@4
|
324 It's a user's decision which way to go.
|
meillo@14
|
325 .PP
|
meillo@8
|
326 The examples show that many tasks on a Unix system
|
meillo@4
|
327 are accomplished by combining several small programs.
|
meillo@4
|
328 The connection between the single programs is denoted by the pipe operator `|'.
|
meillo@4
|
329 .PP
|
meillo@4
|
330 Pipes, and their extensive and easy use, are one of the great
|
meillo@4
|
331 achievements of the Unix system.
|
meillo@4
|
332 Pipes between programs have been possible in earlier operating systems,
|
meillo@4
|
333 but it has never been a so central part of the concept.
|
meillo@4
|
334 When, in the early seventies, Doug McIlroy introduced pipes for the
|
meillo@4
|
335 Unix system,
|
meillo@4
|
336 ``it was this concept and notation for linking several programs together
|
meillo@4
|
337 that transformed Unix from a basic file-sharing system to an entirely new way of computing.''
|
meillo@4
|
338 .[
|
meillo@4
|
339 %T Unix: An Oral History
|
meillo@5
|
340 %O .CW \s-1http://www.princeton.edu/~hos/frs122/unixhist/finalhis.htm
|
meillo@4
|
341 .]
|
meillo@4
|
342 .PP
|
meillo@4
|
343 Being able to specify pipelines in an easy way is,
|
meillo@4
|
344 however, not enough by itself.
|
meillo@5
|
345 It is only one half.
|
meillo@4
|
346 The other is the design of the programs that are used in the pipeline.
|
meillo@8
|
347 They have to interfaces that allows them to be used in such a way.
|
meillo@5
|
348
|
meillo@16
|
349 .NH 2
|
meillo@14
|
350 Interface design
|
meillo@42
|
351 .XS
|
meillo@42
|
352 \*(SN Interface design
|
meillo@42
|
353 .XE
|
meillo@5
|
354 .LP
|
meillo@11
|
355 Unix is, first of all, simple \(en Everything is a file.
|
meillo@5
|
356 Files are sequences of bytes, without any special structure.
|
meillo@5
|
357 Programs should be filters, which read a stream of bytes from ``standard input'' (stdin)
|
meillo@5
|
358 and write a stream of bytes to ``standard output'' (stdout).
|
meillo@5
|
359 .PP
|
meillo@8
|
360 If the files \fIare\fP sequences of bytes,
|
meillo@8
|
361 and the programs \fIare\fP filters on byte streams,
|
meillo@11
|
362 then there is exactly one standardized data interface.
|
meillo@5
|
363 Thus it is possible to combine them in any desired way.
|
meillo@5
|
364 .PP
|
meillo@5
|
365 Even a handful of small programs will yield a large set of combinations,
|
meillo@5
|
366 and thus a large set of different functions.
|
meillo@5
|
367 This is leverage!
|
meillo@5
|
368 If the programs are orthogonal to each other \(en the best case \(en
|
meillo@5
|
369 then the set of different functions is greatest.
|
meillo@5
|
370 .PP
|
meillo@11
|
371 Programs might also have a separate control interface,
|
meillo@11
|
372 besides their data interface.
|
meillo@11
|
373 The control interface is often called ``user interface'',
|
meillo@11
|
374 because it is usually designed to be used by humans.
|
meillo@11
|
375 The Unix Philosophy discourages to assume the user to be human.
|
meillo@11
|
376 Interactive use of software is slow use of software,
|
meillo@11
|
377 because the program waits for user input most of the time.
|
meillo@11
|
378 Interactive software requires the user to be in front of the computer
|
meillo@11
|
379 all the time.
|
meillo@11
|
380 Interactive software occupy the user's attention while they are running.
|
meillo@11
|
381 .PP
|
meillo@11
|
382 Now we come back to the idea of using several small programs, combined,
|
meillo@11
|
383 to have a more specific function.
|
meillo@11
|
384 If these single tools would all be interactive,
|
meillo@11
|
385 how would the user control them?
|
meillo@11
|
386 It is not only a problem to control several programs at once if they run at the same time,
|
meillo@11
|
387 it also very inefficient to have to control each of the single programs
|
meillo@11
|
388 that are intended to work as one large program.
|
meillo@11
|
389 Hence, the Unix Philosophy discourages programs to demand interactive use.
|
meillo@11
|
390 The behavior of programs should be defined at invocation.
|
meillo@11
|
391 This is done by specifying arguments (``command line switches'') to the program call.
|
meillo@11
|
392 Gancarz discusses this topic as ``avoid captive user interfaces''.
|
meillo@11
|
393 .[
|
meillo@11
|
394 %A Mike Gancarz
|
meillo@11
|
395 %T The UNIX Philosophy
|
meillo@11
|
396 %I Digital Press
|
meillo@11
|
397 %D 1995
|
meillo@11
|
398 %P 88 ff.
|
meillo@11
|
399 .]
|
meillo@11
|
400 .PP
|
meillo@11
|
401 Non-interactive use is, during development, also an advantage for testing.
|
meillo@11
|
402 Testing of interactive programs is much more complicated,
|
meillo@11
|
403 than testing of non-interactive programs.
|
meillo@5
|
404
|
meillo@16
|
405 .NH 2
|
meillo@8
|
406 The toolchest approach
|
meillo@42
|
407 .XS
|
meillo@42
|
408 \*(SN The toolchest approach
|
meillo@42
|
409 .XE
|
meillo@5
|
410 .LP
|
meillo@5
|
411 A toolchest is a set of tools.
|
meillo@5
|
412 Instead of having one big tool for all tasks, one has many small tools,
|
meillo@5
|
413 each for one task.
|
meillo@5
|
414 Difficult tasks are solved by combining several of the small, simple tools.
|
meillo@5
|
415 .PP
|
meillo@11
|
416 The Unix toolchest \fIis\fP a set of small, (mostly) non-interactive programs
|
meillo@11
|
417 that are filters on byte streams.
|
meillo@11
|
418 They are, to a large extend, unrelated in their function.
|
meillo@11
|
419 Hence, the Unix toolchest provides a large set of functions
|
meillo@11
|
420 that can be accessed by combining the programs in the desired way.
|
meillo@11
|
421 .PP
|
meillo@11
|
422 There are also advantages for developing small toolchest programs.
|
meillo@5
|
423 It is easier and less error-prone to write small programs.
|
meillo@5
|
424 It is also easier and less error-prone to write a large set of small programs,
|
meillo@5
|
425 than to write one large program with all the functionality included.
|
meillo@5
|
426 If the small programs are combinable, then they offer even a larger set
|
meillo@5
|
427 of functions than the single large program.
|
meillo@5
|
428 Hence, one gets two advantages out of writing small, combinable programs.
|
meillo@5
|
429 .PP
|
meillo@5
|
430 There are two drawbacks of the toolchest approach.
|
meillo@8
|
431 First, one simple, standardized, unidirectional interface has to be sufficient.
|
meillo@5
|
432 If one feels the need for more ``logic'' than a stream of bytes,
|
meillo@8
|
433 then a different approach might be of need.
|
meillo@13
|
434 But it is also possible, that he just can not imagine a design where
|
meillo@8
|
435 a stream of bytes is sufficient.
|
meillo@8
|
436 By becoming more familiar with the ``Unix style of thinking'',
|
meillo@8
|
437 developers will more often and easier find simple designs where
|
meillo@8
|
438 a stream of bytes is a sufficient interface.
|
meillo@8
|
439 .PP
|
meillo@8
|
440 The second drawback of a toolchest affects the users.
|
meillo@5
|
441 A toolchest is often more difficult to use for novices.
|
meillo@9
|
442 It is necessary to become familiar with each of the tools,
|
meillo@5
|
443 to be able to use the right one in a given situation.
|
meillo@9
|
444 Additionally, one needs to combine the tools in a senseful way on its own.
|
meillo@9
|
445 This is like a sharp knife \(en it is a powerful tool in the hand of a master,
|
meillo@5
|
446 but of no good value in the hand of an unskilled.
|
meillo@5
|
447 .PP
|
meillo@8
|
448 However, learning single, small tool of the toolchest is easier than
|
meillo@8
|
449 learning a complex tool.
|
meillo@8
|
450 The user will have a basic understanding of a yet unknown tool,
|
meillo@8
|
451 if the several tools of the toolchest have a common style.
|
meillo@8
|
452 He will be able to transfer knowledge over one tool to another.
|
meillo@8
|
453 .PP
|
meillo@8
|
454 Moreover, the second drawback can be removed easily by adding wrappers
|
meillo@8
|
455 around the single tools.
|
meillo@5
|
456 Novice users do not need to learn several tools if a professional wraps
|
meillo@8
|
457 the single commands into a more high-level script.
|
meillo@5
|
458 Note that the wrapper script still calls the small tools;
|
meillo@5
|
459 the wrapper script is just like a skin around.
|
meillo@8
|
460 No complexity is added this way,
|
meillo@8
|
461 but new programs can get created out of existing one with very low effort.
|
meillo@5
|
462 .PP
|
meillo@5
|
463 A wrapper script for finding the five largest entries in the current directory
|
meillo@5
|
464 could look like this:
|
meillo@41
|
465 .DS
|
meillo@5
|
466 .CW
|
meillo@9
|
467 .ps -1
|
meillo@5
|
468 #!/bin/sh
|
meillo@5
|
469 du -s * | sort -nr | sed 5q
|
meillo@5
|
470 .DE
|
meillo@5
|
471 The script itself is just a text file that calls the command line
|
meillo@5
|
472 a professional user would type in directly.
|
meillo@8
|
473 Making the program flexible on the number of entries it prints,
|
meillo@8
|
474 is easily possible:
|
meillo@41
|
475 .DS
|
meillo@8
|
476 .CW
|
meillo@9
|
477 .ps -1
|
meillo@8
|
478 #!/bin/sh
|
meillo@8
|
479 num=5
|
meillo@8
|
480 [ $# -eq 1 ] && num="$1"
|
meillo@8
|
481 du -sh * | sort -nr | sed "${num}q"
|
meillo@8
|
482 .DE
|
meillo@8
|
483 This script acts like the one before, when called without an argument.
|
meillo@8
|
484 But one can also specify a numerical argument to define the number of lines to print.
|
meillo@5
|
485
|
meillo@16
|
486 .NH 2
|
meillo@8
|
487 A powerful shell
|
meillo@42
|
488 .XS
|
meillo@42
|
489 \*(SN A powerful shell
|
meillo@42
|
490 .XE
|
meillo@8
|
491 .LP
|
meillo@10
|
492 It was already said, that the Unix shell provides the possibility to
|
meillo@10
|
493 combine small programs into large ones easily.
|
meillo@10
|
494 A powerful shell is a great feature in other ways, too.
|
meillo@8
|
495 .PP
|
meillo@10
|
496 For instance by including a scripting language.
|
meillo@10
|
497 The control statements are build into the shell.
|
meillo@8
|
498 The functions, however, are the normal programs, everyone can use on the system.
|
meillo@10
|
499 Thus, the programs are known, so learning to program in the shell is easy.
|
meillo@8
|
500 Using normal programs as functions in the shell programming language
|
meillo@10
|
501 is only possible because they are small and combinable tools in a toolchest style.
|
meillo@8
|
502 .PP
|
meillo@8
|
503 The Unix shell encourages to write small scripts out of other programs,
|
meillo@8
|
504 because it is so easy to do.
|
meillo@8
|
505 This is a great step towards automation.
|
meillo@8
|
506 It is wonderful if the effort to automate a task equals the effort
|
meillo@8
|
507 it takes to do it the second time by hand.
|
meillo@8
|
508 If it is so, then the user will be happy to automate everything he does more than once.
|
meillo@8
|
509 .PP
|
meillo@8
|
510 Small programs that do one job well, standardized interfaces between them,
|
meillo@8
|
511 a mechanism to combine parts to larger parts, and an easy way to automate tasks,
|
meillo@8
|
512 this will inevitably produce software leverage.
|
meillo@8
|
513 Getting multiple times the benefit of an investment is a great offer.
|
meillo@10
|
514 .PP
|
meillo@10
|
515 The shell also encourages rapid prototyping.
|
meillo@10
|
516 Many well known programs started as quickly hacked shell scripts,
|
meillo@10
|
517 and turned into ``real'' programs, written in C, later.
|
meillo@10
|
518 Building a prototype first is a way to avoid the biggest problems
|
meillo@10
|
519 in application development.
|
meillo@10
|
520 Fred Brooks writes in ``No Silver Bullet'':
|
meillo@10
|
521 .[
|
meillo@10
|
522 %A Frederick P. Brooks, Jr.
|
meillo@10
|
523 %T No Silver Bullet: Essence and Accidents of Software Engineering
|
meillo@10
|
524 %B Information Processing 1986, the Proceedings of the IFIP Tenth World Computing Conference
|
meillo@10
|
525 %E H.-J. Kugler
|
meillo@10
|
526 %D 1986
|
meillo@10
|
527 %P 1069\(en1076
|
meillo@10
|
528 %I Elsevier Science B.V.
|
meillo@10
|
529 %C Amsterdam, The Netherlands
|
meillo@10
|
530 .]
|
meillo@10
|
531 .QP
|
meillo@41
|
532 .ps -1
|
meillo@10
|
533 The hardest single part of building a software system is deciding precisely what to build.
|
meillo@10
|
534 No other part of the conceptual work is so difficult as establishing the detailed
|
meillo@10
|
535 technical requirements, [...].
|
meillo@10
|
536 No other part of the work so cripples the resulting system if done wrong.
|
meillo@10
|
537 No other part is more difficult to rectify later.
|
meillo@10
|
538 .PP
|
meillo@10
|
539 Writing a prototype is a great method to become familiar with the requirements
|
meillo@10
|
540 and to actually run into real problems.
|
meillo@10
|
541 Today, prototyping is often seen as a first step in building a software.
|
meillo@10
|
542 This is, of course, good.
|
meillo@10
|
543 However, the Unix Philosophy has an \fIadditional\fP perspective on prototyping:
|
meillo@10
|
544 After having built the prototype, one might notice, that the prototype is already
|
meillo@10
|
545 \fIgood enough\fP.
|
meillo@10
|
546 Hence, no reimplementation, in a more sophisticated programming language, might be of need,
|
meillo@10
|
547 for the moment.
|
meillo@23
|
548 Maybe later, it might be necessary to rewrite the software, but not now.
|
meillo@10
|
549 .PP
|
meillo@10
|
550 By delaying further work, one keeps the flexibility to react easily on
|
meillo@10
|
551 changing requirements.
|
meillo@10
|
552 Software parts that are not written will not miss the requirements.
|
meillo@10
|
553
|
meillo@16
|
554 .NH 2
|
meillo@10
|
555 Worse is better
|
meillo@42
|
556 .XS
|
meillo@42
|
557 \*(SN Worse is better
|
meillo@42
|
558 .XE
|
meillo@10
|
559 .LP
|
meillo@10
|
560 The Unix Philosophy aims for the 80% solution;
|
meillo@10
|
561 others call it the ``Worse is better'' approach.
|
meillo@10
|
562 .PP
|
meillo@10
|
563 First, practical experience shows, that it is almost never possible to define the
|
meillo@10
|
564 requirements completely and correctly the first time.
|
meillo@10
|
565 Hence one should not try to; it will fail anyway.
|
meillo@10
|
566 Second, practical experience shows, that requirements change during time.
|
meillo@10
|
567 Hence it is best to delay requirement-based design decisions as long as possible.
|
meillo@10
|
568 Also, the software should be small and flexible as long as possible
|
meillo@10
|
569 to react on changing requirements.
|
meillo@10
|
570 Shell scripts, for example, are more easily adjusted as C programs.
|
meillo@10
|
571 Third, practical experience shows, that maintenance is hard work.
|
meillo@10
|
572 Hence, one should keep the amount of software as small as possible;
|
meillo@10
|
573 it should just fulfill the \fIcurrent\fP requirements.
|
meillo@10
|
574 Software parts that will be written later, do not need maintenance now.
|
meillo@10
|
575 .PP
|
meillo@10
|
576 Starting with a prototype in a scripting language has several advantages:
|
meillo@10
|
577 .IP \(bu
|
meillo@10
|
578 As the initial effort is low, one will likely start right away.
|
meillo@10
|
579 .IP \(bu
|
meillo@10
|
580 As working parts are available soon, the real requirements can get identified soon.
|
meillo@10
|
581 .IP \(bu
|
meillo@10
|
582 When a software is usable, it gets used, and thus tested.
|
meillo@10
|
583 Hence problems will be found at early stages of the development.
|
meillo@10
|
584 .IP \(bu
|
meillo@10
|
585 The prototype might be enough for the moment,
|
meillo@10
|
586 thus further work on the software can be delayed to a time
|
meillo@10
|
587 when one knows better about the requirements and problems,
|
meillo@10
|
588 than now.
|
meillo@10
|
589 .IP \(bu
|
meillo@10
|
590 Implementing now only the parts that are actually needed now,
|
meillo@10
|
591 requires fewer maintenance work.
|
meillo@10
|
592 .IP \(bu
|
meillo@10
|
593 If the global situation changes so that the software is not needed anymore,
|
meillo@10
|
594 then less effort was spent into the project, than it would have be
|
meillo@10
|
595 when a different approach had been used.
|
meillo@10
|
596
|
meillo@16
|
597 .NH 2
|
meillo@11
|
598 Upgrowth and survival of software
|
meillo@42
|
599 .XS
|
meillo@42
|
600 \*(SN Upgrowth and survival of software
|
meillo@42
|
601 .XE
|
meillo@11
|
602 .LP
|
meillo@12
|
603 So far it was talked about \fIwriting\fP or \fIbuilding\fP software.
|
meillo@13
|
604 Although these are just verbs, they do imply a specific view on the work process
|
meillo@13
|
605 they describe.
|
meillo@12
|
606 The better verb, however, is to \fIgrow\fP.
|
meillo@12
|
607 .PP
|
meillo@12
|
608 Creating software in the sense of the Unix Philosophy is an incremental process.
|
meillo@12
|
609 It starts with a first prototype, which evolves as requirements change.
|
meillo@12
|
610 A quickly hacked shell script might become a large, sophisticated,
|
meillo@13
|
611 compiled program this way.
|
meillo@13
|
612 Its lifetime begins with the initial prototype and ends when the software is not used anymore.
|
meillo@13
|
613 While being alive it will get extended, rearranged, rebuilt (from scratch).
|
meillo@12
|
614 Growing software matches the view that ``software is never finished. It is only released.''
|
meillo@12
|
615 .[
|
meillo@13
|
616 %O FIXME
|
meillo@13
|
617 %A Mike Gancarz
|
meillo@13
|
618 %T The UNIX Philosophy
|
meillo@13
|
619 %P 26
|
meillo@12
|
620 .]
|
meillo@12
|
621 .PP
|
meillo@13
|
622 Software can be seen as being controlled by evolutionary processes.
|
meillo@13
|
623 Successful software is software that is used by many for a long time.
|
meillo@12
|
624 This implies that the software is needed, useful, and better than alternatives.
|
meillo@12
|
625 Darwin talks about: ``The survival of the fittest.''
|
meillo@12
|
626 .[
|
meillo@13
|
627 %O FIXME
|
meillo@13
|
628 %A Charles Darwin
|
meillo@12
|
629 .]
|
meillo@12
|
630 Transferred to software: The most successful software, is the fittest,
|
meillo@12
|
631 is the one that survives.
|
meillo@13
|
632 (This may be at the level of one creature, or at the level of one species.)
|
meillo@13
|
633 The fitness of software is affected mainly by four properties:
|
meillo@15
|
634 portability of code, portability of data, range of usability, and reusability of parts.
|
meillo@15
|
635 .\" .IP \(bu
|
meillo@15
|
636 .\" portability of code
|
meillo@15
|
637 .\" .IP \(bu
|
meillo@15
|
638 .\" portability of data
|
meillo@15
|
639 .\" .IP \(bu
|
meillo@15
|
640 .\" range of usability
|
meillo@15
|
641 .\" .IP \(bu
|
meillo@15
|
642 .\" reuseability of parts
|
meillo@13
|
643 .PP
|
meillo@15
|
644 (1)
|
meillo@15
|
645 .I "Portability of code
|
meillo@15
|
646 means, using high-level programming languages,
|
meillo@13
|
647 sticking to the standard,
|
meillo@13
|
648 and avoiding optimizations that introduce dependencies on specific hardware.
|
meillo@13
|
649 Hardware has a much lower lifetime than software.
|
meillo@13
|
650 By chaining software to a specific hardware,
|
meillo@13
|
651 the software's lifetime gets shortened to that of this hardware.
|
meillo@13
|
652 In contrast, software should be easy to port \(en
|
meillo@23
|
653 adaptation is the key to success.
|
meillo@13
|
654 .\" cf. practice of prog: ch08
|
meillo@13
|
655 .PP
|
meillo@15
|
656 (2)
|
meillo@15
|
657 .I "Portability of data
|
meillo@15
|
658 is best achieved by avoiding binary representations
|
meillo@13
|
659 to store data, because binary representations differ from machine to machine.
|
meillo@23
|
660 Textual representation is favored.
|
meillo@13
|
661 Historically, ASCII was the charset of choice.
|
meillo@13
|
662 In the future, UTF-8 might be the better choice, however.
|
meillo@13
|
663 Important is that it is a plain text representation in a
|
meillo@13
|
664 very common charset encoding.
|
meillo@13
|
665 Apart from being able to transfer data between machines,
|
meillo@13
|
666 readable data has the great advantage, that humans are able
|
meillo@13
|
667 to directly edit it with text editors and other tools from the Unix toolchest.
|
meillo@13
|
668 .\" gancarz tenet 5
|
meillo@13
|
669 .PP
|
meillo@15
|
670 (3)
|
meillo@15
|
671 A large
|
meillo@15
|
672 .I "range of usability
|
meillo@23
|
673 ensures good adaptation, and thus good survival.
|
meillo@13
|
674 It is a special distinction if a software becomes used in fields of action,
|
meillo@13
|
675 the original authors did never imagine.
|
meillo@13
|
676 Software that solves problems in a general way will likely be used
|
meillo@13
|
677 for all kinds of similar problems.
|
meillo@13
|
678 Being too specific limits the range of uses.
|
meillo@13
|
679 Requirements change through time, thus use cases change or even vanish.
|
meillo@13
|
680 A good example in this point is Allman's sendmail.
|
meillo@13
|
681 Allman identifies flexibility to be one major reason for sendmail's success:
|
meillo@13
|
682 .[
|
meillo@13
|
683 %O FIXME
|
meillo@13
|
684 %A Allman
|
meillo@13
|
685 %T sendmail
|
meillo@13
|
686 .]
|
meillo@13
|
687 .QP
|
meillo@41
|
688 .ps -1
|
meillo@13
|
689 Second, I limited myself to the routing function [...].
|
meillo@13
|
690 This was a departure from the dominant thought of the time, [...].
|
meillo@13
|
691 .QP
|
meillo@41
|
692 .ps -1
|
meillo@13
|
693 Third, the sendmail configuration file was flexible enough to adopt
|
meillo@13
|
694 to a rapidly changing world [...].
|
meillo@12
|
695 .LP
|
meillo@13
|
696 Successful software adopts itself to the changing world.
|
meillo@13
|
697 .PP
|
meillo@15
|
698 (4)
|
meillo@15
|
699 .I "Reuse of parts
|
meillo@15
|
700 is even one step further.
|
meillo@13
|
701 A software may completely lose its field of action,
|
meillo@13
|
702 but parts of which the software is build may be general and independent enough
|
meillo@13
|
703 to survive this death.
|
meillo@13
|
704 If software is build by combining small independent programs,
|
meillo@13
|
705 then there are parts readily available for reuse.
|
meillo@13
|
706 Who cares if the large program is a failure,
|
meillo@13
|
707 but parts of it become successful instead?
|
meillo@10
|
708
|
meillo@16
|
709 .NH 2
|
meillo@14
|
710 Summary
|
meillo@42
|
711 .XS
|
meillo@42
|
712 \*(SN Summary
|
meillo@42
|
713 .XE
|
meillo@0
|
714 .LP
|
meillo@14
|
715 This chapter explained the central ideas of the Unix Philosophy.
|
meillo@14
|
716 For each of the ideas, it was exposed what advantages they introduce.
|
meillo@14
|
717 The Unix Philosophy are guidelines that help to write valuable software.
|
meillo@14
|
718 From the view point of a software developer or software designer,
|
meillo@14
|
719 the Unix Philosophy provides answers to many software design problem.
|
meillo@14
|
720 .PP
|
meillo@14
|
721 The various ideas of the Unix Philosophy are very interweaved
|
meillo@14
|
722 and can hardly be applied independently.
|
meillo@14
|
723 However, the probably most important messages are:
|
meillo@14
|
724 .I "``Do one thing well!''" ,
|
meillo@14
|
725 .I "``Keep it simple!''" ,
|
meillo@14
|
726 and
|
meillo@14
|
727 .I "``Use software leverage!''
|
meillo@0
|
728
|
meillo@8
|
729
|
meillo@8
|
730
|
meillo@0
|
731 .NH 1
|
meillo@19
|
732 Case study: \s-1MH\s0
|
meillo@42
|
733 .XS
|
meillo@42
|
734 .sp .5v
|
meillo@42
|
735 \*(SN Case study: \s-1MH\s0
|
meillo@42
|
736 .XE
|
meillo@18
|
737 .LP
|
meillo@30
|
738 The previous chapter introduced and explained the Unix Philosophy
|
meillo@18
|
739 from a general point of view.
|
meillo@30
|
740 The driving force were the guidelines; references to
|
meillo@18
|
741 existing software were given only sparsely.
|
meillo@18
|
742 In this and the next chapter, concrete software will be
|
meillo@18
|
743 the driving force in the discussion.
|
meillo@18
|
744 .PP
|
meillo@23
|
745 This first case study is about the mail user agents (\s-1MUA\s0)
|
meillo@23
|
746 \s-1MH\s0 (``mail handler'') and its descendent \fInmh\fP
|
meillo@23
|
747 (``new mail handler'').
|
meillo@23
|
748 \s-1MUA\s0s provide functions to read, compose, and organize mail,
|
meillo@23
|
749 but (ideally) not to transfer.
|
meillo@19
|
750 In this document, the name \s-1MH\s0 will be used for both of them.
|
meillo@19
|
751 A distinction will only be made if differences between
|
meillo@19
|
752 them are described.
|
meillo@18
|
753
|
meillo@0
|
754
|
meillo@0
|
755 .NH 2
|
meillo@19
|
756 Historical background
|
meillo@42
|
757 .XS
|
meillo@42
|
758 \*(SN Historical background
|
meillo@42
|
759 .XE
|
meillo@0
|
760 .LP
|
meillo@19
|
761 Electronic mail was available in Unix very early.
|
meillo@30
|
762 The first \s-1MUA\s0 on Unix was \f(CWmail\fP,
|
meillo@30
|
763 which was already present in the First Edition.
|
meillo@30
|
764 .[
|
meillo@30
|
765 %A Peter H. Salus
|
meillo@30
|
766 %T A Quarter Century of UNIX
|
meillo@30
|
767 %D 1994
|
meillo@30
|
768 %I Addison-Wesley
|
meillo@30
|
769 %P 41 f.
|
meillo@30
|
770 .]
|
meillo@30
|
771 It was a small program that either prints the user's mailbox file
|
meillo@19
|
772 or appends text to someone elses mailbox file,
|
meillo@19
|
773 depending on the command line arguments.
|
meillo@19
|
774 .[
|
meillo@19
|
775 %O http://cm.bell-labs.com/cm/cs/who/dmr/pdfs/man12.pdf
|
meillo@19
|
776 .]
|
meillo@19
|
777 It was a program that did one job well.
|
meillo@23
|
778 This job was emailing, which was very simple then.
|
meillo@19
|
779 .PP
|
meillo@23
|
780 Later, emailing became more powerful, and thus more complex.
|
meillo@19
|
781 The simple \f(CWmail\fP, which knew nothing of subjects,
|
meillo@19
|
782 independent handling of single messages,
|
meillo@19
|
783 and long-time storage of them, was not powerful enough anymore.
|
meillo@19
|
784 At Berkeley, Kurt Shoens wrote \fIMail\fP (with capital `M')
|
meillo@19
|
785 in 1978 to provide additional functions for emailing.
|
meillo@19
|
786 Mail was still one program, but now it was large and did
|
meillo@19
|
787 several jobs.
|
meillo@23
|
788 Its user interface is modeled after the one of \fIed\fP.
|
meillo@19
|
789 It is designed for humans, but is still scriptable.
|
meillo@23
|
790 \fImailx\fP is the adaptation of Berkeley Mail into System V.
|
meillo@19
|
791 .[
|
meillo@19
|
792 %A Gunnar Ritter
|
meillo@19
|
793 %O http://heirloom.sourceforge.net/mailx_history.html
|
meillo@19
|
794 .]
|
meillo@30
|
795 Elm, pine, mutt, and a whole bunch of graphical \s-1MUA\s0s
|
meillo@19
|
796 followed Mail's direction.
|
meillo@19
|
797 They are large, monolithic programs which include all emailing functions.
|
meillo@19
|
798 .PP
|
meillo@23
|
799 A different way was taken by the people of \s-1RAND\s0 Corporation.
|
meillo@38
|
800 In the beginning, they also had used a monolithic mail system,
|
meillo@30
|
801 called \s-1MS\s0 (for ``mail system'').
|
meillo@19
|
802 But in 1977, Stockton Gaines and Norman Shapiro
|
meillo@19
|
803 came up with a proposal of a new email system concept \(en
|
meillo@19
|
804 one that honors the Unix Philosophy.
|
meillo@19
|
805 The concept was implemented by Bruce Borden in 1978 and 1979.
|
meillo@19
|
806 This was the birth of \s-1MH\s0 \(en the ``mail handler''.
|
meillo@18
|
807 .PP
|
meillo@18
|
808 Since then, \s-1RAND\s0, the University of California at Irvine and
|
meillo@19
|
809 at Berkeley, and several others have contributed to the software.
|
meillo@18
|
810 However, it's core concepts remained the same.
|
meillo@23
|
811 In the late 90s, when development of \s-1MH\s0 slowed down,
|
meillo@19
|
812 Richard Coleman started with \fInmh\fP, the new mail handler.
|
meillo@19
|
813 His goal was to improve \s-1MH\s0, especially in regard of
|
meillo@23
|
814 the requirements of modern emailing.
|
meillo@19
|
815 Today, nmh is developed by various people on the Internet.
|
meillo@18
|
816 .[
|
meillo@18
|
817 %T RAND and the Information Evolution: A History in Essays and Vignettes
|
meillo@18
|
818 %A Willis H. Ware
|
meillo@18
|
819 %D 2008
|
meillo@18
|
820 %I The RAND Corporation
|
meillo@18
|
821 %P 128\(en137
|
meillo@18
|
822 %O .CW \s-1http://www.rand.org/pubs/corporate_pubs/CP537/
|
meillo@18
|
823 .]
|
meillo@18
|
824 .[
|
meillo@18
|
825 %T MH & xmh: Email for Users & Programmers
|
meillo@18
|
826 %A Jerry Peek
|
meillo@18
|
827 %D 1991, 1992, 1995
|
meillo@18
|
828 %I O'Reilly & Associates, Inc.
|
meillo@18
|
829 %P Appendix B
|
meillo@18
|
830 %O Also available online: \f(CW\s-2http://rand-mh.sourceforge.net/book/\fP
|
meillo@18
|
831 .]
|
meillo@0
|
832
|
meillo@0
|
833 .NH 2
|
meillo@20
|
834 Contrasts to monolithic mail systems
|
meillo@42
|
835 .XS
|
meillo@42
|
836 \*(SN Contrasts to monolithic mail systems
|
meillo@42
|
837 .XE
|
meillo@0
|
838 .LP
|
meillo@19
|
839 All \s-1MUA\s0s are monolithic, except \s-1MH\s0.
|
meillo@38
|
840 Although there might actually exist further, very little known,
|
meillo@30
|
841 toolchest \s-1MUA\s0s, this statement reflects the situation pretty well.
|
meillo@19
|
842 .PP
|
meillo@30
|
843 Monolithic \s-1MUA\s0s gather all their functions in one program.
|
meillo@30
|
844 In contrast, \s-1MH\s0 is a toolchest of many small tools \(en one for each job.
|
meillo@23
|
845 Following is a list of important programs of \s-1MH\s0's toolchest
|
meillo@30
|
846 and their function.
|
meillo@30
|
847 It gives a feeling of how the toolchest looks like.
|
meillo@19
|
848 .IP \(bu
|
meillo@19
|
849 .CW inc :
|
meillo@30
|
850 incorporate new mail (this is how mail enters the system)
|
meillo@19
|
851 .IP \(bu
|
meillo@19
|
852 .CW scan :
|
meillo@19
|
853 list messages in folder
|
meillo@19
|
854 .IP \(bu
|
meillo@19
|
855 .CW show :
|
meillo@19
|
856 show message
|
meillo@19
|
857 .IP \(bu
|
meillo@19
|
858 .CW next\fR/\fPprev :
|
meillo@19
|
859 show next/previous message
|
meillo@19
|
860 .IP \(bu
|
meillo@19
|
861 .CW folder :
|
meillo@19
|
862 change current folder
|
meillo@19
|
863 .IP \(bu
|
meillo@19
|
864 .CW refile :
|
meillo@19
|
865 refile message into folder
|
meillo@19
|
866 .IP \(bu
|
meillo@19
|
867 .CW rmm :
|
meillo@19
|
868 remove message
|
meillo@19
|
869 .IP \(bu
|
meillo@19
|
870 .CW comp :
|
meillo@19
|
871 compose a new message
|
meillo@19
|
872 .IP \(bu
|
meillo@19
|
873 .CW repl :
|
meillo@19
|
874 reply to a message
|
meillo@19
|
875 .IP \(bu
|
meillo@19
|
876 .CW forw :
|
meillo@19
|
877 forward a message
|
meillo@19
|
878 .IP \(bu
|
meillo@19
|
879 .CW send :
|
meillo@30
|
880 send a prepared message (this is how mail leaves the system)
|
meillo@0
|
881 .LP
|
meillo@19
|
882 \s-1MH\s0 has no special user interface like monolithic \s-1MUA\s0s have.
|
meillo@19
|
883 The user does not leave the shell to run \s-1MH\s0,
|
meillo@30
|
884 but he uses the various \s-1MH\s0 programs within the shell.
|
meillo@23
|
885 Using a monolithic program with a captive user interface
|
meillo@23
|
886 means ``entering'' the program, using it, and ``exiting'' the program.
|
meillo@23
|
887 Using toolchests like \s-1MH\s0 means running programs,
|
meillo@38
|
888 alone or in combination with others, even from other toolchests,
|
meillo@23
|
889 without leaving the shell.
|
meillo@30
|
890
|
meillo@30
|
891 .NH 2
|
meillo@30
|
892 Data storage
|
meillo@42
|
893 .XS
|
meillo@42
|
894 \*(SN Data storage
|
meillo@42
|
895 .XE
|
meillo@30
|
896 .LP
|
meillo@34
|
897 \s-1MH\s0's mail storage is a directory tree under the user's
|
meillo@34
|
898 \s-1MH\s0 directory (usually \f(CW$HOME/Mail\fP),
|
meillo@34
|
899 where mail folders are directories and mail messages are text files
|
meillo@34
|
900 within them.
|
meillo@34
|
901 Each mail folder contains a file \f(CW.mh_sequences\fP which lists
|
meillo@34
|
902 the public message sequences of that folder, for instance new messages.
|
meillo@34
|
903 Mail messages are text files located in a mail folder.
|
meillo@34
|
904 The files contain the messages as they were received.
|
meillo@34
|
905 They are numbered in ascending order in each folder.
|
meillo@19
|
906 .PP
|
meillo@30
|
907 This mailbox format is called ``\s-1MH\s0'' after the \s-1MUA\s0.
|
meillo@30
|
908 Alternatives are \fImbox\fP and \fImaildir\fP.
|
meillo@30
|
909 In the mbox format all messages are stored within one file.
|
meillo@30
|
910 This was a good solution in the early days, when messages
|
meillo@30
|
911 were only a few lines of text and were deleted soon.
|
meillo@30
|
912 Today, when single messages often include several megabytes
|
meillo@30
|
913 of attachments, it is a bad solution.
|
meillo@30
|
914 Another disadvantage of the mbox format is that it is
|
meillo@30
|
915 more difficult to write tools that work on mail messages,
|
meillo@30
|
916 because it is always necessary to first find and extract
|
meillo@30
|
917 the relevant message in the mbox file.
|
meillo@30
|
918 With the \s-1MH\s0 mailbox format,
|
meillo@30
|
919 each message is a self-standing item, by definition.
|
meillo@30
|
920 Also, the problem of concurrent access to one mailbox is
|
meillo@30
|
921 reduced to the problem of concurrent access to one message.
|
meillo@30
|
922 Maildir is generally similar to \s-1MH\s0's format,
|
meillo@30
|
923 but modified towards guaranteed reliability.
|
meillo@30
|
924 This involves some complexity, unfortunately.
|
meillo@34
|
925 .PP
|
meillo@34
|
926 Working with \s-1MH\s0's toolchest on mailboxes is much like
|
meillo@34
|
927 working with Unix' toolchest on directory trees:
|
meillo@34
|
928 \f(CWscan\fP is like \f(CWls\fP,
|
meillo@34
|
929 \f(CWshow\fP is like \f(CWcat\fP,
|
meillo@34
|
930 \f(CWfolder\fP is like \f(CWcd\fP and \f(CWpwd\fP,
|
meillo@34
|
931 \f(CWrefile\fP is like \f(CWmv\fP,
|
meillo@34
|
932 and \f(CWrmm\fP is like \f(CWrm\fP.
|
meillo@34
|
933 .PP
|
meillo@34
|
934 The context of tools in Unix consists mainly the current working directory,
|
meillo@34
|
935 the user identification, and the environment variables.
|
meillo@34
|
936 \s-1MH\s0 extends this context by two more items:
|
meillo@34
|
937 .IP \(bu
|
meillo@34
|
938 The current mail folder, which is similar to the current working directory.
|
meillo@34
|
939 For mail folders, \f(CWfolder\fP provides the corresponding functionality
|
meillo@34
|
940 of \f(CWcd\fP and \f(CWpwd\fP for directories.
|
meillo@34
|
941 .IP \(bu
|
meillo@34
|
942 Sequences, which are named sets of messages in a mail folder.
|
meillo@34
|
943 The current message, relative to a mail folder, is a special sequence.
|
meillo@34
|
944 It enables commands like \f(CWnext\fP and \f(CWprev\fP.
|
meillo@34
|
945 .LP
|
meillo@34
|
946 In contrast to Unix' context, which is chained to the shell session,
|
meillo@34
|
947 \s-1MH\s0's context is independent.
|
meillo@34
|
948 Usually there is one context for each user, but a user can have many
|
meillo@34
|
949 contexts.
|
meillo@34
|
950 Public sequences are an exception, as they belong to the mail folder.
|
meillo@34
|
951 .[
|
meillo@34
|
952 %O mh-profile(5) and mh-sequence(5)
|
meillo@34
|
953 .]
|
meillo@20
|
954
|
meillo@0
|
955 .NH 2
|
meillo@20
|
956 Discussion of the design
|
meillo@42
|
957 .XS
|
meillo@42
|
958 \*(SN Discussion of the design
|
meillo@42
|
959 .XE
|
meillo@0
|
960 .LP
|
meillo@20
|
961 The following paragraphs discuss \s-1MH\s0 in regard to the tenets
|
meillo@23
|
962 of the Unix Philosophy which Gancarz identified.
|
meillo@20
|
963
|
meillo@20
|
964 .PP
|
meillo@33
|
965 .B "Small is beautiful
|
meillo@20
|
966 and
|
meillo@33
|
967 .B "do one thing well
|
meillo@20
|
968 are two design goals that are directly visible in \s-1MH\s0.
|
meillo@20
|
969 Gancarz actually presents \s-1MH\s0 as example under the headline
|
meillo@20
|
970 ``Making UNIX Do One Thing Well'':
|
meillo@41
|
971 .[
|
meillo@41
|
972 %A Mike Gancarz
|
meillo@41
|
973 %T unix-phil
|
meillo@41
|
974 %P 125
|
meillo@41
|
975 .]
|
meillo@20
|
976 .QP
|
meillo@41
|
977 .ps -1
|
meillo@20
|
978 [\s-1MH\s0] consists of a series of programs which
|
meillo@20
|
979 when combined give the user an enormous ability
|
meillo@20
|
980 to manipulate electronic mail messages.
|
meillo@20
|
981 A complex application, it shows that not only is it
|
meillo@20
|
982 possible to build large applications from smaller
|
meillo@20
|
983 components, but also that such designs are actually preferable.
|
meillo@20
|
984 .LP
|
meillo@20
|
985 The various small programs of \s-1MH\s0 were relatively easy
|
meillo@23
|
986 to write, because each of them is small, limited to one function,
|
meillo@23
|
987 and has clear boundaries.
|
meillo@20
|
988 For the same reasons, they are also good to maintain.
|
meillo@20
|
989 Further more, the system can easily get extended.
|
meillo@20
|
990 One only needs to put a new program into the toolchest.
|
meillo@23
|
991 This was done, for instance, when \s-1MIME\s0 support was added
|
meillo@20
|
992 (e.g. \f(CWmhbuild\fP).
|
meillo@20
|
993 Also, different programs can exist to do the basically same job
|
meillo@20
|
994 in different ways (e.g. in nmh: \f(CWshow\fP and \f(CWmhshow\fP).
|
meillo@20
|
995 If someone needs a mail system with some additionally
|
meillo@23
|
996 functions that are available nowhere yet, he best takes a
|
meillo@20
|
997 toolchest system like \s-1MH\s0 where he can add the
|
meillo@20
|
998 functionality with little work.
|
meillo@20
|
999
|
meillo@20
|
1000 .PP
|
meillo@34
|
1001 .B "Store data in flat text files
|
meillo@34
|
1002 is followed by \s-1MH\s0.
|
meillo@34
|
1003 This is not surprising, because email messages are already plain text.
|
meillo@34
|
1004 \s-1MH\s0 stores the messages as it receives them,
|
meillo@34
|
1005 thus any other tool that works on RFC 2822 mail messages can operate
|
meillo@34
|
1006 on the messages in an \s-1MH\s0 mailbox.
|
meillo@34
|
1007 All other files \s-1MH\s0 uses are plain text too.
|
meillo@34
|
1008 It is therefore possible and encouraged to use the text processing
|
meillo@34
|
1009 tools of Unix' toolchest to extend \s-1MH\s0's toolchest.
|
meillo@20
|
1010
|
meillo@20
|
1011 .PP
|
meillo@33
|
1012 .B "Avoid captive user interfaces" .
|
meillo@19
|
1013 \s-1MH\s0 is perfectly suited for non-interactive use.
|
meillo@19
|
1014 It offers all functions directly and without captive user interfaces.
|
meillo@30
|
1015 If, nonetheless, users want a graphical user interface,
|
meillo@20
|
1016 they can have it with \fIxmh\fP or \fIexmh\fP, too.
|
meillo@19
|
1017 These are graphical frontends for the \s-1MH\s0 toolchest.
|
meillo@19
|
1018 This means, all email-related work is still done by \s-1MH\s0 tools,
|
meillo@20
|
1019 but the frontend issues the appropriate calls when the user
|
meillo@30
|
1020 clicks on buttons.
|
meillo@20
|
1021 Providing easy-to-use user interfaces in form of frontends is a good
|
meillo@19
|
1022 approach, because it does not limit the power of the backend itself.
|
meillo@20
|
1023 The frontend will anyway only be able to make a subset of the
|
meillo@23
|
1024 backend's power and flexibility available to the user.
|
meillo@20
|
1025 But if it is a separate program,
|
meillo@20
|
1026 then the missing parts can still be accessed at the backend directly.
|
meillo@19
|
1027 If it is integrated, then this will hardly be possible.
|
meillo@30
|
1028 Further more, it is possible to have different frontends to the same
|
meillo@30
|
1029 backend.
|
meillo@19
|
1030
|
meillo@19
|
1031 .PP
|
meillo@33
|
1032 .B "Choose portability over efficiency
|
meillo@20
|
1033 and
|
meillo@33
|
1034 .B "use shell scripts to increase leverage and portability" .
|
meillo@20
|
1035 These two tenets are indirectly, but nicely, demonstrated by
|
meillo@30
|
1036 Bolsky and Korn in their book about the Korn Shell.
|
meillo@20
|
1037 .[
|
meillo@20
|
1038 %T The KornShell: command and programming language
|
meillo@20
|
1039 %A Morris I. Bolsky
|
meillo@20
|
1040 %A David G. Korn
|
meillo@20
|
1041 %I Prentice Hall
|
meillo@20
|
1042 %D 1989
|
meillo@30
|
1043 %P 254\(en290
|
meillo@20
|
1044 %O \s-1ISBN\s0: 0-13-516972-0
|
meillo@20
|
1045 .]
|
meillo@30
|
1046 They demonstrated, in chapter 18 of the book, a basic implementation
|
meillo@20
|
1047 of a subset of \s-1MH\s0 in ksh scripts.
|
meillo@20
|
1048 Of course, this was just a demonstration, but a brilliant one.
|
meillo@20
|
1049 It shows how quickly one can implement such a prototype with shell scripts,
|
meillo@20
|
1050 and how readable they are.
|
meillo@20
|
1051 The implementation in the scripting language may not be very fast,
|
meillo@20
|
1052 but it can be fast enough though, and this is all that matters.
|
meillo@20
|
1053 By having the code in an interpreted language, like the shell,
|
meillo@20
|
1054 portability becomes a minor issue, if we assume the interpreter
|
meillo@20
|
1055 to be widespread.
|
meillo@20
|
1056 This demonstration also shows how easy it is to create single programs
|
meillo@20
|
1057 of a toolchest software.
|
meillo@30
|
1058 There are eight tools (two of them have multiple names) and 16 functions
|
meillo@30
|
1059 with supporting code.
|
meillo@30
|
1060 Each tool comprises between 12 and 38 lines of ksh,
|
meillo@30
|
1061 in total about 200 lines.
|
meillo@30
|
1062 The functions comprise between 3 and 78 lines of ksh,
|
meillo@30
|
1063 in total about 450 lines.
|
meillo@20
|
1064 Such small software is easy to write, easy to understand,
|
meillo@20
|
1065 and thus easy to maintain.
|
meillo@23
|
1066 A toolchest improves the possibility to only write some parts
|
meillo@20
|
1067 and though create a working result.
|
meillo@20
|
1068 Expanding the toolchest without global changes will likely be
|
meillo@20
|
1069 possible, too.
|
meillo@20
|
1070
|
meillo@20
|
1071 .PP
|
meillo@33
|
1072 .B "Use software leverage to your advantage
|
meillo@20
|
1073 and the lesser tenet
|
meillo@33
|
1074 .B "allow the user to tailor the environment
|
meillo@20
|
1075 are ideally followed in the design of \s-1MH\s0.
|
meillo@21
|
1076 Tailoring the environment is heavily encouraged by the ability to
|
meillo@30
|
1077 directly define default options to programs.
|
meillo@30
|
1078 It is even possible to define different default options
|
meillo@21
|
1079 depending on the name under which the program was called.
|
meillo@21
|
1080 Software leverage is heavily encouraged by the ease it is to
|
meillo@21
|
1081 create shell scripts that run a specific command line,
|
meillo@30
|
1082 built of several \s-1MH\s0 programs.
|
meillo@21
|
1083 There is few software that so much wants users to tailor their
|
meillo@21
|
1084 environment and to leverage the use of the software, like \s-1MH\s0.
|
meillo@21
|
1085 Just to make one example:
|
meillo@23
|
1086 One might prefer a different listing format for the \f(CWscan\fP
|
meillo@21
|
1087 program.
|
meillo@30
|
1088 It is possible to take one of the distributed format files
|
meillo@21
|
1089 or to write one yourself.
|
meillo@21
|
1090 To use the format as default for \f(CWscan\fP, a single line,
|
meillo@21
|
1091 reading
|
meillo@21
|
1092 .DS
|
meillo@21
|
1093 .CW
|
meillo@21
|
1094 scan: -form FORMATFILE
|
meillo@21
|
1095 .DE
|
meillo@21
|
1096 must be added to \f(CW.mh_profile\fP.
|
meillo@21
|
1097 If one wants this different format as an additional command,
|
meillo@23
|
1098 instead of changing the default, he needs to create a link to
|
meillo@23
|
1099 \f(CWscan\fP, for instance titled \f(CWscan2\fP.
|
meillo@21
|
1100 The line in \f(CW.mh_profile\fP would then start with \f(CWscan2\fP,
|
meillo@30
|
1101 as the option should only be in effect when scan is called as
|
meillo@21
|
1102 \f(CWscan2\fP.
|
meillo@20
|
1103
|
meillo@20
|
1104 .PP
|
meillo@33
|
1105 .B "Make every program a filter
|
meillo@21
|
1106 is hard to find in \s-1MH\s0.
|
meillo@21
|
1107 The reason therefore is that most of \s-1MH\s0's tools provide
|
meillo@21
|
1108 basic file system operations for the mailboxes.
|
meillo@30
|
1109 The reason is the same because of which
|
meillo@21
|
1110 \f(CWls\fP, \f(CWcp\fP, \f(CWmv\fP, and \f(CWrm\fP
|
meillo@21
|
1111 aren't filters neither.
|
meillo@23
|
1112 However, they build a basis on which filters can operate.
|
meillo@23
|
1113 \s-1MH\s0 does not provide many filters itself, but it is a basis
|
meillo@23
|
1114 to write filters for.
|
meillo@30
|
1115 An example would be a mail message text highlighter,
|
meillo@30
|
1116 that means a program that makes use of a color terminal to display
|
meillo@30
|
1117 header lines, quotations, and signatures in distinct colors.
|
meillo@30
|
1118 The author's version of this program, for instance,
|
meillo@30
|
1119 is a 25 line awk script.
|
meillo@21
|
1120
|
meillo@21
|
1121 .PP
|
meillo@33
|
1122 .B "Build a prototype as soon as possible
|
meillo@21
|
1123 was again well followed by \s-1MH\s0.
|
meillo@21
|
1124 This tenet, of course, focuses on early development, which is
|
meillo@21
|
1125 long time ago for \s-1MH\s0.
|
meillo@21
|
1126 But without following this guideline at the very beginning,
|
meillo@23
|
1127 Bruce Borden may have not convinced the management of \s-1RAND\s0
|
meillo@23
|
1128 to ever create \s-1MH\s0.
|
meillo@23
|
1129 In Bruce' own words:
|
meillo@41
|
1130 .[
|
meillo@41
|
1131 %O FIXME
|
meillo@41
|
1132 .]
|
meillo@21
|
1133 .QP
|
meillo@41
|
1134 .ps -1
|
meillo@30
|
1135 [...] but they [Stockton Gaines and Norm Shapiro] were not able
|
meillo@23
|
1136 to convince anyone that such a system would be fast enough to be usable.
|
meillo@21
|
1137 I proposed a very short project to prove the basic concepts,
|
meillo@21
|
1138 and my management agreed.
|
meillo@21
|
1139 Looking back, I realize that I had been very lucky with my first design.
|
meillo@21
|
1140 Without nearly enough design work,
|
meillo@21
|
1141 I built a working environment and some header files
|
meillo@21
|
1142 with key structures and wrote the first few \s-1MH\s0 commands:
|
meillo@21
|
1143 inc, show/next/prev, and comp.
|
meillo@21
|
1144 [...]
|
meillo@21
|
1145 With these three, I was able to convince people that the structure was viable.
|
meillo@21
|
1146 This took about three weeks.
|
meillo@0
|
1147
|
meillo@0
|
1148 .NH 2
|
meillo@0
|
1149 Problems
|
meillo@42
|
1150 .XS
|
meillo@42
|
1151 \*(SN Problems
|
meillo@42
|
1152 .XE
|
meillo@0
|
1153 .LP
|
meillo@22
|
1154 \s-1MH\s0, for sure is not without problems.
|
meillo@30
|
1155 There are two main problems: one is technical, the other is about human behavior.
|
meillo@22
|
1156 .PP
|
meillo@22
|
1157 \s-1MH\s0 is old and email today is very different to email in the time
|
meillo@22
|
1158 when \s-1MH\s0 was designed.
|
meillo@22
|
1159 \s-1MH\s0 adopted to the changes pretty well, but it is limited.
|
meillo@22
|
1160 For example in development resources.
|
meillo@22
|
1161 \s-1MIME\s0 support and support for different character encodings
|
meillo@22
|
1162 is available, but only on a moderate level.
|
meillo@22
|
1163 More active developers could quickly improve there.
|
meillo@22
|
1164 It is also limited by design, which is the larger problem.
|
meillo@22
|
1165 \s-1IMAP\s0, for example, conflicts with \s-1MH\s0's design to a large extend.
|
meillo@22
|
1166 These design conflicts are not easily solvable.
|
meillo@22
|
1167 Possibly, they require a redesign.
|
meillo@30
|
1168 Maybe \s-1IMAP\s0 is too different to the classic mail model which \s-1MH\s0 covers,
|
meillo@30
|
1169 hence \s-1MH\s0 may never work well with \s-1IMAP\s0.
|
meillo@22
|
1170 .PP
|
meillo@22
|
1171 The other kind of problem is human habits.
|
meillo@22
|
1172 When in this world almost all \s-1MUA\s0s are monolithic,
|
meillo@22
|
1173 it is very difficult to convince people to use a toolbox style \s-1MUA\s0
|
meillo@22
|
1174 like \s-1MH\s0.
|
meillo@22
|
1175 The habits are so strong, that even people who understood the concept
|
meillo@30
|
1176 and advantages of \s-1MH\s0 do not like to switch,
|
meillo@30
|
1177 simply because \s-1MH\s0 is different.
|
meillo@30
|
1178 Unfortunately, the frontends to \s-1MH\s0, which could provide familiar look'n'feel,
|
meillo@30
|
1179 are quite outdated and thus not very appealing compared to the modern interfaces
|
meillo@30
|
1180 which monolithic \s-1MUA\s0s offer.
|
meillo@20
|
1181
|
meillo@20
|
1182 .NH 2
|
meillo@20
|
1183 Summary \s-1MH\s0
|
meillo@42
|
1184 .XS
|
meillo@42
|
1185 \*(SN Summary \s-1MH\s0
|
meillo@42
|
1186 .XE
|
meillo@20
|
1187 .LP
|
meillo@31
|
1188 \s-1MH\s0 is an \s-1MUA\s0 that follows the Unix Philosophy in its design
|
meillo@31
|
1189 and implementation.
|
meillo@31
|
1190 It consists of a toolchest of small tools, each of them does one job well.
|
meillo@31
|
1191 The tools are orthogonal to each other, to a large extend.
|
meillo@31
|
1192 However, for historical reasons, there also exist distinct tools
|
meillo@31
|
1193 that cover the same task.
|
meillo@31
|
1194 .PP
|
meillo@31
|
1195 The toolchest approach offers great flexibility to the user.
|
meillo@31
|
1196 He can use the complete power of the Unix shell with \s-1MH\s0.
|
meillo@31
|
1197 This makes \s-1MH\s0 a very powerful mail system.
|
meillo@31
|
1198 Extending and customizing \s-1MH\s0 is easy and encouraged, too.
|
meillo@31
|
1199 .PP
|
meillo@31
|
1200 Apart from the user's perspective, \s-1MH\s0 is development-friendly.
|
meillo@31
|
1201 Its overall design follows clear rules.
|
meillo@31
|
1202 The single tools do only one job, thus they are easy to understand,
|
meillo@31
|
1203 easy to write, and good to maintain.
|
meillo@31
|
1204 They are all independent and do not interfere with the others.
|
meillo@31
|
1205 Automated testing of their function is a straight forward task.
|
meillo@31
|
1206 .PP
|
meillo@31
|
1207 It is sad, that \s-1MH\s0's differentness is its largest problem,
|
meillo@31
|
1208 as its differentness is also its largest advantage.
|
meillo@31
|
1209 Unfortunately, for most people their habits are stronger
|
meillo@31
|
1210 than the attraction of the clear design and the power, \s-1MH\s0 offers.
|
meillo@0
|
1211
|
meillo@8
|
1212
|
meillo@8
|
1213
|
meillo@0
|
1214 .NH 1
|
meillo@0
|
1215 Case study: uzbl
|
meillo@42
|
1216 .XS
|
meillo@42
|
1217 .sp .5v
|
meillo@42
|
1218 \*(SN Case study: uzbl
|
meillo@42
|
1219 .XE
|
meillo@32
|
1220 .LP
|
meillo@32
|
1221 The last chapter took a look on the \s-1MUA\s0 \s-1MH\s0,
|
meillo@32
|
1222 this chapter is about uzbl, a web browser that adheres to the Unix Philosophy.
|
meillo@32
|
1223 ``uzbl'' is the \fIlolcat\fP's word for the English adjective ``usable''.
|
meillo@32
|
1224 It is pronounced the identical.
|
meillo@0
|
1225
|
meillo@0
|
1226 .NH 2
|
meillo@32
|
1227 Historical background
|
meillo@42
|
1228 .XS
|
meillo@42
|
1229 \*(SN Historical background
|
meillo@42
|
1230 .XE
|
meillo@0
|
1231 .LP
|
meillo@32
|
1232 Uzbl was started by Dieter Plaetinck in April 2009.
|
meillo@32
|
1233 The idea was born in a thread in the Arch Linux forum.
|
meillo@32
|
1234 .[
|
meillo@32
|
1235 %O http://bbs.archlinux.org/viewtopic.php?id=67463
|
meillo@32
|
1236 .]
|
meillo@32
|
1237 After some discussion about failures of well known web browsers,
|
meillo@32
|
1238 Plaetinck (alias Dieter@be) came up with a very sketchy proposal
|
meillo@32
|
1239 of how a better web browser could look like.
|
meillo@32
|
1240 To the question of another member, if Plaetinck would write that program,
|
meillo@32
|
1241 because it would sound fantastic, Plaetinck replied:
|
meillo@32
|
1242 ``Maybe, if I find the time ;-)''.
|
meillo@32
|
1243 .PP
|
meillo@32
|
1244 Fortunately, he found the time.
|
meillo@32
|
1245 One day later, the first prototype was out.
|
meillo@32
|
1246 One week later, uzbl had an own website.
|
meillo@32
|
1247 One month after the first code showed up,
|
meillo@32
|
1248 a mailing list was installed to coordinate and discuss further development.
|
meillo@32
|
1249 A wiki was set up to store documentation and scripts that showed up on the
|
meillo@32
|
1250 mailing list and elsewhere.
|
meillo@32
|
1251 .PP
|
meillo@38
|
1252 In the, now, one year of uzbl's existence, it was heavily developed in various branches.
|
meillo@32
|
1253 Plaetinck's task became more and more to only merge the best code from the
|
meillo@32
|
1254 different branches into his main branch, and to apply patches.
|
meillo@32
|
1255 About once a month, Plaetinck released a new version.
|
meillo@32
|
1256 In September 2009, he presented several forks of uzbl.
|
meillo@38
|
1257 Uzbl, actually, opened the field for a whole family of web browsers with similar shape.
|
meillo@32
|
1258 .PP
|
meillo@32
|
1259 In July 2009, \fILinux Weekly News\fP published an interview with Plaetinck about uzbl.
|
meillo@32
|
1260 In September 2009, the uzbl web browser was on \fISlashdot\fP.
|
meillo@0
|
1261
|
meillo@0
|
1262 .NH 2
|
meillo@32
|
1263 Contrasts to other web browsers
|
meillo@42
|
1264 .XS
|
meillo@42
|
1265 \*(SN Contrasts to other web browsers
|
meillo@42
|
1266 .XE
|
meillo@0
|
1267 .LP
|
meillo@32
|
1268 Like most \s-1MUA\s0s are monolithic, but \s-1MH\s0 is a toolchest,
|
meillo@32
|
1269 most web browsers are monolithic, but uzbl is a frontend to a toolchest.
|
meillo@32
|
1270 .PP
|
meillo@32
|
1271 Today, uzbl is divided into uzbl-core and uzbl-browser.
|
meillo@32
|
1272 Uzbl-core is, how its name already indicates, the core of uzbl.
|
meillo@32
|
1273 It handles commands and events to interface other programs,
|
meillo@32
|
1274 and also displays webpages by using webkit as render engine.
|
meillo@32
|
1275 Uzbl-browser combines uzbl-core with a bunch of handler scripts, a status bar,
|
meillo@32
|
1276 an event manager, yanking, pasting, page searching, zooming, and more stuff,
|
meillo@32
|
1277 to form a ``complete'' web browser.
|
meillo@32
|
1278 In the following text, the term ``uzbl'' usually stands for uzbl-browser,
|
meillo@32
|
1279 so uzbl-core is included.
|
meillo@32
|
1280 .PP
|
meillo@32
|
1281 Unlike most other web browsers, uzbl is mainly the mediator between the
|
meillo@32
|
1282 various tools that cover single jobs of web browsing.
|
meillo@35
|
1283 Therefore, uzbl listens for commands on a named pipe (fifo), a Unix socket,
|
meillo@35
|
1284 and on stdin, and it writes events to a Unix socket and to stdout.
|
meillo@35
|
1285 The graphical rendering of the webpage is done by webkit, a web content engine.
|
meillo@35
|
1286 Uzbl-core is build around this library.
|
meillo@35
|
1287 Loading a webpage in a running uzbl instance requires only:
|
meillo@32
|
1288 .DS
|
meillo@32
|
1289 .CW
|
meillo@32
|
1290 echo 'uri http://example.org' >/path/to/uzbl-fifo
|
meillo@32
|
1291 .DE
|
meillo@32
|
1292 .PP
|
meillo@32
|
1293 Downloads, browsing history, bookmarks, and thelike are not provided
|
meillo@32
|
1294 by uzbl-core itself, as they are in other web browsers.
|
meillo@35
|
1295 Uzbl-browser also only provides, so called, handler scripts that wrap
|
meillo@35
|
1296 external applications which provide the actual functionality.
|
meillo@32
|
1297 For instance, \fIwget\fP is used to download files and uzbl-browser
|
meillo@32
|
1298 includes a script that calls wget with appropriate options in
|
meillo@32
|
1299 a prepared environment.
|
meillo@32
|
1300 .PP
|
meillo@32
|
1301 Modern web browsers are proud to have addons, plugins, and modules, instead.
|
meillo@32
|
1302 This is their effort to achieve similar goals.
|
meillo@35
|
1303 But instead of using existing, external programs, modern web browsers
|
meillo@35
|
1304 include these functions, although they might be loaded at runtime.
|
meillo@0
|
1305
|
meillo@0
|
1306 .NH 2
|
meillo@32
|
1307 Discussion of the design
|
meillo@42
|
1308 .XS
|
meillo@42
|
1309 \*(SN Discussion of the design
|
meillo@42
|
1310 .XE
|
meillo@0
|
1311 .LP
|
meillo@32
|
1312 This section discusses uzbl in regard of the Unix Philosophy,
|
meillo@32
|
1313 as identified by Gancarz.
|
meillo@32
|
1314
|
meillo@32
|
1315 .PP
|
meillo@35
|
1316 .B "Make each program do one thing well" .
|
meillo@35
|
1317 Uzbl tries to be a web browser and nothing else.
|
meillo@36
|
1318 The common definition of a web browser is, of course, highly influenced by
|
meillo@36
|
1319 existing implementations of web browsers, although they are degenerated.
|
meillo@35
|
1320 Web browsers should be programs to browse the web, and nothing more.
|
meillo@35
|
1321 This is the one thing they should do, as demanded by the Unix Philosophy.
|
meillo@36
|
1322 .PP
|
meillo@36
|
1323 Web browsers should, for instance, not manage downloads.
|
meillo@35
|
1324 This is the job download managers exist for.
|
meillo@35
|
1325 Download managers do primary care about being good in downloading files.
|
meillo@35
|
1326 Modern web browsers provide download management only as a secondary feature.
|
meillo@35
|
1327 How could they perform this job better, than programs that exist only for
|
meillo@35
|
1328 this very job?
|
meillo@35
|
1329 And how could anyone want less than the best download manager available?
|
meillo@32
|
1330 .PP
|
meillo@35
|
1331 A web browser's job is to let the user browse the web.
|
meillo@35
|
1332 This means, navigating through websites by following links.
|
meillo@36
|
1333 Rendering the \s-1HTML\s0 sources is a different job, too.
|
meillo@36
|
1334 It is covered by the webkit render engine, in uzbl's case.
|
meillo@35
|
1335 Audio and video content and files like PostScript, \s-1PDF\s0, and the like,
|
meillo@36
|
1336 are also not the job of a web browser.
|
meillo@36
|
1337 They should be handled by external applications \(en
|
meillo@36
|
1338 ones which's job is to handle such data.
|
meillo@35
|
1339 Uzbl strives to do it this way.
|
meillo@36
|
1340 .PP
|
meillo@36
|
1341 Remember Doug McIlroy:
|
meillo@35
|
1342 .I
|
meillo@35
|
1343 ``Write programs that do one thing and do it well.
|
meillo@35
|
1344 Write programs to work together.''
|
meillo@35
|
1345 .R
|
meillo@35
|
1346 .PP
|
meillo@35
|
1347 The lesser tenet
|
meillo@35
|
1348 .B "allow the user to tailor the environment
|
meillo@35
|
1349 matches good here.
|
meillo@35
|
1350 There was the question, how anyone could want anything less than the
|
meillo@35
|
1351 best program for the job.
|
meillo@36
|
1352 But as personal preferences matter much,
|
meillo@36
|
1353 it is probably more important to ask:
|
meillo@35
|
1354 How could anyone want something else than his preferred program for the job?
|
meillo@36
|
1355 .PP
|
meillo@35
|
1356 Usually users want one program for one job.
|
meillo@35
|
1357 Hence, whenever the task is, for instance, downloading,
|
meillo@36
|
1358 exactly one download manager should be used.
|
meillo@35
|
1359 More advanced users might want to have this download manager in this
|
meillo@35
|
1360 situation and that one in that situation.
|
meillo@35
|
1361 They should be able to configure it this way.
|
meillo@35
|
1362 With uzbl, one can use any download manager the user wants.
|
meillo@36
|
1363 To switch to a different one, only one line in a small handler script
|
meillo@35
|
1364 needs to be changed.
|
meillo@36
|
1365 Alternatively it would be possible to query an entry in a global file
|
meillo@36
|
1366 or an environment variable, which specifies the download manager to use,
|
meillo@35
|
1367 in the handler script.
|
meillo@36
|
1368 .PP
|
meillo@35
|
1369 As uzbl does neither have its own download manager nor depends on a
|
meillo@35
|
1370 specific one, thus uzbl's browsing abilities will not be lowered by having
|
meillo@35
|
1371 a bad download manager.
|
meillo@36
|
1372 Uzbl's download capabilities will just as good as the ones of the best
|
meillo@36
|
1373 download manager available on the system.
|
meillo@38
|
1374 Of course, this applies to all of the other supplementary tools, too.
|
meillo@32
|
1375
|
meillo@32
|
1376 .PP
|
meillo@36
|
1377 .B "Use software leverage to your advantage" .
|
meillo@36
|
1378 Shell scripts are a good choice to extend uzbl.
|
meillo@36
|
1379 Uzbl is designed to be extended by external tools.
|
meillo@36
|
1380 These external tools are usually wrapped by small handler shell scripts.
|
meillo@36
|
1381 Shell scripts are the glue in this approach.
|
meillo@36
|
1382 They make the various parts fit together.
|
meillo@36
|
1383 .PP
|
meillo@36
|
1384 As an example, the history mechanism of uzbl shall be presented.
|
meillo@36
|
1385 Uzbl is configured to spawn a script to append an entry to the history
|
meillo@36
|
1386 whenever the event of a fully loaded page occurs.
|
meillo@36
|
1387 The script to append the entry to the history not much more than:
|
meillo@36
|
1388 .DS
|
meillo@36
|
1389 .CW
|
meillo@36
|
1390 #!/bin/sh
|
meillo@36
|
1391 file=/path/to/uzbl-history
|
meillo@36
|
1392 echo `date +'%Y-%m-%d %H:%M:%S'`" $6 $7" >> $file
|
meillo@36
|
1393 .DE
|
meillo@36
|
1394 \f(CW$6\fP and \f(CW$7\fP expand to the \s-1URL\s0 and the page title.
|
meillo@36
|
1395 For loading an entry, a key is bound to spawn a load from history script.
|
meillo@36
|
1396 The script reverses the history to have newer entries first,
|
meillo@36
|
1397 then displays \fIdmenu\fP to select an item,
|
meillo@36
|
1398 and afterwards writes the selected \s-1URL\s0 into uzbl's command input pipe.
|
meillo@36
|
1399 With error checking and corner cases removed, the script looks like this:
|
meillo@36
|
1400 .DS
|
meillo@36
|
1401 .CW
|
meillo@36
|
1402 #!/bin/sh
|
meillo@36
|
1403 file=/path/to/uzbl-history
|
meillo@36
|
1404 goto=`tac $file | dmenu | cut -d' ' -f 3`
|
meillo@36
|
1405 echo "uri $goto" > $4
|
meillo@36
|
1406 .DE
|
meillo@36
|
1407 \f(CW$4\fP expands to the path of the command input pipe of the current
|
meillo@36
|
1408 uzbl instance.
|
meillo@32
|
1409
|
meillo@32
|
1410 .PP
|
meillo@33
|
1411 .B "Avoid captive user interfaces" .
|
meillo@36
|
1412 One could say, that uzbl, to a large extend, actually \fIis\fP
|
meillo@36
|
1413 a captive user interface.
|
meillo@37
|
1414 But the difference to most other web browsers is, that uzbl is only
|
meillo@37
|
1415 the captive user interface frontend and the core of the backend.
|
meillo@38
|
1416 Many parts of the backend are independent of uzbl.
|
meillo@37
|
1417 Some are distributed with uzbl, for some external programs, handler scripts
|
meillo@37
|
1418 are distributed, arbitrary additional functionality can be added if desired.
|
meillo@37
|
1419 .PP
|
meillo@37
|
1420 The frontend is captive \(en that is true.
|
meillo@37
|
1421 This is okay for the task of browsing the web, as this task is only relevant
|
meillo@37
|
1422 for humans.
|
meillo@37
|
1423 Automated programs would \fIcrawl\fP the web.
|
meillo@37
|
1424 That means, they read the source directly.
|
meillo@37
|
1425 The source includes all the semantics.
|
meillo@37
|
1426 The graphical representation is just for humans to transfer the semantics
|
meillo@37
|
1427 more intuitively.
|
meillo@32
|
1428
|
meillo@32
|
1429 .PP
|
meillo@33
|
1430 .B "Make every program a filter" .
|
meillo@37
|
1431 Graphical web browsers are almost dead ends in the chain of information flow.
|
meillo@37
|
1432 Thus it is difficult to see what graphical web browsers should filter.
|
meillo@37
|
1433 Graphical web browsers exist almost only for interactive use by humans.
|
meillo@37
|
1434 The only case when one might want to automate the rendering function is
|
meillo@37
|
1435 to generate images of rendered webpages.
|
meillo@37
|
1436
|
meillo@37
|
1437 .PP
|
meillo@37
|
1438 .B "Small is beautiful"
|
meillo@38
|
1439 is not easy to apply to a web browser, primary because modern web technology
|
meillo@38
|
1440 is very complex; hence the rendering task is very complex.
|
meillo@37
|
1441 Modern web browsers will always consist of many thousand lines of code,
|
meillo@37
|
1442 unfortunately.
|
meillo@37
|
1443 Using the toolchest approach and wrappers can split the browser into
|
meillo@37
|
1444 several small parts, tough.
|
meillo@37
|
1445 .PP
|
meillo@37
|
1446 Uzbl-core consists of about 3\,500 lines of C code.
|
meillo@37
|
1447 The distribution includes another 3\,500 lines of Shell and Python code,
|
meillo@37
|
1448 which are the handler scripts and plugins like a modal interface.
|
meillo@38
|
1449 Further more, uzbl uses functionality of external tools like
|
meillo@38
|
1450 \fIwget\fP and \fInetcat\fP.
|
meillo@37
|
1451 Up to this point, uzbl looks pretty neat and small.
|
meillo@38
|
1452 The ugly part of uzbl is the web content renderer, webkit.
|
meillo@37
|
1453 Webkit consists of roughly 400\,000 (!) lines of code.
|
meillo@38
|
1454 Unfortunately, small web render engines are not possible anymore
|
meillo@38
|
1455 because of the modern web.
|
meillo@38
|
1456 The problems section will explain this in more detail.
|
meillo@35
|
1457
|
meillo@35
|
1458 .PP
|
meillo@35
|
1459 .B "Build a prototype as soon as possible" .
|
meillo@35
|
1460 Plaetinck made his code public, right from the beginning.
|
meillo@38
|
1461 Discussion and development was, and still is, open to everyone interested.
|
meillo@38
|
1462 Development versions of uzbl can be obtained very simply from the code
|
meillo@38
|
1463 repository.
|
meillo@38
|
1464 Within the first year of uzbl's existence, a new version was released
|
meillo@35
|
1465 more often than once a month.
|
meillo@38
|
1466 Different forks and branches arose.
|
meillo@38
|
1467 They introduced new features, which were tested for suitability.
|
meillo@35
|
1468 The experiences of using prototypes influenced further development.
|
meillo@35
|
1469 Actually, all development was community driven.
|
meillo@38
|
1470 Plaetinck says, three months after uzbl's birth:
|
meillo@35
|
1471 ``Right now I hardly code anything myself for Uzbl.
|
meillo@35
|
1472 I just merge in other people's code, ponder a lot, and lead the discussions.''
|
meillo@35
|
1473 .[
|
meillo@36
|
1474 %A FIXME
|
meillo@35
|
1475 %O http://lwn.net/Articles/341245/
|
meillo@35
|
1476 .]
|
meillo@32
|
1477
|
meillo@0
|
1478
|
meillo@0
|
1479 .NH 2
|
meillo@0
|
1480 Problems
|
meillo@42
|
1481 .XS
|
meillo@42
|
1482 \*(SN Problems
|
meillo@42
|
1483 .XE
|
meillo@0
|
1484 .LP
|
meillo@38
|
1485 Similar to \s-1MH\s0, uzbl, too suffers from being different.
|
meillo@38
|
1486 It is sad, but people use what they know.
|
meillo@38
|
1487 Fortunately, uzbl's user interface can look and feel very much the
|
meillo@38
|
1488 same as the one of the well known web browsers,
|
meillo@38
|
1489 hiding the internal differences.
|
meillo@38
|
1490 But uzbl has to provide this similar look and feel to be accepted
|
meillo@38
|
1491 as a ``normal'' browser by ``normal'' users.
|
meillo@37
|
1492 .PP
|
meillo@38
|
1493 The more important problem is the modern web.
|
meillo@38
|
1494 The modern web is simply broken.
|
meillo@38
|
1495 It has state in a state-less protocol,
|
meillo@38
|
1496 it misuses technologies,
|
meillo@38
|
1497 and it is helplessly overloaded.
|
meillo@38
|
1498 The result are web content render engines that must consist
|
meillo@38
|
1499 of hundreds of thousands lines of code.
|
meillo@38
|
1500 They also must combine and integrate many different technologies,
|
meillo@38
|
1501 only to make our modern web usable.
|
meillo@38
|
1502 Website to image converter are hardly possible to run without
|
meillo@38
|
1503 human interaction because of state in sessions, impossible
|
meillo@38
|
1504 deep-linking, and unautomatable technologies.
|
meillo@37
|
1505 .PP
|
meillo@38
|
1506 The web was misused to provide all kinds of imaginable wishes.
|
meillo@38
|
1507 Now web browsers, and eventually the users, suffer from it.
|
meillo@37
|
1508
|
meillo@8
|
1509
|
meillo@32
|
1510 .NH 2
|
meillo@32
|
1511 Summary uzbl
|
meillo@42
|
1512 .XS
|
meillo@42
|
1513 \*(SN Summary uzbl
|
meillo@42
|
1514 .XE
|
meillo@32
|
1515 .LP
|
meillo@38
|
1516 ``Uzbl is a browser that adheres to the Unix Philosophy'',
|
meillo@38
|
1517 that is how uzbl is seen by its authors.
|
meillo@38
|
1518 Indeed, uzbl follows the Unix Philosophy in many ways.
|
meillo@38
|
1519 It consists of independent parts that work together,
|
meillo@38
|
1520 its core is mainly a mediator which glues the parts together.
|
meillo@38
|
1521 .PP
|
meillo@38
|
1522 Software leverage can excellently be seen in uzbl.
|
meillo@38
|
1523 It makes use of external tools, separates independent tasks
|
meillo@38
|
1524 in independent parts, and glues them together with small
|
meillo@38
|
1525 handler scripts, around uzbl-core.
|
meillo@38
|
1526 .PP
|
meillo@38
|
1527 As uzbl, more or less, consists of a set of tools and a bit
|
meillo@38
|
1528 of glue, anyone can put the parts together and expand it
|
meillo@38
|
1529 in any desired way.
|
meillo@38
|
1530 Uzbl is very flexible and customizable.
|
meillo@38
|
1531 These properties make it valuable for advanced users,
|
meillo@38
|
1532 but may keep novice users from using it.
|
meillo@38
|
1533 .PP
|
meillo@38
|
1534 Uzbl's main problem is the modern web, that makes it hard
|
meillo@38
|
1535 to design a sane web browser.
|
meillo@38
|
1536 Despite this bad situation, uzbl does a fairly good job.
|
meillo@32
|
1537
|
meillo@8
|
1538
|
meillo@0
|
1539 .NH 1
|
meillo@0
|
1540 Final thoughts
|
meillo@42
|
1541 .XS
|
meillo@42
|
1542 .sp .5v
|
meillo@42
|
1543 \*(SN Final thoughts
|
meillo@42
|
1544 .XE
|
meillo@0
|
1545
|
meillo@0
|
1546 .NH 2
|
meillo@0
|
1547 Quick summary
|
meillo@42
|
1548 .XS
|
meillo@42
|
1549 \*(SN Quick summary
|
meillo@42
|
1550 .XE
|
meillo@0
|
1551 .LP
|
meillo@0
|
1552 good design
|
meillo@0
|
1553 .LP
|
meillo@0
|
1554 unix phil
|
meillo@0
|
1555 .LP
|
meillo@0
|
1556 case studies
|
meillo@0
|
1557
|
meillo@0
|
1558 .NH 2
|
meillo@0
|
1559 Why people should choose
|
meillo@42
|
1560 .XS
|
meillo@42
|
1561 \*(SN Why people should choose
|
meillo@42
|
1562 .XE
|
meillo@0
|
1563 .LP
|
meillo@0
|
1564 Make the right choice!
|
meillo@0
|
1565
|
meillo@42
|
1566 .nr PI .3i
|
meillo@0
|
1567 .rm ]<
|
meillo@0
|
1568 .de ]<
|
meillo@0
|
1569 .LP
|
meillo@0
|
1570 .de FP
|
meillo@0
|
1571 .IP \\\\$1.
|
meillo@0
|
1572 \\..
|
meillo@0
|
1573 .rm FS FE
|
meillo@0
|
1574 ..
|
meillo@42
|
1575 .ds CH "
|
meillo@42
|
1576 .bp
|
meillo@42
|
1577 .rs
|
meillo@42
|
1578 .sp .3i
|
meillo@42
|
1579 .TL
|
meillo@0
|
1580 References
|
meillo@42
|
1581 .LP
|
meillo@42
|
1582 .XS
|
meillo@42
|
1583 .sp .5v
|
meillo@42
|
1584 References
|
meillo@42
|
1585 .XE
|
meillo@42
|
1586 .sp 2v
|
meillo@42
|
1587 .nr PS -1
|
meillo@42
|
1588 .nr VS -1
|
meillo@0
|
1589 .[
|
meillo@0
|
1590 $LIST$
|
meillo@0
|
1591 .]
|
meillo@42
|
1592 .\".wh -1p
|
meillo@42
|
1593 .bp
|
meillo@42
|
1594 .PX
|