]> code.delx.au - gnu-emacs/blob - doc/lispref/text.texi
Minor quoting etc. fixes to lispref manual
[gnu-emacs] / doc / lispref / text.texi
1 @c -*-texinfo-*-
2 @c This is part of the GNU Emacs Lisp Reference Manual.
3 @c Copyright (C) 1990-1995, 1998-2015 Free Software Foundation, Inc.
4 @c See the file elisp.texi for copying conditions.
5 @node Text
6 @chapter Text
7 @cindex text
8
9 This chapter describes the functions that deal with the text in a
10 buffer. Most examine, insert, or delete text in the current buffer,
11 often operating at point or on text adjacent to point. Many are
12 interactive. All the functions that change the text provide for undoing
13 the changes (@pxref{Undo}).
14
15 Many text-related functions operate on a region of text defined by two
16 buffer positions passed in arguments named @var{start} and @var{end}.
17 These arguments should be either markers (@pxref{Markers}) or numeric
18 character positions (@pxref{Positions}). The order of these arguments
19 does not matter; it is all right for @var{start} to be the end of the
20 region and @var{end} the beginning. For example, @code{(delete-region 1
21 10)} and @code{(delete-region 10 1)} are equivalent. An
22 @code{args-out-of-range} error is signaled if either @var{start} or
23 @var{end} is outside the accessible portion of the buffer. In an
24 interactive call, point and the mark are used for these arguments.
25
26 @cindex buffer contents
27 Throughout this chapter, ``text'' refers to the characters in the
28 buffer, together with their properties (when relevant). Keep in mind
29 that point is always between two characters, and the cursor appears on
30 the character after point.
31
32 @menu
33 * Near Point:: Examining text in the vicinity of point.
34 * Buffer Contents:: Examining text in a general fashion.
35 * Comparing Text:: Comparing substrings of buffers.
36 * Insertion:: Adding new text to a buffer.
37 * Commands for Insertion:: User-level commands to insert text.
38 * Deletion:: Removing text from a buffer.
39 * User-Level Deletion:: User-level commands to delete text.
40 * The Kill Ring:: Where removed text sometimes is saved for later use.
41 * Undo:: Undoing changes to the text of a buffer.
42 * Maintaining Undo:: How to enable and disable undo information.
43 How to control how much information is kept.
44 * Filling:: Functions for explicit filling.
45 * Margins:: How to specify margins for filling commands.
46 * Adaptive Fill:: Adaptive Fill mode chooses a fill prefix from context.
47 * Auto Filling:: How auto-fill mode is implemented to break lines.
48 * Sorting:: Functions for sorting parts of the buffer.
49 * Columns:: Computing horizontal positions, and using them.
50 * Indentation:: Functions to insert or adjust indentation.
51 * Case Changes:: Case conversion of parts of the buffer.
52 * Text Properties:: Assigning Lisp property lists to text characters.
53 * Substitution:: Replacing a given character wherever it appears.
54 * Registers:: How registers are implemented. Accessing the text or
55 position stored in a register.
56 * Transposition:: Swapping two portions of a buffer.
57 * Decompression:: Dealing with compressed data.
58 * Base 64:: Conversion to or from base 64 encoding.
59 * Checksum/Hash:: Computing cryptographic hashes.
60 * Parsing HTML/XML:: Parsing HTML and XML.
61 * Atomic Changes:: Installing several buffer changes "atomically".
62 * Change Hooks:: Supplying functions to be run when text is changed.
63 @end menu
64
65 @node Near Point
66 @section Examining Text Near Point
67 @cindex text near point
68
69 Many functions are provided to look at the characters around point.
70 Several simple functions are described here. See also @code{looking-at}
71 in @ref{Regexp Search}.
72
73 In the following four functions, ``beginning'' or ``end'' of buffer
74 refers to the beginning or end of the accessible portion.
75
76 @defun char-after &optional position
77 This function returns the character in the current buffer at (i.e.,
78 immediately after) position @var{position}. If @var{position} is out of
79 range for this purpose, either before the beginning of the buffer, or at
80 or beyond the end, then the value is @code{nil}. The default for
81 @var{position} is point.
82
83 In the following example, assume that the first character in the
84 buffer is @samp{@@}:
85
86 @example
87 @group
88 (string (char-after 1))
89 @result{} "@@"
90 @end group
91 @end example
92 @end defun
93
94 @defun char-before &optional position
95 This function returns the character in the current buffer immediately
96 before position @var{position}. If @var{position} is out of range for
97 this purpose, either at or before the beginning of the buffer, or beyond
98 the end, then the value is @code{nil}. The default for
99 @var{position} is point.
100 @end defun
101
102 @defun following-char
103 This function returns the character following point in the current
104 buffer. This is similar to @code{(char-after (point))}. However, if
105 point is at the end of the buffer, then @code{following-char} returns 0.
106
107 Remember that point is always between characters, and the cursor
108 normally appears over the character following point. Therefore, the
109 character returned by @code{following-char} is the character the
110 cursor is over.
111
112 In this example, point is between the @samp{a} and the @samp{c}.
113
114 @example
115 @group
116 ---------- Buffer: foo ----------
117 Gentlemen may cry ``Pea@point{}ce! Peace!,''
118 but there is no peace.
119 ---------- Buffer: foo ----------
120 @end group
121
122 @group
123 (string (preceding-char))
124 @result{} "a"
125 (string (following-char))
126 @result{} "c"
127 @end group
128 @end example
129 @end defun
130
131 @defun preceding-char
132 This function returns the character preceding point in the current
133 buffer. See above, under @code{following-char}, for an example. If
134 point is at the beginning of the buffer, @code{preceding-char} returns
135 0.
136 @end defun
137
138 @defun bobp
139 This function returns @code{t} if point is at the beginning of the
140 buffer. If narrowing is in effect, this means the beginning of the
141 accessible portion of the text. See also @code{point-min} in
142 @ref{Point}.
143 @end defun
144
145 @defun eobp
146 This function returns @code{t} if point is at the end of the buffer.
147 If narrowing is in effect, this means the end of accessible portion of
148 the text. See also @code{point-max} in @xref{Point}.
149 @end defun
150
151 @defun bolp
152 This function returns @code{t} if point is at the beginning of a line.
153 @xref{Text Lines}. The beginning of the buffer (or of its accessible
154 portion) always counts as the beginning of a line.
155 @end defun
156
157 @defun eolp
158 This function returns @code{t} if point is at the end of a line. The
159 end of the buffer (or of its accessible portion) is always considered
160 the end of a line.
161 @end defun
162
163 @node Buffer Contents
164 @section Examining Buffer Contents
165 @cindex buffer portion as string
166
167 This section describes functions that allow a Lisp program to
168 convert any portion of the text in the buffer into a string.
169
170 @defun buffer-substring start end
171 This function returns a string containing a copy of the text of the
172 region defined by positions @var{start} and @var{end} in the current
173 buffer. If the arguments are not positions in the accessible portion
174 of the buffer, @code{buffer-substring} signals an
175 @code{args-out-of-range} error.
176
177 Here's an example which assumes Font-Lock mode is not enabled:
178
179 @example
180 @group
181 ---------- Buffer: foo ----------
182 This is the contents of buffer foo
183
184 ---------- Buffer: foo ----------
185 @end group
186
187 @group
188 (buffer-substring 1 10)
189 @result{} "This is t"
190 @end group
191 @group
192 (buffer-substring (point-max) 10)
193 @result{} "he contents of buffer foo\n"
194 @end group
195 @end example
196
197 If the text being copied has any text properties, these are copied into
198 the string along with the characters they belong to. @xref{Text
199 Properties}. However, overlays (@pxref{Overlays}) in the buffer and
200 their properties are ignored, not copied.
201
202 For example, if Font-Lock mode is enabled, you might get results like
203 these:
204
205 @example
206 @group
207 (buffer-substring 1 10)
208 @result{} #("This is t" 0 1 (fontified t) 1 9 (fontified t))
209 @end group
210 @end example
211 @end defun
212
213 @defun buffer-substring-no-properties start end
214 This is like @code{buffer-substring}, except that it does not copy text
215 properties, just the characters themselves. @xref{Text Properties}.
216 @end defun
217
218 @defun buffer-string
219 This function returns the contents of the entire accessible portion of
220 the current buffer, as a string.
221 @end defun
222
223 If you need to make sure the resulting string, when copied to a
224 different location, will not change its visual appearance due to
225 reordering of bidirectional text, use the
226 @code{buffer-substring-with-bidi-context} function
227 (@pxref{Bidirectional Display, buffer-substring-with-bidi-context}).
228
229 @defun filter-buffer-substring start end &optional delete
230 This function filters the buffer text between @var{start} and @var{end}
231 using a function specified by the variable
232 @code{filter-buffer-substring-function}, and returns the result.
233
234 The default filter function consults the obsolete wrapper hook
235 @code{filter-buffer-substring-functions}, and the obsolete variable
236 @code{buffer-substring-filters}. If both of these are @code{nil}, it
237 returns the unaltered text from the buffer, i.e., what
238 @code{buffer-substring} would return.
239
240 If @var{delete} is non-@code{nil}, the function deletes the text
241 between @var{start} and @var{end} after copying it, like
242 @code{delete-and-extract-region}.
243
244 Lisp code should use this function instead of @code{buffer-substring},
245 @code{buffer-substring-no-properties},
246 or @code{delete-and-extract-region} when copying into user-accessible
247 data structures such as the kill-ring, X clipboard, and registers.
248 Major and minor modes can modify @code{filter-buffer-substring-function}
249 to alter such text as it is copied out of the buffer.
250 @end defun
251
252 @defvar filter-buffer-substring-function
253 The value of this variable is a function that @code{filter-buffer-substring}
254 will call to do the actual work. The function receives three
255 arguments, the same as those of @code{filter-buffer-substring},
256 which it should treat as per the documentation of that function. It
257 should return the filtered text (and optionally delete the source text).
258 @end defvar
259
260 @noindent The following two variables are obsoleted by
261 @code{filter-buffer-substring-function}, but are still supported for
262 backward compatibility.
263
264 @defvar filter-buffer-substring-functions
265 This obsolete variable is a wrapper hook, whose members should be functions
266 that accept four arguments: @var{fun}, @var{start}, @var{end}, and
267 @var{delete}. @var{fun} is a function that takes three arguments
268 (@var{start}, @var{end}, and @var{delete}), and returns a string. In
269 both cases, the @var{start}, @var{end}, and @var{delete} arguments are
270 the same as those of @code{filter-buffer-substring}.
271
272 The first hook function is passed a @var{fun} that is equivalent to
273 the default operation of @code{filter-buffer-substring}, i.e., it
274 returns the buffer-substring between @var{start} and @var{end}
275 (processed by any @code{buffer-substring-filters}) and optionally
276 deletes the original text from the buffer. In most cases, the hook
277 function will call @var{fun} once, and then do its own processing of
278 the result. The next hook function receives a @var{fun} equivalent to
279 this, and so on. The actual return value is the result of all the
280 hook functions acting in sequence.
281 @end defvar
282
283 @defvar buffer-substring-filters
284 The value of this obsolete variable should be a list of functions
285 that accept a single string argument and return another string.
286 The default @code{filter-buffer-substring} function passes the buffer
287 substring to the first function in this list, and the return value of
288 each function is passed to the next function. The return value of the
289 last function is passed to @code{filter-buffer-substring-functions}.
290 @end defvar
291
292 @defun current-word &optional strict really-word
293 This function returns the symbol (or word) at or near point, as a
294 string. The return value includes no text properties.
295
296 If the optional argument @var{really-word} is non-@code{nil}, it finds a
297 word; otherwise, it finds a symbol (which includes both word
298 characters and symbol constituent characters).
299
300 If the optional argument @var{strict} is non-@code{nil}, then point
301 must be in or next to the symbol or word---if no symbol or word is
302 there, the function returns @code{nil}. Otherwise, a nearby symbol or
303 word on the same line is acceptable.
304 @end defun
305
306 @defun thing-at-point thing
307 Return the @var{thing} around or next to point, as a string.
308
309 The argument @var{thing} is a symbol which specifies a kind of syntactic
310 entity. Possibilities include @code{symbol}, @code{list}, @code{sexp},
311 @code{defun}, @code{filename}, @code{url}, @code{word}, @code{sentence},
312 @code{whitespace}, @code{line}, @code{page}, and others.
313
314 @example
315 ---------- Buffer: foo ----------
316 Gentlemen may cry ``Pea@point{}ce! Peace!,''
317 but there is no peace.
318 ---------- Buffer: foo ----------
319
320 (thing-at-point 'word)
321 @result{} "Peace"
322 (thing-at-point 'line)
323 @result{} "Gentlemen may cry ``Peace! Peace!,''\n"
324 (thing-at-point 'whitespace)
325 @result{} nil
326 @end example
327 @end defun
328
329 @node Comparing Text
330 @section Comparing Text
331 @cindex comparing buffer text
332
333 This function lets you compare portions of the text in a buffer, without
334 copying them into strings first.
335
336 @defun compare-buffer-substrings buffer1 start1 end1 buffer2 start2 end2
337 This function lets you compare two substrings of the same buffer or two
338 different buffers. The first three arguments specify one substring,
339 giving a buffer (or a buffer name) and two positions within the
340 buffer. The last three arguments specify the other substring in the
341 same way. You can use @code{nil} for @var{buffer1}, @var{buffer2}, or
342 both to stand for the current buffer.
343
344 The value is negative if the first substring is less, positive if the
345 first is greater, and zero if they are equal. The absolute value of
346 the result is one plus the index of the first differing characters
347 within the substrings.
348
349 This function ignores case when comparing characters
350 if @code{case-fold-search} is non-@code{nil}. It always ignores
351 text properties.
352
353 Suppose you have the text @w{@samp{foobarbar haha!rara!}} in the
354 current buffer; then in this example the two substrings are @samp{rbar
355 } and @samp{rara!}. The value is 2 because the first substring is
356 greater at the second character.
357
358 @example
359 (compare-buffer-substrings nil 6 11 nil 16 21)
360 @result{} 2
361 @end example
362 @end defun
363
364 @node Insertion
365 @section Inserting Text
366 @cindex insertion of text
367 @cindex text insertion
368
369 @cindex insertion before point
370 @cindex before point, insertion
371 @dfn{Insertion} means adding new text to a buffer. The inserted text
372 goes at point---between the character before point and the character
373 after point. Some insertion functions leave point before the inserted
374 text, while other functions leave it after. We call the former
375 insertion @dfn{after point} and the latter insertion @dfn{before point}.
376
377 Insertion moves markers located at positions after the insertion
378 point, so that they stay with the surrounding text (@pxref{Markers}).
379 When a marker points at the place of insertion, insertion may or may
380 not relocate the marker, depending on the marker's insertion type
381 (@pxref{Marker Insertion Types}). Certain special functions such as
382 @code{insert-before-markers} relocate all such markers to point after
383 the inserted text, regardless of the markers' insertion type.
384
385 Insertion functions signal an error if the current buffer is
386 read-only (@pxref{Read Only Buffers}) or if they insert within
387 read-only text (@pxref{Special Properties}).
388
389 These functions copy text characters from strings and buffers along
390 with their properties. The inserted characters have exactly the same
391 properties as the characters they were copied from. By contrast,
392 characters specified as separate arguments, not part of a string or
393 buffer, inherit their text properties from the neighboring text.
394
395 The insertion functions convert text from unibyte to multibyte in
396 order to insert in a multibyte buffer, and vice versa---if the text
397 comes from a string or from a buffer. However, they do not convert
398 unibyte character codes 128 through 255 to multibyte characters, not
399 even if the current buffer is a multibyte buffer. @xref{Converting
400 Representations}.
401
402 @defun insert &rest args
403 This function inserts the strings and/or characters @var{args} into the
404 current buffer, at point, moving point forward. In other words, it
405 inserts the text before point. An error is signaled unless all
406 @var{args} are either strings or characters. The value is @code{nil}.
407 @end defun
408
409 @defun insert-before-markers &rest args
410 This function inserts the strings and/or characters @var{args} into the
411 current buffer, at point, moving point forward. An error is signaled
412 unless all @var{args} are either strings or characters. The value is
413 @code{nil}.
414
415 This function is unlike the other insertion functions in that it
416 relocates markers initially pointing at the insertion point, to point
417 after the inserted text. If an overlay begins at the insertion point,
418 the inserted text falls outside the overlay; if a nonempty overlay
419 ends at the insertion point, the inserted text falls inside that
420 overlay.
421 @end defun
422
423 @deffn Command insert-char character &optional count inherit
424 This command inserts @var{count} instances of @var{character} into the
425 current buffer before point. The argument @var{count} must be an
426 integer, and @var{character} must be a character.
427
428 If called interactively, this command prompts for @var{character}
429 using its Unicode name or its code point. @xref{Inserting Text,,,
430 emacs, The GNU Emacs Manual}.
431
432 This function does not convert unibyte character codes 128 through 255
433 to multibyte characters, not even if the current buffer is a multibyte
434 buffer. @xref{Converting Representations}.
435
436 If @var{inherit} is non-@code{nil}, the inserted characters inherit
437 sticky text properties from the two characters before and after the
438 insertion point. @xref{Sticky Properties}.
439 @end deffn
440
441 @defun insert-buffer-substring from-buffer-or-name &optional start end
442 This function inserts a portion of buffer @var{from-buffer-or-name}
443 into the current buffer before point. The text inserted is the region
444 between @var{start} (inclusive) and @var{end} (exclusive). (These
445 arguments default to the beginning and end of the accessible portion
446 of that buffer.) This function returns @code{nil}.
447
448 In this example, the form is executed with buffer @samp{bar} as the
449 current buffer. We assume that buffer @samp{bar} is initially empty.
450
451 @example
452 @group
453 ---------- Buffer: foo ----------
454 We hold these truths to be self-evident, that all
455 ---------- Buffer: foo ----------
456 @end group
457
458 @group
459 (insert-buffer-substring "foo" 1 20)
460 @result{} nil
461
462 ---------- Buffer: bar ----------
463 We hold these truth@point{}
464 ---------- Buffer: bar ----------
465 @end group
466 @end example
467 @end defun
468
469 @defun insert-buffer-substring-no-properties from-buffer-or-name &optional start end
470 This is like @code{insert-buffer-substring} except that it does not
471 copy any text properties.
472 @end defun
473
474 @xref{Sticky Properties}, for other insertion functions that inherit
475 text properties from the nearby text in addition to inserting it.
476 Whitespace inserted by indentation functions also inherits text
477 properties.
478
479 @node Commands for Insertion
480 @section User-Level Insertion Commands
481
482 This section describes higher-level commands for inserting text,
483 commands intended primarily for the user but useful also in Lisp
484 programs.
485
486 @deffn Command insert-buffer from-buffer-or-name
487 This command inserts the entire accessible contents of
488 @var{from-buffer-or-name} (which must exist) into the current buffer
489 after point. It leaves the mark after the inserted text. The value
490 is @code{nil}.
491 @end deffn
492
493 @deffn Command self-insert-command count
494 @cindex character insertion
495 @cindex self-insertion
496 This command inserts the last character typed; it does so @var{count}
497 times, before point, and returns @code{nil}. Most printing characters
498 are bound to this command. In routine use, @code{self-insert-command}
499 is the most frequently called function in Emacs, but programs rarely use
500 it except to install it on a keymap.
501
502 In an interactive call, @var{count} is the numeric prefix argument.
503
504 @c FIXME: This variable is obsolete since 23.1.
505 Self-insertion translates the input character through
506 @code{translation-table-for-input}. @xref{Translation of Characters}.
507
508 This command calls @code{auto-fill-function} whenever that is
509 non-@code{nil} and the character inserted is in the table
510 @code{auto-fill-chars} (@pxref{Auto Filling}).
511
512 @c Cross refs reworded to prevent overfull hbox. --rjc 15mar92
513 This command performs abbrev expansion if Abbrev mode is enabled and
514 the inserted character does not have word-constituent
515 syntax. (@xref{Abbrevs}, and @ref{Syntax Class Table}.) It is also
516 responsible for calling @code{blink-paren-function} when the inserted
517 character has close parenthesis syntax (@pxref{Blinking}).
518
519 @vindex post-self-insert-hook
520 The final thing this command does is to run the hook
521 @code{post-self-insert-hook}. You could use this to automatically
522 reindent text as it is typed, for example.
523
524 Do not try substituting your own definition of
525 @code{self-insert-command} for the standard one. The editor command
526 loop handles this function specially.
527 @end deffn
528
529 @deffn Command newline &optional number-of-newlines
530 This command inserts newlines into the current buffer before point.
531 If @var{number-of-newlines} is supplied, that many newline characters
532 are inserted.
533
534 @cindex newline and Auto Fill mode
535 This function calls @code{auto-fill-function} if the current column
536 number is greater than the value of @code{fill-column} and
537 @var{number-of-newlines} is @code{nil}. Typically what
538 @code{auto-fill-function} does is insert a newline; thus, the overall
539 result in this case is to insert two newlines at different places: one
540 at point, and another earlier in the line. @code{newline} does not
541 auto-fill if @var{number-of-newlines} is non-@code{nil}.
542
543 This command indents to the left margin if that is not zero.
544 @xref{Margins}.
545
546 The value returned is @code{nil}. In an interactive call, @var{count}
547 is the numeric prefix argument.
548 @end deffn
549
550 @defvar overwrite-mode
551 This variable controls whether overwrite mode is in effect. The value
552 should be @code{overwrite-mode-textual}, @code{overwrite-mode-binary},
553 or @code{nil}. @code{overwrite-mode-textual} specifies textual
554 overwrite mode (treats newlines and tabs specially), and
555 @code{overwrite-mode-binary} specifies binary overwrite mode (treats
556 newlines and tabs like any other characters).
557 @end defvar
558
559 @node Deletion
560 @section Deleting Text
561 @cindex text deletion
562
563 @cindex deleting text vs killing
564 Deletion means removing part of the text in a buffer, without saving
565 it in the kill ring (@pxref{The Kill Ring}). Deleted text can't be
566 yanked, but can be reinserted using the undo mechanism (@pxref{Undo}).
567 Some deletion functions do save text in the kill ring in some special
568 cases.
569
570 All of the deletion functions operate on the current buffer.
571
572 @deffn Command erase-buffer
573 This function deletes the entire text of the current buffer
574 (@emph{not} just the accessible portion), leaving it
575 empty. If the buffer is read-only, it signals a @code{buffer-read-only}
576 error; if some of the text in it is read-only, it signals a
577 @code{text-read-only} error. Otherwise, it deletes the text without
578 asking for any confirmation. It returns @code{nil}.
579
580 Normally, deleting a large amount of text from a buffer inhibits further
581 auto-saving of that buffer ``because it has shrunk''. However,
582 @code{erase-buffer} does not do this, the idea being that the future
583 text is not really related to the former text, and its size should not
584 be compared with that of the former text.
585 @end deffn
586
587 @deffn Command delete-region start end
588 This command deletes the text between positions @var{start} and
589 @var{end} in the current buffer, and returns @code{nil}. If point was
590 inside the deleted region, its value afterward is @var{start}.
591 Otherwise, point relocates with the surrounding text, as markers do.
592 @end deffn
593
594 @defun delete-and-extract-region start end
595 This function deletes the text between positions @var{start} and
596 @var{end} in the current buffer, and returns a string containing the
597 text just deleted.
598
599 If point was inside the deleted region, its value afterward is
600 @var{start}. Otherwise, point relocates with the surrounding text, as
601 markers do.
602 @end defun
603
604 @deffn Command delete-char count &optional killp
605 This command deletes @var{count} characters directly after point, or
606 before point if @var{count} is negative. If @var{killp} is
607 non-@code{nil}, then it saves the deleted characters in the kill ring.
608
609 In an interactive call, @var{count} is the numeric prefix argument, and
610 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
611 argument is supplied, the text is saved in the kill ring. If no prefix
612 argument is supplied, then one character is deleted, but not saved in
613 the kill ring.
614
615 The value returned is always @code{nil}.
616 @end deffn
617
618 @deffn Command delete-backward-char count &optional killp
619 @cindex deleting previous char
620 This command deletes @var{count} characters directly before point, or
621 after point if @var{count} is negative. If @var{killp} is
622 non-@code{nil}, then it saves the deleted characters in the kill ring.
623
624 In an interactive call, @var{count} is the numeric prefix argument, and
625 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
626 argument is supplied, the text is saved in the kill ring. If no prefix
627 argument is supplied, then one character is deleted, but not saved in
628 the kill ring.
629
630 The value returned is always @code{nil}.
631 @end deffn
632
633 @deffn Command backward-delete-char-untabify count &optional killp
634 @cindex tab deletion
635 This command deletes @var{count} characters backward, changing tabs
636 into spaces. When the next character to be deleted is a tab, it is
637 first replaced with the proper number of spaces to preserve alignment
638 and then one of those spaces is deleted instead of the tab. If
639 @var{killp} is non-@code{nil}, then the command saves the deleted
640 characters in the kill ring.
641
642 Conversion of tabs to spaces happens only if @var{count} is positive.
643 If it is negative, exactly @minus{}@var{count} characters after point
644 are deleted.
645
646 In an interactive call, @var{count} is the numeric prefix argument, and
647 @var{killp} is the unprocessed prefix argument. Therefore, if a prefix
648 argument is supplied, the text is saved in the kill ring. If no prefix
649 argument is supplied, then one character is deleted, but not saved in
650 the kill ring.
651
652 The value returned is always @code{nil}.
653 @end deffn
654
655 @defopt backward-delete-char-untabify-method
656 This option specifies how @code{backward-delete-char-untabify} should
657 deal with whitespace. Possible values include @code{untabify}, the
658 default, meaning convert a tab to many spaces and delete one;
659 @code{hungry}, meaning delete all tabs and spaces before point with
660 one command; @code{all} meaning delete all tabs, spaces and newlines
661 before point, and @code{nil}, meaning do nothing special for
662 whitespace characters.
663 @end defopt
664
665 @node User-Level Deletion
666 @section User-Level Deletion Commands
667
668 This section describes higher-level commands for deleting text,
669 commands intended primarily for the user but useful also in Lisp
670 programs.
671
672 @deffn Command delete-horizontal-space &optional backward-only
673 @cindex deleting whitespace
674 This function deletes all spaces and tabs around point. It returns
675 @code{nil}.
676
677 If @var{backward-only} is non-@code{nil}, the function deletes
678 spaces and tabs before point, but not after point.
679
680 In the following examples, we call @code{delete-horizontal-space} four
681 times, once on each line, with point between the second and third
682 characters on the line each time.
683
684 @example
685 @group
686 ---------- Buffer: foo ----------
687 I @point{}thought
688 I @point{} thought
689 We@point{} thought
690 Yo@point{}u thought
691 ---------- Buffer: foo ----------
692 @end group
693
694 @group
695 (delete-horizontal-space) ; @r{Four times.}
696 @result{} nil
697
698 ---------- Buffer: foo ----------
699 Ithought
700 Ithought
701 Wethought
702 You thought
703 ---------- Buffer: foo ----------
704 @end group
705 @end example
706 @end deffn
707
708 @deffn Command delete-indentation &optional join-following-p
709 This function joins the line point is on to the previous line, deleting
710 any whitespace at the join and in some cases replacing it with one
711 space. If @var{join-following-p} is non-@code{nil},
712 @code{delete-indentation} joins this line to the following line
713 instead. The function returns @code{nil}.
714
715 If there is a fill prefix, and the second of the lines being joined
716 starts with the prefix, then @code{delete-indentation} deletes the
717 fill prefix before joining the lines. @xref{Margins}.
718
719 In the example below, point is located on the line starting
720 @samp{events}, and it makes no difference if there are trailing spaces
721 in the preceding line.
722
723 @smallexample
724 @group
725 ---------- Buffer: foo ----------
726 When in the course of human
727 @point{} events, it becomes necessary
728 ---------- Buffer: foo ----------
729 @end group
730
731 (delete-indentation)
732 @result{} nil
733
734 @group
735 ---------- Buffer: foo ----------
736 When in the course of human@point{} events, it becomes necessary
737 ---------- Buffer: foo ----------
738 @end group
739 @end smallexample
740
741 After the lines are joined, the function @code{fixup-whitespace} is
742 responsible for deciding whether to leave a space at the junction.
743 @end deffn
744
745 @deffn Command fixup-whitespace
746 This function replaces all the horizontal whitespace surrounding point
747 with either one space or no space, according to the context. It
748 returns @code{nil}.
749
750 At the beginning or end of a line, the appropriate amount of space is
751 none. Before a character with close parenthesis syntax, or after a
752 character with open parenthesis or expression-prefix syntax, no space is
753 also appropriate. Otherwise, one space is appropriate. @xref{Syntax
754 Class Table}.
755
756 In the example below, @code{fixup-whitespace} is called the first time
757 with point before the word @samp{spaces} in the first line. For the
758 second invocation, point is directly after the @samp{(}.
759
760 @smallexample
761 @group
762 ---------- Buffer: foo ----------
763 This has too many @point{}spaces
764 This has too many spaces at the start of (@point{} this list)
765 ---------- Buffer: foo ----------
766 @end group
767
768 @group
769 (fixup-whitespace)
770 @result{} nil
771 (fixup-whitespace)
772 @result{} nil
773 @end group
774
775 @group
776 ---------- Buffer: foo ----------
777 This has too many spaces
778 This has too many spaces at the start of (this list)
779 ---------- Buffer: foo ----------
780 @end group
781 @end smallexample
782 @end deffn
783
784 @deffn Command just-one-space &optional n
785 @comment !!SourceFile simple.el
786 This command replaces any spaces and tabs around point with a single
787 space, or @var{n} spaces if @var{n} is specified. It returns
788 @code{nil}.
789 @end deffn
790
791 @c There is also cycle-spacing, but I cannot see it being useful in
792 @c Lisp programs, so it is not mentioned here.
793
794 @deffn Command delete-blank-lines
795 This function deletes blank lines surrounding point. If point is on a
796 blank line with one or more blank lines before or after it, then all but
797 one of them are deleted. If point is on an isolated blank line, then it
798 is deleted. If point is on a nonblank line, the command deletes all
799 blank lines immediately following it.
800
801 A blank line is defined as a line containing only tabs and spaces.
802 @c and the Newline character?
803
804 @code{delete-blank-lines} returns @code{nil}.
805 @end deffn
806
807 @deffn Command delete-trailing-whitespace start end
808 Delete trailing whitespace in the region defined by @var{start} and
809 @var{end}.
810
811 This command deletes whitespace characters after the last
812 non-whitespace character in each line in the region.
813
814 If this command acts on the entire buffer (i.e., if called
815 interactively with the mark inactive, or called from Lisp with
816 @var{end} @code{nil}), it also deletes all trailing lines at the end of the
817 buffer if the variable @code{delete-trailing-lines} is non-@code{nil}.
818 @end deffn
819
820 @node The Kill Ring
821 @section The Kill Ring
822 @cindex kill ring
823
824 @dfn{Kill functions} delete text like the deletion functions, but save
825 it so that the user can reinsert it by @dfn{yanking}. Most of these
826 functions have @samp{kill-} in their name. By contrast, the functions
827 whose names start with @samp{delete-} normally do not save text for
828 yanking (though they can still be undone); these are ``deletion''
829 functions.
830
831 Most of the kill commands are primarily for interactive use, and are
832 not described here. What we do describe are the functions provided for
833 use in writing such commands. You can use these functions to write
834 commands for killing text. When you need to delete text for internal
835 purposes within a Lisp function, you should normally use deletion
836 functions, so as not to disturb the kill ring contents.
837 @xref{Deletion}.
838
839 Killed text is saved for later yanking in the @dfn{kill ring}. This
840 is a list that holds a number of recent kills, not just the last text
841 kill. We call this a ``ring'' because yanking treats it as having
842 elements in a cyclic order. The list is kept in the variable
843 @code{kill-ring}, and can be operated on with the usual functions for
844 lists; there are also specialized functions, described in this section,
845 that treat it as a ring.
846
847 Some people think this use of the word ``kill'' is unfortunate, since
848 it refers to operations that specifically @emph{do not} destroy the
849 entities ``killed''. This is in sharp contrast to ordinary life, in
850 which death is permanent and ``killed'' entities do not come back to
851 life. Therefore, other metaphors have been proposed. For example, the
852 term ``cut ring'' makes sense to people who, in pre-computer days, used
853 scissors and paste to cut up and rearrange manuscripts. However, it
854 would be difficult to change the terminology now.
855
856 @menu
857 * Kill Ring Concepts:: What text looks like in the kill ring.
858 * Kill Functions:: Functions that kill text.
859 * Yanking:: How yanking is done.
860 * Yank Commands:: Commands that access the kill ring.
861 * Low-Level Kill Ring:: Functions and variables for kill ring access.
862 * Internals of Kill Ring:: Variables that hold kill ring data.
863 @end menu
864
865 @node Kill Ring Concepts
866 @subsection Kill Ring Concepts
867
868 The kill ring records killed text as strings in a list, most recent
869 first. A short kill ring, for example, might look like this:
870
871 @example
872 ("some text" "a different piece of text" "even older text")
873 @end example
874
875 @noindent
876 When the list reaches @code{kill-ring-max} entries in length, adding a
877 new entry automatically deletes the last entry.
878
879 When kill commands are interwoven with other commands, each kill
880 command makes a new entry in the kill ring. Multiple kill commands in
881 succession build up a single kill ring entry, which would be yanked as a
882 unit; the second and subsequent consecutive kill commands add text to
883 the entry made by the first one.
884
885 For yanking, one entry in the kill ring is designated the ``front'' of
886 the ring. Some yank commands ``rotate'' the ring by designating a
887 different element as the ``front''. But this virtual rotation doesn't
888 change the list itself---the most recent entry always comes first in the
889 list.
890
891 @node Kill Functions
892 @subsection Functions for Killing
893
894 @code{kill-region} is the usual subroutine for killing text. Any
895 command that calls this function is a ``kill command'' (and should
896 probably have @samp{kill} in its name). @code{kill-region} puts the
897 newly killed text in a new element at the beginning of the kill ring or
898 adds it to the most recent element. It determines automatically (using
899 @code{last-command}) whether the previous command was a kill command,
900 and if so appends the killed text to the most recent entry.
901
902 @deffn Command kill-region start end
903 This function kills the text in the region defined by @var{start} and
904 @var{end}. The text is deleted but saved in the kill ring, along with
905 its text properties. The value is always @code{nil}.
906
907 In an interactive call, @var{start} and @var{end} are point and
908 the mark.
909
910 If the buffer or text is read-only, @code{kill-region} modifies the kill
911 ring just the same, then signals an error without modifying the buffer.
912 This is convenient because it lets the user use a series of kill
913 commands to copy text from a read-only buffer into the kill ring.
914 @end deffn
915
916 @defopt kill-read-only-ok
917 If this option is non-@code{nil}, @code{kill-region} does not signal an
918 error if the buffer or text is read-only. Instead, it simply returns,
919 updating the kill ring but not changing the buffer.
920 @end defopt
921
922 @deffn Command copy-region-as-kill start end
923 This command saves the region defined by @var{start} and @var{end} on
924 the kill ring (including text properties), but does not delete the text
925 from the buffer. It returns @code{nil}.
926
927 The command does not set @code{this-command} to @code{kill-region}, so a
928 subsequent kill command does not append to the same kill ring entry.
929
930 @c FIXME Why is it better? Why isn't copy-region-as-kill obsolete then?
931 @c Why is it used in many places in Emacs?
932 In Lisp programs, it is better to use @code{kill-new} or
933 @code{kill-append} instead of this command. @xref{Low-Level Kill Ring}.
934 @end deffn
935
936 @node Yanking
937 @subsection Yanking
938
939 Yanking means inserting text from the kill ring, but it does not
940 insert the text blindly. The @code{yank} command, and related
941 commands, use @code{insert-for-yank} to perform special processing on
942 the text before it is inserted.
943
944 @defun insert-for-yank string
945 This function works like @code{insert}, except that it processes the
946 text in @var{string} according to the @code{yank-handler} text
947 property, as well as the variables @code{yank-handled-properties} and
948 @code{yank-excluded-properties} (see below), before inserting the
949 result into the current buffer.
950 @end defun
951
952 @defun insert-buffer-substring-as-yank buf &optional start end
953 This function resembles @code{insert-buffer-substring}, except that it
954 processes the text according to @code{yank-handled-properties} and
955 @code{yank-excluded-properties}. (It does not handle the
956 @code{yank-handler} property, which does not normally occur in buffer
957 text anyway.)
958 @end defun
959
960 @c FIXME: Add an index for yank-handler.
961 If you put a @code{yank-handler} text property on all or part of a
962 string, that alters how @code{insert-for-yank} inserts the string. If
963 different parts of the string have different @code{yank-handler}
964 values (comparison being done with @code{eq}), each substring is
965 handled separately. The property value must be a list of one to four
966 elements, with the following format (where elements after the first
967 may be omitted):
968
969 @example
970 (@var{function} @var{param} @var{noexclude} @var{undo})
971 @end example
972
973 Here is what the elements do:
974
975 @table @var
976 @item function
977 When @var{function} is non-@code{nil}, it is called instead of
978 @code{insert} to insert the string, with one argument---the string to
979 insert.
980
981 @item param
982 If @var{param} is present and non-@code{nil}, it replaces @var{string}
983 (or the substring of @var{string} being processed) as the object
984 passed to @var{function} (or @code{insert}). For example, if
985 @var{function} is @code{yank-rectangle}, @var{param} should be a list
986 of strings to insert as a rectangle.
987
988 @item noexclude
989 If @var{noexclude} is present and non-@code{nil}, that disables the
990 normal action of @code{yank-handled-properties} and
991 @code{yank-excluded-properties} on the inserted string.
992
993 @item undo
994 If @var{undo} is present and non-@code{nil}, it is a function that will be
995 called by @code{yank-pop} to undo the insertion of the current object.
996 It is called with two arguments, the start and end of the current
997 region. @var{function} can set @code{yank-undo-function} to override
998 the @var{undo} value.
999 @end table
1000
1001 @cindex yanking and text properties
1002 @defopt yank-handled-properties
1003 This variable specifies special text property handling conditions for
1004 yanked text. It takes effect after the text has been inserted (either
1005 normally, or via the @code{yank-handler} property), and prior to
1006 @code{yank-excluded-properties} taking effect.
1007
1008 The value should be an alist of elements @code{(@var{prop}
1009 . @var{fun})}. Each alist element is handled in order. The inserted
1010 text is scanned for stretches of text having text properties @code{eq}
1011 to @var{prop}; for each such stretch, @var{fun} is called with three
1012 arguments: the value of the property, and the start and end positions
1013 of the text.
1014 @end defopt
1015
1016 @defopt yank-excluded-properties
1017 The value of this variable is the list of properties to remove from
1018 inserted text. Its default value contains properties that might lead
1019 to annoying results, such as causing the text to respond to the mouse
1020 or specifying key bindings. It takes effect after
1021 @code{yank-handled-properties}.
1022 @end defopt
1023
1024
1025 @node Yank Commands
1026 @subsection Functions for Yanking
1027
1028 This section describes higher-level commands for yanking, which are
1029 intended primarily for the user but useful also in Lisp programs.
1030 Both @code{yank} and @code{yank-pop} honor the
1031 @code{yank-excluded-properties} variable and @code{yank-handler} text
1032 property (@pxref{Yanking}).
1033
1034 @deffn Command yank &optional arg
1035 @cindex inserting killed text
1036 This command inserts before point the text at the front of the kill
1037 ring. It sets the mark at the beginning of that text, using
1038 @code{push-mark} (@pxref{The Mark}), and puts point at the end.
1039
1040 If @var{arg} is a non-@code{nil} list (which occurs interactively when
1041 the user types @kbd{C-u} with no digits), then @code{yank} inserts the
1042 text as described above, but puts point before the yanked text and
1043 sets the mark after it.
1044
1045 If @var{arg} is a number, then @code{yank} inserts the @var{arg}th
1046 most recently killed text---the @var{arg}th element of the kill ring
1047 list, counted cyclically from the front, which is considered the
1048 first element for this purpose.
1049
1050 @code{yank} does not alter the contents of the kill ring, unless it
1051 used text provided by another program, in which case it pushes that text
1052 onto the kill ring. However if @var{arg} is an integer different from
1053 one, it rotates the kill ring to place the yanked string at the front.
1054
1055 @code{yank} returns @code{nil}.
1056 @end deffn
1057
1058 @deffn Command yank-pop &optional arg
1059 This command replaces the just-yanked entry from the kill ring with a
1060 different entry from the kill ring.
1061
1062 This is allowed only immediately after a @code{yank} or another
1063 @code{yank-pop}. At such a time, the region contains text that was just
1064 inserted by yanking. @code{yank-pop} deletes that text and inserts in
1065 its place a different piece of killed text. It does not add the deleted
1066 text to the kill ring, since it is already in the kill ring somewhere.
1067 It does however rotate the kill ring to place the newly yanked string at
1068 the front.
1069
1070 If @var{arg} is @code{nil}, then the replacement text is the previous
1071 element of the kill ring. If @var{arg} is numeric, the replacement is
1072 the @var{arg}th previous kill. If @var{arg} is negative, a more recent
1073 kill is the replacement.
1074
1075 The sequence of kills in the kill ring wraps around, so that after the
1076 oldest one comes the newest one, and before the newest one goes the
1077 oldest.
1078
1079 The return value is always @code{nil}.
1080 @end deffn
1081
1082 @defvar yank-undo-function
1083 If this variable is non-@code{nil}, the function @code{yank-pop} uses
1084 its value instead of @code{delete-region} to delete the text
1085 inserted by the previous @code{yank} or
1086 @code{yank-pop} command. The value must be a function of two
1087 arguments, the start and end of the current region.
1088
1089 The function @code{insert-for-yank} automatically sets this variable
1090 according to the @var{undo} element of the @code{yank-handler}
1091 text property, if there is one.
1092 @end defvar
1093
1094 @node Low-Level Kill Ring
1095 @subsection Low-Level Kill Ring
1096
1097 These functions and variables provide access to the kill ring at a
1098 lower level, but are still convenient for use in Lisp programs,
1099 because they take care of interaction with window system selections
1100 (@pxref{Window System Selections}).
1101
1102 @defun current-kill n &optional do-not-move
1103 The function @code{current-kill} rotates the yanking pointer, which
1104 designates the ``front'' of the kill ring, by @var{n} places (from newer
1105 kills to older ones), and returns the text at that place in the ring.
1106
1107 If the optional second argument @var{do-not-move} is non-@code{nil},
1108 then @code{current-kill} doesn't alter the yanking pointer; it just
1109 returns the @var{n}th kill, counting from the current yanking pointer.
1110
1111 If @var{n} is zero, indicating a request for the latest kill,
1112 @code{current-kill} calls the value of
1113 @code{interprogram-paste-function} (documented below) before
1114 consulting the kill ring. If that value is a function and calling it
1115 returns a string or a list of several string, @code{current-kill}
1116 pushes the strings onto the kill ring and returns the first string.
1117 It also sets the yanking pointer to point to the kill-ring entry of
1118 the first string returned by @code{interprogram-paste-function},
1119 regardless of the value of @var{do-not-move}. Otherwise,
1120 @code{current-kill} does not treat a zero value for @var{n} specially:
1121 it returns the entry pointed at by the yanking pointer and does not
1122 move the yanking pointer.
1123 @end defun
1124
1125 @defun kill-new string &optional replace
1126 This function pushes the text @var{string} onto the kill ring and
1127 makes the yanking pointer point to it. It discards the oldest entry
1128 if appropriate. It also invokes the value of
1129 @code{interprogram-cut-function} (see below).
1130
1131 If @var{replace} is non-@code{nil}, then @code{kill-new} replaces the
1132 first element of the kill ring with @var{string}, rather than pushing
1133 @var{string} onto the kill ring.
1134 @end defun
1135
1136 @defun kill-append string before-p
1137 This function appends the text @var{string} to the first entry in the
1138 kill ring and makes the yanking pointer point to the combined entry.
1139 Normally @var{string} goes at the end of the entry, but if
1140 @var{before-p} is non-@code{nil}, it goes at the beginning. This
1141 function also invokes the value of @code{interprogram-cut-function}
1142 (see below).
1143 @end defun
1144
1145 @defvar interprogram-paste-function
1146 This variable provides a way of transferring killed text from other
1147 programs, when you are using a window system. Its value should be
1148 @code{nil} or a function of no arguments.
1149
1150 If the value is a function, @code{current-kill} calls it to get the
1151 ``most recent kill''. If the function returns a non-@code{nil} value,
1152 then that value is used as the ``most recent kill''. If it returns
1153 @code{nil}, then the front of the kill ring is used.
1154
1155 To facilitate support for window systems that support multiple
1156 selections, this function may also return a list of strings. In that
1157 case, the first string is used as the ``most recent kill'', and all
1158 the other strings are pushed onto the kill ring, for easy access by
1159 @code{yank-pop}.
1160
1161 The normal use of this function is to get the window system's
1162 clipboard as the most recent kill, even if the selection belongs to
1163 another application. @xref{Window System Selections}. However, if
1164 the clipboard contents come from the current Emacs session, this
1165 function should return @code{nil}.
1166 @end defvar
1167
1168 @defvar interprogram-cut-function
1169 This variable provides a way of communicating killed text to other
1170 programs, when you are using a window system. Its value should be
1171 @code{nil} or a function of one required argument.
1172
1173 If the value is a function, @code{kill-new} and @code{kill-append} call
1174 it with the new first element of the kill ring as the argument.
1175
1176 The normal use of this function is to put newly killed text in the
1177 window system's clipboard. @xref{Window System Selections}.
1178 @end defvar
1179
1180 @node Internals of Kill Ring
1181 @subsection Internals of the Kill Ring
1182
1183 The variable @code{kill-ring} holds the kill ring contents, in the
1184 form of a list of strings. The most recent kill is always at the front
1185 of the list.
1186
1187 The @code{kill-ring-yank-pointer} variable points to a link in the
1188 kill ring list, whose @sc{car} is the text to yank next. We say it
1189 identifies the ``front'' of the ring. Moving
1190 @code{kill-ring-yank-pointer} to a different link is called
1191 @dfn{rotating the kill ring}. We call the kill ring a ``ring'' because
1192 the functions that move the yank pointer wrap around from the end of the
1193 list to the beginning, or vice-versa. Rotation of the kill ring is
1194 virtual; it does not change the value of @code{kill-ring}.
1195
1196 Both @code{kill-ring} and @code{kill-ring-yank-pointer} are Lisp
1197 variables whose values are normally lists. The word ``pointer'' in the
1198 name of the @code{kill-ring-yank-pointer} indicates that the variable's
1199 purpose is to identify one element of the list for use by the next yank
1200 command.
1201
1202 The value of @code{kill-ring-yank-pointer} is always @code{eq} to one
1203 of the links in the kill ring list. The element it identifies is the
1204 @sc{car} of that link. Kill commands, which change the kill ring, also
1205 set this variable to the value of @code{kill-ring}. The effect is to
1206 rotate the ring so that the newly killed text is at the front.
1207
1208 Here is a diagram that shows the variable @code{kill-ring-yank-pointer}
1209 pointing to the second entry in the kill ring @code{("some text" "a
1210 different piece of text" "yet older text")}.
1211
1212 @example
1213 @group
1214 kill-ring ---- kill-ring-yank-pointer
1215 | |
1216 | v
1217 | --- --- --- --- --- ---
1218 --> | | |------> | | |--> | | |--> nil
1219 --- --- --- --- --- ---
1220 | | |
1221 | | |
1222 | | -->"yet older text"
1223 | |
1224 | --> "a different piece of text"
1225 |
1226 --> "some text"
1227 @end group
1228 @end example
1229
1230 @noindent
1231 This state of affairs might occur after @kbd{C-y} (@code{yank})
1232 immediately followed by @kbd{M-y} (@code{yank-pop}).
1233
1234 @defvar kill-ring
1235 This variable holds the list of killed text sequences, most recently
1236 killed first.
1237 @end defvar
1238
1239 @defvar kill-ring-yank-pointer
1240 This variable's value indicates which element of the kill ring is at the
1241 ``front'' of the ring for yanking. More precisely, the value is a tail
1242 of the value of @code{kill-ring}, and its @sc{car} is the kill string
1243 that @kbd{C-y} should yank.
1244 @end defvar
1245
1246 @defopt kill-ring-max
1247 The value of this variable is the maximum length to which the kill
1248 ring can grow, before elements are thrown away at the end. The default
1249 value for @code{kill-ring-max} is 60.
1250 @end defopt
1251
1252 @node Undo
1253 @section Undo
1254 @cindex redo
1255
1256 Most buffers have an @dfn{undo list}, which records all changes made
1257 to the buffer's text so that they can be undone. (The buffers that
1258 don't have one are usually special-purpose buffers for which Emacs
1259 assumes that undoing is not useful. In particular, any buffer whose
1260 name begins with a space has its undo recording off by default;
1261 see @ref{Buffer Names}.) All the primitives that modify the
1262 text in the buffer automatically add elements to the front of the undo
1263 list, which is in the variable @code{buffer-undo-list}.
1264
1265 @defvar buffer-undo-list
1266 This buffer-local variable's value is the undo list of the current
1267 buffer. A value of @code{t} disables the recording of undo information.
1268 @end defvar
1269
1270 Here are the kinds of elements an undo list can have:
1271
1272 @table @code
1273 @item @var{position}
1274 This kind of element records a previous value of point; undoing this
1275 element moves point to @var{position}. Ordinary cursor motion does not
1276 make any sort of undo record, but deletion operations use these entries
1277 to record where point was before the command.
1278
1279 @item (@var{beg} . @var{end})
1280 This kind of element indicates how to delete text that was inserted.
1281 Upon insertion, the text occupied the range @var{beg}--@var{end} in the
1282 buffer.
1283
1284 @item (@var{text} . @var{position})
1285 This kind of element indicates how to reinsert text that was deleted.
1286 The deleted text itself is the string @var{text}. The place to
1287 reinsert it is @code{(abs @var{position})}. If @var{position} is
1288 positive, point was at the beginning of the deleted text, otherwise it
1289 was at the end. Zero or more (@var{marker} . @var{adjustment})
1290 elements follow immediately after this element.
1291
1292 @item (t . @var{time-flag})
1293 This kind of element indicates that an unmodified buffer became
1294 modified. A @var{time-flag} of the form
1295 @code{(@var{sec-high} @var{sec-low} @var{microsec}
1296 @var{picosec})} represents the visited file's modification time as of
1297 when it was previously visited or saved, using the same format as
1298 @code{current-time}; see @ref{Time of Day}.
1299 A @var{time-flag} of 0 means the buffer does not correspond to any file;
1300 @minus{}1 means the visited file previously did not exist.
1301 @code{primitive-undo} uses these
1302 values to determine whether to mark the buffer as unmodified once again;
1303 it does so only if the file's status matches that of @var{time-flag}.
1304
1305 @item (nil @var{property} @var{value} @var{beg} . @var{end})
1306 This kind of element records a change in a text property.
1307 Here's how you might undo the change:
1308
1309 @example
1310 (put-text-property @var{beg} @var{end} @var{property} @var{value})
1311 @end example
1312
1313 @item (@var{marker} . @var{adjustment})
1314 This kind of element records the fact that the marker @var{marker} was
1315 relocated due to deletion of surrounding text, and that it moved
1316 @var{adjustment} character positions. If the marker's location is
1317 consistent with the (@var{text} . @var{position}) element preceding it
1318 in the undo list, then undoing this element moves @var{marker}
1319 @minus{} @var{adjustment} characters.
1320
1321 @item (apply @var{funname} . @var{args})
1322 This is an extensible undo item, which is undone by calling
1323 @var{funname} with arguments @var{args}.
1324
1325 @item (apply @var{delta} @var{beg} @var{end} @var{funname} . @var{args})
1326 This is an extensible undo item, which records a change limited to the
1327 range @var{beg} to @var{end}, which increased the size of the buffer
1328 by @var{delta} characters. It is undone by calling @var{funname} with
1329 arguments @var{args}.
1330
1331 This kind of element enables undo limited to a region to determine
1332 whether the element pertains to that region.
1333
1334 @item nil
1335 This element is a boundary. The elements between two boundaries are
1336 called a @dfn{change group}; normally, each change group corresponds to
1337 one keyboard command, and undo commands normally undo an entire group as
1338 a unit.
1339 @end table
1340
1341 @defun undo-boundary
1342 This function places a boundary element in the undo list. The undo
1343 command stops at such a boundary, and successive undo commands undo
1344 to earlier and earlier boundaries. This function returns @code{nil}.
1345
1346 The editor command loop automatically calls @code{undo-boundary} just
1347 before executing each key sequence, so that each undo normally undoes
1348 the effects of one command. As an exception, the command
1349 @code{self-insert-command}, which produces self-inserting input
1350 characters (@pxref{Commands for Insertion}), may remove the boundary
1351 inserted by the command loop: a boundary is accepted for the first
1352 such character, the next 19 consecutive self-inserting input
1353 characters do not have boundaries, and then the 20th does; and so on
1354 as long as the self-inserting characters continue. Hence, sequences
1355 of consecutive character insertions can be undone as a group.
1356
1357 All buffer modifications add a boundary whenever the previous undoable
1358 change was made in some other buffer. This is to ensure that
1359 each command makes a boundary in each buffer where it makes changes.
1360
1361 Calling this function explicitly is useful for splitting the effects of
1362 a command into more than one unit. For example, @code{query-replace}
1363 calls @code{undo-boundary} after each replacement, so that the user can
1364 undo individual replacements one by one.
1365 @end defun
1366
1367 @defvar undo-in-progress
1368 This variable is normally @code{nil}, but the undo commands bind it to
1369 @code{t}. This is so that various kinds of change hooks can tell when
1370 they're being called for the sake of undoing.
1371 @end defvar
1372
1373 @defun primitive-undo count list
1374 This is the basic function for undoing elements of an undo list.
1375 It undoes the first @var{count} elements of @var{list}, returning
1376 the rest of @var{list}.
1377
1378 @code{primitive-undo} adds elements to the buffer's undo list when it
1379 changes the buffer. Undo commands avoid confusion by saving the undo
1380 list value at the beginning of a sequence of undo operations. Then the
1381 undo operations use and update the saved value. The new elements added
1382 by undoing are not part of this saved value, so they don't interfere with
1383 continuing to undo.
1384
1385 This function does not bind @code{undo-in-progress}.
1386 @end defun
1387
1388 @node Maintaining Undo
1389 @section Maintaining Undo Lists
1390
1391 This section describes how to enable and disable undo information for
1392 a given buffer. It also explains how the undo list is truncated
1393 automatically so it doesn't get too big.
1394
1395 Recording of undo information in a newly created buffer is normally
1396 enabled to start with; but if the buffer name starts with a space, the
1397 undo recording is initially disabled. You can explicitly enable or
1398 disable undo recording with the following two functions, or by setting
1399 @code{buffer-undo-list} yourself.
1400
1401 @deffn Command buffer-enable-undo &optional buffer-or-name
1402 This command enables recording undo information for buffer
1403 @var{buffer-or-name}, so that subsequent changes can be undone. If no
1404 argument is supplied, then the current buffer is used. This function
1405 does nothing if undo recording is already enabled in the buffer. It
1406 returns @code{nil}.
1407
1408 In an interactive call, @var{buffer-or-name} is the current buffer.
1409 You cannot specify any other buffer.
1410 @end deffn
1411
1412 @deffn Command buffer-disable-undo &optional buffer-or-name
1413 @cindex disabling undo
1414 This function discards the undo list of @var{buffer-or-name}, and disables
1415 further recording of undo information. As a result, it is no longer
1416 possible to undo either previous changes or any subsequent changes. If
1417 the undo list of @var{buffer-or-name} is already disabled, this function
1418 has no effect.
1419
1420 In an interactive call, BUFFER-OR-NAME is the current buffer. You
1421 cannot specify any other buffer. This function returns @code{nil}.
1422 @end deffn
1423
1424 As editing continues, undo lists get longer and longer. To prevent
1425 them from using up all available memory space, garbage collection trims
1426 them back to size limits you can set. (For this purpose, the ``size''
1427 of an undo list measures the cons cells that make up the list, plus the
1428 strings of deleted text.) Three variables control the range of acceptable
1429 sizes: @code{undo-limit}, @code{undo-strong-limit} and
1430 @code{undo-outer-limit}. In these variables, size is counted as the
1431 number of bytes occupied, which includes both saved text and other
1432 data.
1433
1434 @defopt undo-limit
1435 This is the soft limit for the acceptable size of an undo list. The
1436 change group at which this size is exceeded is the last one kept.
1437 @end defopt
1438
1439 @defopt undo-strong-limit
1440 This is the upper limit for the acceptable size of an undo list. The
1441 change group at which this size is exceeded is discarded itself (along
1442 with all older change groups). There is one exception: the very latest
1443 change group is only discarded if it exceeds @code{undo-outer-limit}.
1444 @end defopt
1445
1446 @defopt undo-outer-limit
1447 If at garbage collection time the undo info for the current command
1448 exceeds this limit, Emacs discards the info and displays a warning.
1449 This is a last ditch limit to prevent memory overflow.
1450 @end defopt
1451
1452 @defopt undo-ask-before-discard
1453 If this variable is non-@code{nil}, when the undo info exceeds
1454 @code{undo-outer-limit}, Emacs asks in the echo area whether to
1455 discard the info. The default value is @code{nil}, which means to
1456 discard it automatically.
1457
1458 This option is mainly intended for debugging. Garbage collection is
1459 inhibited while the question is asked, which means that Emacs might
1460 leak memory if the user waits too long before answering the question.
1461 @end defopt
1462
1463 @node Filling
1464 @section Filling
1465 @cindex filling text
1466
1467 @dfn{Filling} means adjusting the lengths of lines (by moving the line
1468 breaks) so that they are nearly (but no greater than) a specified
1469 maximum width. Additionally, lines can be @dfn{justified}, which means
1470 inserting spaces to make the left and/or right margins line up
1471 precisely. The width is controlled by the variable @code{fill-column}.
1472 For ease of reading, lines should be no longer than 70 or so columns.
1473
1474 You can use Auto Fill mode (@pxref{Auto Filling}) to fill text
1475 automatically as you insert it, but changes to existing text may leave
1476 it improperly filled. Then you must fill the text explicitly.
1477
1478 Most of the commands in this section return values that are not
1479 meaningful. All the functions that do filling take note of the current
1480 left margin, current right margin, and current justification style
1481 (@pxref{Margins}). If the current justification style is
1482 @code{none}, the filling functions don't actually do anything.
1483
1484 Several of the filling functions have an argument @var{justify}.
1485 If it is non-@code{nil}, that requests some kind of justification. It
1486 can be @code{left}, @code{right}, @code{full}, or @code{center}, to
1487 request a specific style of justification. If it is @code{t}, that
1488 means to use the current justification style for this part of the text
1489 (see @code{current-justification}, below). Any other value is treated
1490 as @code{full}.
1491
1492 When you call the filling functions interactively, using a prefix
1493 argument implies the value @code{full} for @var{justify}.
1494
1495 @deffn Command fill-paragraph &optional justify region
1496 This command fills the paragraph at or after point. If
1497 @var{justify} is non-@code{nil}, each line is justified as well.
1498 It uses the ordinary paragraph motion commands to find paragraph
1499 boundaries. @xref{Paragraphs,,, emacs, The GNU Emacs Manual}.
1500
1501 When @var{region} is non-@code{nil}, then if Transient Mark mode is
1502 enabled and the mark is active, this command calls @code{fill-region}
1503 to fill all the paragraphs in the region, instead of filling only the
1504 current paragraph. When this command is called interactively,
1505 @var{region} is @code{t}.
1506 @end deffn
1507
1508 @deffn Command fill-region start end &optional justify nosqueeze to-eop
1509 This command fills each of the paragraphs in the region from @var{start}
1510 to @var{end}. It justifies as well if @var{justify} is
1511 non-@code{nil}.
1512
1513 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1514 other than line breaks untouched. If @var{to-eop} is non-@code{nil},
1515 that means to keep filling to the end of the paragraph---or the next hard
1516 newline, if @code{use-hard-newlines} is enabled (see below).
1517
1518 The variable @code{paragraph-separate} controls how to distinguish
1519 paragraphs. @xref{Standard Regexps}.
1520 @end deffn
1521
1522 @deffn Command fill-individual-paragraphs start end &optional justify citation-regexp
1523 This command fills each paragraph in the region according to its
1524 individual fill prefix. Thus, if the lines of a paragraph were indented
1525 with spaces, the filled paragraph will remain indented in the same
1526 fashion.
1527
1528 The first two arguments, @var{start} and @var{end}, are the beginning
1529 and end of the region to be filled. The third and fourth arguments,
1530 @var{justify} and @var{citation-regexp}, are optional. If
1531 @var{justify} is non-@code{nil}, the paragraphs are justified as
1532 well as filled. If @var{citation-regexp} is non-@code{nil}, it means the
1533 function is operating on a mail message and therefore should not fill
1534 the header lines. If @var{citation-regexp} is a string, it is used as
1535 a regular expression; if it matches the beginning of a line, that line
1536 is treated as a citation marker.
1537
1538 @c FIXME: "That mode" is confusing. It isn't a major/minor mode.
1539 Ordinarily, @code{fill-individual-paragraphs} regards each change in
1540 indentation as starting a new paragraph. If
1541 @code{fill-individual-varying-indent} is non-@code{nil}, then only
1542 separator lines separate paragraphs. That mode can handle indented
1543 paragraphs with additional indentation on the first line.
1544 @end deffn
1545
1546 @defopt fill-individual-varying-indent
1547 This variable alters the action of @code{fill-individual-paragraphs} as
1548 described above.
1549 @end defopt
1550
1551 @deffn Command fill-region-as-paragraph start end &optional justify nosqueeze squeeze-after
1552 This command considers a region of text as a single paragraph and fills
1553 it. If the region was made up of many paragraphs, the blank lines
1554 between paragraphs are removed. This function justifies as well as
1555 filling when @var{justify} is non-@code{nil}.
1556
1557 If @var{nosqueeze} is non-@code{nil}, that means to leave whitespace
1558 other than line breaks untouched. If @var{squeeze-after} is
1559 non-@code{nil}, it specifies a position in the region, and means don't
1560 canonicalize spaces before that position.
1561
1562 In Adaptive Fill mode, this command calls @code{fill-context-prefix} to
1563 choose a fill prefix by default. @xref{Adaptive Fill}.
1564 @end deffn
1565
1566 @deffn Command justify-current-line &optional how eop nosqueeze
1567 This command inserts spaces between the words of the current line so
1568 that the line ends exactly at @code{fill-column}. It returns
1569 @code{nil}.
1570
1571 The argument @var{how}, if non-@code{nil} specifies explicitly the style
1572 of justification. It can be @code{left}, @code{right}, @code{full},
1573 @code{center}, or @code{none}. If it is @code{t}, that means to do
1574 follow specified justification style (see @code{current-justification},
1575 below). @code{nil} means to do full justification.
1576
1577 If @var{eop} is non-@code{nil}, that means do only left-justification
1578 if @code{current-justification} specifies full justification. This is
1579 used for the last line of a paragraph; even if the paragraph as a
1580 whole is fully justified, the last line should not be.
1581
1582 If @var{nosqueeze} is non-@code{nil}, that means do not change interior
1583 whitespace.
1584 @end deffn
1585
1586 @defopt default-justification
1587 This variable's value specifies the style of justification to use for
1588 text that doesn't specify a style with a text property. The possible
1589 values are @code{left}, @code{right}, @code{full}, @code{center}, or
1590 @code{none}. The default value is @code{left}.
1591 @end defopt
1592
1593 @defun current-justification
1594 This function returns the proper justification style to use for filling
1595 the text around point.
1596
1597 This returns the value of the @code{justification} text property at
1598 point, or the variable @var{default-justification} if there is no such
1599 text property. However, it returns @code{nil} rather than @code{none}
1600 to mean ``don't justify''.
1601 @end defun
1602
1603 @defopt sentence-end-double-space
1604 @anchor{Definition of sentence-end-double-space}
1605 If this variable is non-@code{nil}, a period followed by just one space
1606 does not count as the end of a sentence, and the filling functions
1607 avoid breaking the line at such a place.
1608 @end defopt
1609
1610 @defopt sentence-end-without-period
1611 If this variable is non-@code{nil}, a sentence can end without a
1612 period. This is used for languages like Thai, where sentences end
1613 with a double space but without a period.
1614 @end defopt
1615
1616 @defopt sentence-end-without-space
1617 If this variable is non-@code{nil}, it should be a string of
1618 characters that can end a sentence without following spaces.
1619 @end defopt
1620
1621 @defvar fill-paragraph-function
1622 This variable provides a way to override the filling of paragraphs.
1623 If its value is non-@code{nil}, @code{fill-paragraph} calls this
1624 function to do the work. If the function returns a non-@code{nil}
1625 value, @code{fill-paragraph} assumes the job is done, and immediately
1626 returns that value.
1627
1628 The usual use of this feature is to fill comments in programming
1629 language modes. If the function needs to fill a paragraph in the usual
1630 way, it can do so as follows:
1631
1632 @example
1633 (let ((fill-paragraph-function nil))
1634 (fill-paragraph arg))
1635 @end example
1636 @end defvar
1637
1638 @defvar fill-forward-paragraph-function
1639 This variable provides a way to override how the filling functions,
1640 such as @code{fill-region} and @code{fill-paragraph}, move forward to
1641 the next paragraph. Its value should be a function, which is called
1642 with a single argument @var{n}, the number of paragraphs to move, and
1643 should return the difference between @var{n} and the number of
1644 paragraphs actually moved. The default value of this variable is
1645 @code{forward-paragraph}. @xref{Paragraphs,,, emacs, The GNU Emacs
1646 Manual}.
1647 @end defvar
1648
1649 @defvar use-hard-newlines
1650 If this variable is non-@code{nil}, the filling functions do not delete
1651 newlines that have the @code{hard} text property. These ``hard
1652 newlines'' act as paragraph separators. @xref{Hard and Soft
1653 Newlines,, Hard and Soft Newlines, emacs, The GNU Emacs Manual}.
1654 @end defvar
1655
1656 @node Margins
1657 @section Margins for Filling
1658 @cindex margins, filling
1659
1660 @defopt fill-prefix
1661 This buffer-local variable, if non-@code{nil}, specifies a string of
1662 text that appears at the beginning of normal text lines and should be
1663 disregarded when filling them. Any line that fails to start with the
1664 fill prefix is considered the start of a paragraph; so is any line
1665 that starts with the fill prefix followed by additional whitespace.
1666 Lines that start with the fill prefix but no additional whitespace are
1667 ordinary text lines that can be filled together. The resulting filled
1668 lines also start with the fill prefix.
1669
1670 The fill prefix follows the left margin whitespace, if any.
1671 @end defopt
1672
1673 @defopt fill-column
1674 This buffer-local variable specifies the maximum width of filled lines.
1675 Its value should be an integer, which is a number of columns. All the
1676 filling, justification, and centering commands are affected by this
1677 variable, including Auto Fill mode (@pxref{Auto Filling}).
1678
1679 As a practical matter, if you are writing text for other people to
1680 read, you should set @code{fill-column} to no more than 70. Otherwise
1681 the line will be too long for people to read comfortably, and this can
1682 make the text seem clumsy.
1683
1684 The default value for @code{fill-column} is 70.
1685 @end defopt
1686
1687 @deffn Command set-left-margin from to margin
1688 This sets the @code{left-margin} property on the text from @var{from} to
1689 @var{to} to the value @var{margin}. If Auto Fill mode is enabled, this
1690 command also refills the region to fit the new margin.
1691 @end deffn
1692
1693 @deffn Command set-right-margin from to margin
1694 This sets the @code{right-margin} property on the text from @var{from}
1695 to @var{to} to the value @var{margin}. If Auto Fill mode is enabled,
1696 this command also refills the region to fit the new margin.
1697 @end deffn
1698
1699 @defun current-left-margin
1700 This function returns the proper left margin value to use for filling
1701 the text around point. The value is the sum of the @code{left-margin}
1702 property of the character at the start of the current line (or zero if
1703 none), and the value of the variable @code{left-margin}.
1704 @end defun
1705
1706 @defun current-fill-column
1707 This function returns the proper fill column value to use for filling
1708 the text around point. The value is the value of the @code{fill-column}
1709 variable, minus the value of the @code{right-margin} property of the
1710 character after point.
1711 @end defun
1712
1713 @deffn Command move-to-left-margin &optional n force
1714 This function moves point to the left margin of the current line. The
1715 column moved to is determined by calling the function
1716 @code{current-left-margin}. If the argument @var{n} is non-@code{nil},
1717 @code{move-to-left-margin} moves forward @var{n}@minus{}1 lines first.
1718
1719 If @var{force} is non-@code{nil}, that says to fix the line's
1720 indentation if that doesn't match the left margin value.
1721 @end deffn
1722
1723 @defun delete-to-left-margin &optional from to
1724 This function removes left margin indentation from the text between
1725 @var{from} and @var{to}. The amount of indentation to delete is
1726 determined by calling @code{current-left-margin}. In no case does this
1727 function delete non-whitespace. If @var{from} and @var{to} are omitted,
1728 they default to the whole buffer.
1729 @end defun
1730
1731 @defun indent-to-left-margin
1732 This function adjusts the indentation at the beginning of the current
1733 line to the value specified by the variable @code{left-margin}. (That
1734 may involve either inserting or deleting whitespace.) This function
1735 is value of @code{indent-line-function} in Paragraph-Indent Text mode.
1736 @end defun
1737
1738 @defopt left-margin
1739 This variable specifies the base left margin column. In Fundamental
1740 mode, @kbd{RET} indents to this column. This variable automatically
1741 becomes buffer-local when set in any fashion.
1742 @end defopt
1743
1744 @defopt fill-nobreak-predicate
1745 This variable gives major modes a way to specify not to break a line
1746 at certain places. Its value should be a list of functions. Whenever
1747 filling considers breaking the line at a certain place in the buffer,
1748 it calls each of these functions with no arguments and with point
1749 located at that place. If any of the functions returns
1750 non-@code{nil}, then the line won't be broken there.
1751 @end defopt
1752
1753 @node Adaptive Fill
1754 @section Adaptive Fill Mode
1755 @c @cindex Adaptive Fill mode "adaptive-fill-mode" is adjacent.
1756
1757 When @dfn{Adaptive Fill Mode} is enabled, Emacs determines the fill
1758 prefix automatically from the text in each paragraph being filled
1759 rather than using a predetermined value. During filling, this fill
1760 prefix gets inserted at the start of the second and subsequent lines
1761 of the paragraph as described in @ref{Filling}, and in @ref{Auto
1762 Filling}.
1763
1764 @defopt adaptive-fill-mode
1765 Adaptive Fill mode is enabled when this variable is non-@code{nil}.
1766 It is @code{t} by default.
1767 @end defopt
1768
1769 @defun fill-context-prefix from to
1770 This function implements the heart of Adaptive Fill mode; it chooses a
1771 fill prefix based on the text between @var{from} and @var{to},
1772 typically the start and end of a paragraph. It does this by looking
1773 at the first two lines of the paragraph, based on the variables
1774 described below.
1775 @c The optional argument first-line-regexp is not documented
1776 @c because it exists for internal purposes and might be eliminated
1777 @c in the future.
1778
1779 Usually, this function returns the fill prefix, a string. However,
1780 before doing this, the function makes a final check (not specially
1781 mentioned in the following) that a line starting with this prefix
1782 wouldn't look like the start of a paragraph. Should this happen, the
1783 function signals the anomaly by returning @code{nil} instead.
1784
1785 In detail, @code{fill-context-prefix} does this:
1786
1787 @enumerate
1788 @item
1789 It takes a candidate for the fill prefix from the first line---it
1790 tries first the function in @code{adaptive-fill-function} (if any),
1791 then the regular expression @code{adaptive-fill-regexp} (see below).
1792 The first non-@code{nil} result of these, or the empty string if
1793 they're both @code{nil}, becomes the first line's candidate.
1794 @item
1795 If the paragraph has as yet only one line, the function tests the
1796 validity of the prefix candidate just found. The function then
1797 returns the candidate if it's valid, or a string of spaces otherwise.
1798 (see the description of @code{adaptive-fill-first-line-regexp} below).
1799 @item
1800 When the paragraph already has two lines, the function next looks for
1801 a prefix candidate on the second line, in just the same way it did for
1802 the first line. If it doesn't find one, it returns @code{nil}.
1803 @item
1804 The function now compares the two candidate prefixes heuristically: if
1805 the non-whitespace characters in the line 2 candidate occur in the
1806 same order in the line 1 candidate, the function returns the line 2
1807 candidate. Otherwise, it returns the largest initial substring which
1808 is common to both candidates (which might be the empty string).
1809 @end enumerate
1810 @end defun
1811
1812 @defopt adaptive-fill-regexp
1813 Adaptive Fill mode matches this regular expression against the text
1814 starting after the left margin whitespace (if any) on a line; the
1815 characters it matches are that line's candidate for the fill prefix.
1816
1817 The default value matches whitespace with certain punctuation
1818 characters intermingled.
1819 @end defopt
1820
1821 @defopt adaptive-fill-first-line-regexp
1822 Used only in one-line paragraphs, this regular expression acts as an
1823 additional check of the validity of the one available candidate fill
1824 prefix: the candidate must match this regular expression, or match
1825 @code{comment-start-skip}. If it doesn't, @code{fill-context-prefix}
1826 replaces the candidate with a string of spaces ``of the same width''
1827 as it.
1828
1829 The default value of this variable is @w{@code{"\\`[ \t]*\\'"}}, which
1830 matches only a string of whitespace. The effect of this default is to
1831 force the fill prefixes found in one-line paragraphs always to be pure
1832 whitespace.
1833 @end defopt
1834
1835 @defopt adaptive-fill-function
1836 You can specify more complex ways of choosing a fill prefix
1837 automatically by setting this variable to a function. The function is
1838 called with point after the left margin (if any) of a line, and it
1839 must preserve point. It should return either ``that line's'' fill
1840 prefix or @code{nil}, meaning it has failed to determine a prefix.
1841 @end defopt
1842
1843 @node Auto Filling
1844 @section Auto Filling
1845 @cindex filling, automatic
1846 @cindex Auto Fill mode
1847
1848 @c FIXME: I don't think any of the variables below is a/an normal/abnormal hook.
1849 Auto Fill mode is a minor mode that fills lines automatically as text
1850 is inserted. This section describes the hook used by Auto Fill mode.
1851 For a description of functions that you can call explicitly to fill and
1852 justify existing text, see @ref{Filling}.
1853
1854 Auto Fill mode also enables the functions that change the margins and
1855 justification style to refill portions of the text. @xref{Margins}.
1856
1857 @defvar auto-fill-function
1858 The value of this buffer-local variable should be a function (of no
1859 arguments) to be called after self-inserting a character from the table
1860 @code{auto-fill-chars}. It may be @code{nil}, in which case nothing
1861 special is done in that case.
1862
1863 The value of @code{auto-fill-function} is @code{do-auto-fill} when
1864 Auto-Fill mode is enabled. That is a function whose sole purpose is to
1865 implement the usual strategy for breaking a line.
1866 @end defvar
1867
1868 @defvar normal-auto-fill-function
1869 This variable specifies the function to use for
1870 @code{auto-fill-function}, if and when Auto Fill is turned on. Major
1871 modes can set buffer-local values for this variable to alter how Auto
1872 Fill works.
1873 @end defvar
1874
1875 @defvar auto-fill-chars
1876 A char table of characters which invoke @code{auto-fill-function} when
1877 self-inserted---space and newline in most language environments. They
1878 have an entry @code{t} in the table.
1879 @end defvar
1880
1881 @node Sorting
1882 @section Sorting Text
1883 @cindex sorting text
1884
1885 The sorting functions described in this section all rearrange text in
1886 a buffer. This is in contrast to the function @code{sort}, which
1887 rearranges the order of the elements of a list (@pxref{Rearrangement}).
1888 The values returned by these functions are not meaningful.
1889
1890 @defun sort-subr reverse nextrecfun endrecfun &optional startkeyfun endkeyfun predicate
1891 This function is the general text-sorting routine that subdivides a
1892 buffer into records and then sorts them. Most of the commands in this
1893 section use this function.
1894
1895 To understand how @code{sort-subr} works, consider the whole accessible
1896 portion of the buffer as being divided into disjoint pieces called
1897 @dfn{sort records}. The records may or may not be contiguous, but they
1898 must not overlap. A portion of each sort record (perhaps all of it) is
1899 designated as the sort key. Sorting rearranges the records in order by
1900 their sort keys.
1901
1902 Usually, the records are rearranged in order of ascending sort key.
1903 If the first argument to the @code{sort-subr} function, @var{reverse},
1904 is non-@code{nil}, the sort records are rearranged in order of
1905 descending sort key.
1906
1907 The next four arguments to @code{sort-subr} are functions that are
1908 called to move point across a sort record. They are called many times
1909 from within @code{sort-subr}.
1910
1911 @enumerate
1912 @item
1913 @var{nextrecfun} is called with point at the end of a record. This
1914 function moves point to the start of the next record. The first record
1915 is assumed to start at the position of point when @code{sort-subr} is
1916 called. Therefore, you should usually move point to the beginning of
1917 the buffer before calling @code{sort-subr}.
1918
1919 This function can indicate there are no more sort records by leaving
1920 point at the end of the buffer.
1921
1922 @item
1923 @var{endrecfun} is called with point within a record. It moves point to
1924 the end of the record.
1925
1926 @item
1927 @var{startkeyfun} is called to move point from the start of a record to
1928 the start of the sort key. This argument is optional; if it is omitted,
1929 the whole record is the sort key. If supplied, the function should
1930 either return a non-@code{nil} value to be used as the sort key, or
1931 return @code{nil} to indicate that the sort key is in the buffer
1932 starting at point. In the latter case, @var{endkeyfun} is called to
1933 find the end of the sort key.
1934
1935 @item
1936 @var{endkeyfun} is called to move point from the start of the sort key
1937 to the end of the sort key. This argument is optional. If
1938 @var{startkeyfun} returns @code{nil} and this argument is omitted (or
1939 @code{nil}), then the sort key extends to the end of the record. There
1940 is no need for @var{endkeyfun} if @var{startkeyfun} returns a
1941 non-@code{nil} value.
1942 @end enumerate
1943
1944 The argument @var{predicate} is the function to use to compare keys.
1945 If keys are numbers, it defaults to @code{<}; otherwise it defaults to
1946 @code{string<}.
1947
1948 As an example of @code{sort-subr}, here is the complete function
1949 definition for @code{sort-lines}:
1950
1951 @example
1952 @group
1953 ;; @r{Note that the first two lines of doc string}
1954 ;; @r{are effectively one line when viewed by a user.}
1955 (defun sort-lines (reverse beg end)
1956 "Sort lines in region alphabetically;\
1957 argument means descending order.
1958 Called from a program, there are three arguments:
1959 @end group
1960 @group
1961 REVERSE (non-nil means reverse order),\
1962 BEG and END (region to sort).
1963 The variable `sort-fold-case' determines\
1964 whether alphabetic case affects
1965 the sort order."
1966 @end group
1967 @group
1968 (interactive "P\nr")
1969 (save-excursion
1970 (save-restriction
1971 (narrow-to-region beg end)
1972 (goto-char (point-min))
1973 (let ((inhibit-field-text-motion t))
1974 (sort-subr reverse 'forward-line 'end-of-line)))))
1975 @end group
1976 @end example
1977
1978 Here @code{forward-line} moves point to the start of the next record,
1979 and @code{end-of-line} moves point to the end of record. We do not pass
1980 the arguments @var{startkeyfun} and @var{endkeyfun}, because the entire
1981 record is used as the sort key.
1982
1983 The @code{sort-paragraphs} function is very much the same, except that
1984 its @code{sort-subr} call looks like this:
1985
1986 @example
1987 @group
1988 (sort-subr reverse
1989 (function
1990 (lambda ()
1991 (while (and (not (eobp))
1992 (looking-at paragraph-separate))
1993 (forward-line 1))))
1994 'forward-paragraph)
1995 @end group
1996 @end example
1997
1998 Markers pointing into any sort records are left with no useful
1999 position after @code{sort-subr} returns.
2000 @end defun
2001
2002 @defopt sort-fold-case
2003 If this variable is non-@code{nil}, @code{sort-subr} and the other
2004 buffer sorting functions ignore case when comparing strings.
2005 @end defopt
2006
2007 @deffn Command sort-regexp-fields reverse record-regexp key-regexp start end
2008 This command sorts the region between @var{start} and @var{end}
2009 alphabetically as specified by @var{record-regexp} and @var{key-regexp}.
2010 If @var{reverse} is a negative integer, then sorting is in reverse
2011 order.
2012
2013 Alphabetical sorting means that two sort keys are compared by
2014 comparing the first characters of each, the second characters of each,
2015 and so on. If a mismatch is found, it means that the sort keys are
2016 unequal; the sort key whose character is less at the point of first
2017 mismatch is the lesser sort key. The individual characters are compared
2018 according to their numerical character codes in the Emacs character set.
2019
2020 The value of the @var{record-regexp} argument specifies how to divide
2021 the buffer into sort records. At the end of each record, a search is
2022 done for this regular expression, and the text that matches it is taken
2023 as the next record. For example, the regular expression @samp{^.+$},
2024 which matches lines with at least one character besides a newline, would
2025 make each such line into a sort record. @xref{Regular Expressions}, for
2026 a description of the syntax and meaning of regular expressions.
2027
2028 The value of the @var{key-regexp} argument specifies what part of each
2029 record is the sort key. The @var{key-regexp} could match the whole
2030 record, or only a part. In the latter case, the rest of the record has
2031 no effect on the sorted order of records, but it is carried along when
2032 the record moves to its new position.
2033
2034 The @var{key-regexp} argument can refer to the text matched by a
2035 subexpression of @var{record-regexp}, or it can be a regular expression
2036 on its own.
2037
2038 If @var{key-regexp} is:
2039
2040 @table @asis
2041 @item @samp{\@var{digit}}
2042 then the text matched by the @var{digit}th @samp{\(...\)} parenthesis
2043 grouping in @var{record-regexp} is the sort key.
2044
2045 @item @samp{\&}
2046 then the whole record is the sort key.
2047
2048 @item a regular expression
2049 then @code{sort-regexp-fields} searches for a match for the regular
2050 expression within the record. If such a match is found, it is the sort
2051 key. If there is no match for @var{key-regexp} within a record then
2052 that record is ignored, which means its position in the buffer is not
2053 changed. (The other records may move around it.)
2054 @end table
2055
2056 For example, if you plan to sort all the lines in the region by the
2057 first word on each line starting with the letter @samp{f}, you should
2058 set @var{record-regexp} to @samp{^.*$} and set @var{key-regexp} to
2059 @samp{\<f\w*\>}. The resulting expression looks like this:
2060
2061 @example
2062 @group
2063 (sort-regexp-fields nil "^.*$" "\\<f\\w*\\>"
2064 (region-beginning)
2065 (region-end))
2066 @end group
2067 @end example
2068
2069 If you call @code{sort-regexp-fields} interactively, it prompts for
2070 @var{record-regexp} and @var{key-regexp} in the minibuffer.
2071 @end deffn
2072
2073 @deffn Command sort-lines reverse start end
2074 This command alphabetically sorts lines in the region between
2075 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2076 is in reverse order.
2077 @end deffn
2078
2079 @deffn Command sort-paragraphs reverse start end
2080 This command alphabetically sorts paragraphs in the region between
2081 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2082 is in reverse order.
2083 @end deffn
2084
2085 @deffn Command sort-pages reverse start end
2086 This command alphabetically sorts pages in the region between
2087 @var{start} and @var{end}. If @var{reverse} is non-@code{nil}, the sort
2088 is in reverse order.
2089 @end deffn
2090
2091 @deffn Command sort-fields field start end
2092 This command sorts lines in the region between @var{start} and
2093 @var{end}, comparing them alphabetically by the @var{field}th field
2094 of each line. Fields are separated by whitespace and numbered starting
2095 from 1. If @var{field} is negative, sorting is by the
2096 @w{@minus{}@var{field}th} field from the end of the line. This command
2097 is useful for sorting tables.
2098 @end deffn
2099
2100 @deffn Command sort-numeric-fields field start end
2101 This command sorts lines in the region between @var{start} and
2102 @var{end}, comparing them numerically by the @var{field}th field of
2103 each line. Fields are separated by whitespace and numbered starting
2104 from 1. The specified field must contain a number in each line of the
2105 region. Numbers starting with 0 are treated as octal, and numbers
2106 starting with @samp{0x} are treated as hexadecimal.
2107
2108 If @var{field} is negative, sorting is by the
2109 @w{@minus{}@var{field}th} field from the end of the line. This
2110 command is useful for sorting tables.
2111 @end deffn
2112
2113 @defopt sort-numeric-base
2114 This variable specifies the default radix for
2115 @code{sort-numeric-fields} to parse numbers.
2116 @end defopt
2117
2118 @deffn Command sort-columns reverse &optional beg end
2119 This command sorts the lines in the region between @var{beg} and
2120 @var{end}, comparing them alphabetically by a certain range of
2121 columns. The column positions of @var{beg} and @var{end} bound the
2122 range of columns to sort on.
2123
2124 If @var{reverse} is non-@code{nil}, the sort is in reverse order.
2125
2126 One unusual thing about this command is that the entire line
2127 containing position @var{beg}, and the entire line containing position
2128 @var{end}, are included in the region sorted.
2129
2130 Note that @code{sort-columns} rejects text that contains tabs, because
2131 tabs could be split across the specified columns. Use @kbd{M-x
2132 untabify} to convert tabs to spaces before sorting.
2133
2134 When possible, this command actually works by calling the @code{sort}
2135 utility program.
2136 @end deffn
2137
2138 @node Columns
2139 @section Counting Columns
2140 @cindex columns
2141 @cindex counting columns
2142 @cindex horizontal position
2143
2144 The column functions convert between a character position (counting
2145 characters from the beginning of the buffer) and a column position
2146 (counting screen characters from the beginning of a line).
2147
2148 These functions count each character according to the number of
2149 columns it occupies on the screen. This means control characters count
2150 as occupying 2 or 4 columns, depending upon the value of
2151 @code{ctl-arrow}, and tabs count as occupying a number of columns that
2152 depends on the value of @code{tab-width} and on the column where the tab
2153 begins. @xref{Usual Display}.
2154
2155 Column number computations ignore the width of the window and the
2156 amount of horizontal scrolling. Consequently, a column value can be
2157 arbitrarily high. The first (or leftmost) column is numbered 0. They
2158 also ignore overlays and text properties, aside from invisibility.
2159
2160 @defun current-column
2161 This function returns the horizontal position of point, measured in
2162 columns, counting from 0 at the left margin. The column position is the
2163 sum of the widths of all the displayed representations of the characters
2164 between the start of the current line and point.
2165
2166 For an example of using @code{current-column}, see the description of
2167 @code{count-lines} in @ref{Text Lines}.
2168 @end defun
2169
2170 @deffn Command move-to-column column &optional force
2171 This function moves point to @var{column} in the current line. The
2172 calculation of @var{column} takes into account the widths of the
2173 displayed representations of the characters between the start of the
2174 line and point.
2175
2176 When called interactively, @var{column} is the value of prefix numeric
2177 argument. If @var{column} is not an integer, an error is signaled.
2178
2179 @c This behavior used to be documented until 2013/08.
2180 @ignore
2181 If column @var{column} is beyond the end of the line, point moves to
2182 the end of the line. If @var{column} is negative, point moves to the
2183 beginning of the line.
2184 @end ignore
2185
2186 If it is impossible to move to column @var{column} because that is in
2187 the middle of a multicolumn character such as a tab, point moves to the
2188 end of that character. However, if @var{force} is non-@code{nil}, and
2189 @var{column} is in the middle of a tab, then @code{move-to-column}
2190 converts the tab into spaces so that it can move precisely to column
2191 @var{column}. Other multicolumn characters can cause anomalies despite
2192 @var{force}, since there is no way to split them.
2193
2194 The argument @var{force} also has an effect if the line isn't long
2195 enough to reach column @var{column}; if it is @code{t}, that means to
2196 add whitespace at the end of the line to reach that column.
2197
2198 The return value is the column number actually moved to.
2199 @end deffn
2200
2201 @node Indentation
2202 @section Indentation
2203 @cindex indentation
2204
2205 The indentation functions are used to examine, move to, and change
2206 whitespace that is at the beginning of a line. Some of the functions
2207 can also change whitespace elsewhere on a line. Columns and indentation
2208 count from zero at the left margin.
2209
2210 @menu
2211 * Primitive Indent:: Functions used to count and insert indentation.
2212 * Mode-Specific Indent:: Customize indentation for different modes.
2213 * Region Indent:: Indent all the lines in a region.
2214 * Relative Indent:: Indent the current line based on previous lines.
2215 * Indent Tabs:: Adjustable, typewriter-like tab stops.
2216 * Motion by Indent:: Move to first non-blank character.
2217 @end menu
2218
2219 @node Primitive Indent
2220 @subsection Indentation Primitives
2221
2222 This section describes the primitive functions used to count and
2223 insert indentation. The functions in the following sections use these
2224 primitives. @xref{Size of Displayed Text}, for related functions.
2225
2226 @defun current-indentation
2227 @comment !!Type Primitive Function
2228 @comment !!SourceFile indent.c
2229 This function returns the indentation of the current line, which is
2230 the horizontal position of the first nonblank character. If the
2231 contents are entirely blank, then this is the horizontal position of the
2232 end of the line.
2233 @end defun
2234
2235 @deffn Command indent-to column &optional minimum
2236 @comment !!Type Primitive Function
2237 @comment !!SourceFile indent.c
2238 This function indents from point with tabs and spaces until @var{column}
2239 is reached. If @var{minimum} is specified and non-@code{nil}, then at
2240 least that many spaces are inserted even if this requires going beyond
2241 @var{column}. Otherwise the function does nothing if point is already
2242 beyond @var{column}. The value is the column at which the inserted
2243 indentation ends.
2244
2245 The inserted whitespace characters inherit text properties from the
2246 surrounding text (usually, from the preceding text only). @xref{Sticky
2247 Properties}.
2248 @end deffn
2249
2250 @defopt indent-tabs-mode
2251 @comment !!SourceFile indent.c
2252 If this variable is non-@code{nil}, indentation functions can insert
2253 tabs as well as spaces. Otherwise, they insert only spaces. Setting
2254 this variable automatically makes it buffer-local in the current buffer.
2255 @end defopt
2256
2257 @node Mode-Specific Indent
2258 @subsection Indentation Controlled by Major Mode
2259
2260 An important function of each major mode is to customize the @key{TAB}
2261 key to indent properly for the language being edited. This section
2262 describes the mechanism of the @key{TAB} key and how to control it.
2263 The functions in this section return unpredictable values.
2264
2265 @deffn Command indent-for-tab-command &optional rigid
2266 This is the command bound to @key{TAB} in most editing modes. Its
2267 usual action is to indent the current line, but it can alternatively
2268 insert a tab character or indent a region.
2269
2270 Here is what it does:
2271
2272 @itemize
2273 @item
2274 First, it checks whether Transient Mark mode is enabled and the region
2275 is active. If so, it called @code{indent-region} to indent all the
2276 text in the region (@pxref{Region Indent}).
2277
2278 @item
2279 Otherwise, if the indentation function in @code{indent-line-function}
2280 is @code{indent-to-left-margin} (a trivial command that inserts a tab
2281 character), or if the variable @code{tab-always-indent} specifies that
2282 a tab character ought to be inserted (see below), then it inserts a
2283 tab character.
2284
2285 @item
2286 Otherwise, it indents the current line; this is done by calling the
2287 function in @code{indent-line-function}. If the line is already
2288 indented, and the value of @code{tab-always-indent} is @code{complete}
2289 (see below), it tries completing the text at point.
2290 @end itemize
2291
2292 If @var{rigid} is non-@code{nil} (interactively, with a prefix
2293 argument), then after this command indents a line or inserts a tab, it
2294 also rigidly indents the entire balanced expression which starts at
2295 the beginning of the current line, in order to reflect the new
2296 indentation. This argument is ignored if the command indents the
2297 region.
2298 @end deffn
2299
2300 @defvar indent-line-function
2301 This variable's value is the function to be used by
2302 @code{indent-for-tab-command}, and various other indentation commands,
2303 to indent the current line. It is usually assigned by the major mode;
2304 for instance, Lisp mode sets it to @code{lisp-indent-line}, C mode
2305 sets it to @code{c-indent-line}, and so on. The default value is
2306 @code{indent-relative}. @xref{Auto-Indentation}.
2307 @end defvar
2308
2309 @deffn Command indent-according-to-mode
2310 This command calls the function in @code{indent-line-function} to
2311 indent the current line in a way appropriate for the current major mode.
2312 @end deffn
2313
2314 @deffn Command newline-and-indent
2315 This function inserts a newline, then indents the new line (the one
2316 following the newline just inserted) according to the major mode. It
2317 does indentation by calling @code{indent-according-to-mode}.
2318 @end deffn
2319
2320 @deffn Command reindent-then-newline-and-indent
2321 This command reindents the current line, inserts a newline at point,
2322 and then indents the new line (the one following the newline just
2323 inserted). It does indentation on both lines by calling
2324 @code{indent-according-to-mode}.
2325 @end deffn
2326
2327 @defopt tab-always-indent
2328 This variable can be used to customize the behavior of the @key{TAB}
2329 (@code{indent-for-tab-command}) command. If the value is @code{t}
2330 (the default), the command normally just indents the current line. If
2331 the value is @code{nil}, the command indents the current line only if
2332 point is at the left margin or in the line's indentation; otherwise,
2333 it inserts a tab character. If the value is @code{complete}, the
2334 command first tries to indent the current line, and if the line was
2335 already indented, it calls @code{completion-at-point} to complete the
2336 text at point (@pxref{Completion in Buffers}).
2337 @end defopt
2338
2339 @node Region Indent
2340 @subsection Indenting an Entire Region
2341
2342 This section describes commands that indent all the lines in the
2343 region. They return unpredictable values.
2344
2345 @deffn Command indent-region start end &optional to-column
2346 This command indents each nonblank line starting between @var{start}
2347 (inclusive) and @var{end} (exclusive). If @var{to-column} is
2348 @code{nil}, @code{indent-region} indents each nonblank line by calling
2349 the current mode's indentation function, the value of
2350 @code{indent-line-function}.
2351
2352 If @var{to-column} is non-@code{nil}, it should be an integer
2353 specifying the number of columns of indentation; then this function
2354 gives each line exactly that much indentation, by either adding or
2355 deleting whitespace.
2356
2357 If there is a fill prefix, @code{indent-region} indents each line
2358 by making it start with the fill prefix.
2359 @end deffn
2360
2361 @defvar indent-region-function
2362 The value of this variable is a function that can be used by
2363 @code{indent-region} as a short cut. It should take two arguments, the
2364 start and end of the region. You should design the function so
2365 that it will produce the same results as indenting the lines of the
2366 region one by one, but presumably faster.
2367
2368 If the value is @code{nil}, there is no short cut, and
2369 @code{indent-region} actually works line by line.
2370
2371 A short-cut function is useful in modes such as C mode and Lisp mode,
2372 where the @code{indent-line-function} must scan from the beginning of
2373 the function definition: applying it to each line would be quadratic in
2374 time. The short cut can update the scan information as it moves through
2375 the lines indenting them; this takes linear time. In a mode where
2376 indenting a line individually is fast, there is no need for a short cut.
2377
2378 @code{indent-region} with a non-@code{nil} argument @var{to-column} has
2379 a different meaning and does not use this variable.
2380 @end defvar
2381
2382 @deffn Command indent-rigidly start end count
2383 This function indents all lines starting between @var{start}
2384 (inclusive) and @var{end} (exclusive) sideways by @var{count} columns.
2385 This ``preserves the shape'' of the affected region, moving it as a
2386 rigid unit.
2387
2388 This is useful not only for indenting regions of unindented text, but
2389 also for indenting regions of formatted code. For example, if
2390 @var{count} is 3, this command adds 3 columns of indentation to every
2391 line that begins in the specified region.
2392
2393 If called interactively with no prefix argument, this command invokes
2394 a transient mode for adjusting indentation rigidly. @xref{Indentation
2395 Commands,,, emacs, The GNU Emacs Manual}.
2396 @end deffn
2397
2398 @deffn Command indent-code-rigidly start end columns &optional nochange-regexp
2399 This is like @code{indent-rigidly}, except that it doesn't alter lines
2400 that start within strings or comments.
2401
2402 In addition, it doesn't alter a line if @var{nochange-regexp} matches at
2403 the beginning of the line (if @var{nochange-regexp} is non-@code{nil}).
2404 @end deffn
2405
2406 @node Relative Indent
2407 @subsection Indentation Relative to Previous Lines
2408
2409 This section describes two commands that indent the current line
2410 based on the contents of previous lines.
2411
2412 @deffn Command indent-relative &optional unindented-ok
2413 This command inserts whitespace at point, extending to the same
2414 column as the next @dfn{indent point} of the previous nonblank line. An
2415 indent point is a non-whitespace character following whitespace. The
2416 next indent point is the first one at a column greater than the current
2417 column of point. For example, if point is underneath and to the left of
2418 the first non-blank character of a line of text, it moves to that column
2419 by inserting whitespace.
2420
2421 If the previous nonblank line has no next indent point (i.e., none at a
2422 great enough column position), @code{indent-relative} either does
2423 nothing (if @var{unindented-ok} is non-@code{nil}) or calls
2424 @code{tab-to-tab-stop}. Thus, if point is underneath and to the right
2425 of the last column of a short line of text, this command ordinarily
2426 moves point to the next tab stop by inserting whitespace.
2427
2428 The return value of @code{indent-relative} is unpredictable.
2429
2430 In the following example, point is at the beginning of the second
2431 line:
2432
2433 @example
2434 @group
2435 This line is indented twelve spaces.
2436 @point{}The quick brown fox jumped.
2437 @end group
2438 @end example
2439
2440 @noindent
2441 Evaluation of the expression @code{(indent-relative nil)} produces the
2442 following:
2443
2444 @example
2445 @group
2446 This line is indented twelve spaces.
2447 @point{}The quick brown fox jumped.
2448 @end group
2449 @end example
2450
2451 In this next example, point is between the @samp{m} and @samp{p} of
2452 @samp{jumped}:
2453
2454 @example
2455 @group
2456 This line is indented twelve spaces.
2457 The quick brown fox jum@point{}ped.
2458 @end group
2459 @end example
2460
2461 @noindent
2462 Evaluation of the expression @code{(indent-relative nil)} produces the
2463 following:
2464
2465 @example
2466 @group
2467 This line is indented twelve spaces.
2468 The quick brown fox jum @point{}ped.
2469 @end group
2470 @end example
2471 @end deffn
2472
2473 @deffn Command indent-relative-maybe
2474 @comment !!SourceFile indent.el
2475 This command indents the current line like the previous nonblank line,
2476 by calling @code{indent-relative} with @code{t} as the
2477 @var{unindented-ok} argument. The return value is unpredictable.
2478
2479 If the previous nonblank line has no indent points beyond the current
2480 column, this command does nothing.
2481 @end deffn
2482
2483 @node Indent Tabs
2484 @subsection Adjustable ``Tab Stops''
2485 @cindex tabs stops for indentation
2486
2487 This section explains the mechanism for user-specified ``tab stops''
2488 and the mechanisms that use and set them. The name ``tab stops'' is
2489 used because the feature is similar to that of the tab stops on a
2490 typewriter. The feature works by inserting an appropriate number of
2491 spaces and tab characters to reach the next tab stop column; it does not
2492 affect the display of tab characters in the buffer (@pxref{Usual
2493 Display}). Note that the @key{TAB} character as input uses this tab
2494 stop feature only in a few major modes, such as Text mode.
2495 @xref{Tab Stops,,, emacs, The GNU Emacs Manual}.
2496
2497 @deffn Command tab-to-tab-stop
2498 This command inserts spaces or tabs before point, up to the next tab
2499 stop column defined by @code{tab-stop-list}.
2500 @end deffn
2501
2502 @defopt tab-stop-list
2503 This variable defines the tab stop columns used by @code{tab-to-tab-stop}.
2504 It should be either @code{nil}, or a list of increasing integers,
2505 which need not be evenly spaced. The list is implicitly
2506 extended to infinity through repetition of the interval between the
2507 last and penultimate elements (or @code{tab-width} if the list has
2508 fewer than two elements). A value of @code{nil} means a tab stop
2509 every @code{tab-width} columns.
2510
2511 Use @kbd{M-x edit-tab-stops} to edit the location of tab stops interactively.
2512 @end defopt
2513
2514 @node Motion by Indent
2515 @subsection Indentation-Based Motion Commands
2516
2517 These commands, primarily for interactive use, act based on the
2518 indentation in the text.
2519
2520 @deffn Command back-to-indentation
2521 @comment !!SourceFile simple.el
2522 This command moves point to the first non-whitespace character in the
2523 current line (which is the line in which point is located). It returns
2524 @code{nil}.
2525 @end deffn
2526
2527 @deffn Command backward-to-indentation &optional arg
2528 @comment !!SourceFile simple.el
2529 This command moves point backward @var{arg} lines and then to the
2530 first nonblank character on that line. It returns @code{nil}.
2531 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2532 @end deffn
2533
2534 @deffn Command forward-to-indentation &optional arg
2535 @comment !!SourceFile simple.el
2536 This command moves point forward @var{arg} lines and then to the first
2537 nonblank character on that line. It returns @code{nil}.
2538 If @var{arg} is omitted or @code{nil}, it defaults to 1.
2539 @end deffn
2540
2541 @node Case Changes
2542 @section Case Changes
2543 @cindex case conversion in buffers
2544
2545 The case change commands described here work on text in the current
2546 buffer. @xref{Case Conversion}, for case conversion functions that work
2547 on strings and characters. @xref{Case Tables}, for how to customize
2548 which characters are upper or lower case and how to convert them.
2549
2550 @deffn Command capitalize-region start end
2551 This function capitalizes all words in the region defined by
2552 @var{start} and @var{end}. To capitalize means to convert each word's
2553 first character to upper case and convert the rest of each word to lower
2554 case. The function returns @code{nil}.
2555
2556 If one end of the region is in the middle of a word, the part of the
2557 word within the region is treated as an entire word.
2558
2559 When @code{capitalize-region} is called interactively, @var{start} and
2560 @var{end} are point and the mark, with the smallest first.
2561
2562 @example
2563 @group
2564 ---------- Buffer: foo ----------
2565 This is the contents of the 5th foo.
2566 ---------- Buffer: foo ----------
2567 @end group
2568
2569 @group
2570 (capitalize-region 1 37)
2571 @result{} nil
2572
2573 ---------- Buffer: foo ----------
2574 This Is The Contents Of The 5th Foo.
2575 ---------- Buffer: foo ----------
2576 @end group
2577 @end example
2578 @end deffn
2579
2580 @deffn Command downcase-region start end
2581 This function converts all of the letters in the region defined by
2582 @var{start} and @var{end} to lower case. The function returns
2583 @code{nil}.
2584
2585 When @code{downcase-region} is called interactively, @var{start} and
2586 @var{end} are point and the mark, with the smallest first.
2587 @end deffn
2588
2589 @deffn Command upcase-region start end
2590 This function converts all of the letters in the region defined by
2591 @var{start} and @var{end} to upper case. The function returns
2592 @code{nil}.
2593
2594 When @code{upcase-region} is called interactively, @var{start} and
2595 @var{end} are point and the mark, with the smallest first.
2596 @end deffn
2597
2598 @deffn Command capitalize-word count
2599 This function capitalizes @var{count} words after point, moving point
2600 over as it does. To capitalize means to convert each word's first
2601 character to upper case and convert the rest of each word to lower case.
2602 If @var{count} is negative, the function capitalizes the
2603 @minus{}@var{count} previous words but does not move point. The value
2604 is @code{nil}.
2605
2606 If point is in the middle of a word, the part of the word before point
2607 is ignored when moving forward. The rest is treated as an entire word.
2608
2609 When @code{capitalize-word} is called interactively, @var{count} is
2610 set to the numeric prefix argument.
2611 @end deffn
2612
2613 @deffn Command downcase-word count
2614 This function converts the @var{count} words after point to all lower
2615 case, moving point over as it does. If @var{count} is negative, it
2616 converts the @minus{}@var{count} previous words but does not move point.
2617 The value is @code{nil}.
2618
2619 When @code{downcase-word} is called interactively, @var{count} is set
2620 to the numeric prefix argument.
2621 @end deffn
2622
2623 @deffn Command upcase-word count
2624 This function converts the @var{count} words after point to all upper
2625 case, moving point over as it does. If @var{count} is negative, it
2626 converts the @minus{}@var{count} previous words but does not move point.
2627 The value is @code{nil}.
2628
2629 When @code{upcase-word} is called interactively, @var{count} is set to
2630 the numeric prefix argument.
2631 @end deffn
2632
2633 @node Text Properties
2634 @section Text Properties
2635 @cindex text properties
2636 @cindex attributes of text
2637 @cindex properties of text
2638
2639 Each character position in a buffer or a string can have a @dfn{text
2640 property list}, much like the property list of a symbol (@pxref{Property
2641 Lists}). The properties belong to a particular character at a
2642 particular place, such as, the letter @samp{T} at the beginning of this
2643 sentence or the first @samp{o} in @samp{foo}---if the same character
2644 occurs in two different places, the two occurrences in general have
2645 different properties.
2646
2647 Each property has a name and a value. Both of these can be any Lisp
2648 object, but the name is normally a symbol. Typically each property
2649 name symbol is used for a particular purpose; for instance, the text
2650 property @code{face} specifies the faces for displaying the character
2651 (@pxref{Special Properties}). The usual way to access the property
2652 list is to specify a name and ask what value corresponds to it.
2653
2654 If a character has a @code{category} property, we call it the
2655 @dfn{property category} of the character. It should be a symbol. The
2656 properties of the symbol serve as defaults for the properties of the
2657 character.
2658
2659 Copying text between strings and buffers preserves the properties
2660 along with the characters; this includes such diverse functions as
2661 @code{substring}, @code{insert}, and @code{buffer-substring}.
2662
2663 @menu
2664 * Examining Properties:: Looking at the properties of one character.
2665 * Changing Properties:: Setting the properties of a range of text.
2666 * Property Search:: Searching for where a property changes value.
2667 * Special Properties:: Particular properties with special meanings.
2668 * Format Properties:: Properties for representing formatting of text.
2669 * Sticky Properties:: How inserted text gets properties from
2670 neighboring text.
2671 * Lazy Properties:: Computing text properties in a lazy fashion
2672 only when text is examined.
2673 * Clickable Text:: Using text properties to make regions of text
2674 do something when you click on them.
2675 * Fields:: The @code{field} property defines
2676 fields within the buffer.
2677 * Not Intervals:: Why text properties do not use
2678 Lisp-visible text intervals.
2679 @end menu
2680
2681 @node Examining Properties
2682 @subsection Examining Text Properties
2683 @cindex examining text properties
2684 @cindex text properties, examining
2685
2686 The simplest way to examine text properties is to ask for the value of
2687 a particular property of a particular character. For that, use
2688 @code{get-text-property}. Use @code{text-properties-at} to get the
2689 entire property list of a character. @xref{Property Search}, for
2690 functions to examine the properties of a number of characters at once.
2691
2692 These functions handle both strings and buffers. Keep in mind that
2693 positions in a string start from 0, whereas positions in a buffer start
2694 from 1.
2695
2696 @defun get-text-property pos prop &optional object
2697 This function returns the value of the @var{prop} property of the
2698 character after position @var{pos} in @var{object} (a buffer or
2699 string). The argument @var{object} is optional and defaults to the
2700 current buffer.
2701
2702 If there is no @var{prop} property strictly speaking, but the character
2703 has a property category that is a symbol, then @code{get-text-property} returns
2704 the @var{prop} property of that symbol.
2705 @end defun
2706
2707 @defun get-char-property position prop &optional object
2708 This function is like @code{get-text-property}, except that it checks
2709 overlays first and then text properties. @xref{Overlays}.
2710
2711 The argument @var{object} may be a string, a buffer, or a window. If
2712 it is a window, then the buffer displayed in that window is used for
2713 text properties and overlays, but only the overlays active for that
2714 window are considered. If @var{object} is a buffer, then overlays in
2715 that buffer are considered first, in order of decreasing priority,
2716 followed by the text properties. If @var{object} is a string, only
2717 text properties are considered, since strings never have overlays.
2718 @end defun
2719
2720 @defun get-pos-property position prop &optional object
2721 This function is like @code{get-char-property}, except that it pays
2722 attention to properties' stickiness and overlays' advancement settings
2723 instead of the property of the character at (i.e., right after)
2724 @var{position}.
2725 @end defun
2726
2727 @defun get-char-property-and-overlay position prop &optional object
2728 This is like @code{get-char-property}, but gives extra information
2729 about the overlay that the property value comes from.
2730
2731 Its value is a cons cell whose @sc{car} is the property value, the
2732 same value @code{get-char-property} would return with the same
2733 arguments. Its @sc{cdr} is the overlay in which the property was
2734 found, or @code{nil}, if it was found as a text property or not found
2735 at all.
2736
2737 If @var{position} is at the end of @var{object}, both the @sc{car} and
2738 the @sc{cdr} of the value are @code{nil}.
2739 @end defun
2740
2741 @defvar char-property-alias-alist
2742 This variable holds an alist which maps property names to a list of
2743 alternative property names. If a character does not specify a direct
2744 value for a property, the alternative property names are consulted in
2745 order; the first non-@code{nil} value is used. This variable takes
2746 precedence over @code{default-text-properties}, and @code{category}
2747 properties take precedence over this variable.
2748 @end defvar
2749
2750 @defun text-properties-at position &optional object
2751 This function returns the entire property list of the character at
2752 @var{position} in the string or buffer @var{object}. If @var{object} is
2753 @code{nil}, it defaults to the current buffer.
2754 @end defun
2755
2756 @defvar default-text-properties
2757 This variable holds a property list giving default values for text
2758 properties. Whenever a character does not specify a value for a
2759 property, neither directly, through a category symbol, or through
2760 @code{char-property-alias-alist}, the value stored in this list is
2761 used instead. Here is an example:
2762
2763 @example
2764 (setq default-text-properties '(foo 69)
2765 char-property-alias-alist nil)
2766 ;; @r{Make sure character 1 has no properties of its own.}
2767 (set-text-properties 1 2 nil)
2768 ;; @r{What we get, when we ask, is the default value.}
2769 (get-text-property 1 'foo)
2770 @result{} 69
2771 @end example
2772 @end defvar
2773
2774 @node Changing Properties
2775 @subsection Changing Text Properties
2776 @cindex changing text properties
2777 @cindex text properties, changing
2778
2779 The primitives for changing properties apply to a specified range of
2780 text in a buffer or string. The function @code{set-text-properties}
2781 (see end of section) sets the entire property list of the text in that
2782 range; more often, it is useful to add, change, or delete just certain
2783 properties specified by name.
2784
2785 Since text properties are considered part of the contents of the
2786 buffer (or string), and can affect how a buffer looks on the screen,
2787 any change in buffer text properties marks the buffer as modified.
2788 Buffer text property changes are undoable also (@pxref{Undo}).
2789 Positions in a string start from 0, whereas positions in a buffer
2790 start from 1.
2791
2792 @defun put-text-property start end prop value &optional object
2793 This function sets the @var{prop} property to @var{value} for the text
2794 between @var{start} and @var{end} in the string or buffer @var{object}.
2795 If @var{object} is @code{nil}, it defaults to the current buffer.
2796 @end defun
2797
2798 @defun add-text-properties start end props &optional object
2799 This function adds or overrides text properties for the text between
2800 @var{start} and @var{end} in the string or buffer @var{object}. If
2801 @var{object} is @code{nil}, it defaults to the current buffer.
2802
2803 The argument @var{props} specifies which properties to add. It should
2804 have the form of a property list (@pxref{Property Lists}): a list whose
2805 elements include the property names followed alternately by the
2806 corresponding values.
2807
2808 The return value is @code{t} if the function actually changed some
2809 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2810 its values agree with those in the text).
2811
2812 For example, here is how to set the @code{comment} and @code{face}
2813 properties of a range of text:
2814
2815 @example
2816 (add-text-properties @var{start} @var{end}
2817 '(comment t face highlight))
2818 @end example
2819 @end defun
2820
2821 @defun remove-text-properties start end props &optional object
2822 This function deletes specified text properties from the text between
2823 @var{start} and @var{end} in the string or buffer @var{object}. If
2824 @var{object} is @code{nil}, it defaults to the current buffer.
2825
2826 The argument @var{props} specifies which properties to delete. It
2827 should have the form of a property list (@pxref{Property Lists}): a list
2828 whose elements are property names alternating with corresponding values.
2829 But only the names matter---the values that accompany them are ignored.
2830 For example, here's how to remove the @code{face} property.
2831
2832 @example
2833 (remove-text-properties @var{start} @var{end} '(face nil))
2834 @end example
2835
2836 The return value is @code{t} if the function actually changed some
2837 property's value; @code{nil} otherwise (if @var{props} is @code{nil} or
2838 if no character in the specified text had any of those properties).
2839
2840 To remove all text properties from certain text, use
2841 @code{set-text-properties} and specify @code{nil} for the new property
2842 list.
2843 @end defun
2844
2845 @defun remove-list-of-text-properties start end list-of-properties &optional object
2846 Like @code{remove-text-properties} except that
2847 @var{list-of-properties} is a list of property names only, not an
2848 alternating list of property names and values.
2849 @end defun
2850
2851 @defun set-text-properties start end props &optional object
2852 This function completely replaces the text property list for the text
2853 between @var{start} and @var{end} in the string or buffer @var{object}.
2854 If @var{object} is @code{nil}, it defaults to the current buffer.
2855
2856 The argument @var{props} is the new property list. It should be a list
2857 whose elements are property names alternating with corresponding values.
2858
2859 After @code{set-text-properties} returns, all the characters in the
2860 specified range have identical properties.
2861
2862 If @var{props} is @code{nil}, the effect is to get rid of all properties
2863 from the specified range of text. Here's an example:
2864
2865 @example
2866 (set-text-properties @var{start} @var{end} nil)
2867 @end example
2868
2869 Do not rely on the return value of this function.
2870 @end defun
2871
2872 @defun add-face-text-property start end face &optional appendp object
2873 This function acts on the text between @var{start} and @var{end},
2874 adding the face @var{face} to the @code{face} text property.
2875 @var{face} should be a valid value for the @code{face} property
2876 (@pxref{Special Properties}), such as a face name or an anonymous face
2877 (@pxref{Faces}).
2878
2879 If any text in the region already has a non-@code{nil} @code{face} property,
2880 those face(s) are retained. This function sets the @code{face}
2881 property to a list of faces, with @var{face} as the first element (by
2882 default) and the pre-existing faces as the remaining elements. If the
2883 optional argument @var{append} is non-@code{nil}, @var{face} is
2884 appended to the end of the list instead. Note that in a face list,
2885 the first occurring value for each attribute takes precedence.
2886
2887 For example, the following code would assign a italicized green face
2888 to the text between @var{start} and @var{end}:
2889
2890 @example
2891 (add-face-text-property @var{start} @var{end} 'italic)
2892 (add-face-text-property @var{start} @var{end} '(:foreground "red"))
2893 (add-face-text-property @var{start} @var{end} '(:foreground "green"))
2894 @end example
2895
2896 The optional argument @var{object}, if non-@code{nil}, specifies a
2897 buffer or string to act on, rather than the current buffer. If
2898 @var{object} is a string, then @var{start} and @var{end} are
2899 zero-based indices into the string.
2900 @end defun
2901
2902 The easiest way to make a string with text properties is with
2903 @code{propertize}:
2904
2905 @defun propertize string &rest properties
2906 This function returns a copy of @var{string} with the text properties
2907 @var{properties} added. These properties apply to all the characters
2908 in the string that is returned. Here is an example that constructs a
2909 string with a @code{face} property and a @code{mouse-face} property:
2910
2911 @smallexample
2912 (propertize "foo" 'face 'italic
2913 'mouse-face 'bold-italic)
2914 @result{} #("foo" 0 3 (mouse-face bold-italic face italic))
2915 @end smallexample
2916
2917 To put different properties on various parts of a string, you can
2918 construct each part with @code{propertize} and then combine them with
2919 @code{concat}:
2920
2921 @smallexample
2922 (concat
2923 (propertize "foo" 'face 'italic
2924 'mouse-face 'bold-italic)
2925 " and "
2926 (propertize "bar" 'face 'italic
2927 'mouse-face 'bold-italic))
2928 @result{} #("foo and bar"
2929 0 3 (face italic mouse-face bold-italic)
2930 3 8 nil
2931 8 11 (face italic mouse-face bold-italic))
2932 @end smallexample
2933 @end defun
2934
2935 @xref{Buffer Contents}, for the function
2936 @code{buffer-substring-no-properties}, which copies text from the
2937 buffer but does not copy its properties.
2938
2939 @node Property Search
2940 @subsection Text Property Search Functions
2941 @cindex searching text properties
2942 @cindex text properties, searching
2943
2944 In typical use of text properties, most of the time several or many
2945 consecutive characters have the same value for a property. Rather than
2946 writing your programs to examine characters one by one, it is much
2947 faster to process chunks of text that have the same property value.
2948
2949 Here are functions you can use to do this. They use @code{eq} for
2950 comparing property values. In all cases, @var{object} defaults to the
2951 current buffer.
2952
2953 For good performance, it's very important to use the @var{limit}
2954 argument to these functions, especially the ones that search for a
2955 single property---otherwise, they may spend a long time scanning to the
2956 end of the buffer, if the property you are interested in does not change.
2957
2958 These functions do not move point; instead, they return a position (or
2959 @code{nil}). Remember that a position is always between two characters;
2960 the position returned by these functions is between two characters with
2961 different properties.
2962
2963 @defun next-property-change pos &optional object limit
2964 The function scans the text forward from position @var{pos} in the
2965 string or buffer @var{object} until it finds a change in some text
2966 property, then returns the position of the change. In other words, it
2967 returns the position of the first character beyond @var{pos} whose
2968 properties are not identical to those of the character just after
2969 @var{pos}.
2970
2971 If @var{limit} is non-@code{nil}, then the scan ends at position
2972 @var{limit}. If there is no property change before that point, this
2973 function returns @var{limit}.
2974
2975 The value is @code{nil} if the properties remain unchanged all the way
2976 to the end of @var{object} and @var{limit} is @code{nil}. If the value
2977 is non-@code{nil}, it is a position greater than or equal to @var{pos}.
2978 The value equals @var{pos} only when @var{limit} equals @var{pos}.
2979
2980 Here is an example of how to scan the buffer by chunks of text within
2981 which all properties are constant:
2982
2983 @smallexample
2984 (while (not (eobp))
2985 (let ((plist (text-properties-at (point)))
2986 (next-change
2987 (or (next-property-change (point) (current-buffer))
2988 (point-max))))
2989 @r{Process text from point to @var{next-change}@dots{}}
2990 (goto-char next-change)))
2991 @end smallexample
2992 @end defun
2993
2994 @defun previous-property-change pos &optional object limit
2995 This is like @code{next-property-change}, but scans back from @var{pos}
2996 instead of forward. If the value is non-@code{nil}, it is a position
2997 less than or equal to @var{pos}; it equals @var{pos} only if @var{limit}
2998 equals @var{pos}.
2999 @end defun
3000
3001 @defun next-single-property-change pos prop &optional object limit
3002 The function scans text for a change in the @var{prop} property, then
3003 returns the position of the change. The scan goes forward from
3004 position @var{pos} in the string or buffer @var{object}. In other
3005 words, this function returns the position of the first character
3006 beyond @var{pos} whose @var{prop} property differs from that of the
3007 character just after @var{pos}.
3008
3009 If @var{limit} is non-@code{nil}, then the scan ends at position
3010 @var{limit}. If there is no property change before that point,
3011 @code{next-single-property-change} returns @var{limit}.
3012
3013 The value is @code{nil} if the property remains unchanged all the way to
3014 the end of @var{object} and @var{limit} is @code{nil}. If the value is
3015 non-@code{nil}, it is a position greater than or equal to @var{pos}; it
3016 equals @var{pos} only if @var{limit} equals @var{pos}.
3017 @end defun
3018
3019 @defun previous-single-property-change pos prop &optional object limit
3020 This is like @code{next-single-property-change}, but scans back from
3021 @var{pos} instead of forward. If the value is non-@code{nil}, it is a
3022 position less than or equal to @var{pos}; it equals @var{pos} only if
3023 @var{limit} equals @var{pos}.
3024 @end defun
3025
3026 @defun next-char-property-change pos &optional limit
3027 This is like @code{next-property-change} except that it considers
3028 overlay properties as well as text properties, and if no change is
3029 found before the end of the buffer, it returns the maximum buffer
3030 position rather than @code{nil} (in this sense, it resembles the
3031 corresponding overlay function @code{next-overlay-change}, rather than
3032 @code{next-property-change}). There is no @var{object} operand
3033 because this function operates only on the current buffer. It returns
3034 the next address at which either kind of property changes.
3035 @end defun
3036
3037 @defun previous-char-property-change pos &optional limit
3038 This is like @code{next-char-property-change}, but scans back from
3039 @var{pos} instead of forward, and returns the minimum buffer
3040 position if no change is found.
3041 @end defun
3042
3043 @defun next-single-char-property-change pos prop &optional object limit
3044 This is like @code{next-single-property-change} except that it
3045 considers overlay properties as well as text properties, and if no
3046 change is found before the end of the @var{object}, it returns the
3047 maximum valid position in @var{object} rather than @code{nil}. Unlike
3048 @code{next-char-property-change}, this function @emph{does} have an
3049 @var{object} operand; if @var{object} is not a buffer, only
3050 text-properties are considered.
3051 @end defun
3052
3053 @defun previous-single-char-property-change pos prop &optional object limit
3054 This is like @code{next-single-char-property-change}, but scans back
3055 from @var{pos} instead of forward, and returns the minimum valid
3056 position in @var{object} if no change is found.
3057 @end defun
3058
3059 @defun text-property-any start end prop value &optional object
3060 This function returns non-@code{nil} if at least one character between
3061 @var{start} and @var{end} has a property @var{prop} whose value is
3062 @var{value}. More precisely, it returns the position of the first such
3063 character. Otherwise, it returns @code{nil}.
3064
3065 The optional fifth argument, @var{object}, specifies the string or
3066 buffer to scan. Positions are relative to @var{object}. The default
3067 for @var{object} is the current buffer.
3068 @end defun
3069
3070 @defun text-property-not-all start end prop value &optional object
3071 This function returns non-@code{nil} if at least one character between
3072 @var{start} and @var{end} does not have a property @var{prop} with value
3073 @var{value}. More precisely, it returns the position of the first such
3074 character. Otherwise, it returns @code{nil}.
3075
3076 The optional fifth argument, @var{object}, specifies the string or
3077 buffer to scan. Positions are relative to @var{object}. The default
3078 for @var{object} is the current buffer.
3079 @end defun
3080
3081 @node Special Properties
3082 @subsection Properties with Special Meanings
3083
3084 Here is a table of text property names that have special built-in
3085 meanings. The following sections list a few additional special property
3086 names that control filling and property inheritance. All other names
3087 have no standard meaning, and you can use them as you like.
3088
3089 Note: the properties @code{composition}, @code{display},
3090 @code{invisible} and @code{intangible} can also cause point to move to
3091 an acceptable place, after each Emacs command. @xref{Adjusting
3092 Point}.
3093
3094 @table @code
3095 @cindex property category of text character
3096 @c FIXME: Isn't @kindex for keyboard commands?
3097 @kindex category @r{(text property)}
3098 @item category
3099 If a character has a @code{category} property, we call it the
3100 @dfn{property category} of the character. It should be a symbol. The
3101 properties of this symbol serve as defaults for the properties of the
3102 character.
3103
3104 @item face
3105 @cindex face codes of text
3106 @kindex face @r{(text property)}
3107 The @code{face} property controls the appearance of the character
3108 (@pxref{Faces}). The value of the property can be the following:
3109
3110 @itemize @bullet
3111 @item
3112 A face name (a symbol or string).
3113
3114 @item
3115 An anonymous face: a property list of the form @code{(@var{keyword}
3116 @var{value} @dots{})}, where each @var{keyword} is a face attribute
3117 name and @var{value} is a value for that attribute.
3118
3119 @item
3120 A list of faces. Each list element should be either a face name or an
3121 anonymous face. This specifies a face which is an aggregate of the
3122 attributes of each of the listed faces. Faces occurring earlier in
3123 the list have higher priority.
3124
3125 @item
3126 A cons cell of the form @code{(foreground-color . @var{color-name})}
3127 or @code{(background-color . @var{color-name})}. This specifies the
3128 foreground or background color, similar to @code{(:foreground
3129 @var{color-name})} or @code{(:background @var{color-name})}. This
3130 form is supported for backward compatibility only, and should be
3131 avoided.
3132 @end itemize
3133
3134 Font Lock mode (@pxref{Font Lock Mode}) works in most buffers by
3135 dynamically updating the @code{face} property of characters based on
3136 the context.
3137
3138 The @code{add-face-text-property} function provides a convenient way
3139 to set this text property. @xref{Changing Properties}.
3140
3141 @item font-lock-face
3142 @kindex font-lock-face @r{(text property)}
3143 This property specifies a value for the @code{face} property that Font
3144 Lock mode should apply to the underlying text. It is one of the
3145 fontification methods used by Font Lock mode, and is useful for
3146 special modes that implement their own highlighting.
3147 @xref{Precalculated Fontification}. When Font Lock mode is disabled,
3148 @code{font-lock-face} has no effect.
3149
3150 @item mouse-face
3151 @kindex mouse-face @r{(text property)}
3152 This property is used instead of @code{face} when the mouse is on or
3153 near the character. For this purpose, ``near'' means that all text
3154 between the character and where the mouse is have the same
3155 @code{mouse-face} property value.
3156
3157 Emacs ignores all face attributes from the @code{mouse-face} property
3158 that alter the text size (e.g., @code{:height}, @code{:weight}, and
3159 @code{:slant}). Those attributes are always the same as for the
3160 unhighlighted text.
3161
3162 @item fontified
3163 @kindex fontified @r{(text property)}
3164 This property says whether the text is ready for display. If
3165 @code{nil}, Emacs's redisplay routine calls the functions in
3166 @code{fontification-functions} (@pxref{Auto Faces}) to prepare this
3167 part of the buffer before it is displayed. It is used internally by
3168 the ``just in time'' font locking code.
3169
3170 @item display
3171 This property activates various features that change the
3172 way text is displayed. For example, it can make text appear taller
3173 or shorter, higher or lower, wider or narrow, or replaced with an image.
3174 @xref{Display Property}.
3175
3176 @item help-echo
3177 @kindex help-echo @r{(text property)}
3178 @cindex tooltip
3179 @anchor{Text help-echo}
3180 If text has a string as its @code{help-echo} property, then when you
3181 move the mouse onto that text, Emacs displays that string in the echo
3182 area, or in the tooltip window (@pxref{Tooltips,,, emacs, The GNU Emacs
3183 Manual}).
3184
3185 If the value of the @code{help-echo} property is a function, that
3186 function is called with three arguments, @var{window}, @var{object} and
3187 @var{pos} and should return a help string or @code{nil} for
3188 none. The first argument, @var{window} is the window in which
3189 the help was found. The second, @var{object}, is the buffer, overlay or
3190 string which had the @code{help-echo} property. The @var{pos}
3191 argument is as follows:
3192
3193 @itemize @bullet{}
3194 @item
3195 If @var{object} is a buffer, @var{pos} is the position in the buffer.
3196 @item
3197 If @var{object} is an overlay, that overlay has a @code{help-echo}
3198 property, and @var{pos} is the position in the overlay's buffer.
3199 @item
3200 If @var{object} is a string (an overlay string or a string displayed
3201 with the @code{display} property), @var{pos} is the position in that
3202 string.
3203 @end itemize
3204
3205 If the value of the @code{help-echo} property is neither a function nor
3206 a string, it is evaluated to obtain a help string.
3207
3208 You can alter the way help text is displayed by setting the variable
3209 @code{show-help-function} (@pxref{Help display}).
3210
3211 This feature is used in the mode line and for other active text.
3212
3213 @item keymap
3214 @cindex keymap of character
3215 @kindex keymap @r{(text property)}
3216 The @code{keymap} property specifies an additional keymap for
3217 commands. When this keymap applies, it is used for key lookup before
3218 the minor mode keymaps and before the buffer's local map.
3219 @xref{Active Keymaps}. If the property value is a symbol, the
3220 symbol's function definition is used as the keymap.
3221
3222 The property's value for the character before point applies if it is
3223 non-@code{nil} and rear-sticky, and the property's value for the
3224 character after point applies if it is non-@code{nil} and
3225 front-sticky. (For mouse clicks, the position of the click is used
3226 instead of the position of point.)
3227
3228 @item local-map
3229 @kindex local-map @r{(text property)}
3230 This property works like @code{keymap} except that it specifies a
3231 keymap to use @emph{instead of} the buffer's local map. For most
3232 purposes (perhaps all purposes), it is better to use the @code{keymap}
3233 property.
3234
3235 @item syntax-table
3236 The @code{syntax-table} property overrides what the syntax table says
3237 about this particular character. @xref{Syntax Properties}.
3238
3239 @item read-only
3240 @cindex read-only character
3241 @kindex read-only @r{(text property)}
3242 If a character has the property @code{read-only}, then modifying that
3243 character is not allowed. Any command that would do so gets an error,
3244 @code{text-read-only}. If the property value is a string, that string
3245 is used as the error message.
3246
3247 Insertion next to a read-only character is an error if inserting
3248 ordinary text there would inherit the @code{read-only} property due to
3249 stickiness. Thus, you can control permission to insert next to
3250 read-only text by controlling the stickiness. @xref{Sticky Properties}.
3251
3252 Since changing properties counts as modifying the buffer, it is not
3253 possible to remove a @code{read-only} property unless you know the
3254 special trick: bind @code{inhibit-read-only} to a non-@code{nil} value
3255 and then remove the property. @xref{Read Only Buffers}.
3256
3257 @item inhibit-read-only
3258 @kindex inhibit-read-only @r{(text property)}
3259 If a character has the property @code{inhibit-read-only}, and the
3260 buffer is read-only, editing the character in question is allowed.
3261
3262 @item invisible
3263 @kindex invisible @r{(text property)}
3264 A non-@code{nil} @code{invisible} property can make a character invisible
3265 on the screen. @xref{Invisible Text}, for details.
3266
3267 @item intangible
3268 @kindex intangible @r{(text property)}
3269 If a group of consecutive characters have equal and non-@code{nil}
3270 @code{intangible} properties, then you cannot place point between them.
3271 If you try to move point forward into the group, point actually moves to
3272 the end of the group. If you try to move point backward into the group,
3273 point actually moves to the start of the group.
3274
3275 If consecutive characters have unequal non-@code{nil}
3276 @code{intangible} properties, they belong to separate groups; each
3277 group is separately treated as described above.
3278
3279 When the variable @code{inhibit-point-motion-hooks} is non-@code{nil},
3280 the @code{intangible} property is ignored.
3281
3282 Beware: this property operates at a very low level, and affects a lot of code
3283 in unexpected ways. So use it with extreme caution. A common misuse is to put
3284 an intangible property on invisible text, which is actually unnecessary since
3285 the command loop will move point outside of the invisible text at the end of
3286 each command anyway. @xref{Adjusting Point}.
3287
3288 @item field
3289 @kindex field @r{(text property)}
3290 Consecutive characters with the same @code{field} property constitute a
3291 @dfn{field}. Some motion functions including @code{forward-word} and
3292 @code{beginning-of-line} stop moving at a field boundary.
3293 @xref{Fields}.
3294
3295 @item cursor
3296 @kindex cursor @r{(text property)}
3297 Normally, the cursor is displayed at the beginning or the end of any
3298 overlay and text property strings present at the current buffer
3299 position. You can place the cursor on any desired character of these
3300 strings by giving that character a non-@code{nil} @code{cursor} text
3301 property. In addition, if the value of the @code{cursor} property is
3302 an integer, it specifies the number of buffer's character
3303 positions, starting with the position where the overlay or the
3304 @code{display} property begins, for which the cursor should be
3305 displayed on that character. Specifically, if the value of the
3306 @code{cursor} property of a character is the number @var{n}, the
3307 cursor will be displayed on this character for any buffer position in
3308 the range @code{[@var{ovpos}..@var{ovpos}+@var{n})}, where @var{ovpos}
3309 is the overlay's starting position given by @code{overlay-start}
3310 (@pxref{Managing Overlays}), or the position where the @code{display}
3311 text property begins in the buffer.
3312
3313 In other words, the string character with the @code{cursor} property
3314 of any non-@code{nil} value is the character where to display the
3315 cursor. The value of the property says for which buffer positions to
3316 display the cursor there. If the value is an integer @var{n},
3317 the cursor is displayed there when point is anywhere between the
3318 beginning of the overlay or @code{display} property and @var{n}
3319 positions after that. If the value is anything else and
3320 non-@code{nil}, the cursor is displayed there only when point is at
3321 the beginning of the @code{display} property or at
3322 @code{overlay-start}.
3323
3324 @cindex cursor position for @code{display} properties and overlays
3325 When the buffer has many overlay strings (e.g., @pxref{Overlay
3326 Properties, before-string}) or @code{display} properties that are
3327 strings, it is a good idea to use the @code{cursor} property on these
3328 strings to cue the Emacs display about the places where to put the
3329 cursor while traversing these strings. This directly communicates to
3330 the display engine where the Lisp program wants to put the cursor, or
3331 where the user would expect the cursor.
3332
3333 @item pointer
3334 @kindex pointer @r{(text property)}
3335 This specifies a specific pointer shape when the mouse pointer is over
3336 this text or image. @xref{Pointer Shape}, for possible pointer
3337 shapes.
3338
3339 @item line-spacing
3340 @kindex line-spacing @r{(text property)}
3341 A newline can have a @code{line-spacing} text or overlay property that
3342 controls the height of the display line ending with that newline. The
3343 property value overrides the default frame line spacing and the buffer
3344 local @code{line-spacing} variable. @xref{Line Height}.
3345
3346 @item line-height
3347 @kindex line-height @r{(text property)}
3348 A newline can have a @code{line-height} text or overlay property that
3349 controls the total height of the display line ending in that newline.
3350 @xref{Line Height}.
3351
3352 @item wrap-prefix
3353 If text has a @code{wrap-prefix} property, the prefix it defines will
3354 be added at display time to the beginning of every continuation line
3355 due to text wrapping (so if lines are truncated, the wrap-prefix is
3356 never used). It may be a string or an image (@pxref{Other Display
3357 Specs}), or a stretch of whitespace such as specified by the
3358 @code{:width} or @code{:align-to} display properties (@pxref{Specified
3359 Space}).
3360
3361 A wrap-prefix may also be specified for an entire buffer using the
3362 @code{wrap-prefix} buffer-local variable (however, a
3363 @code{wrap-prefix} text-property takes precedence over the value of
3364 the @code{wrap-prefix} variable). @xref{Truncation}.
3365
3366 @item line-prefix
3367 If text has a @code{line-prefix} property, the prefix it defines will
3368 be added at display time to the beginning of every non-continuation
3369 line. It may be a string or an image (@pxref{Other Display
3370 Specs}), or a stretch of whitespace such as specified by the
3371 @code{:width} or @code{:align-to} display properties (@pxref{Specified
3372 Space}).
3373
3374 A line-prefix may also be specified for an entire buffer using the
3375 @code{line-prefix} buffer-local variable (however, a
3376 @code{line-prefix} text-property takes precedence over the value of
3377 the @code{line-prefix} variable). @xref{Truncation}.
3378
3379 @item modification-hooks
3380 @cindex change hooks for a character
3381 @cindex hooks for changing a character
3382 @kindex modification-hooks @r{(text property)}
3383 If a character has the property @code{modification-hooks}, then its
3384 value should be a list of functions; modifying that character calls
3385 all of those functions before the actual modification. Each function
3386 receives two arguments: the beginning and end of the part of the
3387 buffer being modified. Note that if a particular modification hook
3388 function appears on several characters being modified by a single
3389 primitive, you can't predict how many times the function will
3390 be called.
3391 Furthermore, insertion will not modify any existing character, so this
3392 hook will only be run when removing some characters, replacing them
3393 with others, or changing their text-properties.
3394
3395 If these functions modify the buffer, they should bind
3396 @code{inhibit-modification-hooks} to @code{t} around doing so, to
3397 avoid confusing the internal mechanism that calls these hooks.
3398
3399 Overlays also support the @code{modification-hooks} property, but the
3400 details are somewhat different (@pxref{Overlay Properties}).
3401
3402 @item insert-in-front-hooks
3403 @itemx insert-behind-hooks
3404 @kindex insert-in-front-hooks @r{(text property)}
3405 @kindex insert-behind-hooks @r{(text property)}
3406 The operation of inserting text in a buffer also calls the functions
3407 listed in the @code{insert-in-front-hooks} property of the following
3408 character and in the @code{insert-behind-hooks} property of the
3409 preceding character. These functions receive two arguments, the
3410 beginning and end of the inserted text. The functions are called
3411 @emph{after} the actual insertion takes place.
3412
3413 See also @ref{Change Hooks}, for other hooks that are called
3414 when you change text in a buffer.
3415
3416 @item point-entered
3417 @itemx point-left
3418 @cindex hooks for motion of point
3419 @kindex point-entered @r{(text property)}
3420 @kindex point-left @r{(text property)}
3421 The special properties @code{point-entered} and @code{point-left}
3422 record hook functions that report motion of point. Each time point
3423 moves, Emacs compares these two property values:
3424
3425 @itemize @bullet
3426 @item
3427 the @code{point-left} property of the character after the old location,
3428 and
3429 @item
3430 the @code{point-entered} property of the character after the new
3431 location.
3432 @end itemize
3433
3434 @noindent
3435 If these two values differ, each of them is called (if not @code{nil})
3436 with two arguments: the old value of point, and the new one.
3437
3438 The same comparison is made for the characters before the old and new
3439 locations. The result may be to execute two @code{point-left} functions
3440 (which may be the same function) and/or two @code{point-entered}
3441 functions (which may be the same function). In any case, all the
3442 @code{point-left} functions are called first, followed by all the
3443 @code{point-entered} functions.
3444
3445 It is possible to use @code{char-after} to examine characters at various
3446 buffer positions without moving point to those positions. Only an
3447 actual change in the value of point runs these hook functions.
3448
3449 The variable @code{inhibit-point-motion-hooks} can inhibit running the
3450 @code{point-left} and @code{point-entered} hooks, see @ref{Inhibit
3451 point motion hooks}.
3452
3453 @item composition
3454 @kindex composition @r{(text property)}
3455 This text property is used to display a sequence of characters as a
3456 single glyph composed from components. But the value of the property
3457 itself is completely internal to Emacs and should not be manipulated
3458 directly by, for instance, @code{put-text-property}.
3459
3460 @end table
3461
3462 @defvar inhibit-point-motion-hooks
3463 @anchor{Inhibit point motion hooks} When this variable is
3464 non-@code{nil}, @code{point-left} and @code{point-entered} hooks are
3465 not run, and the @code{intangible} property has no effect. Do not set
3466 this variable globally; bind it with @code{let}.
3467 @end defvar
3468
3469 @defvar show-help-function
3470 @anchor{Help display} If this variable is non-@code{nil}, it specifies a
3471 function called to display help strings. These may be @code{help-echo}
3472 properties, menu help strings (@pxref{Simple Menu Items},
3473 @pxref{Extended Menu Items}), or tool bar help strings (@pxref{Tool
3474 Bar}). The specified function is called with one argument, the help
3475 string to display. Tooltip mode (@pxref{Tooltips,,, emacs, The GNU Emacs
3476 Manual}) provides an example.
3477 @end defvar
3478
3479 @node Format Properties
3480 @subsection Formatted Text Properties
3481
3482 These text properties affect the behavior of the fill commands. They
3483 are used for representing formatted text. @xref{Filling}, and
3484 @ref{Margins}.
3485
3486 @table @code
3487 @item hard
3488 If a newline character has this property, it is a ``hard'' newline.
3489 The fill commands do not alter hard newlines and do not move words
3490 across them. However, this property takes effect only if the
3491 @code{use-hard-newlines} minor mode is enabled. @xref{Hard and Soft
3492 Newlines,, Hard and Soft Newlines, emacs, The GNU Emacs Manual}.
3493
3494 @item right-margin
3495 This property specifies an extra right margin for filling this part of the
3496 text.
3497
3498 @item left-margin
3499 This property specifies an extra left margin for filling this part of the
3500 text.
3501
3502 @item justification
3503 This property specifies the style of justification for filling this part
3504 of the text.
3505 @end table
3506
3507 @node Sticky Properties
3508 @subsection Stickiness of Text Properties
3509 @cindex sticky text properties
3510 @cindex inheritance, text property
3511
3512 Self-inserting characters normally take on the same properties as the
3513 preceding character. This is called @dfn{inheritance} of properties.
3514
3515 A Lisp program can do insertion with inheritance or without,
3516 depending on the choice of insertion primitive. The ordinary text
3517 insertion functions, such as @code{insert}, do not inherit any
3518 properties. They insert text with precisely the properties of the
3519 string being inserted, and no others. This is correct for programs
3520 that copy text from one context to another---for example, into or out
3521 of the kill ring. To insert with inheritance, use the special
3522 primitives described in this section. Self-inserting characters
3523 inherit properties because they work using these primitives.
3524
3525 When you do insertion with inheritance, @emph{which} properties are
3526 inherited, and from where, depends on which properties are @dfn{sticky}.
3527 Insertion after a character inherits those of its properties that are
3528 @dfn{rear-sticky}. Insertion before a character inherits those of its
3529 properties that are @dfn{front-sticky}. When both sides offer different
3530 sticky values for the same property, the previous character's value
3531 takes precedence.
3532
3533 By default, a text property is rear-sticky but not front-sticky; thus,
3534 the default is to inherit all the properties of the preceding character,
3535 and nothing from the following character.
3536
3537 You can control the stickiness of various text properties with two
3538 specific text properties, @code{front-sticky} and @code{rear-nonsticky},
3539 and with the variable @code{text-property-default-nonsticky}. You can
3540 use the variable to specify a different default for a given property.
3541 You can use those two text properties to make any specific properties
3542 sticky or nonsticky in any particular part of the text.
3543
3544 If a character's @code{front-sticky} property is @code{t}, then all
3545 its properties are front-sticky. If the @code{front-sticky} property is
3546 a list, then the sticky properties of the character are those whose
3547 names are in the list. For example, if a character has a
3548 @code{front-sticky} property whose value is @code{(face read-only)},
3549 then insertion before the character can inherit its @code{face} property
3550 and its @code{read-only} property, but no others.
3551
3552 The @code{rear-nonsticky} property works the opposite way. Most
3553 properties are rear-sticky by default, so the @code{rear-nonsticky}
3554 property says which properties are @emph{not} rear-sticky. If a
3555 character's @code{rear-nonsticky} property is @code{t}, then none of its
3556 properties are rear-sticky. If the @code{rear-nonsticky} property is a
3557 list, properties are rear-sticky @emph{unless} their names are in the
3558 list.
3559
3560 @defvar text-property-default-nonsticky
3561 This variable holds an alist which defines the default rear-stickiness
3562 of various text properties. Each element has the form
3563 @code{(@var{property} . @var{nonstickiness})}, and it defines the
3564 stickiness of a particular text property, @var{property}.
3565
3566 If @var{nonstickiness} is non-@code{nil}, this means that the property
3567 @var{property} is rear-nonsticky by default. Since all properties are
3568 front-nonsticky by default, this makes @var{property} nonsticky in both
3569 directions by default.
3570
3571 The text properties @code{front-sticky} and @code{rear-nonsticky}, when
3572 used, take precedence over the default @var{nonstickiness} specified in
3573 @code{text-property-default-nonsticky}.
3574 @end defvar
3575
3576 Here are the functions that insert text with inheritance of properties:
3577
3578 @defun insert-and-inherit &rest strings
3579 Insert the strings @var{strings}, just like the function @code{insert},
3580 but inherit any sticky properties from the adjoining text.
3581 @end defun
3582
3583 @defun insert-before-markers-and-inherit &rest strings
3584 Insert the strings @var{strings}, just like the function
3585 @code{insert-before-markers}, but inherit any sticky properties from the
3586 adjoining text.
3587 @end defun
3588
3589 @xref{Insertion}, for the ordinary insertion functions which do not
3590 inherit.
3591
3592 @node Lazy Properties
3593 @subsection Lazy Computation of Text Properties
3594
3595 Instead of computing text properties for all the text in the buffer,
3596 you can arrange to compute the text properties for parts of the text
3597 when and if something depends on them.
3598
3599 The primitive that extracts text from the buffer along with its
3600 properties is @code{buffer-substring}. Before examining the properties,
3601 this function runs the abnormal hook @code{buffer-access-fontify-functions}.
3602
3603 @defvar buffer-access-fontify-functions
3604 This variable holds a list of functions for computing text properties.
3605 Before @code{buffer-substring} copies the text and text properties for a
3606 portion of the buffer, it calls all the functions in this list. Each of
3607 the functions receives two arguments that specify the range of the
3608 buffer being accessed. (The buffer itself is always the current
3609 buffer.)
3610 @end defvar
3611
3612 The function @code{buffer-substring-no-properties} does not call these
3613 functions, since it ignores text properties anyway.
3614
3615 In order to prevent the hook functions from being called more than
3616 once for the same part of the buffer, you can use the variable
3617 @code{buffer-access-fontified-property}.
3618
3619 @defvar buffer-access-fontified-property
3620 If this variable's value is non-@code{nil}, it is a symbol which is used
3621 as a text property name. A non-@code{nil} value for that text property
3622 means, ``the other text properties for this character have already been
3623 computed''.
3624
3625 If all the characters in the range specified for @code{buffer-substring}
3626 have a non-@code{nil} value for this property, @code{buffer-substring}
3627 does not call the @code{buffer-access-fontify-functions} functions. It
3628 assumes these characters already have the right text properties, and
3629 just copies the properties they already have.
3630
3631 The normal way to use this feature is that the
3632 @code{buffer-access-fontify-functions} functions add this property, as
3633 well as others, to the characters they operate on. That way, they avoid
3634 being called over and over for the same text.
3635 @end defvar
3636
3637 @node Clickable Text
3638 @subsection Defining Clickable Text
3639 @cindex clickable text
3640 @cindex follow links
3641 @cindex mouse-1
3642
3643 @dfn{Clickable text} is text that can be clicked, with either the
3644 mouse or via a keyboard command, to produce some result. Many major
3645 modes use clickable text to implement textual hyper-links, or
3646 @dfn{links} for short.
3647
3648 The easiest way to insert and manipulate links is to use the
3649 @code{button} package. @xref{Buttons}. In this section, we will
3650 explain how to manually set up clickable text in a buffer, using text
3651 properties. For simplicity, we will refer to the clickable text as a
3652 @dfn{link}.
3653
3654 Implementing a link involves three separate steps: (1) indicating
3655 clickability when the mouse moves over the link; (2) making @key{RET}
3656 or @kbd{Mouse-2} on that link do something; and (3) setting up a
3657 @code{follow-link} condition so that the link obeys
3658 @code{mouse-1-click-follows-link}.
3659
3660 To indicate clickability, add the @code{mouse-face} text property to
3661 the text of the link; then Emacs will highlight the link when the
3662 mouse moves over it. In addition, you should define a tooltip or echo
3663 area message, using the @code{help-echo} text property. @xref{Special
3664 Properties}. For instance, here is how Dired indicates that file
3665 names are clickable:
3666
3667 @smallexample
3668 (if (dired-move-to-filename)
3669 (add-text-properties
3670 (point)
3671 (save-excursion
3672 (dired-move-to-end-of-filename)
3673 (point))
3674 '(mouse-face highlight
3675 help-echo "mouse-2: visit this file in other window")))
3676 @end smallexample
3677
3678 To make the link clickable, bind @key{RET} and @kbd{Mouse-2} to
3679 commands that perform the desired action. Each command should check
3680 to see whether it was called on a link, and act accordingly. For
3681 instance, Dired's major mode keymap binds @kbd{Mouse-2} to the
3682 following command:
3683
3684 @smallexample
3685 (defun dired-mouse-find-file-other-window (event)
3686 "In Dired, visit the file or directory name you click on."
3687 (interactive "e")
3688 (let ((window (posn-window (event-end event)))
3689 (pos (posn-point (event-end event)))
3690 file)
3691 (if (not (windowp window))
3692 (error "No file chosen"))
3693 (with-current-buffer (window-buffer window)
3694 (goto-char pos)
3695 (setq file (dired-get-file-for-visit)))
3696 (if (file-directory-p file)
3697 (or (and (cdr dired-subdir-alist)
3698 (dired-goto-subdir file))
3699 (progn
3700 (select-window window)
3701 (dired-other-window file)))
3702 (select-window window)
3703 (find-file-other-window (file-name-sans-versions file t)))))
3704 @end smallexample
3705
3706 @noindent
3707 This command uses the functions @code{posn-window} and
3708 @code{posn-point} to determine where the click occurred, and
3709 @code{dired-get-file-for-visit} to determine which file to visit.
3710
3711 Instead of binding the mouse command in a major mode keymap, you can
3712 bind it within the link text, using the @code{keymap} text property
3713 (@pxref{Special Properties}). For instance:
3714
3715 @example
3716 (let ((map (make-sparse-keymap)))
3717 (define-key map [mouse-2] 'operate-this-button)
3718 (put-text-property link-start link-end 'keymap map))
3719 @end example
3720
3721 @noindent
3722 With this method, you can easily define different commands for
3723 different links. Furthermore, the global definition of @key{RET} and
3724 @kbd{Mouse-2} remain available for the rest of the text in the buffer.
3725
3726 @vindex mouse-1-click-follows-link
3727 The basic Emacs command for clicking on links is @kbd{Mouse-2}.
3728 However, for compatibility with other graphical applications, Emacs
3729 also recognizes @kbd{Mouse-1} clicks on links, provided the user
3730 clicks on the link quickly without moving the mouse. This behavior is
3731 controlled by the user option @code{mouse-1-click-follows-link}.
3732 @xref{Mouse References,,, emacs, The GNU Emacs Manual}.
3733
3734 @cindex follow-link (text or overlay property)
3735 To set up the link so that it obeys
3736 @code{mouse-1-click-follows-link}, you must either (1) apply a
3737 @code{follow-link} text or overlay property to the link text, or (2)
3738 bind the @code{follow-link} event to a keymap (which can be a major
3739 mode keymap or a local keymap specified via the @code{keymap} text
3740 property). The value of the @code{follow-link} property, or the
3741 binding for the @code{follow-link} event, acts as a ``condition'' for
3742 the link action. This condition tells Emacs two things: the
3743 circumstances under which a @kbd{Mouse-1} click should be regarded as
3744 occurring ``inside'' the link, and how to compute an ``action code''
3745 that says what to translate the @kbd{Mouse-1} click into. The link
3746 action condition can be one of the following:
3747
3748 @table @asis
3749 @item @code{mouse-face}
3750 If the condition is the symbol @code{mouse-face}, a position is inside
3751 a link if there is a non-@code{nil} @code{mouse-face} property at that
3752 position. The action code is always @code{t}.
3753
3754 For example, here is how Info mode handles @key{Mouse-1}:
3755
3756 @smallexample
3757 (define-key Info-mode-map [follow-link] 'mouse-face)
3758 @end smallexample
3759
3760 @item a function
3761 If the condition is a function, @var{func}, then a position @var{pos}
3762 is inside a link if @code{(@var{func} @var{pos})} evaluates to
3763 non-@code{nil}. The value returned by @var{func} serves as the action
3764 code.
3765
3766 For example, here is how pcvs enables @kbd{Mouse-1} to follow links on
3767 file names only:
3768
3769 @smallexample
3770 (define-key map [follow-link]
3771 (lambda (pos)
3772 (eq (get-char-property pos 'face) 'cvs-filename-face)))
3773 @end smallexample
3774
3775 @item anything else
3776 If the condition value is anything else, then the position is inside a
3777 link and the condition itself is the action code. Clearly, you should
3778 specify this kind of condition only when applying the condition via a
3779 text or property overlay on the link text (so that it does not apply
3780 to the entire buffer).
3781 @end table
3782
3783 @noindent
3784 The action code tells @kbd{Mouse-1} how to follow the link:
3785
3786 @table @asis
3787 @item a string or vector
3788 If the action code is a string or vector, the @kbd{Mouse-1} event is
3789 translated into the first element of the string or vector; i.e., the
3790 action of the @kbd{Mouse-1} click is the local or global binding of
3791 that character or symbol. Thus, if the action code is @code{"foo"},
3792 @kbd{Mouse-1} translates into @kbd{f}. If it is @code{[foo]},
3793 @kbd{Mouse-1} translates into @key{foo}.
3794
3795 @item anything else
3796 For any other non-@code{nil} action code, the @kbd{Mouse-1} event is
3797 translated into a @kbd{Mouse-2} event at the same position.
3798 @end table
3799
3800 To define @kbd{Mouse-1} to activate a button defined with
3801 @code{define-button-type}, give the button a @code{follow-link}
3802 property. The property value should be a link action condition, as
3803 described above. @xref{Buttons}. For example, here is how Help mode
3804 handles @kbd{Mouse-1}:
3805
3806 @smallexample
3807 (define-button-type 'help-xref
3808 'follow-link t
3809 'action #'help-button-action)
3810 @end smallexample
3811
3812 To define @kbd{Mouse-1} on a widget defined with
3813 @code{define-widget}, give the widget a @code{:follow-link} property.
3814 The property value should be a link action condition, as described
3815 above. For example, here is how the @code{link} widget specifies that
3816 a @key{Mouse-1} click shall be translated to @key{RET}:
3817
3818 @smallexample
3819 (define-widget 'link 'item
3820 "An embedded link."
3821 :button-prefix 'widget-link-prefix
3822 :button-suffix 'widget-link-suffix
3823 :follow-link "\C-m"
3824 :help-echo "Follow the link."
3825 :format "%[%t%]")
3826 @end smallexample
3827
3828 @defun mouse-on-link-p pos
3829 This function returns non-@code{nil} if position @var{pos} in the
3830 current buffer is on a link. @var{pos} can also be a mouse event
3831 location, as returned by @code{event-start} (@pxref{Accessing Mouse}).
3832 @end defun
3833
3834 @node Fields
3835 @subsection Defining and Using Fields
3836 @cindex fields
3837
3838 A field is a range of consecutive characters in the buffer that are
3839 identified by having the same value (comparing with @code{eq}) of the
3840 @code{field} property (either a text-property or an overlay property).
3841 This section describes special functions that are available for
3842 operating on fields.
3843
3844 You specify a field with a buffer position, @var{pos}. We think of
3845 each field as containing a range of buffer positions, so the position
3846 you specify stands for the field containing that position.
3847
3848 When the characters before and after @var{pos} are part of the same
3849 field, there is no doubt which field contains @var{pos}: the one those
3850 characters both belong to. When @var{pos} is at a boundary between
3851 fields, which field it belongs to depends on the stickiness of the
3852 @code{field} properties of the two surrounding characters (@pxref{Sticky
3853 Properties}). The field whose property would be inherited by text
3854 inserted at @var{pos} is the field that contains @var{pos}.
3855
3856 There is an anomalous case where newly inserted text at @var{pos}
3857 would not inherit the @code{field} property from either side. This
3858 happens if the previous character's @code{field} property is not
3859 rear-sticky, and the following character's @code{field} property is not
3860 front-sticky. In this case, @var{pos} belongs to neither the preceding
3861 field nor the following field; the field functions treat it as belonging
3862 to an empty field whose beginning and end are both at @var{pos}.
3863
3864 In all of these functions, if @var{pos} is omitted or @code{nil}, the
3865 value of point is used by default. If narrowing is in effect, then
3866 @var{pos} should fall within the accessible portion. @xref{Narrowing}.
3867
3868 @defun field-beginning &optional pos escape-from-edge limit
3869 This function returns the beginning of the field specified by @var{pos}.
3870
3871 If @var{pos} is at the beginning of its field, and
3872 @var{escape-from-edge} is non-@code{nil}, then the return value is
3873 always the beginning of the preceding field that @emph{ends} at @var{pos},
3874 regardless of the stickiness of the @code{field} properties around
3875 @var{pos}.
3876
3877 If @var{limit} is non-@code{nil}, it is a buffer position; if the
3878 beginning of the field is before @var{limit}, then @var{limit} will be
3879 returned instead.
3880 @end defun
3881
3882 @defun field-end &optional pos escape-from-edge limit
3883 This function returns the end of the field specified by @var{pos}.
3884
3885 If @var{pos} is at the end of its field, and @var{escape-from-edge} is
3886 non-@code{nil}, then the return value is always the end of the following
3887 field that @emph{begins} at @var{pos}, regardless of the stickiness of
3888 the @code{field} properties around @var{pos}.
3889
3890 If @var{limit} is non-@code{nil}, it is a buffer position; if the end
3891 of the field is after @var{limit}, then @var{limit} will be returned
3892 instead.
3893 @end defun
3894
3895 @defun field-string &optional pos
3896 This function returns the contents of the field specified by @var{pos},
3897 as a string.
3898 @end defun
3899
3900 @defun field-string-no-properties &optional pos
3901 This function returns the contents of the field specified by @var{pos},
3902 as a string, discarding text properties.
3903 @end defun
3904
3905 @defun delete-field &optional pos
3906 This function deletes the text of the field specified by @var{pos}.
3907 @end defun
3908
3909 @defun constrain-to-field new-pos old-pos &optional escape-from-edge only-in-line inhibit-capture-property
3910 This function ``constrains'' @var{new-pos} to the field that
3911 @var{old-pos} belongs to---in other words, it returns the position
3912 closest to @var{new-pos} that is in the same field as @var{old-pos}.
3913
3914 If @var{new-pos} is @code{nil}, then @code{constrain-to-field} uses
3915 the value of point instead, and moves point to the resulting position
3916 in addition to returning that position.
3917
3918 If @var{old-pos} is at the boundary of two fields, then the acceptable
3919 final positions depend on the argument @var{escape-from-edge}. If
3920 @var{escape-from-edge} is @code{nil}, then @var{new-pos} must be in
3921 the field whose @code{field} property equals what new characters
3922 inserted at @var{old-pos} would inherit. (This depends on the
3923 stickiness of the @code{field} property for the characters before and
3924 after @var{old-pos}.) If @var{escape-from-edge} is non-@code{nil},
3925 @var{new-pos} can be anywhere in the two adjacent fields.
3926 Additionally, if two fields are separated by another field with the
3927 special value @code{boundary}, then any point within this special
3928 field is also considered to be ``on the boundary''.
3929
3930 Commands like @kbd{C-a} with no argument, that normally move backward
3931 to a specific kind of location and stay there once there, probably
3932 should specify @code{nil} for @var{escape-from-edge}. Other motion
3933 commands that check fields should probably pass @code{t}.
3934
3935 If the optional argument @var{only-in-line} is non-@code{nil}, and
3936 constraining @var{new-pos} in the usual way would move it to a different
3937 line, @var{new-pos} is returned unconstrained. This used in commands
3938 that move by line, such as @code{next-line} and
3939 @code{beginning-of-line}, so that they respect field boundaries only in
3940 the case where they can still move to the right line.
3941
3942 If the optional argument @var{inhibit-capture-property} is
3943 non-@code{nil}, and @var{old-pos} has a non-@code{nil} property of that
3944 name, then any field boundaries are ignored.
3945
3946 You can cause @code{constrain-to-field} to ignore all field boundaries
3947 (and so never constrain anything) by binding the variable
3948 @code{inhibit-field-text-motion} to a non-@code{nil} value.
3949 @end defun
3950
3951 @node Not Intervals
3952 @subsection Why Text Properties are not Intervals
3953 @cindex intervals
3954
3955 Some editors that support adding attributes to text in the buffer do
3956 so by letting the user specify ``intervals'' within the text, and adding
3957 the properties to the intervals. Those editors permit the user or the
3958 programmer to determine where individual intervals start and end. We
3959 deliberately provided a different sort of interface in Emacs Lisp to
3960 avoid certain paradoxical behavior associated with text modification.
3961
3962 If the actual subdivision into intervals is meaningful, that means you
3963 can distinguish between a buffer that is just one interval with a
3964 certain property, and a buffer containing the same text subdivided into
3965 two intervals, both of which have that property.
3966
3967 Suppose you take the buffer with just one interval and kill part of
3968 the text. The text remaining in the buffer is one interval, and the
3969 copy in the kill ring (and the undo list) becomes a separate interval.
3970 Then if you yank back the killed text, you get two intervals with the
3971 same properties. Thus, editing does not preserve the distinction
3972 between one interval and two.
3973
3974 Suppose we ``fix'' this problem by coalescing the two intervals when
3975 the text is inserted. That works fine if the buffer originally was a
3976 single interval. But suppose instead that we have two adjacent
3977 intervals with the same properties, and we kill the text of one interval
3978 and yank it back. The same interval-coalescence feature that rescues
3979 the other case causes trouble in this one: after yanking, we have just
3980 one interval. Once again, editing does not preserve the distinction
3981 between one interval and two.
3982
3983 Insertion of text at the border between intervals also raises
3984 questions that have no satisfactory answer.
3985
3986 However, it is easy to arrange for editing to behave consistently
3987 for questions of the form, ``What are the properties of text at this
3988 buffer or string position?'' So we have decided these are the only
3989 questions that make sense; we have not implemented asking questions
3990 about where intervals start or end.
3991
3992 In practice, you can usually use the text property search functions in
3993 place of explicit interval boundaries. You can think of them as finding
3994 the boundaries of intervals, assuming that intervals are always
3995 coalesced whenever possible. @xref{Property Search}.
3996
3997 Emacs also provides explicit intervals as a presentation feature; see
3998 @ref{Overlays}.
3999
4000 @node Substitution
4001 @section Substituting for a Character Code
4002 @cindex replace characters in region
4003 @cindex substitute characters
4004
4005 The following functions replace characters within a specified region
4006 based on their character codes.
4007
4008 @defun subst-char-in-region start end old-char new-char &optional noundo
4009 @cindex replace characters
4010 This function replaces all occurrences of the character @var{old-char}
4011 with the character @var{new-char} in the region of the current buffer
4012 defined by @var{start} and @var{end}.
4013
4014 @cindex undo avoidance
4015 If @var{noundo} is non-@code{nil}, then @code{subst-char-in-region} does
4016 not record the change for undo and does not mark the buffer as modified.
4017 This was useful for controlling the old selective display feature
4018 (@pxref{Selective Display}).
4019
4020 @code{subst-char-in-region} does not move point and returns
4021 @code{nil}.
4022
4023 @example
4024 @group
4025 ---------- Buffer: foo ----------
4026 This is the contents of the buffer before.
4027 ---------- Buffer: foo ----------
4028 @end group
4029
4030 @group
4031 (subst-char-in-region 1 20 ?i ?X)
4032 @result{} nil
4033
4034 ---------- Buffer: foo ----------
4035 ThXs Xs the contents of the buffer before.
4036 ---------- Buffer: foo ----------
4037 @end group
4038 @end example
4039 @end defun
4040
4041 @deffn Command translate-region start end table
4042 This function applies a translation table to the characters in the
4043 buffer between positions @var{start} and @var{end}.
4044
4045 The translation table @var{table} is a string or a char-table;
4046 @code{(aref @var{table} @var{ochar})} gives the translated character
4047 corresponding to @var{ochar}. If @var{table} is a string, any
4048 characters with codes larger than the length of @var{table} are not
4049 altered by the translation.
4050
4051 The return value of @code{translate-region} is the number of
4052 characters that were actually changed by the translation. This does
4053 not count characters that were mapped into themselves in the
4054 translation table.
4055 @end deffn
4056
4057 @node Registers
4058 @section Registers
4059 @cindex registers
4060
4061 A register is a sort of variable used in Emacs editing that can hold a
4062 variety of different kinds of values. Each register is named by a
4063 single character. All @acronym{ASCII} characters and their meta variants
4064 (but with the exception of @kbd{C-g}) can be used to name registers.
4065 Thus, there are 255 possible registers. A register is designated in
4066 Emacs Lisp by the character that is its name.
4067
4068 @defvar register-alist
4069 This variable is an alist of elements of the form @code{(@var{name} .
4070 @var{contents})}. Normally, there is one element for each Emacs
4071 register that has been used.
4072
4073 The object @var{name} is a character (an integer) identifying the
4074 register.
4075 @end defvar
4076
4077 The @var{contents} of a register can have several possible types:
4078
4079 @table @asis
4080 @item a number
4081 A number stands for itself. If @code{insert-register} finds a number
4082 in the register, it converts the number to decimal.
4083
4084 @item a marker
4085 A marker represents a buffer position to jump to.
4086
4087 @item a string
4088 A string is text saved in the register.
4089
4090 @item a rectangle
4091 A rectangle is represented by a list of strings.
4092
4093 @item @code{(@var{window-configuration} @var{position})}
4094 This represents a window configuration to restore in one frame, and a
4095 position to jump to in the current buffer.
4096
4097 @c FIXME: Mention frameset here.
4098 @item @code{(@var{frame-configuration} @var{position})}
4099 This represents a frame configuration to restore, and a position
4100 to jump to in the current buffer.
4101
4102 @item (file @var{filename})
4103 This represents a file to visit; jumping to this value visits file
4104 @var{filename}.
4105
4106 @item (file-query @var{filename} @var{position})
4107 This represents a file to visit and a position in it; jumping to this
4108 value visits file @var{filename} and goes to buffer position
4109 @var{position}. Restoring this type of position asks the user for
4110 confirmation first.
4111 @end table
4112
4113 The functions in this section return unpredictable values unless
4114 otherwise stated.
4115
4116 @defun get-register reg
4117 This function returns the contents of the register
4118 @var{reg}, or @code{nil} if it has no contents.
4119 @end defun
4120
4121 @defun set-register reg value
4122 This function sets the contents of register @var{reg} to @var{value}.
4123 A register can be set to any value, but the other register functions
4124 expect only certain data types. The return value is @var{value}.
4125 @end defun
4126
4127 @deffn Command view-register reg
4128 This command displays what is contained in register @var{reg}.
4129 @end deffn
4130
4131 @deffn Command insert-register reg &optional beforep
4132 This command inserts contents of register @var{reg} into the current
4133 buffer.
4134
4135 Normally, this command puts point before the inserted text, and the
4136 mark after it. However, if the optional second argument @var{beforep}
4137 is non-@code{nil}, it puts the mark before and point after.
4138
4139 When called interactively, the command defaults to putting point after
4140 text, and a prefix argument inverts this behavior.
4141
4142 If the register contains a rectangle, then the rectangle is inserted
4143 with its upper left corner at point. This means that text is inserted
4144 in the current line and underneath it on successive lines.
4145
4146 If the register contains something other than saved text (a string) or
4147 a rectangle (a list), currently useless things happen. This may be
4148 changed in the future.
4149 @end deffn
4150
4151 @defun register-read-with-preview prompt
4152 @cindex register preview
4153 This function reads and returns a register name, prompting with
4154 @var{prompt} and possibly showing a preview of the existing registers
4155 and their contents. The preview is shown in a temporary window, after
4156 the delay specified by the user option @code{register-preview-delay},
4157 if its value and @code{register-alist} are both non-@code{nil}. The
4158 preview is also shown if the user requests help (e.g., by typing the
4159 help character). We recommend that all interactive commands which
4160 read register names use this function.
4161 @end defun
4162
4163 @node Transposition
4164 @section Transposition of Text
4165
4166 This function can be used to transpose stretches of text:
4167
4168 @defun transpose-regions start1 end1 start2 end2 &optional leave-markers
4169 This function exchanges two nonoverlapping portions of the buffer.
4170 Arguments @var{start1} and @var{end1} specify the bounds of one portion
4171 and arguments @var{start2} and @var{end2} specify the bounds of the
4172 other portion.
4173
4174 Normally, @code{transpose-regions} relocates markers with the transposed
4175 text; a marker previously positioned within one of the two transposed
4176 portions moves along with that portion, thus remaining between the same
4177 two characters in their new position. However, if @var{leave-markers}
4178 is non-@code{nil}, @code{transpose-regions} does not do this---it leaves
4179 all markers unrelocated.
4180 @end defun
4181
4182 @node Decompression
4183 @section Dealing With Compressed Data
4184
4185 When @code{auto-compression-mode} is enabled, Emacs automatically
4186 uncompresses compressed files when you visit them, and automatically
4187 recompresses them if you alter and save them. @xref{Compressed
4188 Files,,, emacs, The GNU Emacs Manual}.
4189
4190 The above feature works by calling an external executable (e.g.,
4191 @command{gzip}). Emacs can also be compiled with support for built-in
4192 decompression using the zlib library, which is faster than calling an
4193 external program.
4194
4195 @defun zlib-available-p
4196 This function returns non-@code{nil} if built-in zlib decompression is
4197 available.
4198 @end defun
4199
4200 @defun zlib-decompress-region start end
4201 This function decompresses the region between @var{start} and
4202 @var{end}, using built-in zlib decompression. The region should
4203 contain data that were compressed with gzip or zlib. On success, the
4204 function replaces the contents of the region with the decompressed
4205 data. On failure, the function leaves the region unchanged and
4206 returns @code{nil}. This function can be called only in unibyte
4207 buffers.
4208 @end defun
4209
4210
4211 @node Base 64
4212 @section Base 64 Encoding
4213 @cindex base 64 encoding
4214
4215 Base 64 code is used in email to encode a sequence of 8-bit bytes as
4216 a longer sequence of @acronym{ASCII} graphic characters. It is defined in
4217 Internet RFC@footnote{
4218 An RFC, an acronym for @dfn{Request for Comments}, is a numbered
4219 Internet informational document describing a standard. RFCs are
4220 usually written by technical experts acting on their own initiative,
4221 and are traditionally written in a pragmatic, experience-driven
4222 manner.
4223 }2045. This section describes the functions for
4224 converting to and from this code.
4225
4226 @deffn Command base64-encode-region beg end &optional no-line-break
4227 This function converts the region from @var{beg} to @var{end} into base
4228 64 code. It returns the length of the encoded text. An error is
4229 signaled if a character in the region is multibyte, i.e., in a
4230 multibyte buffer the region must contain only characters from the
4231 charsets @code{ascii}, @code{eight-bit-control} and
4232 @code{eight-bit-graphic}.
4233
4234 Normally, this function inserts newline characters into the encoded
4235 text, to avoid overlong lines. However, if the optional argument
4236 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4237 the output is just one long line.
4238 @end deffn
4239
4240 @defun base64-encode-string string &optional no-line-break
4241 This function converts the string @var{string} into base 64 code. It
4242 returns a string containing the encoded text. As for
4243 @code{base64-encode-region}, an error is signaled if a character in the
4244 string is multibyte.
4245
4246 Normally, this function inserts newline characters into the encoded
4247 text, to avoid overlong lines. However, if the optional argument
4248 @var{no-line-break} is non-@code{nil}, these newlines are not added, so
4249 the result string is just one long line.
4250 @end defun
4251
4252 @deffn Command base64-decode-region beg end
4253 This function converts the region from @var{beg} to @var{end} from base
4254 64 code into the corresponding decoded text. It returns the length of
4255 the decoded text.
4256
4257 The decoding functions ignore newline characters in the encoded text.
4258 @end deffn
4259
4260 @defun base64-decode-string string
4261 This function converts the string @var{string} from base 64 code into
4262 the corresponding decoded text. It returns a unibyte string containing the
4263 decoded text.
4264
4265 The decoding functions ignore newline characters in the encoded text.
4266 @end defun
4267
4268 @node Checksum/Hash
4269 @section Checksum/Hash
4270 @cindex MD5 checksum
4271 @cindex SHA hash
4272 @cindex hash, cryptographic
4273 @cindex cryptographic hash
4274
4275 Emacs has built-in support for computing @dfn{cryptographic hashes}.
4276 A cryptographic hash, or @dfn{checksum}, is a digital ``fingerprint''
4277 of a piece of data (e.g., a block of text) which can be used to check
4278 that you have an unaltered copy of that data.
4279
4280 @cindex message digest
4281 Emacs supports several common cryptographic hash algorithms: MD5,
4282 SHA-1, SHA-2, SHA-224, SHA-256, SHA-384 and SHA-512. MD5 is the
4283 oldest of these algorithms, and is commonly used in @dfn{message
4284 digests} to check the integrity of messages transmitted over a
4285 network. MD5 is not ``collision resistant'' (i.e., it is possible to
4286 deliberately design different pieces of data which have the same MD5
4287 hash), so you should not used it for anything security-related. A
4288 similar theoretical weakness also exists in SHA-1. Therefore, for
4289 security-related applications you should use the other hash types,
4290 such as SHA-2.
4291
4292 @defun secure-hash algorithm object &optional start end binary
4293 This function returns a hash for @var{object}. The argument
4294 @var{algorithm} is a symbol stating which hash to compute: one of
4295 @code{md5}, @code{sha1}, @code{sha224}, @code{sha256}, @code{sha384}
4296 or @code{sha512}. The argument @var{object} should be a buffer or a
4297 string.
4298
4299 The optional arguments @var{start} and @var{end} are character
4300 positions specifying the portion of @var{object} to compute the
4301 message digest for. If they are @code{nil} or omitted, the hash is
4302 computed for the whole of @var{object}.
4303
4304 If the argument @var{binary} is omitted or @code{nil}, the function
4305 returns the @dfn{text form} of the hash, as an ordinary Lisp string.
4306 If @var{binary} is non-@code{nil}, it returns the hash in @dfn{binary
4307 form}, as a sequence of bytes stored in a unibyte string.
4308
4309 This function does not compute the hash directly from the internal
4310 representation of @var{object}'s text (@pxref{Text Representations}).
4311 Instead, it encodes the text using a coding system (@pxref{Coding
4312 Systems}), and computes the hash from that encoded text. If
4313 @var{object} is a buffer, the coding system used is the one which
4314 would be chosen by default for writing the text into a file. If
4315 @var{object} is a string, the user's preferred coding system is used
4316 (@pxref{Recognize Coding,,, emacs, GNU Emacs Manual}).
4317 @end defun
4318
4319 @defun md5 object &optional start end coding-system noerror
4320 This function returns an MD5 hash. It is semi-obsolete, since for
4321 most purposes it is equivalent to calling @code{secure-hash} with
4322 @code{md5} as the @var{algorithm} argument. The @var{object},
4323 @var{start} and @var{end} arguments have the same meanings as in
4324 @code{secure-hash}.
4325
4326 If @var{coding-system} is non-@code{nil}, it specifies a coding system
4327 to use to encode the text; if omitted or @code{nil}, the default
4328 coding system is used, like in @code{secure-hash}.
4329
4330 Normally, @code{md5} signals an error if the text can't be encoded
4331 using the specified or chosen coding system. However, if
4332 @var{noerror} is non-@code{nil}, it silently uses @code{raw-text}
4333 coding instead.
4334 @end defun
4335
4336 @node Parsing HTML/XML
4337 @section Parsing HTML and XML
4338 @cindex parsing html
4339
4340 When Emacs is compiled with libxml2 support, the following functions
4341 are available to parse HTML or XML text into Lisp object trees.
4342
4343 @defun libxml-parse-html-region start end &optional base-url discard-comments
4344 This function parses the text between @var{start} and @var{end} as
4345 HTML, and returns a list representing the HTML @dfn{parse tree}. It
4346 attempts to handle ``real world'' HTML by robustly coping with syntax
4347 mistakes.
4348
4349 The optional argument @var{base-url}, if non-@code{nil}, should be a
4350 string specifying the base URL for relative URLs occurring in links.
4351
4352 If the optional argument @var{discard-comments} is non-@code{nil},
4353 then the parse tree is created without any comments.
4354
4355 In the parse tree, each HTML node is represented by a list in which
4356 the first element is a symbol representing the node name, the second
4357 element is an alist of node attributes, and the remaining elements are
4358 the subnodes.
4359
4360 The following example demonstrates this. Given this (malformed) HTML
4361 document:
4362
4363 @example
4364 <html><head></head><body width=101><div class=thing>Foo<div>Yes
4365 @end example
4366
4367 @noindent
4368 A call to @code{libxml-parse-html-region} returns this @acronym{DOM}
4369 (document object model):
4370
4371 @example
4372 (html nil
4373 (head nil)
4374 (body ((width . "101"))
4375 (div ((class . "thing"))
4376 "Foo"
4377 (div nil
4378 "Yes"))))
4379 @end example
4380 @end defun
4381
4382 @cindex rendering html
4383 @defun shr-insert-document dom
4384 This function renders the parsed HTML in @var{dom} into the current
4385 buffer. The argument @var{dom} should be a list as generated by
4386 @code{libxml-parse-html-region}. This function is, e.g., used by
4387 @ref{Top, EWW,, eww, The Emacs Web Wowser Manual}.
4388 @end defun
4389
4390 @cindex parsing xml
4391 @defun libxml-parse-xml-region start end &optional base-url discard-comments
4392 This function is the same as @code{libxml-parse-html-region}, except
4393 that it parses the text as XML rather than HTML (so it is stricter
4394 about syntax).
4395 @end defun
4396
4397 @menu
4398 * Document Object Model:: Access, manipulate and search the @acronym{DOM}.
4399 @end menu
4400
4401 @node Document Object Model
4402 @subsection Document Object Model
4403 @cindex HTML DOM
4404 @cindex XML DOM
4405 @cindex DOM
4406 @cindex Document Object Model
4407
4408 The @acronym{DOM} returned by @code{libxml-parse-html-region} (and the
4409 other @acronym{XML} parsing functions) is a tree structure where each
4410 node has a node name (called a @dfn{tag}), and optional key/value
4411 @dfn{attribute} list, and then a list of @dfn{child nodes}. The child
4412 nodes are either strings or @acronym{DOM} objects.
4413
4414 @example
4415 (body ((width . "101"))
4416 (div ((class . "thing"))
4417 "Foo"
4418 (div nil
4419 "Yes")))
4420 @end example
4421
4422 @defun dom-node tag &optional attributes &rest children
4423 This function creates a @acronym{DOM} node of type @var{tag}. If
4424 given, @var{attributes} should be a key/value pair list.
4425 If given, @var{children} should be @acronym{DOM} nodes.
4426 @end defun
4427
4428 The following functions can be used to work with this structure. Each
4429 function takes a @acronym{DOM} node, or a list of nodes. In the
4430 latter case, only the first node in the list is used.
4431
4432 Simple accessors:
4433
4434 @table @code
4435 @item dom-tag @var{node}
4436 Return the @dfn{tag} (also called ``node name'') of the node.
4437
4438 @item dom-attr @var{node} @var{attribute}
4439 Return the value of @var{attribute} in the node. A common usage
4440 would be:
4441
4442 @lisp
4443 (dom-attr img 'href)
4444 => "http://fsf.org/logo.png"
4445 @end lisp
4446
4447 @item dom-children @var{node}
4448 Return all the children of the node.
4449
4450 @item dom-non-text-children @var{node}
4451 Return all the non-string children of the node.
4452
4453 @item dom-attributes @var{node}
4454 Return the key/value pair list of attributes of the node.
4455
4456 @item dom-text @var{node}
4457 Return all the textual elements of the node as a concatenated string.
4458
4459 @item dom-texts @var{node}
4460 Return all the textual elements of the node, as well as the textual
4461 elements of all the children of the node, recursively, as a
4462 concatenated string. This function also takes an optional separator
4463 to be inserted between the textual elements.
4464
4465 @item dom-parent @var{dom} @var{node}
4466 Return the parent of @var{node} in @var{dom}.
4467 @end table
4468
4469 The following are functions for altering the @acronym{DOM}.
4470
4471 @table @code
4472 @item dom-set-attribute @var{node} @var{attribute} @var{value}
4473 Set the @var{attribute} of the node to @var{value}.
4474
4475 @item dom-append-child @var{node} @var{child}
4476 Append @var{child} as the last child of @var{node}.
4477
4478 @item dom-add-child-before @var{node} @var{child} @var{before}
4479 Add @var{child} to @var{node}'s child list before the @var{before}
4480 node. If @var{before} is @code{nil}, make @var{child} the first child.
4481
4482 @item dom-set-attributes @var{node} @var{attributes}
4483 Replace all the attributes of the node with a new key/value list.
4484 @end table
4485
4486 The following are functions for searching for elements in the
4487 @acronym{DOM}. They all return lists of matching nodes.
4488
4489 @table @code
4490 @item dom-by-tag @var{dom} @var{tag}
4491 Return all nodes in @var{dom} that are of type @var{tag}. A typical
4492 use would be:
4493
4494 @lisp
4495 (dom-by-tag dom 'td)
4496 => '((td ...) (td ...) (td ...))
4497 @end lisp
4498
4499 @item dom-by-class @var{dom} @var{match}
4500 Return all nodes in @var{dom} that have class names that match
4501 @var{match}, which is a regular expression.
4502
4503 @item dom-by-style @var{dom} @var{style}
4504 Return all nodes in @var{dom} that have styles that match @var{match},
4505 which is a regular expression.
4506
4507 @item dom-by-id @var{dom} @var{style}
4508 Return all nodes in @var{dom} that have IDs that match @var{match},
4509 which is a regular expression.
4510
4511 @item dom-strings @var{dom}
4512 Return all strings in @var{DOM}.
4513
4514 @end table
4515
4516 Utility functions:
4517
4518 @table @code
4519 @item dom-pp @var{dom} &optional @var{remove-empty}
4520 Pretty-print @var{dom} at point. If @var{remove-empty}, don't print
4521 textual nodes that just contain white-space.
4522 @end table
4523
4524
4525 @node Atomic Changes
4526 @section Atomic Change Groups
4527 @cindex atomic changes
4528
4529 In database terminology, an @dfn{atomic} change is an indivisible
4530 change---it can succeed entirely or it can fail entirely, but it
4531 cannot partly succeed. A Lisp program can make a series of changes to
4532 one or several buffers as an @dfn{atomic change group}, meaning that
4533 either the entire series of changes will be installed in their buffers
4534 or, in case of an error, none of them will be.
4535
4536 To do this for one buffer, the one already current, simply write a
4537 call to @code{atomic-change-group} around the code that makes the
4538 changes, like this:
4539
4540 @example
4541 (atomic-change-group
4542 (insert foo)
4543 (delete-region x y))
4544 @end example
4545
4546 @noindent
4547 If an error (or other nonlocal exit) occurs inside the body of
4548 @code{atomic-change-group}, it unmakes all the changes in that buffer
4549 that were during the execution of the body. This kind of change group
4550 has no effect on any other buffers---any such changes remain.
4551
4552 If you need something more sophisticated, such as to make changes in
4553 various buffers constitute one atomic group, you must directly call
4554 lower-level functions that @code{atomic-change-group} uses.
4555
4556 @defun prepare-change-group &optional buffer
4557 This function sets up a change group for buffer @var{buffer}, which
4558 defaults to the current buffer. It returns a ``handle'' that
4559 represents the change group. You must use this handle to activate the
4560 change group and subsequently to finish it.
4561 @end defun
4562
4563 To use the change group, you must @dfn{activate} it. You must do
4564 this before making any changes in the text of @var{buffer}.
4565
4566 @defun activate-change-group handle
4567 This function activates the change group that @var{handle} designates.
4568 @end defun
4569
4570 After you activate the change group, any changes you make in that
4571 buffer become part of it. Once you have made all the desired changes
4572 in the buffer, you must @dfn{finish} the change group. There are two
4573 ways to do this: you can either accept (and finalize) all the changes,
4574 or cancel them all.
4575
4576 @defun accept-change-group handle
4577 This function accepts all the changes in the change group specified by
4578 @var{handle}, making them final.
4579 @end defun
4580
4581 @defun cancel-change-group handle
4582 This function cancels and undoes all the changes in the change group
4583 specified by @var{handle}.
4584 @end defun
4585
4586 Your code should use @code{unwind-protect} to make sure the group is
4587 always finished. The call to @code{activate-change-group} should be
4588 inside the @code{unwind-protect}, in case the user types @kbd{C-g}
4589 just after it runs. (This is one reason why
4590 @code{prepare-change-group} and @code{activate-change-group} are
4591 separate functions, because normally you would call
4592 @code{prepare-change-group} before the start of that
4593 @code{unwind-protect}.) Once you finish the group, don't use the
4594 handle again---in particular, don't try to finish the same group
4595 twice.
4596
4597 To make a multibuffer change group, call @code{prepare-change-group}
4598 once for each buffer you want to cover, then use @code{nconc} to
4599 combine the returned values, like this:
4600
4601 @example
4602 (nconc (prepare-change-group buffer-1)
4603 (prepare-change-group buffer-2))
4604 @end example
4605
4606 You can then activate the multibuffer change group with a single call
4607 to @code{activate-change-group}, and finish it with a single call to
4608 @code{accept-change-group} or @code{cancel-change-group}.
4609
4610 Nested use of several change groups for the same buffer works as you
4611 would expect. Non-nested use of change groups for the same buffer
4612 will get Emacs confused, so don't let it happen; the first change
4613 group you start for any given buffer should be the last one finished.
4614
4615 @node Change Hooks
4616 @section Change Hooks
4617 @cindex change hooks
4618 @cindex hooks for text changes
4619
4620 These hook variables let you arrange to take notice of all changes in
4621 all buffers (or in a particular buffer, if you make them buffer-local).
4622 See also @ref{Special Properties}, for how to detect changes to specific
4623 parts of the text.
4624
4625 The functions you use in these hooks should save and restore the match
4626 data if they do anything that uses regular expressions; otherwise, they
4627 will interfere in bizarre ways with the editing operations that call
4628 them.
4629
4630 @defvar before-change-functions
4631 This variable holds a list of functions to call before any buffer
4632 modification. Each function gets two arguments, the beginning and end
4633 of the region that is about to change, represented as integers. The
4634 buffer that is about to change is always the current buffer.
4635 @end defvar
4636
4637 @defvar after-change-functions
4638 This variable holds a list of functions to call after any buffer
4639 modification. Each function receives three arguments: the beginning
4640 and end of the region just changed, and the length of the text that
4641 existed before the change. All three arguments are integers. The
4642 buffer that has been changed is always the current buffer.
4643
4644 The length of the old text is the difference between the buffer
4645 positions before and after that text as it was before the change. As
4646 for the changed text, its length is simply the difference between the
4647 first two arguments.
4648 @end defvar
4649
4650 Output of messages into the @file{*Messages*} buffer does not
4651 call these functions.
4652
4653 @defmac combine-after-change-calls body@dots{}
4654 The macro executes @var{body} normally, but arranges to call the
4655 after-change functions just once for a series of several changes---if
4656 that seems safe.
4657
4658 If a program makes several text changes in the same area of the buffer,
4659 using the macro @code{combine-after-change-calls} around that part of
4660 the program can make it run considerably faster when after-change hooks
4661 are in use. When the after-change hooks are ultimately called, the
4662 arguments specify a portion of the buffer including all of the changes
4663 made within the @code{combine-after-change-calls} body.
4664
4665 @strong{Warning:} You must not alter the values of
4666 @code{after-change-functions} within
4667 the body of a @code{combine-after-change-calls} form.
4668
4669 @strong{Warning:} if the changes you combine occur in widely scattered
4670 parts of the buffer, this will still work, but it is not advisable,
4671 because it may lead to inefficient behavior for some change hook
4672 functions.
4673 @end defmac
4674
4675 @defvar first-change-hook
4676 This variable is a normal hook that is run whenever a buffer is changed
4677 that was previously in the unmodified state.
4678 @end defvar
4679
4680 @defvar inhibit-modification-hooks
4681 If this variable is non-@code{nil}, all of the change hooks are
4682 disabled; none of them run. This affects all the hook variables
4683 described above in this section, as well as the hooks attached to
4684 certain special text properties (@pxref{Special Properties}) and overlay
4685 properties (@pxref{Overlay Properties}).
4686
4687 Also, this variable is bound to non-@code{nil} while running those
4688 same hook variables, so that by default modifying the buffer from
4689 a modification hook does not cause other modification hooks to be run.
4690 If you do want modification hooks to be run in a particular piece of
4691 code that is itself run from a modification hook, then rebind locally
4692 @code{inhibit-modification-hooks} to @code{nil}.
4693 @end defvar