]> code.delx.au - gnu-emacs/blob - doc/emacs/search.texi
Improve and update documentation of search commands
[gnu-emacs] / doc / emacs / search.texi
1 @c This is part of the Emacs manual.
2 @c Copyright (C) 1985-1987, 1993-1995, 1997, 2000-2015 Free Software
3 @c Foundation, Inc.
4 @c See file emacs.texi for copying conditions.
5 @node Search
6 @chapter Searching and Replacement
7 @cindex searching
8 @cindex finding strings within text
9
10 Like other editors, Emacs has commands to search for occurrences of
11 a string. Emacs also has commands to replace occurrences of a string
12 with a different string. There are also commands that do the same
13 thing, but search for patterns instead of fixed strings.
14
15 You can also search multiple files under the control of a tags table
16 (@pxref{Tags Search}) or through the Dired @kbd{A} command
17 (@pxref{Operating on Files}), or ask the @code{grep} program to do it
18 (@pxref{Grep Searching}).
19
20 @menu
21 * Incremental Search:: Search happens as you type the string.
22 * Nonincremental Search:: Specify entire string and then search.
23 * Word Search:: Search for sequence of words.
24 * Symbol Search:: Search for a source code symbol.
25 * Regexp Search:: Search for match for a regexp.
26 * Regexps:: Syntax of regular expressions.
27 * Regexp Backslash:: Regular expression constructs starting with `\'.
28 * Regexp Example:: A complex regular expression explained.
29 * Lax Search:: Search ignores some distinctions between
30 similar characters, like letter-case.
31 * Replace:: Search, and replace some or all matches.
32 * Other Repeating Search:: Operating on all matches for some regexp.
33 * Search Customizations:: Various search customizations.
34 @end menu
35
36 @node Incremental Search
37 @section Incremental Search
38 @cindex incremental search
39 @cindex isearch
40
41 The principal search command in Emacs is @dfn{incremental}: it
42 begins searching as soon as you type the first character of the search
43 string. As you type in the search string, Emacs shows you where the
44 string (as you have typed it so far) would be found. When you have
45 typed enough characters to identify the place you want, you can stop.
46 Depending on what you plan to do next, you may or may not need to
47 terminate the search explicitly with @key{RET}.
48
49 @table @kbd
50 @item C-s
51 Incremental search forward (@code{isearch-forward}).
52 @item C-r
53 Incremental search backward (@code{isearch-backward}).
54 @end table
55
56 @menu
57 * Basic Isearch:: Basic incremental search commands.
58 * Repeat Isearch:: Searching for the same string again.
59 * Isearch Yank:: Commands that grab text into the search string
60 or else edit the search string.
61 * Error in Isearch:: When your string is not found.
62 * Special Isearch:: Special input in incremental search.
63 * Not Exiting Isearch:: Prefix argument and scrolling commands.
64 * Isearch Minibuffer:: Incremental search of the minibuffer history.
65 @end menu
66
67 @node Basic Isearch
68 @subsection Basics of Incremental Search
69
70 @table @kbd
71 @item C-s
72 Begin incremental search (@code{isearch-forward}).
73 @item C-r
74 Begin reverse incremental search (@code{isearch-backward}).
75 @end table
76
77 @kindex C-s
78 @findex isearch-forward
79 @kbd{C-s} (@code{isearch-forward}) starts a forward incremental
80 search. It reads characters from the keyboard, and moves point just
81 past the end of the next occurrence of those characters in the buffer.
82
83 For instance, if you type @kbd{C-s} and then @kbd{F}, that puts the
84 cursor after the first @samp{F} that occurs in the buffer after the
85 starting point. If you then type @kbd{O}, the cursor moves to just
86 after the first @samp{FO}; the @samp{F} in that @samp{FO} might not be
87 the first @samp{F} previously found. After another @kbd{O}, the
88 cursor moves to just after the first @samp{FOO}.
89
90 @cindex faces for highlighting search matches
91 @cindex isearch face
92 At each step, Emacs highlights the @dfn{current match}---the buffer
93 text that matches the search string---using the @code{isearch} face
94 (@pxref{Faces}). @xref{Search Customizations}, for various options
95 that customize this highlighting. The current search string is also
96 displayed in the echo area.
97
98 If you make a mistake typing the search string, type @key{DEL}.
99 Each @key{DEL} cancels the last character of the search string.
100 @xref{Error in Isearch}, for more about dealing with unsuccessful
101 search.
102
103 @cindex exit incremental search
104 @cindex incremental search, exiting
105 When you are satisfied with the place you have reached, type
106 @key{RET}. This stops searching, leaving the cursor where the search
107 brought it. Also, any command not specially meaningful in searches
108 stops the searching and is then executed. Thus, typing @kbd{C-a}
109 exits the search and then moves to the beginning of the line; typing
110 one of the arrow keys exits the search and performs the respective
111 movement command; etc. @key{RET} is necessary only if the next
112 command you want to type is a printing character, @key{DEL},
113 @key{RET}, or another character that is special within searches
114 (@kbd{C-q}, @kbd{C-w}, @kbd{C-r}, @kbd{C-s}, @kbd{C-y}, @kbd{M-y},
115 @kbd{M-r}, @kbd{M-c}, @kbd{M-e}, and some others described below).
116 You can fine-tune the commands that exit the search; see @ref{Not
117 Exiting Isearch}.
118
119 As a special exception, entering @key{RET} when the search string is
120 empty launches nonincremental search (@pxref{Nonincremental Search}).
121 (This can be customized; see @ref{Search Customizations}.)
122
123 To abandon the search and return to the place where you started,
124 type @kbd{@key{ESC} @key{ESC} @key{ESC}} (@code{isearch-cancel}) or
125 @kbd{C-g C-g} (@code{isearch-abort}).
126
127 When you exit the incremental search, it adds the original value of
128 point to the mark ring, without activating the mark; you can thus use
129 @kbd{C-u C-@key{SPC}} or @kbd{C-x C-x} to return to where you were
130 before beginning the search. @xref{Mark Ring}. (Emacs only does this
131 if the mark was not already active; if the mark was active when you
132 started the search, both @kbd{C-u C-@key{SPC}} and @kbd{C-x C-x} will
133 go to the mark.)
134
135 @kindex C-r
136 @findex isearch-backward
137 To search backwards, use @kbd{C-r} (@code{isearch-backward}) instead
138 of @kbd{C-s} to start the search. A backward search finds matches
139 that end before the starting point, just as a forward search finds
140 matches that begin after it.
141
142 @node Repeat Isearch
143 @subsection Repeating Incremental Search
144
145 Suppose you search forward for @samp{FOO} and find a match, but not
146 the one you expected to find: the @samp{FOO} you were aiming for
147 occurs later in the buffer. In this event, type another @kbd{C-s} to
148 move to the next occurrence of the search string. You can repeat this
149 any number of times. If you overshoot, you can cancel some @kbd{C-s}
150 characters with @key{DEL}. Similarly, each @kbd{C-r} in a backward
151 incremental search repeats the backward search.
152
153 @cindex lazy search highlighting
154 If you pause for a little while during incremental search, Emacs
155 highlights all the other possible matches for the search string that
156 are present on the screen. This helps you anticipate where you can
157 get to by typing @kbd{C-s} or @kbd{C-r} to repeat the search. The
158 other matches are highlighted differently from the current match,
159 using the customizable face @code{lazy-highlight} (@pxref{Faces}). If
160 you don't like this feature, you can disable it by setting
161 @code{isearch-lazy-highlight} to @code{nil}. For other customizations
162 related to highlighting matches, see @ref{Search Customizations}.
163
164 After exiting a search, you can search for the same string again by
165 typing just @kbd{C-s C-s}. The first @kbd{C-s} is the key that
166 invokes incremental search, and the second @kbd{C-s} means to search
167 again for the last search string. Similarly, @kbd{C-r C-r} searches
168 backward for the last search string. In determining the last search
169 string, it doesn't matter whether that string was searched for with
170 @kbd{C-s} or @kbd{C-r}.
171
172 If you are searching forward but you realize you were looking for
173 something before the starting point, type @kbd{C-r} to switch to a
174 backward search, leaving the search string unchanged. Similarly,
175 @kbd{C-s} in a backward search switches to a forward search.
176
177 @cindex search, wrapping around
178 @cindex search, overwrapped
179 @cindex wrapped search
180 @cindex overwrapped search
181 If a search is failing and you ask to repeat it by typing another
182 @kbd{C-s}, it starts again from the beginning of the buffer.
183 Repeating a failing reverse search with @kbd{C-r} starts again from
184 the end. This is called @dfn{wrapping around}, and @samp{Wrapped}
185 appears in the search prompt once this has happened. If you keep on
186 going past the original starting point of the search, it changes to
187 @samp{Overwrapped}, which means that you are revisiting matches that
188 you have already seen.
189
190 @cindex search ring
191 @kindex M-n @r{(Incremental search)}
192 @kindex M-p @r{(Incremental search)}
193 @vindex search-ring-max
194 To reuse earlier search strings, use the @dfn{search ring}. The
195 commands @kbd{M-p} and @kbd{M-n} move through the ring to pick a
196 search string to reuse. These commands leave the selected search ring
197 element in the minibuffer, where you can edit it. Type
198 @kbd{C-s}/@kbd{C-r} or @key{RET} to accept the string and start
199 searching for it. The number of most recently used search strings
200 saved in the search ring is specified by the variable
201 @code{search-ring-max}, 16 by default.
202
203 @cindex incremental search, edit search string
204 @cindex interactively edit search string
205 @kindex M-e @r{(Incremental search)}
206 @kindex Mouse-1 @r{in the minibuffer (Incremental Search)}
207 To edit the current search string in the minibuffer without
208 replacing it with items from the search ring, type @kbd{M-e} or click
209 @kbd{Mouse-1} in the minibuffer. Type @key{RET}, @kbd{C-s} or
210 @kbd{C-r} to finish editing the string and search for it. Type
211 @kbd{C-f} or @kbd{@key{RIGHT}} to add to the search string characters
212 following point from the buffer from which you started the search.
213
214 @node Isearch Yank
215 @subsection Isearch Yanking
216
217 In many cases, you will want to use text at or near point as your
218 search string. The commands described in this subsection let you do
219 that conveniently.
220
221 @kindex C-w @r{(Incremental search)}
222 @findex isearch-yank-word-or-char
223 @kbd{C-w} (@code{isearch-yank-word-or-char}) appends the next
224 character or word at point to the search string. This is an easy way
225 to search for another occurrence of the text at point. (The decision
226 of whether to copy a character or a word is heuristic.)
227
228 @kindex M-s C-e @r{(Incremental search)}
229 @findex isearch-yank-line
230 Similarly, @kbd{M-s C-e} (@code{isearch-yank-line}) appends the rest
231 of the current line to the search string. If point is already at the
232 end of a line, it appends the next line. With a prefix argument
233 @var{n}, it appends the next @var{n} lines.
234
235 @kindex C-y @r{(Incremental search)}
236 @kindex M-y @r{(Incremental search)}
237 @kindex Mouse-2 @r{in the minibuffer (Incremental search)}
238 @findex isearch-yank-kill
239 @findex isearch-yank-pop
240 @findex isearch-yank-x-selection
241 Within incremental search, @kbd{C-y} (@code{isearch-yank-kill})
242 appends the current kill to the search string. @kbd{M-y}
243 (@code{isearch-yank-pop}), if called after @kbd{C-y}, replaces that
244 appended text with an earlier kill, similar to the usual @kbd{M-y}
245 (@code{yank-pop}) command (@pxref{Yanking}). Clicking @kbd{Mouse-2}
246 in the echo area appends the current X selection (@pxref{Primary
247 Selection}) to the search string (@code{isearch-yank-x-selection}).
248
249 @kindex C-M-w @r{(Incremental search)}
250 @kindex C-M-y @r{(Incremental search)}
251 @findex isearch-del-char
252 @findex isearch-yank-char
253 @kbd{C-M-w} (@code{isearch-del-char}) deletes the last character
254 from the search string, and @kbd{C-M-y} (@code{isearch-yank-char})
255 appends the character after point to the search string. An
256 alternative method to add the character after point is to enter the
257 minibuffer with @kbd{M-e} (@pxref{Repeat Isearch}) and type @kbd{C-f}
258 or @kbd{@key{RIGHT}} at the end of the search string in the
259 minibuffer. Each @kbd{C-f} or @kbd{@key{RIGHT}} you type adds another
260 character following point to the search string.
261
262 Normally, when the search is case-insensitive, text yanked into the
263 search string is converted to lower case, so that the search remains
264 case-insensitive (@pxref{Lax Search, case folding}). However, if the
265 value of the variable @code{search-upper-case} (@pxref{Lax Search,
266 search-upper-case}) is other than @code{not-yanks}, that disables this
267 down-casing.
268
269 @node Error in Isearch
270 @subsection Errors in Incremental Search
271
272 @cindex isearch-fail face
273 If your string is not found at all, the echo area says @samp{Failing
274 I-Search}, and the cursor moves past the place where Emacs found as
275 much of your string as it could. Thus, if you search for @samp{FOOT},
276 and there is no @samp{FOOT}, you might see the cursor after the
277 @samp{FOO} in @samp{FOOL}. In the echo area, the part of the search
278 string that failed to match is highlighted using the face
279 @code{isearch-fail}.
280
281 At this point, there are several things you can do. If your string
282 was mistyped, you can use @key{DEL} to erase some of it and correct
283 it, or you can type @kbd{M-e} and edit it. If you like the place you
284 have found, you can type @key{RET} to remain there. Or you can type
285 @kbd{C-g}, which removes from the search string the characters that
286 could not be found (the @samp{T} in @samp{FOOT}), leaving those that
287 were found (the @samp{FOO} in @samp{FOOT}). A second @kbd{C-g} at
288 that point cancels the search entirely, returning point to where it
289 was when the search started.
290
291 @cindex quitting (in search)
292 @kindex C-g @r{(Incremental search)}
293 The quit command, @kbd{C-g}, does special things during searches;
294 just what it does depends on the status of the search. If the search
295 has found what you specified and is waiting for input, @kbd{C-g}
296 cancels the entire search, moving the cursor back to where you started
297 the search. If @kbd{C-g} is typed when there are characters in the
298 search string that have not been found---because Emacs is still
299 searching for them, or because it has failed to find them---then the
300 search string characters which have not been found are discarded from
301 the search string. With them gone, the search is now successful and
302 waiting for more input, so a second @kbd{C-g} will cancel the entire
303 search.
304
305 @node Special Isearch
306 @subsection Special Input for Incremental Search
307
308 In addition to characters described in the previous subsections,
309 some of the other characters you type during incremental search have
310 special effects. They are described here.
311
312 To toggle lax space matching (@pxref{Lax Search, lax space
313 matching}), type @kbd{M-s @key{SPC}}.
314
315 To toggle case sensitivity of the search, type @kbd{M-c} or
316 @kbd{M-s c}. @xref{Lax Search, case folding}. If the search string
317 includes upper-case letters, the search is case-sensitive by default.
318
319 To toggle whether or not the search will consider similar and
320 equivalent characters as a match, type @kbd{M-s '}. @xref{Lax Search,
321 character folding}. If the search string includes accented
322 characters, that disables character folding during that search.
323
324 @cindex invisible text, searching for
325 @kindex M-s i @r{(Incremental search)}
326 @findex isearch-toggle-invisible
327 To toggle whether or not invisible text is searched, type
328 @kbd{M-s i} (@code{isearch-toggle-invisible}). @xref{Outline Search}.
329
330 @kindex M-r @r{(Incremental Search)}
331 @kindex M-s r @r{(Incremental Search)}
332 @findex isearch-toggle-regexp
333 To toggle between non-regexp and regexp incremental search, type
334 @kbd{M-r} or @kbd{M-s r} (@code{isearch-toggle-regexp}).
335 @xref{Regexp Search}.
336
337 To toggle symbol mode, type @kbd{M-s _}. @xref{Symbol Search}.
338
339 To search for a newline character, type @kbd{C-j} as part of the
340 search string.
341
342 To search for non-@acronym{ASCII} characters, use one of the
343 following methods:
344
345 @itemize @bullet
346 @item
347 Type @kbd{C-q}, followed by a non-graphic character or a sequence of
348 octal digits. This adds a character to the search string, similar to
349 inserting into a buffer using @kbd{C-q} (@pxref{Inserting Text}). For
350 example, @kbd{C-q C-s} during incremental search adds the
351 @samp{control-S} character to the search string.
352
353 @item
354 Type @kbd{C-x 8 @key{RET}}, followed by a Unicode name or code-point
355 in hex. This adds the specified character into the search string,
356 similar to the usual @code{insert-char} command (@pxref{Inserting
357 Text}).
358
359 @item
360 @kindex C-^ @r{(Incremental Search)}
361 @findex isearch-toggle-input-method
362 @findex isearch-toggle-specified-input-method
363 Use an input method (@pxref{Input Methods}). If an input method is
364 enabled in the current buffer when you start the search, the same
365 method will be active in the minibuffer when you type the search
366 string. While typing the search string, you can toggle the input
367 method with @kbd{C-\} (@code{isearch-toggle-input-method}). You can
368 also turn on a non-default input method with @kbd{C-^}
369 (@code{isearch-toggle-specified-input-method}), which prompts for the
370 name of the input method. When an input method is active during
371 incremental search, the search prompt includes the input method
372 mnemonic, like this:
373
374 @example
375 I-search [@var{im}]:
376 @end example
377
378 @noindent
379 where @var{im} is the mnemonic of the active input method. Any input
380 method you enable during incremental search remains enabled in the
381 current buffer afterwards.
382 @end itemize
383
384 @kindex M-s o @r{(Incremental Search)}
385 @findex isearch-occur
386 Typing @kbd{M-s o} in incremental search invokes
387 @code{isearch-occur}, which runs @code{occur} with the current search
388 string. @xref{Other Repeating Search, occur}.
389
390 @kindex M-% @r{(Incremental search)}
391 Typing @kbd{M-%} in incremental search invokes @code{query-replace}
392 or @code{query-replace-regexp} (depending on search mode) with the
393 current search string used as the string to replace. A negative
394 prefix argument means to replace backward. @xref{Query Replace}.
395
396 @kindex M-TAB @r{(Incremental search)}
397 Typing @kbd{M-@key{TAB}} in incremental search invokes
398 @code{isearch-complete}, which attempts to complete the search string
399 using the search ring (the previous search strings you used) as a list
400 of completion alternatives. @xref{Completion}. In many operating
401 systems, the @kbd{M-@key{TAB}} key sequence is captured by the window
402 manager; you then need to rebind @code{isearch-complete} to another
403 key sequence if you want to use it (@pxref{Rebinding}).
404
405 @kindex M-s h r @r{(Incremental Search)}
406 @findex isearch-highlight-regexp
407 You can exit the search while leaving the matches for the last
408 search string highlighted on display. To this end, type @kbd{M-s h r}
409 (@code{isearch-highlight-regexp}), which will run
410 @code{highlight-regexp} (@pxref{Highlight Interactively}) passing
411 it the regexp derived from the last search string and prompting you
412 for the face to use for highlighting. To remove the highlighting,
413 type @kbd{M-s h u} (@code{unhighlight-regexp}).
414
415 @cindex incremental search, help on special keys
416 @kindex C-h C-h @r{(Incremental Search)}
417 @findex isearch-help-map
418 @vindex isearch-mode-map
419 When incremental search is active, you can type @kbd{C-h C-h}
420 (@code{isearch-help-map}) to access interactive help options,
421 including a list of special key bindings. These key bindings are part
422 of the keymap @code{isearch-mode-map} (@pxref{Keymaps}).
423
424 @node Not Exiting Isearch
425 @subsection Not Exiting Incremental Search
426
427 This subsection describes how to control whether typing a command not
428 specifically meaningful is searches exits the search before executing
429 the command. It also describes two categories of commands which you
430 can type without exiting the current incremental search, even though
431 they are not themselves part of incremental search.
432
433 @vindex search-exit-option
434 Normally, typing a command that is not bound by the incremental
435 search exits the search before executing the command. Thus, the
436 command operates on the buffer from which you invoked the search.
437 However, if you customize the variable @code{search-exit-option} to a
438 @code{nil} value, the characters which you type that are not
439 interpreted by the incremental search are simply appended to the
440 search string. This is so you could include in the search string
441 control characters, such as @kbd{C-a}, that would normally exit the
442 search and invoke the command bound to them on the buffer.
443
444 @table @asis
445 @item Prefix Arguments
446 @cindex prefix argument commands, during incremental search
447 @vindex isearch-allow-prefix
448 In incremental search, when you type a command that specifies a
449 prefix argument (@pxref{Arguments}), by default it will apply either
450 to the next action in the search or to the command that exits the
451 search. In other words, entering a prefix argument will not by itself
452 terminate the search.
453
454 In previous versions of Emacs, entering a prefix argument always
455 terminated the search. You can revert to this behavior by setting the
456 variable @code{isearch-allow-prefix} to @code{nil}.
457
458 When @code{isearch-allow-scroll} is non-@code{nil} (see below),
459 prefix arguments always have the default behavior described above,
460 i.e., they don't terminate the search, even if
461 @code{isearch-allow-prefix} is @code{nil}.
462
463 @item Scrolling Commands
464 @cindex scrolling commands, during incremental search
465 @vindex isearch-allow-scroll
466 Normally, scrolling commands exit incremental search. If you change
467 the variable @code{isearch-allow-scroll} to a non-@code{nil} value,
468 that enables the use of the scroll-bar, as well as keyboard scrolling
469 commands like @kbd{C-v}, @kbd{M-v}, and @kbd{C-l} (@pxref{Scrolling}).
470 This applies only to calling these commands via their bound key
471 sequences---typing @kbd{M-x} will still exit the search. You can give
472 prefix arguments to these commands in the usual way. This feature
473 won't let you scroll the current match out of visibility, however.
474
475 The @code{isearch-allow-scroll} feature also affects some other
476 commands, such as @kbd{C-x 2} (@code{split-window-below}) and
477 @kbd{C-x ^} (@code{enlarge-window}), which don't exactly scroll but do
478 affect where the text appears on the screen. It applies to any
479 command whose name has a non-@code{nil} @code{isearch-scroll}
480 property. So you can control which commands are affected by changing
481 these properties.
482
483 @cindex prevent commands from exiting incremental search
484 For example, to make @kbd{C-h l} usable within an incremental search
485 in all future Emacs sessions, use @kbd{C-h c} to find what command it
486 runs (@pxref{Key Help}), which is @code{view-lossage}. Then you can
487 put the following line in your init file (@pxref{Init File}):
488
489 @example
490 (put 'view-lossage 'isearch-scroll t)
491 @end example
492
493 @noindent
494 This feature can be applied to any command that doesn't permanently
495 change point, the buffer contents, the match data, the current buffer,
496 or the selected window and frame. The command must not itself attempt
497 an incremental search. This feature is disabled if
498 @code{isearch-allow-scroll} is @code{nil} (which it is by default).
499 @end table
500
501 @node Isearch Minibuffer
502 @subsection Searching the Minibuffer
503 @cindex minibuffer history, searching
504
505 If you start an incremental search while the minibuffer is active,
506 Emacs searches the contents of the minibuffer. Unlike searching an
507 ordinary buffer, the search string is not shown in the echo area,
508 because that is used to display the minibuffer.
509
510 If an incremental search fails in the minibuffer, it tries searching
511 the minibuffer history. @xref{Minibuffer History}. You can visualize
512 the minibuffer and its history as a series of pages, with the
513 earliest history element on the first page and the current minibuffer
514 on the last page. A forward search, @kbd{C-s}, searches forward to
515 later pages; a reverse search, @kbd{C-r}, searches backwards to
516 earlier pages. Like in ordinary buffer search, a failing search can
517 wrap around, going from the last page to the first page or vice versa.
518
519 When the current match is on a history element, that history element
520 is pulled into the minibuffer. If you exit the incremental search
521 normally (e.g., by typing @key{RET}), it remains in the minibuffer
522 afterwards. Canceling the search, with @kbd{C-g}, restores the
523 contents of the minibuffer when you began the search.
524
525 @node Nonincremental Search
526 @section Nonincremental Search
527 @cindex nonincremental search
528
529 Emacs also has conventional nonincremental search commands, which require
530 you to type the entire search string before searching begins.
531
532 @table @kbd
533 @item C-s @key{RET} @var{string} @key{RET}
534 Search for @var{string}.
535 @item C-r @key{RET} @var{string} @key{RET}
536 Search backward for @var{string}.
537 @end table
538
539 To start a nonincremental search, first type @kbd{C-s @key{RET}}.
540 This enters the minibuffer to read the search string; terminate the
541 string with @key{RET}, and then the search takes place. If the string
542 is not found, the search command signals an error.
543
544 @findex search-forward
545 @findex search-backward
546 When you type @kbd{C-s @key{RET}}, the @kbd{C-s} invokes incremental
547 search as usual. That command is specially programmed to invoke the
548 command for nonincremental search, @code{search-forward}, if the
549 string you specify is empty. (Such an empty argument would otherwise
550 be useless.) @kbd{C-r @key{RET}} does likewise, invoking the command
551 @code{search-backward}.
552
553 @node Word Search
554 @section Word Search
555 @cindex word search
556
557 A @dfn{word search} finds a sequence of words without regard to the
558 type of punctuation between them. For instance, if you enter a search
559 string that consists of two words separated by a single space, the
560 search matches any sequence of those two words separated by one or
561 more spaces, newlines, or other punctuation characters. This is
562 particularly useful for searching text documents, because you don't
563 have to worry whether the words you are looking for are separated by
564 newlines or spaces. Note that major modes for programming languages
565 or other specialized modes can modify the definition of a word to suit
566 their syntactic needs.
567
568 @table @kbd
569 @item M-s w
570 If incremental search is active, toggle word search mode
571 (@code{isearch-toggle-word}); otherwise, begin an incremental forward
572 word search (@code{isearch-forward-word}).
573 @item M-s w @key{RET} @var{words} @key{RET}
574 Search for @var{words}, using a forward nonincremental word search.
575 @item M-s w C-r @key{RET} @var{words} @key{RET}
576 Search backward for @var{words}, using a nonincremental word search.
577 @end table
578
579 @kindex M-s w
580 @findex isearch-forward-word
581 To begin a forward incremental word search, type @kbd{M-s w}. If
582 incremental search is not already active, this runs the command
583 @code{isearch-forward-word}. If incremental search is already active
584 (whether a forward or backward search), @kbd{M-s w} switches to a word
585 search while keeping the direction of the search and the current
586 search string unchanged. You can toggle word search back off by
587 typing @kbd{M-s w} again.
588
589 @findex word-search-forward
590 @findex word-search-backward
591 To begin a nonincremental word search, type @kbd{M-s w @key{RET}}
592 for a forward search, or @kbd{M-s w C-r @key{RET}} for a backward search.
593 These run the commands @code{word-search-forward} and
594 @code{word-search-backward} respectively.
595
596 Incremental and nonincremental word searches differ slightly in the
597 way they find a match. In a nonincremental word search, each word in
598 the search string must exactly match a whole word. In an incremental
599 word search, the matching is more lax: while you are typing the search
600 string, its first and last words need not match whole words. This is
601 so that the matching can proceed incrementally as you type. This
602 additional laxity does not apply to the lazy highlight
603 (@pxref{Incremental Search}), which always matches whole words.
604
605 @kindex M-s M-w
606 @findex eww-search-word
607 @vindex eww-search-prefix
608 Search the Web for the text in region. This command performs an
609 Internet search for the words in region using the search engine whose
610 @acronym{URL} is specified by the variable @code{eww-search-prefix}.
611 @xref{Basics, EWW, , eww, The Emacs Web Wowser Manual}.
612
613 @node Symbol Search
614 @section Symbol Search
615 @cindex symbol search
616
617 A @dfn{symbol search} is much like an ordinary search, except that
618 the boundaries of the search must match the boundaries of a symbol.
619 The meaning of @dfn{symbol} in this context depends on the major mode,
620 and usually refers to a source code token, such as a Lisp symbol in
621 Emacs Lisp mode. For instance, if you perform an incremental symbol
622 search for the Lisp symbol @code{forward-word}, it would not match
623 @code{isearch-forward-word}. This feature is thus mainly useful for
624 searching source code.
625
626 @table @kbd
627 @item M-s _
628 @findex isearch-toggle-symbol
629 If incremental search is active, toggle symbol search mode
630 (@code{isearch-toggle-symbol}); otherwise, begin an incremental
631 forward symbol search (@code{isearch-forward-symbol}).
632 @item M-s .
633 Start a symbol incremental search forward with the symbol found near
634 point added to the search string initially.
635 @item M-s _ @key{RET} @var{symbol} @key{RET}
636 Search forward for @var{symbol}, nonincrementally.
637 @item M-s _ C-r @key{RET} @var{symbol} @key{RET}
638 Search backward for @var{symbol}, nonincrementally.
639 @end table
640
641 @kindex M-s _
642 @kindex M-s .
643 @findex isearch-forward-symbol
644 @findex isearch-forward-symbol-at-point
645 To begin a forward incremental symbol search, type @kbd{M-s _} (or
646 @kbd{M-s .} if the symbol to search is near point). If incremental
647 search is not already active, this runs the command
648 @code{isearch-forward-symbol}. If incremental search is already
649 active, @kbd{M-s _} switches to a symbol search, preserving the
650 direction of the search and the current search string; you can disable
651 symbol search by typing @kbd{M-s _} again. In incremental symbol
652 search, only the beginning of the search string is required to match
653 the beginning of a symbol.
654
655 To begin a nonincremental symbol search, type @kbd{M-s _ @key{RET}}
656 for a forward search, or @kbd{M-s _ C-r @key{RET}} or a backward
657 search. In nonincremental symbol searches, the beginning and end of
658 the search string are required to match the beginning and end of a
659 symbol, respectively.
660
661 @node Regexp Search
662 @section Regular Expression Search
663 @cindex regexp search
664 @cindex search for a regular expression
665
666 A @dfn{regular expression} (or @dfn{regexp} for short) is a pattern
667 that denotes a class of alternative strings to match. Emacs
668 provides both incremental and nonincremental ways to search for a
669 match for a regexp. The syntax of regular expressions is explained in
670 the next section.
671
672 @table @kbd
673 @item C-M-s
674 Begin incremental regexp search (@code{isearch-forward-regexp}).
675 @item C-M-r
676 Begin reverse incremental regexp search (@code{isearch-backward-regexp}).
677 @end table
678
679 @kindex C-M-s
680 @findex isearch-forward-regexp
681 @kindex C-M-r
682 @findex isearch-backward-regexp
683 Incremental search for a regexp is done by typing @kbd{C-M-s}
684 (@code{isearch-forward-regexp}), by invoking @kbd{C-s} with a
685 prefix argument (whose value does not matter), or by typing @kbd{M-r}
686 within a forward incremental search. This command reads a
687 search string incrementally just like @kbd{C-s}, but it treats the
688 search string as a regexp rather than looking for an exact match
689 against the text in the buffer. Each time you add text to the search
690 string, you make the regexp longer, and the new regexp is searched
691 for. To search backward for a regexp, use @kbd{C-M-r}
692 (@code{isearch-backward-regexp}), @kbd{C-r} with a prefix argument,
693 or @kbd{M-r} within a backward incremental search.
694
695 @vindex regexp-search-ring-max
696 All of the special key sequences in an ordinary incremental search
697 (@pxref{Special Isearch}) do similar things in an incremental regexp
698 search. For instance, typing @kbd{C-s} immediately after starting the
699 search retrieves the last incremental search regexp used and searches
700 forward for it. Incremental regexp and non-regexp searches have
701 independent defaults. They also have separate search rings, which you
702 can access with @kbd{M-p} and @kbd{M-n}. The maximum number of search
703 regexps saved in the search ring is determined by the value of
704 @code{regexp-search-ring-max}, 16 by default.
705
706 Unlike ordinary incremental search, incremental regexp search
707 do not use lax space matching by default. To toggle this feature
708 use @kbd{M-s @key{SPC}} (@code{isearch-toggle-lax-whitespace}).
709 Then any @key{SPC} typed in incremental regexp search will match
710 any sequence of one or more whitespace characters. The variable
711 @code{search-whitespace-regexp} specifies the regexp for the lax
712 space matching. @xref{Special Isearch}.
713
714 In some cases, adding characters to the regexp in an incremental
715 regexp search can make the cursor move back and start again. For
716 example, if you have searched for @samp{foo} and you add @samp{\|bar},
717 the cursor backs up in case the first @samp{bar} precedes the first
718 @samp{foo}. @xref{Regexps}.
719
720 Forward and backward regexp search are not symmetrical, because
721 regexp matching in Emacs always operates forward, starting with the
722 beginning of the regexp. Thus, forward regexp search scans forward,
723 trying a forward match at each possible starting position. Backward
724 regexp search scans backward, trying a forward match at each possible
725 starting position. These search methods are not mirror images.
726
727 @findex re-search-forward
728 @findex re-search-backward
729 Nonincremental search for a regexp is done with the commands
730 @code{re-search-forward} and @code{re-search-backward}. You can
731 invoke these with @kbd{M-x}, or by way of incremental regexp search
732 with @kbd{C-M-s @key{RET}} and @kbd{C-M-r @key{RET}}.
733
734 If you use the incremental regexp search commands with a prefix
735 argument, they perform ordinary string search, like
736 @code{isearch-forward} and @code{isearch-backward}. @xref{Incremental
737 Search}.
738
739 @node Regexps
740 @section Syntax of Regular Expressions
741 @cindex syntax of regexps
742 @cindex regular expression
743 @cindex regexp
744
745 This manual describes regular expression features that users
746 typically use. @xref{Regular Expressions,,, elisp, The Emacs Lisp
747 Reference Manual}, for additional features used mainly in Lisp
748 programs.
749
750 Regular expressions have a syntax in which a few characters are
751 special constructs and the rest are @dfn{ordinary}. An ordinary
752 character matches that same character and nothing else. The special
753 characters are @samp{$^.*+?[\}. The character @samp{]} is special if
754 it ends a character alternative (see later). The character @samp{-}
755 is special inside a character alternative. Any other character
756 appearing in a regular expression is ordinary, unless a @samp{\}
757 precedes it. (When you use regular expressions in a Lisp program,
758 each @samp{\} must be doubled, see the example near the end of this
759 section.)
760
761 For example, @samp{f} is not a special character, so it is ordinary, and
762 therefore @samp{f} is a regular expression that matches the string
763 @samp{f} and no other string. (It does @emph{not} match the string
764 @samp{ff}.) Likewise, @samp{o} is a regular expression that matches
765 only @samp{o}. (When case distinctions are being ignored, these regexps
766 also match @samp{F} and @samp{O}, but we consider this a generalization
767 of ``the same string'', rather than an exception.)
768
769 Any two regular expressions @var{a} and @var{b} can be concatenated.
770 The result is a regular expression which matches a string if @var{a}
771 matches some amount of the beginning of that string and @var{b}
772 matches the rest of the string. For example, concatenating the
773 regular expressions @samp{f} and @samp{o} gives the regular expression
774 @samp{fo}, which matches only the string @samp{fo}. Still trivial.
775 To do something nontrivial, you need to use one of the special
776 characters. Here is a list of them.
777
778 @table @asis
779 @item @kbd{.}@: @r{(Period)}
780 is a special character that matches any single character except a
781 newline. For example, the regular expressions @samp{a.b} matches any
782 three-character string that begins with @samp{a} and ends with
783 @samp{b}.
784
785 @item @kbd{*}
786 is not a construct by itself; it is a postfix operator that means to
787 match the preceding regular expression repetitively any number of
788 times, as many times as possible. Thus, @samp{o*} matches any number
789 of @samp{o}s, including no @samp{o}s.
790
791 @samp{*} always applies to the @emph{smallest} possible preceding
792 expression. Thus, @samp{fo*} has a repeating @samp{o}, not a repeating
793 @samp{fo}. It matches @samp{f}, @samp{fo}, @samp{foo}, and so on.
794
795 The matcher processes a @samp{*} construct by matching, immediately,
796 as many repetitions as can be found. Then it continues with the rest
797 of the pattern. If that fails, backtracking occurs, discarding some
798 of the matches of the @samp{*}-modified construct in case that makes
799 it possible to match the rest of the pattern. For example, in matching
800 @samp{ca*ar} against the string @samp{caaar}, the @samp{a*} first
801 tries to match all three @samp{a}s; but the rest of the pattern is
802 @samp{ar} and there is only @samp{r} left to match, so this try fails.
803 The next alternative is for @samp{a*} to match only two @samp{a}s.
804 With this choice, the rest of the regexp matches successfully.
805
806 @item @kbd{+}
807 is a postfix operator, similar to @samp{*} except that it must match
808 the preceding expression at least once. Thus, @samp{ca+r} matches the
809 strings @samp{car} and @samp{caaaar} but not the string @samp{cr},
810 whereas @samp{ca*r} matches all three strings.
811
812 @item @kbd{?}
813 is a postfix operator, similar to @samp{*} except that it can match
814 the preceding expression either once or not at all. Thus, @samp{ca?r}
815 matches @samp{car} or @samp{cr}, and nothing else.
816
817 @item @kbd{*?}, @kbd{+?}, @kbd{??}
818 @cindex non-greedy regexp matching
819 are non-@dfn{greedy} variants of the operators above. The normal
820 operators @samp{*}, @samp{+}, @samp{?} match as much as they can, as
821 long as the overall regexp can still match. With a following
822 @samp{?}, they will match as little as possible.
823
824 Thus, both @samp{ab*} and @samp{ab*?} can match the string @samp{a}
825 and the string @samp{abbbb}; but if you try to match them both against
826 the text @samp{abbb}, @samp{ab*} will match it all (the longest valid
827 match), while @samp{ab*?} will match just @samp{a} (the shortest
828 valid match).
829
830 Non-greedy operators match the shortest possible string starting at a
831 given starting point; in a forward search, though, the earliest
832 possible starting point for match is always the one chosen. Thus, if
833 you search for @samp{a.*?$} against the text @samp{abbab} followed by
834 a newline, it matches the whole string. Since it @emph{can} match
835 starting at the first @samp{a}, it does.
836
837 @item @kbd{\@{@var{n}\@}}
838 is a postfix operator specifying @var{n} repetitions---that is, the
839 preceding regular expression must match exactly @var{n} times in a
840 row. For example, @samp{x\@{4\@}} matches the string @samp{xxxx} and
841 nothing else.
842
843 @item @kbd{\@{@var{n},@var{m}\@}}
844 is a postfix operator specifying between @var{n} and @var{m}
845 repetitions---that is, the preceding regular expression must match at
846 least @var{n} times, but no more than @var{m} times. If @var{m} is
847 omitted, then there is no upper limit, but the preceding regular
848 expression must match at least @var{n} times.@* @samp{\@{0,1\@}} is
849 equivalent to @samp{?}. @* @samp{\@{0,\@}} is equivalent to
850 @samp{*}. @* @samp{\@{1,\@}} is equivalent to @samp{+}.
851
852 @item @kbd{[ @dots{} ]}
853 is a @dfn{character set}, beginning with @samp{[} and terminated by
854 @samp{]}.
855
856 In the simplest case, the characters between the two brackets are what
857 this set can match. Thus, @samp{[ad]} matches either one @samp{a} or
858 one @samp{d}, and @samp{[ad]*} matches any string composed of just
859 @samp{a}s and @samp{d}s (including the empty string). It follows that
860 @samp{c[ad]*r} matches @samp{cr}, @samp{car}, @samp{cdr},
861 @samp{caddaar}, etc.
862
863 You can also include character ranges in a character set, by writing the
864 starting and ending characters with a @samp{-} between them. Thus,
865 @samp{[a-z]} matches any lower-case @acronym{ASCII} letter. Ranges may be
866 intermixed freely with individual characters, as in @samp{[a-z$%.]},
867 which matches any lower-case @acronym{ASCII} letter or @samp{$}, @samp{%} or
868 period.
869
870 You can also include certain special @dfn{character classes} in a
871 character set. A @samp{[:} and balancing @samp{:]} enclose a
872 character class inside a character alternative. For instance,
873 @samp{[[:alnum:]]} matches any letter or digit. @xref{Char Classes,,,
874 elisp, The Emacs Lisp Reference Manual}, for a list of character
875 classes.
876
877 To include a @samp{]} in a character set, you must make it the first
878 character. For example, @samp{[]a]} matches @samp{]} or @samp{a}. To
879 include a @samp{-}, write @samp{-} as the first or last character of the
880 set, or put it after a range. Thus, @samp{[]-]} matches both @samp{]}
881 and @samp{-}.
882
883 To include @samp{^} in a set, put it anywhere but at the beginning of
884 the set. (At the beginning, it complements the set---see below.)
885
886 When you use a range in case-insensitive search, you should write both
887 ends of the range in upper case, or both in lower case, or both should
888 be non-letters. The behavior of a mixed-case range such as @samp{A-z}
889 is somewhat ill-defined, and it may change in future Emacs versions.
890
891 @item @kbd{[^ @dots{} ]}
892 @samp{[^} begins a @dfn{complemented character set}, which matches any
893 character except the ones specified. Thus, @samp{[^a-z0-9A-Z]} matches
894 all characters @emph{except} @acronym{ASCII} letters and digits.
895
896 @samp{^} is not special in a character set unless it is the first
897 character. The character following the @samp{^} is treated as if it
898 were first (in other words, @samp{-} and @samp{]} are not special there).
899
900 A complemented character set can match a newline, unless newline is
901 mentioned as one of the characters not to match. This is in contrast to
902 the handling of regexps in programs such as @code{grep}.
903
904 @item @kbd{^}
905 is a special character that matches the empty string, but only at the
906 beginning of a line in the text being matched. Otherwise it fails to
907 match anything. Thus, @samp{^foo} matches a @samp{foo} that occurs at
908 the beginning of a line.
909
910 For historical compatibility reasons, @samp{^} can be used with this
911 meaning only at the beginning of the regular expression, or after
912 @samp{\(} or @samp{\|}.
913
914 @item @kbd{$}
915 is similar to @samp{^} but matches only at the end of a line. Thus,
916 @samp{x+$} matches a string of one @samp{x} or more at the end of a line.
917
918 For historical compatibility reasons, @samp{$} can be used with this
919 meaning only at the end of the regular expression, or before @samp{\)}
920 or @samp{\|}.
921
922 @item @kbd{\}
923 has two functions: it quotes the special characters (including
924 @samp{\}), and it introduces additional special constructs.
925
926 Because @samp{\} quotes special characters, @samp{\$} is a regular
927 expression that matches only @samp{$}, and @samp{\[} is a regular
928 expression that matches only @samp{[}, and so on.
929
930 See the following section for the special constructs that begin
931 with @samp{\}.
932 @end table
933
934 Note: for historical compatibility, special characters are treated as
935 ordinary ones if they are in contexts where their special meanings make no
936 sense. For example, @samp{*foo} treats @samp{*} as ordinary since there is
937 no preceding expression on which the @samp{*} can act. It is poor practice
938 to depend on this behavior; it is better to quote the special character anyway,
939 regardless of where it appears.
940
941 As a @samp{\} is not special inside a character alternative, it can
942 never remove the special meaning of @samp{-} or @samp{]}. So you
943 should not quote these characters when they have no special meaning
944 either. This would not clarify anything, since backslashes can
945 legitimately precede these characters where they @emph{have} special
946 meaning, as in @samp{[^\]} (@code{"[^\\]"} for Lisp string syntax),
947 which matches any single character except a backslash.
948
949 @node Regexp Backslash
950 @section Backslash in Regular Expressions
951
952 For the most part, @samp{\} followed by any character matches only
953 that character. However, there are several exceptions: two-character
954 sequences starting with @samp{\} that have special meanings. The
955 second character in the sequence is always an ordinary character when
956 used on its own. Here is a table of @samp{\} constructs.
957
958 @table @kbd
959 @item \|
960 specifies an alternative. Two regular expressions @var{a} and @var{b}
961 with @samp{\|} in between form an expression that matches some text if
962 either @var{a} matches it or @var{b} matches it. It works by trying to
963 match @var{a}, and if that fails, by trying to match @var{b}.
964
965 Thus, @samp{foo\|bar} matches either @samp{foo} or @samp{bar}
966 but no other string.
967
968 @samp{\|} applies to the largest possible surrounding expressions. Only a
969 surrounding @samp{\( @dots{} \)} grouping can limit the grouping power of
970 @samp{\|}.
971
972 Full backtracking capability exists to handle multiple uses of @samp{\|}.
973
974 @item \( @dots{} \)
975 is a grouping construct that serves three purposes:
976
977 @enumerate
978 @item
979 To enclose a set of @samp{\|} alternatives for other operations.
980 Thus, @samp{\(foo\|bar\)x} matches either @samp{foox} or @samp{barx}.
981
982 @item
983 To enclose a complicated expression for the postfix operators @samp{*},
984 @samp{+} and @samp{?} to operate on. Thus, @samp{ba\(na\)*} matches
985 @samp{bananana}, etc., with any (zero or more) number of @samp{na}
986 strings.
987
988 @item
989 To record a matched substring for future reference.
990 @end enumerate
991
992 This last application is not a consequence of the idea of a
993 parenthetical grouping; it is a separate feature that is assigned as a
994 second meaning to the same @samp{\( @dots{} \)} construct. In practice
995 there is usually no conflict between the two meanings; when there is
996 a conflict, you can use a shy group.
997
998 @item \(?: @dots{} \)
999 @cindex shy group, in regexp
1000 specifies a shy group that does not record the matched substring;
1001 you can't refer back to it with @samp{\@var{d}} (see below). This is
1002 useful in mechanically combining regular expressions, so that you can
1003 add groups for syntactic purposes without interfering with the
1004 numbering of the groups that are meant to be referred to.
1005
1006 @item \@var{d}
1007 @cindex back reference, in regexp
1008 matches the same text that matched the @var{d}th occurrence of a
1009 @samp{\( @dots{} \)} construct. This is called a @dfn{back
1010 reference}.
1011
1012 After the end of a @samp{\( @dots{} \)} construct, the matcher remembers
1013 the beginning and end of the text matched by that construct. Then,
1014 later on in the regular expression, you can use @samp{\} followed by the
1015 digit @var{d} to mean ``match the same text matched the @var{d}th time
1016 by the @samp{\( @dots{} \)} construct''.
1017
1018 The strings matching the first nine @samp{\( @dots{} \)} constructs
1019 appearing in a regular expression are assigned numbers 1 through 9 in
1020 the order that the open-parentheses appear in the regular expression.
1021 So you can use @samp{\1} through @samp{\9} to refer to the text matched
1022 by the corresponding @samp{\( @dots{} \)} constructs.
1023
1024 For example, @samp{\(.*\)\1} matches any newline-free string that is
1025 composed of two identical halves. The @samp{\(.*\)} matches the first
1026 half, which may be anything, but the @samp{\1} that follows must match
1027 the same exact text.
1028
1029 If a particular @samp{\( @dots{} \)} construct matches more than once
1030 (which can easily happen if it is followed by @samp{*}), only the last
1031 match is recorded.
1032
1033 @item \`
1034 matches the empty string, but only at the beginning of the string or
1035 buffer (or its accessible portion) being matched against.
1036
1037 @item \'
1038 matches the empty string, but only at the end of the string or buffer
1039 (or its accessible portion) being matched against.
1040
1041 @item \=
1042 matches the empty string, but only at point.
1043
1044 @item \b
1045 matches the empty string, but only at the beginning or
1046 end of a word. Thus, @samp{\bfoo\b} matches any occurrence of
1047 @samp{foo} as a separate word. @samp{\bballs?\b} matches
1048 @samp{ball} or @samp{balls} as a separate word.
1049
1050 @samp{\b} matches at the beginning or end of the buffer
1051 regardless of what text appears next to it.
1052
1053 @item \B
1054 matches the empty string, but @emph{not} at the beginning or
1055 end of a word.
1056
1057 @item \<
1058 matches the empty string, but only at the beginning of a word.
1059 @samp{\<} matches at the beginning of the buffer only if a
1060 word-constituent character follows.
1061
1062 @item \>
1063 matches the empty string, but only at the end of a word. @samp{\>}
1064 matches at the end of the buffer only if the contents end with a
1065 word-constituent character.
1066
1067 @item \w
1068 matches any word-constituent character. The syntax table determines
1069 which characters these are. @xref{Syntax Tables,, Syntax Tables,
1070 elisp, The Emacs Lisp Reference Manual}.
1071
1072 @item \W
1073 matches any character that is not a word-constituent.
1074
1075 @item \_<
1076 matches the empty string, but only at the beginning of a symbol.
1077 A symbol is a sequence of one or more symbol-constituent characters.
1078 A symbol-constituent character is a character whose syntax is either
1079 @samp{w} or @samp{_}. @samp{\_<} matches at the beginning of the
1080 buffer only if a symbol-constituent character follows.
1081
1082 @item \_>
1083 matches the empty string, but only at the end of a symbol. @samp{\_>}
1084 matches at the end of the buffer only if the contents end with a
1085 symbol-constituent character.
1086
1087 @item \s@var{c}
1088 matches any character whose syntax is @var{c}. Here @var{c} is a
1089 character that designates a particular syntax class: thus, @samp{w}
1090 for word constituent, @samp{-} or @samp{ } for whitespace, @samp{.}
1091 for ordinary punctuation, etc. @xref{Syntax Tables,, Syntax Tables,
1092 elisp, The Emacs Lisp Reference Manual}.
1093
1094 @item \S@var{c}
1095 matches any character whose syntax is not @var{c}.
1096
1097 @cindex categories of characters
1098 @cindex characters which belong to a specific language
1099 @findex describe-categories
1100 @item \c@var{c}
1101 matches any character that belongs to the category @var{c}. For
1102 example, @samp{\cc} matches Chinese characters, @samp{\cg} matches
1103 Greek characters, etc. For the description of the known categories,
1104 type @kbd{M-x describe-categories @key{RET}}.
1105
1106 @item \C@var{c}
1107 matches any character that does @emph{not} belong to category
1108 @var{c}.
1109 @end table
1110
1111 The constructs that pertain to words and syntax are controlled by
1112 the setting of the syntax table. @xref{Syntax Tables,, Syntax Tables,
1113 elisp, The Emacs Lisp Reference Manual}.
1114
1115 @node Regexp Example
1116 @section Regular Expression Example
1117
1118 Here is an example of a regexp---similar to the regexp that Emacs
1119 uses, by default, to recognize the end of a sentence, not including
1120 the following space (i.e., the variable @code{sentence-end-base}):
1121
1122 @example
1123 @verbatim
1124 [.?!][]\"')}]*
1125 @end verbatim
1126 @end example
1127
1128 @noindent
1129 This contains two parts in succession: a character set matching
1130 period, @samp{?}, or @samp{!}, and a character set matching
1131 close-brackets, quotes, or parentheses, repeated zero or more times.
1132
1133 @node Lax Search
1134 @section Lax Matching During Searching
1135
1136 @cindex lax search
1137 @cindex character equivalence in search
1138 Normally, you'd want search commands to disregard certain minor
1139 differences between the search string you types and the text being
1140 searched. For example, sequences of whitespace characters of
1141 different length are normally perceived as equivalent; letter-case
1142 differences usually don't matter; etc. This is known as
1143 @dfn{character equivalence}.
1144
1145 This section describes the Emacs lax search features, and how to
1146 tailor them to your needs.
1147
1148 @cindex lax space matching in search
1149 @kindex M-s SPC @r{(Incremental search)}
1150 @kindex SPC @r{(Incremental search)}
1151 @findex isearch-toggle-lax-whitespace
1152 @vindex search-whitespace-regexp
1153 By default, search commands perform @dfn{lax space matching}:
1154 each space, or sequence of spaces, matches any sequence of one or more
1155 whitespace characters in the text. (Incremental regexp search has a
1156 separate default; see @ref{Regexp Search}.) Hence, @samp{foo bar}
1157 matches @samp{foo bar}, @samp{foo@w{ }bar}, @samp{foo@w{ }bar}, and so
1158 on (but not @samp{foobar}). More precisely, Emacs matches each
1159 sequence of space characters in the search string to a regular
1160 expression specified by the variable @code{search-whitespace-regexp}.
1161 For example, to make spaces match sequences of newlines as well as
1162 spaces, set it to @samp{"[[:space:]\n]+"}. The default value of this
1163 variable depends on the buffer's major mode; most major modes classify
1164 spaces, tabs, and formfeed characters as whitespace.
1165
1166 If you want whitespace characters to match exactly, you can turn lax
1167 space matching by typing @kbd{M-s @key{SPC}}
1168 (@code{isearch-toggle-lax-whitespace}) within an incremental search.
1169 Another @kbd{M-s @key{SPC}} turns lax space matching back on. To
1170 disable this feature entirely, change @code{search-whitespace-regexp}
1171 to @code{nil}; then each space in the search string matches exactly
1172 one space.
1173
1174 @cindex case folding in search
1175 @cindex case-sensitivity and search
1176 Searches in Emacs normally ignore the case of the text they are
1177 searching through, if you specify the search string in lower case.
1178 Thus, if you specify searching for @samp{foo}, then @samp{Foo} and
1179 @samp{foo} also match. Regexps, and in particular character sets,
1180 behave likewise: @samp{[ab]} matches @samp{a} or @samp{A} or @samp{b}
1181 or @samp{B}. This feature is known as @dfn{case folding}, and it is
1182 supported in both incremental and non-incremental search modes.
1183
1184 @vindex search-upper-case
1185 An upper-case letter anywhere in the search string makes the search
1186 case-sensitive. Thus, searching for @samp{Foo} does not find
1187 @samp{foo} or @samp{FOO}. This applies to regular expression search
1188 as well as to string search. The effect ceases if you delete the
1189 upper-case letter from the search string. The variable
1190 @code{search-upper-case} controls this: if it is non-@code{nil} (the
1191 default), an upper-case character in the search string make the search
1192 case-sensitive; setting it to @code{nil} disables this effect of
1193 upper-case characters.
1194
1195 @vindex case-fold-search
1196 If you set the variable @code{case-fold-search} to @code{nil}, then
1197 all letters must match exactly, including case. This is a per-buffer
1198 variable; altering the variable normally affects only the current buffer,
1199 unless you change its default value. @xref{Locals}.
1200 This variable applies to nonincremental searches also, including those
1201 performed by the replace commands (@pxref{Replace}) and the minibuffer
1202 history matching commands (@pxref{Minibuffer History}).
1203
1204 @kindex M-c @r{(Incremental search)}
1205 @kindex M-s c @r{(Incremental search)}
1206 @findex isearch-toggle-case-fold
1207 Typing @kbd{M-c} or @kbd{M-s c} (@code{isearch-toggle-case-fold})
1208 within an incremental search toggles the case sensitivity of that
1209 search. The effect does not extend beyond the current incremental
1210 search to the next one, but it does override the effect of adding or
1211 removing an upper-case letter in the current search.
1212
1213 Several related variables control case-sensitivity of searching and
1214 matching for specific commands or activities. For instance,
1215 @code{tags-case-fold-search} controls case sensitivity for
1216 @code{find-tag}. To find these variables, do @kbd{M-x
1217 apropos-variable @key{RET} case-fold-search @key{RET}}.
1218
1219 @cindex character folding in search
1220 @cindex equivalent character sequences
1221 Case folding disregards case distinctions between characters, making
1222 upper-case characters match lower-case variants, and vice versa. A
1223 generalization of case folding is @dfn{character folding}, which
1224 disregards wider classes of distinctions between similar characters.
1225 For instance, under character folding the letter @code{a} will match
1226 all of its accented cousins like @code{@"a} and @code{@'a}, i.e., the
1227 match disregards the diacriticals that distinguish between these
1228 variants. In addition, @code{a} will match other characters that
1229 resemble it, or have it as part of their graphical representation,
1230 such as @sc{u+249c parenthesized latin small letter a} and @sc{u+2100
1231 account of} (which looks like a small @code{a} over @code{c}).
1232 Similarly, the @acronym{ASCII} double-quote character @code{"} will
1233 match all the other variants of double quotes defined by the Unicode
1234 standard.
1235 @ignore @c FIXME: This doesn't work. Should it?
1236 Finally, character folding can make a sequence of one or
1237 more characters match another sequence of a different length: for
1238 example, the sequence of two characters @code{ae} matches the ligature
1239 @code{@ae{}}.
1240 @end ignore
1241 Character sequences that match under character folding are called
1242 @dfn{equivalent}.
1243
1244 @kindex M-s ' @r{(Incremental Search)}
1245 @findex isearch-toggle-character-fold
1246 Searches in Emacs normally perform character folding, thus matching
1247 equivalent character sequences. You can disable this behavior by
1248 customizing the variable @code{search-default-regexp-mode} to the
1249 @code{nil} value. @xref{Search Customizations}. Within an
1250 incremental search, typing @kbd{M-s '}
1251 (@code{isearch-toggle-character-fold}) toggles character folding, but
1252 only for that search.
1253
1254 Like with case folding, typing an explicit variant of a character,
1255 such as @code{@"a}, as part of the search string disables character
1256 folding for that search. If you delete such a character from the
1257 search string, this effect ceases.
1258
1259 @node Replace
1260 @section Replacement Commands
1261 @cindex replacement
1262 @cindex search-and-replace commands
1263 @cindex string substitution
1264 @cindex global substitution
1265
1266 Emacs provides several commands for performing search-and-replace
1267 operations. In addition to the simple @kbd{M-x replace-string}
1268 command, there is @kbd{M-%} (@code{query-replace}), which presents
1269 each occurrence of the search pattern and asks you whether to replace
1270 it.
1271
1272 The replace commands normally operate on the text from point to the
1273 end of the buffer. When the region is active, they operate on it
1274 instead (@pxref{Mark}). The basic replace commands replace one
1275 @dfn{search string} (or regexp) with one @dfn{replacement string}. It
1276 is possible to perform several replacements in parallel, using the
1277 command @code{expand-region-abbrevs} (@pxref{Expanding Abbrevs}).
1278
1279 @menu
1280 * Unconditional Replace:: Replacing all matches for a string.
1281 * Regexp Replace:: Replacing all matches for a regexp.
1282 * Replacement and Lax Matches::
1283 Lax searching for text to replace.
1284 * Query Replace:: How to use querying.
1285 @end menu
1286
1287 @node Unconditional Replace
1288 @subsection Unconditional Replacement
1289 @findex replace-string
1290
1291 @table @kbd
1292 @item M-x replace-string @key{RET} @var{string} @key{RET} @var{newstring} @key{RET}
1293 Replace every occurrence of @var{string} with @var{newstring}.
1294 @end table
1295
1296 To replace every instance of @samp{foo} after point with @samp{bar},
1297 use the command @kbd{M-x replace-string} with the two arguments
1298 @samp{foo} and @samp{bar}. Replacement happens only in the text after
1299 point, so if you want to cover the whole buffer you must go to the
1300 beginning first. All occurrences up to the end of the buffer are
1301 replaced; to limit replacement to part of the buffer, activate the
1302 region around that part. When the region is active, replacement is
1303 limited to the region (@pxref{Mark}).
1304
1305 When @code{replace-string} exits, it leaves point at the last
1306 occurrence replaced. It adds the prior position of point (where the
1307 @code{replace-string} command was issued) to the mark ring, without
1308 activating the mark; use @kbd{C-u C-@key{SPC}} to move back there.
1309 @xref{Mark Ring}.
1310
1311 A prefix argument restricts replacement to matches that are
1312 surrounded by word boundaries.
1313
1314 @xref{Replacement and Lax Matches}, for details about
1315 case-sensitivity in replace commands.
1316
1317 @node Regexp Replace
1318 @subsection Regexp Replacement
1319 @findex replace-regexp
1320
1321 The @kbd{M-x replace-string} command replaces exact matches for a
1322 single string. The similar command @kbd{M-x replace-regexp} replaces
1323 any match for a specified regular expression pattern (@pxref{Regexps}).
1324
1325 @table @kbd
1326 @item M-x replace-regexp @key{RET} @var{regexp} @key{RET} @var{newstring} @key{RET}
1327 Replace every match for @var{regexp} with @var{newstring}.
1328 @end table
1329
1330 @cindex back reference, in regexp replacement
1331 In @code{replace-regexp}, the @var{newstring} need not be constant:
1332 it can refer to all or part of what is matched by the @var{regexp}.
1333 @samp{\&} in @var{newstring} stands for the entire match being
1334 replaced. @samp{\@var{d}} in @var{newstring}, where @var{d} is a
1335 digit, stands for whatever matched the @var{d}th parenthesized
1336 grouping in @var{regexp}. (This is called a ``back reference''.)
1337 @samp{\#} refers to the count of replacements already made in this
1338 command, as a decimal number. In the first replacement, @samp{\#}
1339 stands for @samp{0}; in the second, for @samp{1}; and so on. For
1340 example,
1341
1342 @example
1343 M-x replace-regexp @key{RET} c[ad]+r @key{RET} \&-safe @key{RET}
1344 @end example
1345
1346 @noindent
1347 replaces (for example) @samp{cadr} with @samp{cadr-safe} and @samp{cddr}
1348 with @samp{cddr-safe}.
1349
1350 @example
1351 M-x replace-regexp @key{RET} \(c[ad]+r\)-safe @key{RET} \1 @key{RET}
1352 @end example
1353
1354 @noindent
1355 performs the inverse transformation. To include a @samp{\} in the
1356 text to replace with, you must enter @samp{\\}.
1357
1358 If you want to enter part of the replacement string by hand each
1359 time, use @samp{\?} in the replacement string. Each replacement will
1360 ask you to edit the replacement string in the minibuffer, putting
1361 point where the @samp{\?} was.
1362
1363 The remainder of this subsection is intended for specialized tasks
1364 and requires knowledge of Lisp. Most readers can skip it.
1365
1366 You can use Lisp expressions to calculate parts of the
1367 replacement string. To do this, write @samp{\,} followed by the
1368 expression in the replacement string. Each replacement calculates the
1369 value of the expression and converts it to text without quoting (if
1370 it's a string, this means using the string's contents), and uses it in
1371 the replacement string in place of the expression itself. If the
1372 expression is a symbol, one space in the replacement string after the
1373 symbol name goes with the symbol name, so the value replaces them
1374 both.
1375
1376 Inside such an expression, you can use some special sequences.
1377 @samp{\&} and @samp{\@var{n}} refer here, as usual, to the entire
1378 match as a string, and to a submatch as a string. @var{n} may be
1379 multiple digits, and the value of @samp{\@var{n}} is @code{nil} if
1380 subexpression @var{n} did not match. You can also use @samp{\#&} and
1381 @samp{\#@var{n}} to refer to those matches as numbers (this is valid
1382 when the match or submatch has the form of a numeral). @samp{\#} here
1383 too stands for the number of already-completed replacements.
1384
1385 Repeating our example to exchange @samp{x} and @samp{y}, we can thus
1386 do it also this way:
1387
1388 @example
1389 M-x replace-regexp @key{RET} \(x\)\|y @key{RET}
1390 \,(if \1 "y" "x") @key{RET}
1391 @end example
1392
1393 For computing replacement strings for @samp{\,}, the @code{format}
1394 function is often useful (@pxref{Formatting Strings,,, elisp, The Emacs
1395 Lisp Reference Manual}). For example, to add consecutively numbered
1396 strings like @samp{ABC00042} to columns 73 @w{to 80} (unless they are
1397 already occupied), you can use
1398
1399 @example
1400 M-x replace-regexp @key{RET} ^.\@{0,72\@}$ @key{RET}
1401 \,(format "%-72sABC%05d" \& \#) @key{RET}
1402 @end example
1403
1404 @node Replacement and Lax Matches
1405 @subsection Replace Commands and Lax Matches
1406
1407 This subsection describes the behavior of replace commands with
1408 respect to lax matches (@pxref{Lax Search}) and how to customize it.
1409 In general, replace commands mostly default to stricter matching than
1410 their search counterparts.
1411
1412 @cindex lax space matching in replace commands
1413 @vindex replace-lax-whitespace
1414 Unlike incremental search, the replacement commands do not use lax
1415 space matching (@pxref{Lax Search, lax space matching}) by default.
1416 To enable lax space matching for replacement, change the variable
1417 @code{replace-lax-whitespace} to non-@code{nil}. (This only affects
1418 how Emacs finds the text to replace, not the replacement text.)
1419
1420 @vindex replace-regexp-lax-whitespace
1421 A companion variable @code{replace-regexp-lax-whitespace} controls
1422 whether @code{query-replace-regexp} uses lax whitespace matching when
1423 searching for patterns.
1424
1425 @cindex case folding in replace commands
1426 If the first argument of a replace command is all lower case, the
1427 command ignores case while searching for occurrences to
1428 replace---provided @code{case-fold-search} is non-@code{nil}. If
1429 @code{case-fold-search} is set to @code{nil}, case is always significant
1430 in all searches.
1431
1432 @vindex case-replace
1433 In addition, when the @var{newstring} argument is all or partly lower
1434 case, replacement commands try to preserve the case pattern of each
1435 occurrence. Thus, the command
1436
1437 @example
1438 M-x replace-string @key{RET} foo @key{RET} bar @key{RET}
1439 @end example
1440
1441 @noindent
1442 replaces a lower case @samp{foo} with a lower case @samp{bar}, an
1443 all-caps @samp{FOO} with @samp{BAR}, and a capitalized @samp{Foo} with
1444 @samp{Bar}. (These three alternatives---lower case, all caps, and
1445 capitalized, are the only ones that @code{replace-string} can
1446 distinguish.)
1447
1448 If upper-case letters are used in the replacement string, they remain
1449 upper case every time that text is inserted. If upper-case letters are
1450 used in the first argument, the second argument is always substituted
1451 exactly as given, with no case conversion. Likewise, if either
1452 @code{case-replace} or @code{case-fold-search} is set to @code{nil},
1453 replacement is done without case conversion.
1454
1455 @cindex character folding in replace commands
1456 The replacement commands by default do not use character folding
1457 (@pxref{Lax Search, character folding}) when looking for the text to
1458 replace. To enable character folding for matching, set the variable
1459 @code{replace-character-fold} to a non-@code{nil} value. (This
1460 setting does not affect the replacement text, only how Emacs finds the
1461 text to replace.)
1462
1463 @node Query Replace
1464 @subsection Query Replace
1465 @cindex query replace
1466
1467 @table @kbd
1468 @item M-% @var{string} @key{RET} @var{newstring} @key{RET}
1469 Replace some occurrences of @var{string} with @var{newstring}.
1470 @item C-M-% @var{regexp} @key{RET} @var{newstring} @key{RET}
1471 Replace some matches for @var{regexp} with @var{newstring}.
1472 @end table
1473
1474 @kindex M-%
1475 @findex query-replace
1476 If you want to change only some of the occurrences of @samp{foo} to
1477 @samp{bar}, not all of them, use @kbd{M-%} (@code{query-replace}).
1478 This command finds occurrences of @samp{foo} one by one, displays each
1479 occurrence and asks you whether to replace it. Aside from querying,
1480 @code{query-replace} works just like @code{replace-string}
1481 (@pxref{Unconditional Replace}). In particular, it preserves case
1482 provided @code{case-replace} is non-@code{nil}, as it normally is
1483 (@pxref{Replacement and Lax Matches}). A numeric argument means to
1484 consider only occurrences that are bounded by word-delimiter
1485 characters. A negative prefix argument replaces backward.
1486
1487 @kindex C-M-%
1488 @findex query-replace-regexp
1489 @kbd{C-M-%} performs regexp search and replace (@code{query-replace-regexp}).
1490 It works like @code{replace-regexp} except that it queries
1491 like @code{query-replace}.
1492
1493 @vindex query-replace-from-to-separator
1494 You can reuse earlier replacements with these commands. When
1495 @code{query-replace} or @code{query-replace-regexp} prompts for the
1496 search string, use @kbd{M-p} and @kbd{M-n} to show previous
1497 replacements in the form @samp{@var{from} -> @var{to}}, where
1498 @var{from} is the search pattern, @var{to} is its replacement, and the
1499 separator between them is determined by the value of the variable
1500 @code{query-replace-from-to-separator}. Type @key{RET} to select the
1501 desired replacement.
1502
1503 @cindex faces for highlighting query replace
1504 @cindex query-replace face
1505 @cindex lazy-highlight face, in replace
1506 @vindex query-replace-highlight
1507 @vindex query-replace-lazy-highlight
1508 @vindex query-replace-show-replacement
1509 These commands highlight the current match using the face
1510 @code{query-replace}. You can disable this highlight by setting the
1511 variable @code{query-replace-highlight} to @code{nil}. They highlight
1512 other matches using @code{lazy-highlight} just like incremental search
1513 (@pxref{Incremental Search}); this can be disabled by setting
1514 @code{query-replace-lazy-highlight} to @code{nil}. By default,
1515 @code{query-replace-regexp} will show the substituted replacement
1516 string for the current match in the minibuffer. If you want to keep
1517 special sequences @samp{\&} and @samp{\@var{n}} unexpanded, customize
1518 @code{query-replace-show-replacement} variable.
1519
1520 @vindex query-replace-skip-read-only
1521 The variable @code{query-replace-skip-read-only}, if set
1522 non-@code{nil}, will cause replacement commands to ignore matches in
1523 read-only text. The default is not to ignore them.
1524
1525 The characters you can type when you are shown a match for the string
1526 or regexp are:
1527
1528 @ignore @c Not worth it.
1529 @kindex SPC @r{(query-replace)}
1530 @kindex DEL @r{(query-replace)}
1531 @kindex , @r{(query-replace)}
1532 @kindex RET @r{(query-replace)}
1533 @kindex . @r{(query-replace)}
1534 @kindex ! @r{(query-replace)}
1535 @kindex ^ @r{(query-replace)}
1536 @kindex C-r @r{(query-replace)}
1537 @kindex C-w @r{(query-replace)}
1538 @kindex C-l @r{(query-replace)}
1539 @end ignore
1540
1541 @c WideCommands
1542 @table @kbd
1543 @item @key{SPC}
1544 @itemx y
1545 to replace the occurrence with @var{newstring}.
1546
1547 @item @key{DEL}
1548 @itemx @key{Delete}
1549 @itemx @key{BACKSPACE}
1550 @itemx n
1551 to skip to the next occurrence without replacing this one.
1552
1553 @item , @r{(Comma)}
1554 to replace this occurrence and display the result. You are then asked
1555 for another input character to say what to do next. Since the
1556 replacement has already been made, @key{DEL} and @key{SPC} are
1557 equivalent in this situation; both move to the next occurrence.
1558
1559 You can type @kbd{C-r} at this point (see below) to alter the replaced
1560 text. You can also type @kbd{C-x u} to undo the replacement; this exits
1561 the @code{query-replace}, so if you want to do further replacement you
1562 must use @kbd{C-x @key{ESC} @key{ESC} @key{RET}} to restart
1563 (@pxref{Repetition}).
1564
1565 @item @key{RET}
1566 @itemx q
1567 to exit without doing any more replacements.
1568
1569 @item .@: @r{(Period)}
1570 to replace this occurrence and then exit without searching for more
1571 occurrences.
1572
1573 @item !
1574 to replace all remaining occurrences without asking again.
1575
1576 @item ^
1577 to go back to the position of the previous occurrence (or what used to
1578 be an occurrence), in case you changed it by mistake or want to
1579 reexamine it.
1580
1581 @item C-r
1582 to enter a recursive editing level, in case the occurrence needs to be
1583 edited rather than just replaced with @var{newstring}. When you are
1584 done, exit the recursive editing level with @kbd{C-M-c} to proceed to
1585 the next occurrence. @xref{Recursive Edit}.
1586
1587 @item C-w
1588 to delete the occurrence, and then enter a recursive editing level as in
1589 @kbd{C-r}. Use the recursive edit to insert text to replace the deleted
1590 occurrence of @var{string}. When done, exit the recursive editing level
1591 with @kbd{C-M-c} to proceed to the next occurrence.
1592
1593 @item e
1594 to edit the replacement string in the minibuffer. When you exit the
1595 minibuffer by typing @key{RET}, the minibuffer contents replace the
1596 current occurrence of the pattern. They also become the new
1597 replacement string for any further occurrences.
1598
1599 @item C-l
1600 to redisplay the screen. Then you must type another character to
1601 specify what to do with this occurrence.
1602
1603 @item Y @r{(Upper-case)}
1604 to replace all remaining occurrences in all remaining buffers in
1605 multi-buffer replacements (like the Dired @key{Q} command that performs
1606 query replace on selected files). It answers this question and all
1607 subsequent questions in the series with ``yes'', without further
1608 user interaction.
1609
1610 @item N @r{(Upper-case)}
1611 to skip to the next buffer in multi-buffer replacements without
1612 replacing remaining occurrences in the current buffer. It answers
1613 this question ``no'', gives up on the questions for the current buffer,
1614 and continues to the next buffer in the sequence.
1615
1616 @item C-h
1617 @itemx ?
1618 @itemx @key{F1}
1619 to display a message summarizing these options. Then you must type
1620 another character to specify what to do with this occurrence.
1621 @end table
1622
1623 Aside from this, any other character exits the @code{query-replace},
1624 and is then reread as part of a key sequence. Thus, if you type
1625 @kbd{C-k}, it exits the @code{query-replace} and then kills to end of
1626 line. In particular, @kbd{C-g} simply exits the @code{query-replace}.
1627
1628 To restart a @code{query-replace} once it is exited, use @kbd{C-x
1629 @key{ESC} @key{ESC}}, which repeats the @code{query-replace} because it
1630 used the minibuffer to read its arguments. @xref{Repetition, C-x ESC
1631 ESC}.
1632
1633 @cindex invisible text, and query-replace
1634 The option @code{search-invisible} determines how @code{query-replace}
1635 treats invisible text. @xref{Outline Search}.
1636
1637 @xref{Operating on Files}, for the Dired @kbd{Q} command which
1638 performs query replace on selected files. See also @ref{Transforming
1639 File Names}, for Dired commands to rename, copy, or link files by
1640 replacing regexp matches in file names.
1641
1642 @node Other Repeating Search
1643 @section Other Search-and-Loop Commands
1644
1645 Here are some other commands that find matches for a regular
1646 expression. They all ignore case in matching, if the pattern contains
1647 no upper-case letters and @code{case-fold-search} is non-@code{nil}.
1648 Aside from @code{occur} and its variants, all operate on the text from
1649 point to the end of the buffer, or on the region if it is active.
1650
1651 @findex list-matching-lines
1652 @findex occur
1653 @findex multi-occur
1654 @findex multi-occur-in-matching-buffers
1655 @findex how-many
1656 @findex flush-lines
1657 @findex keep-lines
1658
1659 @table @kbd
1660 @item M-x multi-isearch-buffers
1661 Prompt for one or more buffer names, ending with @key{RET}; then,
1662 begin a multi-buffer incremental search in those buffers. (If the
1663 search fails in one buffer, the next @kbd{C-s} tries searching the
1664 next specified buffer, and so forth.) With a prefix argument, prompt
1665 for a regexp and begin a multi-buffer incremental search in buffers
1666 matching that regexp.
1667
1668 @item M-x multi-isearch-buffers-regexp
1669 This command is just like @code{multi-isearch-buffers}, except it
1670 performs an incremental regexp search.
1671
1672 @item M-x multi-isearch-files
1673 Prompt for one or more file names, ending with @key{RET}; then,
1674 begin a multi-file incremental search in those files. (If the
1675 search fails in one file, the next @kbd{C-s} tries searching the
1676 next specified file, and so forth.) With a prefix argument, prompt
1677 for a regexp and begin a multi-file incremental search in files
1678 matching that regexp.
1679
1680 @item M-x multi-isearch-files-regexp
1681 This command is just like @code{multi-isearch-files}, except it
1682 performs an incremental regexp search.
1683
1684 In some modes that set the buffer-local variable
1685 @code{multi-isearch-next-buffer-function} (e.g., in Change Log mode)
1686 a multi-file incremental search is activated automatically.
1687
1688 @cindex Occur mode
1689 @cindex mode, Occur
1690 @cindex match (face name)
1691 @vindex list-matching-lines-default-context-lines
1692 @item M-x occur
1693 Prompt for a regexp, and display a list showing each line in the
1694 buffer that contains a match for it. The text that matched is
1695 highlighted using the @code{match} face. To limit the search to part
1696 of the buffer, narrow to that part (@pxref{Narrowing}). A numeric
1697 argument @var{n} specifies that @var{n} lines of context are to be
1698 displayed before and after each matching line. The default number of
1699 context lines is specified by the variable
1700 @code{list-matching-lines-default-context-lines}.
1701
1702 @kindex RET @r{(Occur mode)}
1703 @kindex o @r{(Occur mode)}
1704 @kindex C-o @r{(Occur mode)}
1705 In the @file{*Occur*} buffer, you can click on each entry, or move
1706 point there and type @key{RET}, to visit the corresponding position in
1707 the buffer that was searched. @kbd{o} and @kbd{C-o} display the match
1708 in another window; @kbd{C-o} does not select it. Alternatively, you
1709 can use the @kbd{C-x `} (@code{next-error}) command to visit the
1710 occurrences one by one (@pxref{Compilation Mode}).
1711
1712 @cindex Occur Edit mode
1713 @cindex mode, Occur Edit
1714 Typing @kbd{e} in the @file{*Occur*} buffer switches to Occur Edit
1715 mode, in which edits made to the entries are also applied to the text
1716 in the originating buffer. Type @kbd{C-c C-c} to return to Occur
1717 mode.
1718
1719 The command @kbd{M-x list-matching-lines} is a synonym for @kbd{M-x
1720 occur}.
1721
1722 @kindex M-s o
1723 @item M-s o
1724 Run @code{occur} using the search string of the last incremental
1725 string search. You can also run @kbd{M-s o} when an incremental
1726 search is active; this uses the current search string.
1727
1728 @item M-x multi-occur
1729 This command is just like @code{occur}, except it is able to search
1730 through multiple buffers. It asks you to specify the buffer names one
1731 by one.
1732
1733 @item M-x multi-occur-in-matching-buffers
1734 This command is similar to @code{multi-occur}, except the buffers to
1735 search are specified by a regular expression that matches visited file
1736 names. With a prefix argument, it uses the regular expression to
1737 match buffer names instead.
1738
1739 @item M-x how-many
1740 Prompt for a regexp, and print the number of matches for it in the
1741 buffer after point. If the region is active, this operates on the
1742 region instead.
1743
1744 @item M-x flush-lines
1745 Prompt for a regexp, and delete each line that contains a match for
1746 it, operating on the text after point. This command deletes the
1747 current line if it contains a match starting after point. If the
1748 region is active, it operates on the region instead; if a line
1749 partially contained in the region contains a match entirely contained
1750 in the region, it is deleted.
1751
1752 If a match is split across lines, @code{flush-lines} deletes all those
1753 lines. It deletes the lines before starting to look for the next
1754 match; hence, it ignores a match starting on the same line at which
1755 another match ended.
1756
1757 @item M-x keep-lines
1758 Prompt for a regexp, and delete each line that @emph{does not} contain
1759 a match for it, operating on the text after point. If point is not at
1760 the beginning of a line, this command always keeps the current line.
1761 If the region is active, the command operates on the region instead;
1762 it never deletes lines that are only partially contained in the region
1763 (a newline that ends a line counts as part of that line).
1764
1765 If a match is split across lines, this command keeps all those lines.
1766 @end table
1767
1768 @node Search Customizations
1769 @section Tailoring Search to Your Needs
1770 @cindex search customizations
1771
1772 This section describes miscellaneous search-related customizations
1773 not described elsewhere.
1774
1775 @cindex default search mode
1776 @cindex search mode, default
1777 The default search mode for the incremental search is specified by
1778 the variable @code{search-default-regexp-mode}. It can be @code{nil},
1779 @code{t}, or a function. If it is @code{nil}, the default mode is to
1780 do literal searches without character folding, but with case folding
1781 and lax-whitespace matches as determined by @code{case-fold-search}
1782 and @code{search-whitespace-regexp}, respectively (@pxref{Lax
1783 Search}). If the value is @code{t}, incremental search defaults to
1784 regexp searches. The default value specifies a function that causes
1785 the default search mode to perform character folding in addition to
1786 case folding and lax-whitespace matching.
1787
1788 @vindex search-highlight
1789 The current match of an on-going incremental search is highlighted
1790 using the @code{isearch} face. This highlighting can be disabled by
1791 setting the variable @code{search-highlight} to a @code{nil} value.
1792
1793 @cindex lazy highlighting customizations
1794 @vindex isearch-lazy-highlight
1795 @cindex lazy-highlight face
1796 The other matches for the search string that are visible on display
1797 are highlighted using the @code{lazy-highlight} face. Setting the
1798 variable @code{isearch-lazy-highlight} to a @code{nil} value disables
1799 this highlighting. Here are some other variables that customize the
1800 lazy highlighting:
1801
1802 @table @code
1803 @item lazy-highlight-initial-delay
1804 Time in seconds to wait before highlighting visible matches.
1805
1806 @item lazy-highlight-interval
1807 Time in seconds between highlighting successive matches.
1808
1809 @item lazy-highlight-max-at-a-time
1810 The maximum number of matches to highlight before checking for input.
1811 A large number can take some time to highlight, so if you want to
1812 continue searching and type @kbd{C-s} or @kbd{C-r} during that time,
1813 Emacs will not respond until it finishes highlighting all those
1814 matches. Thus, smaller values make Emacs more responsive.
1815 @end table
1816
1817 @vindex search-nonincremental-instead
1818 Normally, entering @key{RET} within incremental search when the
1819 search string is empty launches a nonincremental search. (Actually,
1820 it lets you edit the search string, and the next @key{RET} does the
1821 search.) However, if you customize the variable
1822 @code{search-nonincremental-instead} to a @code{nil} value, typing
1823 @key{RET} will always exit the incremental search, even if the search
1824 string is empty.
1825
1826 @vindex isearch-hide-immediately
1827 By default, incremental search and query-replace commands match
1828 invisible text, but hide any such matches as soon as the current match
1829 moves off the invisible text. If you customize the variable
1830 @code{isearch-hide-immediately} to a @code{nil} value, any invisible
1831 text where matches were found stays on display until the search or the
1832 replace command exits.
1833
1834 @cindex search display on slow terminals
1835 @vindex search-slow-speed
1836 @vindex search-slow-window-lines
1837 Searching incrementally on slow terminals, such as displays
1838 connected to remote machines over slow connection, could be annoying
1839 due to the need to redraw large portions of the display as the search
1840 proceeds. Emacs provides a special display mode for slow terminals,
1841 whereby search pops up a separate small window and displays the text
1842 surrounding the match in that window. Small windows display faster,
1843 so the annoying effect of slow speed is alleviated. The variable
1844 @code{search-slow-speed} determines the baud rate threshold below
1845 which Emacs will use this display mode. The variable
1846 @code{search-slow-window-lines} controls the number of lines in the
1847 window Emacs pops up for displaying the search results; the default is
1848 1 line. Normally, this window will pop up at the bottom of the window
1849 that displays the buffer where you start searching, bit if the value
1850 of @code{search-slow-window-lines} is negative, that means to put the
1851 window at the top and give it the number of lines that is the absolute
1852 value of that value.