Update copyright year to 2016

[gnu-emacs] / doc / lispref / searching.texi
diff --git a/doc/lispref/searching.texi b/doc/lispref/searching.texi

index 6c9b09d29bff93e6d889fb7bd75b175ca9d8184a..5ff7ef1cf665082edd90d70d0aaaa83ba44bdaf1 100644 (file)
--- a/doc/lispref/searching.texi
+++ b/doc/lispref/searching.texi
@@ -1,7 +1,7 @@
  @c -*-texinfo-*-
  @c This is part of the GNU Emacs Lisp Reference Manual.
-@c Copyright (C) 1990-1995, 1998-1999, 2001-2012
-@c   Free Software Foundation, Inc.
+@c Copyright (C) 1990-1995, 1998-1999, 2001-2016 Free Software
+@c Foundation, Inc.
  @c See the file elisp.texi for copying conditions.
  @node Searching and Matching
  @chapter Searching and Matching
@@ -113,7 +113,7 @@ match.
  @end deffn
  
  @deffn Command word-search-forward string &optional limit noerror repeat
-This function searches forward from point for a ``word'' match for
+This function searches forward from point for a word match for
  @var{string}.  If it finds a match, it sets point to the end of the
  match found, and returns the new value of point.
  
@@ -137,7 +137,7 @@ the ball boy!"
  
  @group
  (word-search-forward "Please find the ball, boy.")
-     @result{} 36
+     @result{} 39
  
  ---------- Buffer: foo ----------
  He said "Please!  Find
@@ -160,16 +160,17 @@ If @var{repeat} is non-@code{nil}, then the search is repeated that many
  times.  Point is positioned at the end of the last match.
  
  @findex word-search-regexp
-Internal, @code{word-search-forward} and related functions use the
+Internally, @code{word-search-forward} and related functions use the
  function @code{word-search-regexp} to convert @var{string} to a
  regular expression that ignores punctuation.
  @end deffn
  
  @deffn Command word-search-forward-lax string &optional limit noerror repeat
  This command is identical to @code{word-search-forward}, except that
-the end of @var{string} need not match a word boundary, unless @var{string} ends
-in whitespace.  For instance, searching for @samp{ball boy} matches
-@samp{ball boyee}, but does not match @samp{aball boy}.
+the beginning or the end of @var{string} need not match a word
+boundary, unless @var{string} begins or ends in whitespace.
+For instance, searching for @samp{ball boy} matches @samp{ball boyee},
+but does not match @samp{balls boy}.
  @end deffn
  
  @deffn Command word-search-backward string &optional limit noerror repeat
@@ -181,8 +182,8 @@ beginning of the match.
  
  @deffn Command word-search-backward-lax string &optional limit noerror repeat
  This command is identical to @code{word-search-backward}, except that
-the end of @var{string} need not match a word boundary, unless @var{string} ends
-in whitespace.
+the beginning or the end of @var{string} need not match a word
+boundary, unless @var{string} begins or ends in whitespace.
  @end deffn
  
  @node Searching and Case
@@ -256,6 +257,8 @@ it easier to verify even very complex regexps.
  
  @node Syntax of Regexps
  @subsection Syntax of Regular Expressions
+@cindex regexp syntax
+@cindex syntax of regular expressions
  
    Regular expressions have a syntax in which a few characters are
  special constructs and the rest are @dfn{ordinary}.  An ordinary
@@ -273,12 +276,12 @@ expression is ordinary, unless a @samp{\} precedes it.
  therefore @samp{f} is a regular expression that matches the string
  @samp{f} and no other string.  (It does @emph{not} match the string
  @samp{fg}, but it does match a @emph{part} of that string.)  Likewise,
-@samp{o} is a regular expression that matches only @samp{o}.@refill
+@samp{o} is a regular expression that matches only @samp{o}.
  
    Any two regular expressions @var{a} and @var{b} can be concatenated.  The
  result is a regular expression that matches a string if @var{a} matches
  some amount of the beginning of that string and @var{b} matches the rest of
-the string.@refill
+the string.
  
    As a simple example, we can concatenate the regular expressions @samp{f}
  and @samp{o} to get the regular expression @samp{fo}, which matches only
@@ -293,6 +296,7 @@ need to use one of the special regular expression constructs.
  
  @node Regexp Special
  @subsubsection Special Characters in Regular Expressions
+@cindex regexp, special characters in
  
    Here is a list of the characters that are special in a regular
  expression.
@@ -304,7 +308,7 @@ expression.
  is a special character that matches any single character except a newline.
  Using concatenation, we can make regular expressions like @samp{a.b}, which
  matches any three-character string that begins with @samp{a} and ends with
-@samp{b}.@refill
+@samp{b}.
  
  @item @samp{*}
  @cindex @samp{*} in regexp
@@ -355,7 +359,7 @@ preceding expression either once or not at all.  For example,
  
  @item @samp{*?}, @samp{+?}, @samp{??}
  @cindex non-greedy repetition characters in regexp
-These are ``non-greedy'' variants of the operators @samp{*}, @samp{+}
+These are @dfn{non-greedy} variants of the operators @samp{*}, @samp{+}
  and @samp{?}.  Where those operators match the largest possible
  substring (consistent with matching the entire containing expression),
  the non-greedy variants match the smallest possible substring
@@ -488,7 +492,7 @@ example, the regular expression that matches the @samp{\} character is
  @samp{\\}.  To write a Lisp string that contains the characters
  @samp{\\}, Lisp syntax requires you to quote each @samp{\} with another
  @samp{\}.  Therefore, the read syntax for a regular expression matching
-@samp{\} is @code{"\\\\"}.@refill
+@samp{\} is @code{"\\\\"}.
  @end table
  
  @strong{Please note:} For historical compatibility, special characters
@@ -496,7 +500,7 @@ are treated as ordinary ones if they are in contexts where their special
  meanings make no sense.  For example, @samp{*foo} treats @samp{*} as
  ordinary since there is no preceding expression on which the @samp{*}
  can act.  It is poor practice to depend on this behavior; quote the
-special character anyway, regardless of where it appears.@refill
+special character anyway, regardless of where it appears.
  
  As a @samp{\} is not special inside a character alternative, it can
  never remove the special meaning of @samp{-} or @samp{]}.  So you
@@ -537,11 +541,15 @@ and what they mean:
  @item [:ascii:]
  This matches any @acronym{ASCII} character (codes 0--127).
  @item [:alnum:]
-This matches any letter or digit.  (At present, for multibyte
-characters, it matches anything that has word syntax.)
+This matches any letter or digit.  For multibyte characters, it
+matches characters whose Unicode @samp{general-category} property
+(@pxref{Character Properties}) indicates they are alphabetic or
+decimal number characters.
  @item [:alpha:]
-This matches any letter.  (At present, for multibyte characters, it
-matches anything that has word syntax.)
+This matches any letter.  For multibyte characters, it matches
+characters whose Unicode @samp{general-category} property
+(@pxref{Character Properties}) indicates they are alphabetic
+characters.
  @item [:blank:]
  This matches space and tab only.
  @item [:cntrl:]
@@ -550,8 +558,11 @@ This matches any @acronym{ASCII} control character.
  This matches @samp{0} through @samp{9}.  Thus, @samp{[-+[:digit:]]}
  matches any digit, as well as @samp{+} and @samp{-}.
  @item [:graph:]
-This matches graphic characters---everything except @acronym{ASCII} control
-characters, space, and the delete character.
+This matches graphic characters---everything except whitespace,
+@acronym{ASCII} and non-@acronym{ASCII} control characters,
+surrogates, and codepoints unassigned by Unicode, as indicated by the
+Unicode @samp{general-category} property (@pxref{Character
+Properties}).
  @item [:lower:]
  This matches any lower-case letter, as determined by the current case
  table (@pxref{Case Tables}).  If @code{case-fold-search} is
@@ -561,8 +572,8 @@ This matches any multibyte character (@pxref{Text Representations}).
  @item [:nonascii:]
  This matches any non-@acronym{ASCII} character.
  @item [:print:]
-This matches printing characters---everything except @acronym{ASCII} control
-characters and the delete character.
+This matches any printing character---either whitespace, or a graphic
+character matched by @samp{[:graph:]}.
  @item [:punct:]
  This matches any punctuation character.  (At present, for multibyte
  characters, it matches anything that has non-word syntax.)
@@ -589,10 +600,8 @@ through @samp{f} and @samp{A} through @samp{F}.
  
    For the most part, @samp{\} followed by any character matches only
  that character.  However, there are several exceptions: certain
-two-character sequences starting with @samp{\} that have special
-meanings.  (The character after the @samp{\} in such a sequence is
-always ordinary when used on its own.)  Here is a table of the special
-@samp{\} constructs.
+sequences starting with @samp{\} that have special meanings.  Here is
+a table of the special @samp{\} constructs.
  
  @table @samp
  @item \|
@@ -601,14 +610,14 @@ always ordinary when used on its own.)  Here is a table of the special
  specifies an alternative.
  Two regular expressions @var{a} and @var{b} with @samp{\|} in
  between form an expression that matches anything that either @var{a} or
-@var{b} matches.@refill
+@var{b} matches.
  
  Thus, @samp{foo\|bar} matches either @samp{foo} or @samp{bar}
-but no other string.@refill
+but no other string.
  
  @samp{\|} applies to the largest possible surrounding expressions.  Only a
  surrounding @samp{\( @dots{} \)} grouping can limit the grouping power of
-@samp{\|}.@refill
+@samp{\|}.
  
  If you need full backtracking capability to handle multiple uses of
  @samp{\|}, use the POSIX regular expression functions (@pxref{POSIX
@@ -787,7 +796,7 @@ matches the empty string, but only at point.
  matches the empty string, but only at the beginning or
  end of a word.  Thus, @samp{\bfoo\b} matches any occurrence of
  @samp{foo} as a separate word.  @samp{\bballs?\b} matches
-@samp{ball} or @samp{balls} as a separate word.@refill
+@samp{ball} or @samp{balls} as a separate word.
  
  @samp{\b} matches at the beginning or end of the buffer (or string)
  regardless of what text appears next to it.
@@ -895,6 +904,7 @@ beyond the minimum needed to end a sentence.
  
    These functions operate on regular expressions.
  
+@cindex quote special characters in regexp
  @defun regexp-quote string
  This function returns a regular expression whose only exact match is
  @var{string}.  Using this regular expression in @code{looking-at} will
@@ -925,6 +935,7 @@ whitespace:
  @end example
  @end defun
  
+@cindex optimize regexp
  @defun regexp-opt strings &optional paren
  This function returns an efficient regular expression that will match
  any of the strings in the list @var{strings}.  This is useful when you
@@ -984,10 +995,11 @@ list of characters @var{chars}.
  @cindex searching for regexp
  
    In GNU Emacs, you can search for the next match for a regular
-expression either incrementally or not.  For incremental search
-commands, see @ref{Regexp Search, , Regular Expression Search, emacs,
-The GNU Emacs Manual}.  Here we describe only the search functions
-useful in programs.  The principal one is @code{re-search-forward}.
+expression (@pxref{Syntax of Regexps}) either incrementally or not.
+For incremental search commands, see @ref{Regexp Search, , Regular
+Expression Search, emacs, The GNU Emacs Manual}.  Here we describe
+only the search functions useful in programs.  The principal one is
+@code{re-search-forward}.
  
    These search functions convert the regular expression to multibyte if
  the buffer is multibyte; they convert the regular expression to unibyte
@@ -1090,7 +1102,7 @@ For example,
  The index of the first character of the
  string is 0, the index of the second character is 1, and so on.
  
-After this function returns, the index of the first character beyond
+If this function finds a match, the index of the first character beyond
  the match is available as @code{(match-end 0)}.  @xref{Match Data}.
  
  @example
@@ -1148,13 +1160,7 @@ implemented by searching backwards from point for a match that ends at
  point.  That can be quite slow if it has to search a long distance.
  You can bound the time required by specifying @var{limit}, which says
  not to search before @var{limit}.  In this case, the match that is
-found must begin at or after @var{limit}.
-
-If @var{greedy} is non-@code{nil}, this function extends the match
-backwards as far as possible, stopping when a single additional
-previous character cannot be part of a match for regexp.  When the
-match is extended, its starting position is allowed to occur before
-@var{limit}.
+found must begin at or after @var{limit}.  Here's an example:
  
  @example
  @group
@@ -1170,6 +1176,12 @@ comes back" twice.
  @end group
  @end example
  
+If @var{greedy} is non-@code{nil}, this function extends the match
+backwards as far as possible, stopping when a single additional
+previous character cannot be part of a match for regexp.  When the
+match is extended, its starting position is allowed to occur before
+@var{limit}.
+
  @c http://debbugs.gnu.org/5689
  As a general recommendation, try to avoid using @code{looking-back}
  wherever possible, since it is slow.  For this reason, there are no
@@ -1285,7 +1297,7 @@ If you did the last search in a buffer, you should omit the
  the current buffer is the one in which you performed the last search.
  Then this function edits the buffer, replacing the matched text with
  @var{replacement}.  It leaves point at the end of the replacement
-text, and returns @code{t}.
+text.
  
  If you performed the last search on a string, pass the same string as
  @var{string}.  Then this function returns a new string, in which the
@@ -1325,7 +1337,7 @@ This stands for a single @samp{\} in the replacement text.
  
  @item @samp{\?}
  This stands for itself (for compatibility with @code{replace-regexp}
-and related commands; @pxref{Regexp Replacement,,, emacs, The GNU
+and related commands; @pxref{Regexp Replace,,, emacs, The GNU
  Emacs Manual}).
  @end table
  
@@ -1413,8 +1425,9 @@ has no text properties.
  @end defun
  
  @defun match-beginning count
-This function returns the position of the start of the text matched by the
-last regular expression searched for, or a subexpression of it.
+If the last regular expression search found a match, this function
+returns the position of the start of the matching text or of a
+subexpression of it.
  
  If @var{count} is zero, then the value is the position of the start of
  the entire match.  Otherwise, @var{count} specifies a subexpression in
@@ -1746,7 +1759,7 @@ in two ways:
  
  @itemize @bullet
  @item
-The ``key bindings'' are not commands, just symbols that are meaningful
+The key bindings are not commands, just symbols that are meaningful
  to the functions that use this map.
  
  @item
@@ -1757,7 +1770,7 @@ event and look it up ``by hand''.
  @end itemize
  @end defvar
  
-Here are the meaningful ``bindings'' for @code{query-replace-map}.
+Here are the meaningful bindings for @code{query-replace-map}.
  Several of them are meaningful only for @code{query-replace} and
  friends.
  
@@ -1774,7 +1787,7 @@ questions, assuming that the answers will be ``no''.
  
  @item exit-prefix
  Like @code{exit}, but add the key that was pressed to
-@code{unread-comment-events}.
+@code{unread-command-events} (@pxref{Event Input Misc}).
  
  @item act-and-exit
  Answer this question ``yes'', and give up on the entire series of
@@ -1822,7 +1835,7 @@ Display some help, then ask again.
  @defvar multi-query-replace-map
  This variable holds a keymap that extends @code{query-replace-map} by
  providing additional keybindings that are useful in multi-buffer
-replacements.  The additional ``bindings'' are:
+replacements.  The additional bindings are:
  
  @table @code
  @item automatic-all