Mercurial > emacs
annotate man/search.texi @ 37678:ebec0594dece
(compile-files): Redirect output of chmod to
/dev/null.
| author | Gerd Moellmann <gerd@gnu.org> |
|---|---|
| date | Fri, 11 May 2001 10:53:56 +0000 |
| parents | 2f80188cbbd6 |
| children | 51351cf9de04 |
| rev | line source |
|---|---|
| 25829 | 1 @c This is part of the Emacs manual. |
| 36058 | 2 @c Copyright (C) 1985, 86, 87, 93, 94, 95, 97, 2000, 2001 |
| 28126 | 3 @c Free Software Foundation, Inc. |
| 25829 | 4 @c See file emacs.texi for copying conditions. |
| 5 @node Search, Fixit, Display, Top | |
| 6 @chapter Searching and Replacement | |
| 7 @cindex searching | |
| 8 @cindex finding strings within text | |
| 9 | |
| 10 Like other editors, Emacs has commands for searching for occurrences of | |
| 11 a string. The principal search command is unusual in that it is | |
| 12 @dfn{incremental}; it begins to search before you have finished typing the | |
| 13 search string. There are also nonincremental search commands more like | |
| 14 those of other editors. | |
| 15 | |
| 16 Besides the usual @code{replace-string} command that finds all | |
| 17 occurrences of one string and replaces them with another, Emacs has a fancy | |
| 18 replacement command called @code{query-replace} which asks interactively | |
| 19 which occurrences to replace. | |
| 20 | |
| 21 @menu | |
| 22 * Incremental Search:: Search happens as you type the string. | |
| 23 * Nonincremental Search:: Specify entire string and then search. | |
| 24 * Word Search:: Search for sequence of words. | |
| 25 * Regexp Search:: Search for match for a regexp. | |
| 26 * Regexps:: Syntax of regular expressions. | |
| 27 * Search Case:: To ignore case while searching, or not. | |
| 28 * Replace:: Search, and replace some or all matches. | |
| 29 * Other Repeating Search:: Operating on all matches for some regexp. | |
| 30 @end menu | |
| 31 | |
| 32 @node Incremental Search, Nonincremental Search, Search, Search | |
| 33 @section Incremental Search | |
| 34 | |
| 35 @cindex incremental search | |
| 36 An incremental search begins searching as soon as you type the first | |
| 37 character of the search string. As you type in the search string, Emacs | |
| 38 shows you where the string (as you have typed it so far) would be | |
| 39 found. When you have typed enough characters to identify the place you | |
| 40 want, you can stop. Depending on what you plan to do next, you may or | |
| 41 may not need to terminate the search explicitly with @key{RET}. | |
| 42 | |
| 43 @c WideCommands | |
| 44 @table @kbd | |
| 45 @item C-s | |
| 46 Incremental search forward (@code{isearch-forward}). | |
| 47 @item C-r | |
| 48 Incremental search backward (@code{isearch-backward}). | |
| 49 @end table | |
| 50 | |
| 51 @kindex C-s | |
| 52 @findex isearch-forward | |
| 53 @kbd{C-s} starts an incremental search. @kbd{C-s} reads characters from | |
| 54 the keyboard and positions the cursor at the first occurrence of the | |
| 55 characters that you have typed. If you type @kbd{C-s} and then @kbd{F}, | |
| 56 the cursor moves right after the first @samp{F}. Type an @kbd{O}, and see | |
| 57 the cursor move to after the first @samp{FO}. After another @kbd{O}, the | |
| 58 cursor is after the first @samp{FOO} after the place where you started the | |
| 59 search. At each step, the buffer text that matches the search string is | |
| 60 highlighted, if the terminal can do that; at each step, the current search | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
61 string is updated in the echo area. |
| 25829 | 62 |
| 63 If you make a mistake in typing the search string, you can cancel | |
| 64 characters with @key{DEL}. Each @key{DEL} cancels the last character of | |
| 65 search string. This does not happen until Emacs is ready to read another | |
| 66 input character; first it must either find, or fail to find, the character | |
| 67 you want to erase. If you do not want to wait for this to happen, use | |
| 68 @kbd{C-g} as described below. | |
| 69 | |
| 70 When you are satisfied with the place you have reached, you can type | |
| 71 @key{RET}, which stops searching, leaving the cursor where the search | |
| 72 brought it. Also, any command not specially meaningful in searches | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
73 stops the searching and is then executed. Thus, typing @kbd{C-a} |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
74 would exit the search and then move to the beginning of the line. |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
75 @key{RET} is necessary only if the next command you want to type is a |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
76 printing character, @key{DEL}, @key{RET}, or another character that is |
| 25829 | 77 special within searches (@kbd{C-q}, @kbd{C-w}, @kbd{C-r}, @kbd{C-s}, |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
78 @kbd{C-y}, @kbd{M-y}, @kbd{M-r}, @kbd{M-s}, and some other |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
79 meta-characters). |
| 25829 | 80 |
| 81 Sometimes you search for @samp{FOO} and find it, but not the one you | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
82 expected to find. There was a second @samp{FOO} that you forgot |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
83 about, before the one you were aiming for. In this event, type |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
84 another @kbd{C-s} to move to the next occurrence of the search string. |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
85 You can repeat this any number of times. If you overshoot, you can |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
86 cancel some @kbd{C-s} characters with @key{DEL}. |
| 25829 | 87 |
| 88 After you exit a search, you can search for the same string again by | |
| 89 typing just @kbd{C-s C-s}: the first @kbd{C-s} is the key that invokes | |
| 90 incremental search, and the second @kbd{C-s} means ``search again.'' | |
| 91 | |
| 92 To reuse earlier search strings, use the @dfn{search ring}. The | |
| 93 commands @kbd{M-p} and @kbd{M-n} move through the ring to pick a search | |
| 94 string to reuse. These commands leave the selected search ring element | |
| 95 in the minibuffer, where you can edit it. Type @kbd{C-s} or @kbd{C-r} | |
| 96 to terminate editing the string and search for it. | |
| 97 | |
| 98 If your string is not found at all, the echo area says @samp{Failing | |
| 99 I-Search}. The cursor is after the place where Emacs found as much of your | |
| 100 string as it could. Thus, if you search for @samp{FOOT}, and there is no | |
| 101 @samp{FOOT}, you might see the cursor after the @samp{FOO} in @samp{FOOL}. | |
| 102 At this point there are several things you can do. If your string was | |
| 103 mistyped, you can rub some of it out and correct it. If you like the place | |
| 104 you have found, you can type @key{RET} or some other Emacs command to | |
| 105 ``accept what the search offered.'' Or you can type @kbd{C-g}, which | |
| 106 removes from the search string the characters that could not be found (the | |
| 107 @samp{T} in @samp{FOOT}), leaving those that were found (the @samp{FOO} in | |
| 108 @samp{FOOT}). A second @kbd{C-g} at that point cancels the search | |
| 109 entirely, returning point to where it was when the search started. | |
| 110 | |
| 111 An upper-case letter in the search string makes the search | |
| 112 case-sensitive. If you delete the upper-case character from the search | |
| 113 string, it ceases to have this effect. @xref{Search Case}. | |
| 114 | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
115 To search for a newline, type @kbd{C-j}. To search for another |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
116 control character, such as control-S or carriage return, you must quote |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
117 it by typing @kbd{C-q} first. This function of @kbd{C-q} is analogous |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
118 to its use for insertion (@pxref{Inserting Text}): it causes the |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
119 following character to be treated the way any ``ordinary'' character is |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
120 treated in the same context. You can also specify a character by its |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
121 octal code: enter @kbd{C-q} followed by a sequence of octal digits. |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
122 |
|
35904
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
123 @cindex searching for non-ASCII characters |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
124 @cindex input method, during incremental search |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
125 To search for non-ASCII characters, you must use an input method |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
126 (@pxref{Input Methods}). If an input method is turned on in the |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
127 current buffer when you start the search, you can use it while you |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
128 type the search string also. Emacs indicates that by including the |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
129 input method mnemonic in its prompt, like this: |
|
35904
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
130 |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
131 @example |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
132 I-search [@var{im}]: |
|
35904
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
133 @end example |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
134 |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
135 @noindent |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
136 @findex isearch-toggle-input-method |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
137 @findex isearch-toggle-specified-input-method |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
138 where @var{im} is the mnemonic of the active input method. You can |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
139 toggle (enable or disable) the input method while you type the search |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
140 string with @kbd{C-\} (@code{isearch-toggle-input-method}). You can |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
141 turn on a certain (non-default) input method with @kbd{C-^} |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
142 (@code{isearch-toggle-specified-input-method}), which prompts for the |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
143 name of the input method. Note that the input method you turn on |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
144 during incremental search is turned on in the current buffer as well. |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
145 |
| 25829 | 146 If a search is failing and you ask to repeat it by typing another |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
147 @kbd{C-s}, it starts again from the beginning of the buffer. |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
148 Repeating a failing reverse search with @kbd{C-r} starts again from |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
149 the end. This is called @dfn{wrapping around}, and @samp{Wrapped} |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
150 appears in the search prompt once this has happened. If you keep on |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
151 going past the original starting point of the search, it changes to |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
152 @samp{Overwrapped}, which means that you are revisiting matches that |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
153 you have already seen. |
| 25829 | 154 |
| 155 @cindex quitting (in search) | |
| 156 The @kbd{C-g} ``quit'' character does special things during searches; | |
| 157 just what it does depends on the status of the search. If the search has | |
| 158 found what you specified and is waiting for input, @kbd{C-g} cancels the | |
| 159 entire search. The cursor moves back to where you started the search. If | |
| 160 @kbd{C-g} is typed when there are characters in the search string that have | |
| 161 not been found---because Emacs is still searching for them, or because it | |
| 162 has failed to find them---then the search string characters which have not | |
| 163 been found are discarded from the search string. With them gone, the | |
| 164 search is now successful and waiting for more input, so a second @kbd{C-g} | |
| 165 will cancel the entire search. | |
| 166 | |
| 167 You can change to searching backwards with @kbd{C-r}. If a search fails | |
| 168 because the place you started was too late in the file, you should do this. | |
| 169 Repeated @kbd{C-r} keeps looking for more occurrences backwards. A | |
| 170 @kbd{C-s} starts going forwards again. @kbd{C-r} in a search can be canceled | |
| 171 with @key{DEL}. | |
| 172 | |
| 173 @kindex C-r | |
| 174 @findex isearch-backward | |
| 175 If you know initially that you want to search backwards, you can use | |
| 176 @kbd{C-r} instead of @kbd{C-s} to start the search, because @kbd{C-r} as | |
| 177 a key runs a command (@code{isearch-backward}) to search backward. A | |
| 178 backward search finds matches that are entirely before the starting | |
| 179 point, just as a forward search finds matches that begin after it. | |
| 180 | |
| 181 The characters @kbd{C-y} and @kbd{C-w} can be used in incremental | |
| 182 search to grab text from the buffer into the search string. This makes | |
| 183 it convenient to search for another occurrence of text at point. | |
| 184 @kbd{C-w} copies the word after point as part of the search string, | |
| 185 advancing point over that word. Another @kbd{C-s} to repeat the search | |
| 186 will then search for a string including that word. @kbd{C-y} is similar | |
| 187 to @kbd{C-w} but copies all the rest of the current line into the search | |
| 188 string. Both @kbd{C-y} and @kbd{C-w} convert the text they copy to | |
| 189 lower case if the search is currently not case-sensitive; this is so the | |
| 190 search remains case-insensitive. | |
| 191 | |
| 192 The character @kbd{M-y} copies text from the kill ring into the search | |
| 193 string. It uses the same text that @kbd{C-y} as a command would yank. | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
194 @kbd{Mouse-2} in the echo area does the same. |
| 25829 | 195 @xref{Yanking}. |
| 196 | |
| 197 When you exit the incremental search, it sets the mark to where point | |
| 198 @emph{was}, before the search. That is convenient for moving back | |
| 199 there. In Transient Mark mode, incremental search sets the mark without | |
| 200 activating it, and does so only if the mark is not already active. | |
| 201 | |
| 27217 | 202 @cindex lazy search highlighting |
| 203 @vindex isearch-lazy-highlight | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
204 When you pause for a little while during incremental search, it |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
205 highlights all other possible matches for the search string. This |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
206 makes it easier to anticipate where you can get to by typing @kbd{C-s} |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
207 or @kbd{C-r} to repeat the search. The short delay before highlighting |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
208 other matches helps indicate which match is the current one. |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
209 If you don't like this feature, you can turn it off by setting |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
210 @code{isearch-lazy-highlight} to @code{nil}. |
| 27217 | 211 |
|
31062
c503cf38b3e8
More details and index entries for isearch-lazy-highlight.
Eli Zaretskii <eliz@gnu.org>
parents:
31048
diff
changeset
|
212 @vindex isearch-lazy-highlight-face |
|
c503cf38b3e8
More details and index entries for isearch-lazy-highlight.
Eli Zaretskii <eliz@gnu.org>
parents:
31048
diff
changeset
|
213 @cindex faces for highlighting search matches |
|
c503cf38b3e8
More details and index entries for isearch-lazy-highlight.
Eli Zaretskii <eliz@gnu.org>
parents:
31048
diff
changeset
|
214 You can control how does the highlighting of matches look like by |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
215 customizing the faces @code{isearch} (used for the current match) and |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
216 @code{isearch-lazy-highlight-face} (used for the other matches). |
|
31062
c503cf38b3e8
More details and index entries for isearch-lazy-highlight.
Eli Zaretskii <eliz@gnu.org>
parents:
31048
diff
changeset
|
217 |
| 25829 | 218 @vindex isearch-mode-map |
| 219 To customize the special characters that incremental search understands, | |
| 220 alter their bindings in the keymap @code{isearch-mode-map}. For a list | |
| 221 of bindings, look at the documentation of @code{isearch-mode} with | |
| 222 @kbd{C-h f isearch-mode @key{RET}}. | |
| 223 | |
| 224 @subsection Slow Terminal Incremental Search | |
| 225 | |
| 226 Incremental search on a slow terminal uses a modified style of display | |
| 227 that is designed to take less time. Instead of redisplaying the buffer at | |
| 228 each place the search gets to, it creates a new single-line window and uses | |
| 229 that to display the line that the search has found. The single-line window | |
| 230 comes into play as soon as point gets outside of the text that is already | |
| 231 on the screen. | |
| 232 | |
| 233 When you terminate the search, the single-line window is removed. | |
| 234 Then Emacs redisplays the window in which the search was done, to show | |
| 235 its new position of point. | |
| 236 | |
| 237 @vindex search-slow-speed | |
| 238 The slow terminal style of display is used when the terminal baud rate is | |
| 239 less than or equal to the value of the variable @code{search-slow-speed}, | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
240 initially 1200. See @code{baud-rate} in @ref{Display Custom}. |
| 25829 | 241 |
| 242 @vindex search-slow-window-lines | |
| 243 The number of lines to use in slow terminal search display is controlled | |
| 244 by the variable @code{search-slow-window-lines}. Its normal value is 1. | |
| 245 | |
| 246 @node Nonincremental Search, Word Search, Incremental Search, Search | |
| 247 @section Nonincremental Search | |
| 248 @cindex nonincremental search | |
| 249 | |
| 250 Emacs also has conventional nonincremental search commands, which require | |
| 251 you to type the entire search string before searching begins. | |
| 252 | |
| 253 @table @kbd | |
| 254 @item C-s @key{RET} @var{string} @key{RET} | |
| 255 Search for @var{string}. | |
| 256 @item C-r @key{RET} @var{string} @key{RET} | |
| 257 Search backward for @var{string}. | |
| 258 @end table | |
| 259 | |
| 260 To do a nonincremental search, first type @kbd{C-s @key{RET}}. This | |
| 261 enters the minibuffer to read the search string; terminate the string | |
| 262 with @key{RET}, and then the search takes place. If the string is not | |
| 263 found, the search command gets an error. | |
| 264 | |
| 265 The way @kbd{C-s @key{RET}} works is that the @kbd{C-s} invokes | |
| 266 incremental search, which is specially programmed to invoke nonincremental | |
| 267 search if the argument you give it is empty. (Such an empty argument would | |
| 268 otherwise be useless.) @kbd{C-r @key{RET}} also works this way. | |
| 269 | |
| 270 However, nonincremental searches performed using @kbd{C-s @key{RET}} do | |
| 271 not call @code{search-forward} right away. The first thing done is to see | |
| 272 if the next character is @kbd{C-w}, which requests a word search. | |
| 273 @ifinfo | |
| 274 @xref{Word Search}. | |
| 275 @end ifinfo | |
| 276 | |
| 277 @findex search-forward | |
| 278 @findex search-backward | |
| 279 Forward and backward nonincremental searches are implemented by the | |
| 280 commands @code{search-forward} and @code{search-backward}. These | |
| 281 commands may be bound to keys in the usual manner. The feature that you | |
| 282 can get to them via the incremental search commands exists for | |
| 283 historical reasons, and to avoid the need to find suitable key sequences | |
| 284 for them. | |
| 285 | |
| 286 @node Word Search, Regexp Search, Nonincremental Search, Search | |
| 287 @section Word Search | |
| 288 @cindex word search | |
| 289 | |
| 290 Word search searches for a sequence of words without regard to how the | |
| 291 words are separated. More precisely, you type a string of many words, | |
| 292 using single spaces to separate them, and the string can be found even if | |
| 293 there are multiple spaces, newlines or other punctuation between the words. | |
| 294 | |
| 295 Word search is useful for editing a printed document made with a text | |
| 296 formatter. If you edit while looking at the printed, formatted version, | |
| 297 you can't tell where the line breaks are in the source file. With word | |
| 298 search, you can search without having to know them. | |
| 299 | |
| 300 @table @kbd | |
| 301 @item C-s @key{RET} C-w @var{words} @key{RET} | |
| 302 Search for @var{words}, ignoring details of punctuation. | |
| 303 @item C-r @key{RET} C-w @var{words} @key{RET} | |
| 304 Search backward for @var{words}, ignoring details of punctuation. | |
| 305 @end table | |
| 306 | |
| 307 Word search is a special case of nonincremental search and is invoked | |
| 308 with @kbd{C-s @key{RET} C-w}. This is followed by the search string, | |
| 309 which must always be terminated with @key{RET}. Being nonincremental, | |
| 310 this search does not start until the argument is terminated. It works | |
| 311 by constructing a regular expression and searching for that; see | |
| 312 @ref{Regexp Search}. | |
| 313 | |
| 314 Use @kbd{C-r @key{RET} C-w} to do backward word search. | |
| 315 | |
| 316 @findex word-search-forward | |
| 317 @findex word-search-backward | |
| 318 Forward and backward word searches are implemented by the commands | |
| 319 @code{word-search-forward} and @code{word-search-backward}. These | |
| 320 commands may be bound to keys in the usual manner. The feature that you | |
| 321 can get to them via the incremental search commands exists for historical | |
| 322 reasons, and to avoid the need to find suitable key sequences for them. | |
| 323 | |
| 324 @node Regexp Search, Regexps, Word Search, Search | |
| 325 @section Regular Expression Search | |
| 326 @cindex regular expression | |
| 327 @cindex regexp | |
| 328 | |
| 329 A @dfn{regular expression} (@dfn{regexp}, for short) is a pattern that | |
| 330 denotes a class of alternative strings to match, possibly infinitely | |
| 331 many. In GNU Emacs, you can search for the next match for a regexp | |
| 332 either incrementally or not. | |
| 333 | |
| 334 @kindex C-M-s | |
| 335 @findex isearch-forward-regexp | |
| 336 @kindex C-M-r | |
| 337 @findex isearch-backward-regexp | |
| 338 Incremental search for a regexp is done by typing @kbd{C-M-s} | |
| 339 (@code{isearch-forward-regexp}). This command reads a search string | |
| 340 incrementally just like @kbd{C-s}, but it treats the search string as a | |
| 341 regexp rather than looking for an exact match against the text in the | |
| 342 buffer. Each time you add text to the search string, you make the | |
| 343 regexp longer, and the new regexp is searched for. Invoking @kbd{C-s} | |
| 344 with a prefix argument (its value does not matter) is another way to do | |
| 345 a forward incremental regexp search. To search backward for a regexp, | |
| 346 use @kbd{C-M-r} (@code{isearch-backward-regexp}), or @kbd{C-r} with a | |
| 347 prefix argument. | |
| 348 | |
| 349 All of the control characters that do special things within an | |
| 350 ordinary incremental search have the same function in incremental regexp | |
| 351 search. Typing @kbd{C-s} or @kbd{C-r} immediately after starting the | |
| 352 search retrieves the last incremental search regexp used; that is to | |
| 353 say, incremental regexp and non-regexp searches have independent | |
| 354 defaults. They also have separate search rings that you can access with | |
| 355 @kbd{M-p} and @kbd{M-n}. | |
| 356 | |
| 357 If you type @key{SPC} in incremental regexp search, it matches any | |
| 358 sequence of whitespace characters, including newlines. If you want | |
| 359 to match just a space, type @kbd{C-q @key{SPC}}. | |
| 360 | |
| 361 Note that adding characters to the regexp in an incremental regexp | |
| 362 search can make the cursor move back and start again. For example, if | |
| 363 you have searched for @samp{foo} and you add @samp{\|bar}, the cursor | |
| 364 backs up in case the first @samp{bar} precedes the first @samp{foo}. | |
| 365 | |
| 366 @findex re-search-forward | |
| 367 @findex re-search-backward | |
| 368 Nonincremental search for a regexp is done by the functions | |
| 369 @code{re-search-forward} and @code{re-search-backward}. You can invoke | |
| 370 these with @kbd{M-x}, or bind them to keys, or invoke them by way of | |
| 371 incremental regexp search with @kbd{C-M-s @key{RET}} and @kbd{C-M-r | |
| 372 @key{RET}}. | |
| 373 | |
| 374 If you use the incremental regexp search commands with a prefix | |
| 375 argument, they perform ordinary string search, like | |
| 376 @code{isearch-forward} and @code{isearch-backward}. @xref{Incremental | |
| 377 Search}. | |
| 378 | |
| 379 @node Regexps, Search Case, Regexp Search, Search | |
| 380 @section Syntax of Regular Expressions | |
|
36263
11db0318031d
Remove redundant index entries.
Eli Zaretskii <eliz@gnu.org>
parents:
36177
diff
changeset
|
381 @cindex syntax of regexps |
| 25829 | 382 |
| 383 Regular expressions have a syntax in which a few characters are | |
| 384 special constructs and the rest are @dfn{ordinary}. An ordinary | |
| 385 character is a simple regular expression which matches that same | |
| 386 character and nothing else. The special characters are @samp{$}, | |
| 387 @samp{^}, @samp{.}, @samp{*}, @samp{+}, @samp{?}, @samp{[}, @samp{]} and | |
| 388 @samp{\}. Any other character appearing in a regular expression is | |
|
37069
6dee6cc113a5
(Regexps): Say up front that backslashes must be doubled in a Lisp program.
Eli Zaretskii <eliz@gnu.org>
parents:
36383
diff
changeset
|
389 ordinary, unless a @samp{\} precedes it. (When you use regular |
|
6dee6cc113a5
(Regexps): Say up front that backslashes must be doubled in a Lisp program.
Eli Zaretskii <eliz@gnu.org>
parents:
36383
diff
changeset
|
390 expressions in a Lisp program, each @samp{\} must be doubled, see the |
|
6dee6cc113a5
(Regexps): Say up front that backslashes must be doubled in a Lisp program.
Eli Zaretskii <eliz@gnu.org>
parents:
36383
diff
changeset
|
391 example near the end of this section.) |
| 25829 | 392 |
| 393 For example, @samp{f} is not a special character, so it is ordinary, and | |
| 394 therefore @samp{f} is a regular expression that matches the string | |
| 395 @samp{f} and no other string. (It does @emph{not} match the string | |
| 396 @samp{ff}.) Likewise, @samp{o} is a regular expression that matches | |
| 397 only @samp{o}. (When case distinctions are being ignored, these regexps | |
| 398 also match @samp{F} and @samp{O}, but we consider this a generalization | |
| 399 of ``the same string,'' rather than an exception.) | |
| 400 | |
| 401 Any two regular expressions @var{a} and @var{b} can be concatenated. The | |
| 402 result is a regular expression which matches a string if @var{a} matches | |
| 403 some amount of the beginning of that string and @var{b} matches the rest of | |
| 404 the string.@refill | |
| 405 | |
| 406 As a simple example, we can concatenate the regular expressions @samp{f} | |
| 407 and @samp{o} to get the regular expression @samp{fo}, which matches only | |
| 408 the string @samp{fo}. Still trivial. To do something nontrivial, you | |
| 409 need to use one of the special characters. Here is a list of them. | |
| 410 | |
| 411 @table @kbd | |
| 412 @item .@: @r{(Period)} | |
| 413 is a special character that matches any single character except a newline. | |
| 414 Using concatenation, we can make regular expressions like @samp{a.b}, which | |
| 415 matches any three-character string that begins with @samp{a} and ends with | |
| 416 @samp{b}.@refill | |
| 417 | |
| 418 @item * | |
| 419 is not a construct by itself; it is a postfix operator that means to | |
| 420 match the preceding regular expression repetitively as many times as | |
| 421 possible. Thus, @samp{o*} matches any number of @samp{o}s (including no | |
| 422 @samp{o}s). | |
| 423 | |
| 424 @samp{*} always applies to the @emph{smallest} possible preceding | |
| 425 expression. Thus, @samp{fo*} has a repeating @samp{o}, not a repeating | |
| 426 @samp{fo}. It matches @samp{f}, @samp{fo}, @samp{foo}, and so on. | |
| 427 | |
| 428 The matcher processes a @samp{*} construct by matching, immediately, | |
| 429 as many repetitions as can be found. Then it continues with the rest | |
| 430 of the pattern. If that fails, backtracking occurs, discarding some | |
| 431 of the matches of the @samp{*}-modified construct in case that makes | |
| 432 it possible to match the rest of the pattern. For example, in matching | |
| 433 @samp{ca*ar} against the string @samp{caaar}, the @samp{a*} first | |
| 434 tries to match all three @samp{a}s; but the rest of the pattern is | |
| 435 @samp{ar} and there is only @samp{r} left to match, so this try fails. | |
| 436 The next alternative is for @samp{a*} to match only two @samp{a}s. | |
| 437 With this choice, the rest of the regexp matches successfully.@refill | |
| 438 | |
| 439 @item + | |
| 440 is a postfix operator, similar to @samp{*} except that it must match | |
| 441 the preceding expression at least once. So, for example, @samp{ca+r} | |
| 442 matches the strings @samp{car} and @samp{caaaar} but not the string | |
| 443 @samp{cr}, whereas @samp{ca*r} matches all three strings. | |
| 444 | |
| 445 @item ? | |
| 446 is a postfix operator, similar to @samp{*} except that it can match the | |
| 447 preceding expression either once or not at all. For example, | |
| 448 @samp{ca?r} matches @samp{car} or @samp{cr}; nothing else. | |
| 449 | |
|
27094
6500fd0a7d8e
*** empty log message ***
Stefan Monnier <monnier@iro.umontreal.ca>
parents:
25829
diff
changeset
|
450 @item *?, +?, ?? |
|
27139
a9508422287d
Improve markup for the description of non-greedy operators.
Eli Zaretskii <eliz@gnu.org>
parents:
27094
diff
changeset
|
451 @cindex non-greedy regexp matching |
|
27094
6500fd0a7d8e
*** empty log message ***
Stefan Monnier <monnier@iro.umontreal.ca>
parents:
25829
diff
changeset
|
452 are non-greedy variants of the operators above. The normal operators |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
453 @samp{*}, @samp{+}, @samp{?} are @dfn{greedy} in that they match as |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
454 much as they can, as long as the overall regexp can still match. With |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
455 a following @samp{?}, they are non-greedy: they will match as little |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
456 as possible. |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
457 |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
458 Thus, both @samp{ab*} and @samp{ab*?} can match the string @samp{a} |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
459 and the string @samp{abbbb}; but if you try to match them both against |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
460 the text @samp{abbb}, @samp{ab*} will match it all (the longest valid |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
461 match), while @samp{ab*?} will match just @samp{a} (the shortest |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
462 valid match). |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
463 |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
464 @item \@{@var{n}\@} |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
465 is a postfix operator that specifies repetition @var{n} times---that |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
466 is, the preceding regular expression must match exactly @var{n} times |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
467 in a row. For example, @samp{x\@{4\@}} matches the string @samp{xxxx} |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
468 and nothing else. |
|
27094
6500fd0a7d8e
*** empty log message ***
Stefan Monnier <monnier@iro.umontreal.ca>
parents:
25829
diff
changeset
|
469 |
|
28063
f1b33463506d
*** empty log message ***
Stefan Monnier <monnier@iro.umontreal.ca>
parents:
27694
diff
changeset
|
470 @item \@{@var{n},@var{m}\@} |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
471 is a postfix operator that specifies repetition between @var{n} and |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
472 @var{m} times---that is, the preceding regular expression must match |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
473 at least @var{n} times, but no more than @var{m} times. If @var{m} is |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
474 omitted, then there is no upper limit, but the preceding regular |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
475 expression must match at least @var{n} times.@* @samp{\@{0,1\@}} is |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
476 equivalent to @samp{?}. @* @samp{\@{0,\@}} is equivalent to |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
477 @samp{*}. @* @samp{\@{1,\@}} is equivalent to @samp{+}. |
|
27694
5c14849aee4c
documented \{n,m\} intervals
Stefan Monnier <monnier@iro.umontreal.ca>
parents:
27217
diff
changeset
|
478 |
| 25829 | 479 @item [ @dots{} ] |
| 480 is a @dfn{character set}, which begins with @samp{[} and is terminated | |
| 481 by @samp{]}. In the simplest case, the characters between the two | |
| 482 brackets are what this set can match. | |
| 483 | |
| 484 Thus, @samp{[ad]} matches either one @samp{a} or one @samp{d}, and | |
| 485 @samp{[ad]*} matches any string composed of just @samp{a}s and @samp{d}s | |
| 486 (including the empty string), from which it follows that @samp{c[ad]*r} | |
| 487 matches @samp{cr}, @samp{car}, @samp{cdr}, @samp{caddaar}, etc. | |
| 488 | |
| 489 You can also include character ranges in a character set, by writing the | |
| 490 starting and ending characters with a @samp{-} between them. Thus, | |
| 491 @samp{[a-z]} matches any lower-case ASCII letter. Ranges may be | |
| 492 intermixed freely with individual characters, as in @samp{[a-z$%.]}, | |
| 493 which matches any lower-case ASCII letter or @samp{$}, @samp{%} or | |
| 494 period. | |
| 495 | |
| 496 Note that the usual regexp special characters are not special inside a | |
| 497 character set. A completely different set of special characters exists | |
| 498 inside character sets: @samp{]}, @samp{-} and @samp{^}. | |
| 499 | |
| 500 To include a @samp{]} in a character set, you must make it the first | |
| 501 character. For example, @samp{[]a]} matches @samp{]} or @samp{a}. To | |
| 502 include a @samp{-}, write @samp{-} as the first or last character of the | |
| 503 set, or put it after a range. Thus, @samp{[]-]} matches both @samp{]} | |
| 504 and @samp{-}. | |
| 505 | |
| 506 To include @samp{^} in a set, put it anywhere but at the beginning of | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
507 the set. (At the beginning, it complements the set---see below.) |
| 25829 | 508 |
| 509 When you use a range in case-insensitive search, you should write both | |
| 510 ends of the range in upper case, or both in lower case, or both should | |
| 511 be non-letters. The behavior of a mixed-case range such as @samp{A-z} | |
| 512 is somewhat ill-defined, and it may change in future Emacs versions. | |
| 513 | |
| 514 @item [^ @dots{} ] | |
| 515 @samp{[^} begins a @dfn{complemented character set}, which matches any | |
| 516 character except the ones specified. Thus, @samp{[^a-z0-9A-Z]} matches | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
517 all characters @emph{except} ASCII letters and digits. |
| 25829 | 518 |
| 519 @samp{^} is not special in a character set unless it is the first | |
| 520 character. The character following the @samp{^} is treated as if it | |
| 521 were first (in other words, @samp{-} and @samp{]} are not special there). | |
| 522 | |
| 523 A complemented character set can match a newline, unless newline is | |
| 524 mentioned as one of the characters not to match. This is in contrast to | |
| 525 the handling of regexps in programs such as @code{grep}. | |
| 526 | |
| 527 @item ^ | |
| 528 is a special character that matches the empty string, but only at the | |
| 529 beginning of a line in the text being matched. Otherwise it fails to | |
| 530 match anything. Thus, @samp{^foo} matches a @samp{foo} that occurs at | |
| 531 the beginning of a line. | |
| 532 | |
| 533 @item $ | |
| 534 is similar to @samp{^} but matches only at the end of a line. Thus, | |
| 535 @samp{x+$} matches a string of one @samp{x} or more at the end of a line. | |
| 536 | |
| 537 @item \ | |
| 538 has two functions: it quotes the special characters (including | |
| 539 @samp{\}), and it introduces additional special constructs. | |
| 540 | |
| 541 Because @samp{\} quotes special characters, @samp{\$} is a regular | |
| 542 expression that matches only @samp{$}, and @samp{\[} is a regular | |
| 543 expression that matches only @samp{[}, and so on. | |
| 544 @end table | |
| 545 | |
| 546 Note: for historical compatibility, special characters are treated as | |
| 547 ordinary ones if they are in contexts where their special meanings make no | |
| 548 sense. For example, @samp{*foo} treats @samp{*} as ordinary since there is | |
| 549 no preceding expression on which the @samp{*} can act. It is poor practice | |
| 550 to depend on this behavior; it is better to quote the special character anyway, | |
| 551 regardless of where it appears.@refill | |
| 552 | |
| 553 For the most part, @samp{\} followed by any character matches only that | |
| 554 character. However, there are several exceptions: two-character | |
| 555 sequences starting with @samp{\} that have special meanings. The second | |
| 556 character in the sequence is always an ordinary character when used on | |
| 557 its own. Here is a table of @samp{\} constructs. | |
| 558 | |
| 559 @table @kbd | |
| 560 @item \| | |
| 561 specifies an alternative. Two regular expressions @var{a} and @var{b} | |
| 562 with @samp{\|} in between form an expression that matches some text if | |
| 563 either @var{a} matches it or @var{b} matches it. It works by trying to | |
| 564 match @var{a}, and if that fails, by trying to match @var{b}. | |
| 565 | |
| 566 Thus, @samp{foo\|bar} matches either @samp{foo} or @samp{bar} | |
| 567 but no other string.@refill | |
| 568 | |
| 569 @samp{\|} applies to the largest possible surrounding expressions. Only a | |
| 570 surrounding @samp{\( @dots{} \)} grouping can limit the grouping power of | |
| 571 @samp{\|}.@refill | |
| 572 | |
| 573 Full backtracking capability exists to handle multiple uses of @samp{\|}. | |
| 574 | |
| 575 @item \( @dots{} \) | |
| 576 is a grouping construct that serves three purposes: | |
| 577 | |
| 578 @enumerate | |
| 579 @item | |
| 580 To enclose a set of @samp{\|} alternatives for other operations. | |
| 581 Thus, @samp{\(foo\|bar\)x} matches either @samp{foox} or @samp{barx}. | |
| 582 | |
| 583 @item | |
| 584 To enclose a complicated expression for the postfix operators @samp{*}, | |
| 585 @samp{+} and @samp{?} to operate on. Thus, @samp{ba\(na\)*} matches | |
| 586 @samp{bananana}, etc., with any (zero or more) number of @samp{na} | |
| 587 strings.@refill | |
| 588 | |
| 589 @item | |
| 590 To record a matched substring for future reference. | |
| 591 @end enumerate | |
| 592 | |
| 593 This last application is not a consequence of the idea of a | |
| 594 parenthetical grouping; it is a separate feature that is assigned as a | |
| 595 second meaning to the same @samp{\( @dots{} \)} construct. In practice | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
596 there is usually no conflict between the two meanings; when there is |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
597 a conflict, you can use a ``shy'' group. |
|
28063
f1b33463506d
*** empty log message ***
Stefan Monnier <monnier@iro.umontreal.ca>
parents:
27694
diff
changeset
|
598 |
|
f1b33463506d
*** empty log message ***
Stefan Monnier <monnier@iro.umontreal.ca>
parents:
27694
diff
changeset
|
599 @item \(?: @dots{} \) |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
600 @cindex shy group, in regexp |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
601 specifies a ``shy'' group that does not record the matched substring; |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
602 you can't refer back to it with @samp{\@var{d}}. This is useful |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
603 in mechanically combining regular expressions, so that you |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
604 can add groups for syntactic purposes without interfering with |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
605 the numbering of the groups that were written by the user. |
| 25829 | 606 |
| 607 @item \@var{d} | |
| 608 matches the same text that matched the @var{d}th occurrence of a | |
| 609 @samp{\( @dots{} \)} construct. | |
| 610 | |
| 611 After the end of a @samp{\( @dots{} \)} construct, the matcher remembers | |
| 612 the beginning and end of the text matched by that construct. Then, | |
| 613 later on in the regular expression, you can use @samp{\} followed by the | |
| 614 digit @var{d} to mean ``match the same text matched the @var{d}th time | |
| 615 by the @samp{\( @dots{} \)} construct.'' | |
| 616 | |
| 617 The strings matching the first nine @samp{\( @dots{} \)} constructs | |
| 618 appearing in a regular expression are assigned numbers 1 through 9 in | |
| 619 the order that the open-parentheses appear in the regular expression. | |
| 620 So you can use @samp{\1} through @samp{\9} to refer to the text matched | |
| 621 by the corresponding @samp{\( @dots{} \)} constructs. | |
| 622 | |
| 623 For example, @samp{\(.*\)\1} matches any newline-free string that is | |
| 624 composed of two identical halves. The @samp{\(.*\)} matches the first | |
| 625 half, which may be anything, but the @samp{\1} that follows must match | |
| 626 the same exact text. | |
| 627 | |
| 628 If a particular @samp{\( @dots{} \)} construct matches more than once | |
| 629 (which can easily happen if it is followed by @samp{*}), only the last | |
| 630 match is recorded. | |
| 631 | |
| 632 @item \` | |
| 633 matches the empty string, but only at the beginning | |
| 634 of the buffer or string being matched against. | |
| 635 | |
| 636 @item \' | |
| 637 matches the empty string, but only at the end of | |
| 638 the buffer or string being matched against. | |
| 639 | |
| 640 @item \= | |
| 641 matches the empty string, but only at point. | |
| 642 | |
| 643 @item \b | |
| 644 matches the empty string, but only at the beginning or | |
| 645 end of a word. Thus, @samp{\bfoo\b} matches any occurrence of | |
| 646 @samp{foo} as a separate word. @samp{\bballs?\b} matches | |
| 647 @samp{ball} or @samp{balls} as a separate word.@refill | |
| 648 | |
| 649 @samp{\b} matches at the beginning or end of the buffer | |
| 650 regardless of what text appears next to it. | |
| 651 | |
| 652 @item \B | |
| 653 matches the empty string, but @emph{not} at the beginning or | |
| 654 end of a word. | |
| 655 | |
| 656 @item \< | |
| 657 matches the empty string, but only at the beginning of a word. | |
| 658 @samp{\<} matches at the beginning of the buffer only if a | |
| 659 word-constituent character follows. | |
| 660 | |
| 661 @item \> | |
| 662 matches the empty string, but only at the end of a word. @samp{\>} | |
| 663 matches at the end of the buffer only if the contents end with a | |
| 664 word-constituent character. | |
| 665 | |
| 666 @item \w | |
| 667 matches any word-constituent character. The syntax table | |
| 668 determines which characters these are. @xref{Syntax}. | |
| 669 | |
| 670 @item \W | |
| 671 matches any character that is not a word-constituent. | |
| 672 | |
| 673 @item \s@var{c} | |
| 674 matches any character whose syntax is @var{c}. Here @var{c} is a | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
675 character that designates a particular syntax class: thus, @samp{w} |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
676 for word constituent, @samp{-} or @samp{ } for whitespace, @samp{.} |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
677 for ordinary punctuation, etc. @xref{Syntax}. |
| 25829 | 678 |
| 679 @item \S@var{c} | |
| 680 matches any character whose syntax is not @var{c}. | |
|
35904
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
681 |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
682 @cindex categories of characters |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
683 @cindex characters which belong to a specific language |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
684 @findex describe-categories |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
685 @item \c@var{c} |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
686 matches any character that belongs to the category @var{c}. For |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
687 example, @samp{\cc} matches Chinese characters, @samp{\cg} matches |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
688 Greek characters, etc. For the description of the known categories, |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
689 type @kbd{M-x describe-categories @key{RET}}. |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
690 |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
691 @item \C@var{c} |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
692 matches any character that does @emph{not} belong to category |
|
250691f20cec
(Incremental Search): Document C-\ and C-^.
Eli Zaretskii <eliz@gnu.org>
parents:
35876
diff
changeset
|
693 @var{c}. |
| 25829 | 694 @end table |
| 695 | |
| 696 The constructs that pertain to words and syntax are controlled by the | |
| 697 setting of the syntax table (@pxref{Syntax}). | |
| 698 | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
699 Here is a complicated regexp, stored in @code{sentence-end} and used |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
700 by Emacs to recognize the end of a sentence together with any |
|
37607
2f80188cbbd6
Typo: "we show it Lisp syntax" --> ... its LIsp ..
Francesco Potort? <pot@gnu.org>
parents:
37235
diff
changeset
|
701 whitespace that follows. We show its Lisp syntax to distinguish the |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
702 spaces from the tab characters. In Lisp syntax, the string constant |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
703 begins and ends with a double-quote. @samp{\"} stands for a |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
704 double-quote as part of the regexp, @samp{\\} for a backslash as part |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
705 of the regexp, @samp{\t} for a tab, and @samp{\n} for a newline. |
| 25829 | 706 |
| 707 @example | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
708 "[.?!][]\"')]*\\($\\| $\\|\t\\| \\)[ \t\n]*" |
| 25829 | 709 @end example |
| 710 | |
| 711 @noindent | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
712 This contains four parts in succession: a character set matching |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
713 period, @samp{?}, or @samp{!}; a character set matching |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
714 close-brackets, quotes, or parentheses, repeated zero or more times; a |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
715 set of alternatives within backslash-parentheses that matches either |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
716 end-of-line, a space at the end of a line, a tab, or two spaces; and a |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
717 character set matching whitespace characters, repeated any number of |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
718 times. |
| 25829 | 719 |
| 720 To enter the same regexp interactively, you would type @key{TAB} to | |
| 721 enter a tab, and @kbd{C-j} to enter a newline. You would also type | |
| 722 single backslashes as themselves, instead of doubling them for Lisp syntax. | |
| 723 | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
724 @ignore |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
725 @c I commented this out because it is missing vital information |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
726 @c and therefore useless. For instance, what do you do to *use* the |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
727 @c regular expression when it is finished? What jobs is this good for? |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
728 @c -- rms |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
729 |
| 31072 | 730 @findex re-builder |
| 731 @cindex authoring regular expressions | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
732 For convenient interactive development of regular expressions, you |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
733 can use the @kbd{M-x re-builder} command. It provides a convenient |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
734 interface for creating regular expressions, by giving immediate visual |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
735 feedback. The buffer from which @code{re-builder} was invoked becomes |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
736 the target for the regexp editor, which pops in a separate window. At |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
737 all times, all the matches in the target buffer for the current |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
738 regular expression are highlighted. Each parenthesized sub-expression |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
739 of the regexp is shown in a distinct face, which makes it easier to |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
740 verify even very complex regexps. (On displays that don't support |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
741 colors, Emacs blinks the cursor around the matched text, as it does |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
742 for matching parens.) |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
743 @end ignore |
| 31072 | 744 |
| 25829 | 745 @node Search Case, Replace, Regexps, Search |
| 746 @section Searching and Case | |
| 747 | |
| 748 Incremental searches in Emacs normally ignore the case of the text | |
| 749 they are searching through, if you specify the text in lower case. | |
| 750 Thus, if you specify searching for @samp{foo}, then @samp{Foo} and | |
| 751 @samp{foo} are also considered a match. Regexps, and in particular | |
| 752 character sets, are included: @samp{[ab]} would match @samp{a} or | |
| 753 @samp{A} or @samp{b} or @samp{B}.@refill | |
| 754 | |
| 755 An upper-case letter anywhere in the incremental search string makes | |
| 756 the search case-sensitive. Thus, searching for @samp{Foo} does not find | |
| 757 @samp{foo} or @samp{FOO}. This applies to regular expression search as | |
| 758 well as to string search. The effect ceases if you delete the | |
| 759 upper-case letter from the search string. | |
| 760 | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
761 Typing @kbd{M-c} within an incremental search toggles the case |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
762 sensitivity of that search. The effect does not extend beyond the |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
763 current incremental search to the next one, but it does override the |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
764 effect of including an upper-case letter in the current search. |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
765 |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
766 @vindex case-fold-search |
| 25829 | 767 If you set the variable @code{case-fold-search} to @code{nil}, then |
| 768 all letters must match exactly, including case. This is a per-buffer | |
| 769 variable; altering the variable affects only the current buffer, but | |
| 770 there is a default value which you can change as well. @xref{Locals}. | |
| 771 This variable applies to nonincremental searches also, including those | |
| 772 performed by the replace commands (@pxref{Replace}) and the minibuffer | |
| 773 history matching commands (@pxref{Minibuffer History}). | |
| 774 | |
| 775 @node Replace, Other Repeating Search, Search Case, Search | |
| 776 @section Replacement Commands | |
| 777 @cindex replacement | |
| 778 @cindex search-and-replace commands | |
| 779 @cindex string substitution | |
| 780 @cindex global substitution | |
| 781 | |
| 782 Global search-and-replace operations are not needed as often in Emacs | |
| 783 as they are in other editors@footnote{In some editors, | |
| 784 search-and-replace operations are the only convenient way to make a | |
| 785 single change in the text.}, but they are available. In addition to the | |
| 786 simple @kbd{M-x replace-string} command which is like that found in most | |
| 787 editors, there is a @kbd{M-x query-replace} command which asks you, for | |
| 788 each occurrence of the pattern, whether to replace it. | |
| 789 | |
| 790 The replace commands normally operate on the text from point to the | |
| 791 end of the buffer; however, in Transient Mark mode, when the mark is | |
| 792 active, they operate on the region. The replace commands all replace | |
| 793 one string (or regexp) with one replacement string. It is possible to | |
| 794 perform several replacements in parallel using the command | |
| 795 @code{expand-region-abbrevs} (@pxref{Expanding Abbrevs}). | |
| 796 | |
| 797 @menu | |
| 798 * Unconditional Replace:: Replacing all matches for a string. | |
| 799 * Regexp Replace:: Replacing all matches for a regexp. | |
| 800 * Replacement and Case:: How replacements preserve case of letters. | |
| 801 * Query Replace:: How to use querying. | |
| 802 @end menu | |
| 803 | |
| 804 @node Unconditional Replace, Regexp Replace, Replace, Replace | |
| 805 @subsection Unconditional Replacement | |
| 806 @findex replace-string | |
| 807 @findex replace-regexp | |
| 808 | |
| 809 @table @kbd | |
| 810 @item M-x replace-string @key{RET} @var{string} @key{RET} @var{newstring} @key{RET} | |
| 811 Replace every occurrence of @var{string} with @var{newstring}. | |
| 812 @item M-x replace-regexp @key{RET} @var{regexp} @key{RET} @var{newstring} @key{RET} | |
| 813 Replace every match for @var{regexp} with @var{newstring}. | |
| 814 @end table | |
| 815 | |
| 816 To replace every instance of @samp{foo} after point with @samp{bar}, | |
| 817 use the command @kbd{M-x replace-string} with the two arguments | |
| 818 @samp{foo} and @samp{bar}. Replacement happens only in the text after | |
| 819 point, so if you want to cover the whole buffer you must go to the | |
| 820 beginning first. All occurrences up to the end of the buffer are | |
| 821 replaced; to limit replacement to part of the buffer, narrow to that | |
| 822 part of the buffer before doing the replacement (@pxref{Narrowing}). | |
| 823 In Transient Mark mode, when the region is active, replacement is | |
| 824 limited to the region (@pxref{Transient Mark}). | |
| 825 | |
| 826 When @code{replace-string} exits, it leaves point at the last | |
| 827 occurrence replaced. It sets the mark to the prior position of point | |
| 828 (where the @code{replace-string} command was issued); use @kbd{C-u | |
| 829 C-@key{SPC}} to move back there. | |
| 830 | |
| 831 A numeric argument restricts replacement to matches that are surrounded | |
| 832 by word boundaries. The argument's value doesn't matter. | |
| 833 | |
| 834 @node Regexp Replace, Replacement and Case, Unconditional Replace, Replace | |
| 835 @subsection Regexp Replacement | |
| 836 | |
| 837 The @kbd{M-x replace-string} command replaces exact matches for a | |
| 838 single string. The similar command @kbd{M-x replace-regexp} replaces | |
| 839 any match for a specified pattern. | |
| 840 | |
| 841 In @code{replace-regexp}, the @var{newstring} need not be constant: it | |
| 842 can refer to all or part of what is matched by the @var{regexp}. | |
| 843 @samp{\&} in @var{newstring} stands for the entire match being replaced. | |
| 844 @samp{\@var{d}} in @var{newstring}, where @var{d} is a digit, stands for | |
| 845 whatever matched the @var{d}th parenthesized grouping in @var{regexp}. | |
| 846 To include a @samp{\} in the text to replace with, you must enter | |
| 847 @samp{\\}. For example, | |
| 848 | |
| 849 @example | |
| 850 M-x replace-regexp @key{RET} c[ad]+r @key{RET} \&-safe @key{RET} | |
| 851 @end example | |
| 852 | |
| 853 @noindent | |
| 854 replaces (for example) @samp{cadr} with @samp{cadr-safe} and @samp{cddr} | |
| 855 with @samp{cddr-safe}. | |
| 856 | |
| 857 @example | |
| 858 M-x replace-regexp @key{RET} \(c[ad]+r\)-safe @key{RET} \1 @key{RET} | |
| 859 @end example | |
| 860 | |
| 861 @noindent | |
| 862 performs the inverse transformation. | |
| 863 | |
| 864 @node Replacement and Case, Query Replace, Regexp Replace, Replace | |
| 865 @subsection Replace Commands and Case | |
| 866 | |
| 867 If the first argument of a replace command is all lower case, the | |
| 868 commands ignores case while searching for occurrences to | |
| 869 replace---provided @code{case-fold-search} is non-@code{nil}. If | |
| 870 @code{case-fold-search} is set to @code{nil}, case is always significant | |
| 871 in all searches. | |
| 872 | |
| 873 @vindex case-replace | |
| 874 In addition, when the @var{newstring} argument is all or partly lower | |
| 875 case, replacement commands try to preserve the case pattern of each | |
| 876 occurrence. Thus, the command | |
| 877 | |
| 878 @example | |
| 879 M-x replace-string @key{RET} foo @key{RET} bar @key{RET} | |
| 880 @end example | |
| 881 | |
| 882 @noindent | |
| 883 replaces a lower case @samp{foo} with a lower case @samp{bar}, an | |
| 884 all-caps @samp{FOO} with @samp{BAR}, and a capitalized @samp{Foo} with | |
| 885 @samp{Bar}. (These three alternatives---lower case, all caps, and | |
| 886 capitalized, are the only ones that @code{replace-string} can | |
| 887 distinguish.) | |
| 888 | |
| 889 If upper-case letters are used in the replacement string, they remain | |
| 890 upper case every time that text is inserted. If upper-case letters are | |
| 891 used in the first argument, the second argument is always substituted | |
| 892 exactly as given, with no case conversion. Likewise, if either | |
| 893 @code{case-replace} or @code{case-fold-search} is set to @code{nil}, | |
| 894 replacement is done without case conversion. | |
| 895 | |
| 896 @node Query Replace,, Replacement and Case, Replace | |
| 897 @subsection Query Replace | |
| 898 @cindex query replace | |
| 899 | |
| 900 @table @kbd | |
| 901 @item M-% @var{string} @key{RET} @var{newstring} @key{RET} | |
| 902 @itemx M-x query-replace @key{RET} @var{string} @key{RET} @var{newstring} @key{RET} | |
| 903 Replace some occurrences of @var{string} with @var{newstring}. | |
| 904 @item C-M-% @var{regexp} @key{RET} @var{newstring} @key{RET} | |
| 905 @itemx M-x query-replace-regexp @key{RET} @var{regexp} @key{RET} @var{newstring} @key{RET} | |
| 906 Replace some matches for @var{regexp} with @var{newstring}. | |
| 907 @end table | |
| 908 | |
| 909 @kindex M-% | |
| 910 @findex query-replace | |
| 911 If you want to change only some of the occurrences of @samp{foo} to | |
| 912 @samp{bar}, not all of them, then you cannot use an ordinary | |
| 913 @code{replace-string}. Instead, use @kbd{M-%} (@code{query-replace}). | |
| 914 This command finds occurrences of @samp{foo} one by one, displays each | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
915 occurrence and asks you whether to replace it. Aside from querying, |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
916 @code{query-replace} works just like @code{replace-string}. It |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
917 preserves case, like @code{replace-string}, provided |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
918 @code{case-replace} is non-@code{nil}, as it normally is. A numeric |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
919 argument means consider only occurrences that are bounded by |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
920 word-delimiter characters. |
| 25829 | 921 |
| 922 @kindex C-M-% | |
| 923 @findex query-replace-regexp | |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
924 @kbd{C-M-%} performs regexp search and replace (@code{query-replace-regexp}). |
| 25829 | 925 |
|
37235
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
926 The characters you can type when you are shown a match for the string |
|
a60e4b9d5ba3
Mention there are other search-terminating Meta chars.
Richard M. Stallman <rms@gnu.org>
parents:
37069
diff
changeset
|
927 or regexp are: |
| 25829 | 928 |
| 929 @ignore @c Not worth it. | |
| 930 @kindex SPC @r{(query-replace)} | |
| 931 @kindex DEL @r{(query-replace)} | |
| 932 @kindex , @r{(query-replace)} | |
| 933 @kindex RET @r{(query-replace)} | |
| 934 @kindex . @r{(query-replace)} | |
| 935 @kindex ! @r{(query-replace)} | |
| 936 @kindex ^ @r{(query-replace)} | |
| 937 @kindex C-r @r{(query-replace)} | |
| 938 @kindex C-w @r{(query-replace)} | |
| 939 @kindex C-l @r{(query-replace)} | |
| 940 @end ignore | |
| 941 | |
| 942 @c WideCommands | |
| 943 @table @kbd | |
| 944 @item @key{SPC} | |
| 945 to replace the occurrence with @var{newstring}. | |
| 946 | |
| 947 @item @key{DEL} | |
| 948 to skip to the next occurrence without replacing this one. | |
| 949 | |
| 950 @item , @r{(Comma)} | |
| 951 to replace this occurrence and display the result. You are then asked | |
| 952 for another input character to say what to do next. Since the | |
| 953 replacement has already been made, @key{DEL} and @key{SPC} are | |
| 954 equivalent in this situation; both move to the next occurrence. | |
| 955 | |
| 956 You can type @kbd{C-r} at this point (see below) to alter the replaced | |
| 957 text. You can also type @kbd{C-x u} to undo the replacement; this exits | |
| 958 the @code{query-replace}, so if you want to do further replacement you | |
| 959 must use @kbd{C-x @key{ESC} @key{ESC} @key{RET}} to restart | |
| 960 (@pxref{Repetition}). | |
| 961 | |
| 962 @item @key{RET} | |
| 963 to exit without doing any more replacements. | |
| 964 | |
| 965 @item .@: @r{(Period)} | |
| 966 to replace this occurrence and then exit without searching for more | |
| 967 occurrences. | |
| 968 | |
| 969 @item ! | |
| 970 to replace all remaining occurrences without asking again. | |
| 971 | |
| 972 @item ^ | |
| 973 to go back to the position of the previous occurrence (or what used to | |
| 974 be an occurrence), in case you changed it by mistake. This works by | |
| 975 popping the mark ring. Only one @kbd{^} in a row is meaningful, because | |
| 976 only one previous replacement position is kept during @code{query-replace}. | |
| 977 | |
| 978 @item C-r | |
| 979 to enter a recursive editing level, in case the occurrence needs to be | |
| 980 edited rather than just replaced with @var{newstring}. When you are | |
| 981 done, exit the recursive editing level with @kbd{C-M-c} to proceed to | |
| 982 the next occurrence. @xref{Recursive Edit}. | |
| 983 | |
| 984 @item C-w | |
| 985 to delete the occurrence, and then enter a recursive editing level as in | |
| 986 @kbd{C-r}. Use the recursive edit to insert text to replace the deleted | |
| 987 occurrence of @var{string}. When done, exit the recursive editing level | |
| 988 with @kbd{C-M-c} to proceed to the next occurrence. | |
| 989 | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
990 @item e |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
991 to edit the replacement string in the minibuffer. When you exit the |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
992 minibuffer by typing @key{RET}, the minibuffer contents replace the |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
993 current occurrence of the pattern. They also become the new |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
994 replacement string for any further occurrences. |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
995 |
| 25829 | 996 @item C-l |
| 997 to redisplay the screen. Then you must type another character to | |
| 998 specify what to do with this occurrence. | |
| 999 | |
| 1000 @item C-h | |
| 1001 to display a message summarizing these options. Then you must type | |
| 1002 another character to specify what to do with this occurrence. | |
| 1003 @end table | |
| 1004 | |
| 1005 Some other characters are aliases for the ones listed above: @kbd{y}, | |
| 1006 @kbd{n} and @kbd{q} are equivalent to @key{SPC}, @key{DEL} and | |
| 1007 @key{RET}. | |
| 1008 | |
| 1009 Aside from this, any other character exits the @code{query-replace}, | |
| 1010 and is then reread as part of a key sequence. Thus, if you type | |
| 1011 @kbd{C-k}, it exits the @code{query-replace} and then kills to end of | |
| 1012 line. | |
| 1013 | |
| 1014 To restart a @code{query-replace} once it is exited, use @kbd{C-x | |
| 1015 @key{ESC} @key{ESC}}, which repeats the @code{query-replace} because it | |
| 1016 used the minibuffer to read its arguments. @xref{Repetition, C-x ESC | |
| 1017 ESC}. | |
| 1018 | |
| 1019 See also @ref{Transforming File Names}, for Dired commands to rename, | |
| 1020 copy, or link files by replacing regexp matches in file names. | |
| 1021 | |
| 1022 @node Other Repeating Search,, Replace, Search | |
| 1023 @section Other Search-and-Loop Commands | |
| 1024 | |
| 1025 Here are some other commands that find matches for a regular | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1026 expression. They all ignore case in matching, if the pattern contains |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1027 no upper-case letters and @code{case-fold-search} is non-@code{nil}. |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1028 Aside from @code{occur}, all operate on the text from point to the end |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1029 of the buffer, or on the active region in Transient Mark mode. |
| 25829 | 1030 |
| 1031 @findex list-matching-lines | |
| 1032 @findex occur | |
|
32035
396b07cf1395
(Other Repeating Search): Describe `how-many'
Gerd Moellmann <gerd@gnu.org>
parents:
31072
diff
changeset
|
1033 @findex how-many |
| 25829 | 1034 @findex delete-non-matching-lines |
| 1035 @findex delete-matching-lines | |
| 1036 @findex flush-lines | |
| 1037 @findex keep-lines | |
| 1038 | |
| 1039 @table @kbd | |
| 1040 @item M-x occur @key{RET} @var{regexp} @key{RET} | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1041 Display a list showing each line in the buffer that contains a match |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1042 for @var{regexp}. To limit the search to part of the buffer, narrow |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1043 to that part (@pxref{Narrowing}). A numeric argument @var{n} |
|
36383
d26ad6e1f13f
Change "filename" to "file name" or "file-name".
Eli Zaretskii <eliz@gnu.org>
parents:
36263
diff
changeset
|
1044 specifies that @var{n} lines of context are to be displayed before and |
|
d26ad6e1f13f
Change "filename" to "file name" or "file-name".
Eli Zaretskii <eliz@gnu.org>
parents:
36263
diff
changeset
|
1045 after each matching line. |
| 25829 | 1046 |
| 1047 @kindex RET @r{(Occur mode)} | |
| 1048 The buffer @samp{*Occur*} containing the output serves as a menu for | |
| 1049 finding the occurrences in their original context. Click @kbd{Mouse-2} | |
| 1050 on an occurrence listed in @samp{*Occur*}, or position point there and | |
| 1051 type @key{RET}; this switches to the buffer that was searched and | |
| 1052 moves point to the original of the chosen occurrence. | |
| 1053 | |
| 1054 @item M-x list-matching-lines | |
| 1055 Synonym for @kbd{M-x occur}. | |
| 1056 | |
|
32035
396b07cf1395
(Other Repeating Search): Describe `how-many'
Gerd Moellmann <gerd@gnu.org>
parents:
31072
diff
changeset
|
1057 @item M-x how-many @key{RET} @var{regexp} @key{RET} |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1058 Print the number of matches for @var{regexp} that exist in the buffer |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1059 after point. In Transient Mark mode, if the region is active, the |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1060 command operates on the region instead. |
| 25829 | 1061 |
| 1062 @item M-x flush-lines @key{RET} @var{regexp} @key{RET} | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1063 Delete each line that contains a match for @var{regexp}, operating on |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1064 the text after point. In Transient Mark mode, if the region is |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1065 active, the command operates on the region instead. |
| 25829 | 1066 |
| 1067 @item M-x keep-lines @key{RET} @var{regexp} @key{RET} | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1068 Delete each line that @emph{does not} contain a match for |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1069 @var{regexp}, operating on the text after point. In Transient Mark |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1070 mode, if the region is active, the command operates on the region |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1071 instead. |
| 25829 | 1072 @end table |
| 1073 | |
|
36177
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1074 You can also search multiple files under control of a tags table |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1075 (@pxref{Tags Search}) or through Dired @kbd{A} command |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1076 (@pxref{Operating on Files}), or ask the @code{grep} program to do it |
|
548a3aac5d5d
Comment out description of re-builder.
Richard M. Stallman <rms@gnu.org>
parents:
36058
diff
changeset
|
1077 (@pxref{Grep Searching}). |
