emacs: src/coding.c comparison

comparison src/coding.c @ 20718:c600dea3b06b

Vselect_safe_coding_system_function): New variable. (coding_category_table): This variable deleted. (Vcoding_category_table): New variable. (coding_category_name): Add "coding-category-iso-7-tight". (detect_coding_iso2022): Check the mask CODING_FLAG_ISO_DESIGNATION in CODING->FLAGS. Check a new coding category coding-category-iso-7-tight. (DECODE_DESIGNATION): Decode only such designations that CODING can handle. (check_composing_code): New function. (decode_coding_iso2022): Decode only such characters that CODING can handle. (encode_coding_iso2022): Before and after encoding composite characters, reset designation and invocation status. (detect_coding_sjis): Delete unnecessary check. (detect_coding_big5): Likewise. (encode_designation_at_bol): Check the validity of requested designation register. (setup_coding_system): Set requested designation registers for non-supported charsets to CODING_SPEC_ISO_NO_REQUESTED_DESIGNATION. Set mask CODING_FLAG_ISO_DESIGNATION in CODING->FLAGS. Code tuned for no-conversion and undecided. (detect_coding): Adjusted for the new variable Vcoding_category_table. (syms_of_coding): Initialize Vcoding_category_table and staticpro it. Register select-safe-coding-system as a Lisp variable. (DECODE_CHARACTER_ASCII): Update coding->produced_char; (DECODE_CHARACTER_DIMENSION1): Likewise. (Qraw_text, Qcoding_category): New variables. (syms_of_coding): Intern and staticpro them. (coding_system_table): New variable. (CHARSET_OK, SHIFT_OUT_OK): New macros. (detect_coding_iso2022): Detection algorithm improved. (decode_coding_iso2022): Arg CONSUMED deleted, and the meaning of return value changed. Update members produced, produced_char, consumed, consumed_char of the struct *coding. Pay attention to CODING_MODE_INHIBIT_INCONSISTENT_EOL. (encode_coding_iso2022): Likewise. (decode_coding_sjis_big5, encode_coding_sjis_big5): Likewise. (decode_eol, encode_eol): Likewise. (ENCODE_ISO_CHARACTER): Update coding->consumed_char. (DECODE_SJIS_BIG5_CHARACTER): Update coding->produced_char. (ENCODE_SJIS_BIG5_CHARACTER): Update coding->consumed_char. (detect_coding(detect_coding(detect_ITIES and SKIP. (detect_coding): Adjusted for the change of detect_coding_mask. Update coding->heading_ascii. (detect_eol_type): New arg SKIP. (detect_eol): Adjusted for the change of detect_eol_type. (ccl_codign_driver): New function. (decode_coding): Arg CONSUMED deleted, and the meaning of return value changed. Update members produced, produced_char, consumed, consumed_char of the struct *coding. (encode_coding): Likewise. (shrink_decoding_region, shrink_encoding_region): New function. (code_convert_region, code_convert_string): Completely rewritten. (detect_coding_sy(detect_coding_sy(detect_coding_sy(detect_coding_sy(detect_codiT. (Fdetect_coding_string): New function. (Fdecode_coding_region, Fencode_coding_region): Adjusted for the change of code_convert_region. (Fdecode_coding_string, Fencode_coding_string): Adjusted for the change of code_convert_string. (Fupdate_iso_coding_systems): New function. (init_coding_once): Initialize coding_system_table.

author	Kenichi Handa <handa@m17n.org>
date	Thu, 22 Jan 1998 01:26:45 +0000
parents	ed9ed828415e
children	13d0a6194de7

comparison

equal deleted inserted replaced

-:19463997fbc6
+:c600dea3b06b
 If a user wants to read/write a text encoded in a coding system not
 listed above, he can supply a decoder and an encoder for it in CCL
 (Code Conversion Language) programs.  Emacs executes the CCL program
 while reading/writing.
-Emacs represents a coding-system by a Lisp symbol that has a property
+Emacs represents a coding system by a Lisp symbol that has a property
-`coding-system'.  But, before actually using the coding-system, the
+`coding-system'.  But, before actually using the coding system, the
 information about it is set in a structure of type `struct
 coding_system' for rapid processing.  See section 6 for more details.
 */
 /*** GENERAL NOTES on END-OF-LINE FORMAT ***
 How end-of-line of a text is encoded depends on a system.  For
 instance, Unix's format is just one byte of `line-feed' code,
 whereas DOS's format is two-byte sequence of `carriage-return' and
-`line-feed' codes.  MacOS's format is one byte of `carriage-return'.
+`line-feed' codes.  MacOS's format is usually one byte of
+`carriage-return'.
 Since text characters encoding and end-of-line encoding are
 independent, any coding system described above can take
 any format of end-of-line.  So, Emacs has information of format of
 end-of-line in each coding-system.  See section 6 for more details.
 /*** GENERAL NOTES on `decode_coding_XXX ()' functions ***
 These functions decode SRC_BYTES length text at SOURCE encoded in
 CODING to Emacs' internal format (emacs-mule).  The resulting text
-goes to a place pointed to by DESTINATION, the length of which should
+goes to a place pointed to by DESTINATION, the length of which
-not exceed DST_BYTES.  The number of bytes actually processed is
+should not exceed DST_BYTES.  These functions set the information of
-returned as *CONSUMED.  The return value is the length of the decoded
+original and decoded texts in the members produced, produced_char,
-text.  Below is a template of these functions.  */
+consumed, and consumed_char of the structure *CODING.
+The return value is an integer (CODING_FINISH_XXX) indicating how
+the decoding finished.
+DST_BYTES zero means that source area and destination area are
+overlapped, which means that we can produce a decoded text until it
+reaches at the head of not-yet-decoded source text.
+Below is a template of these functions.  */
 #if 0
-decode_coding_XXX (coding, source, destination, src_bytes, dst_bytes, consumed)
+decode_coding_XXX (coding, source, destination, src_bytes, dst_bytes)
 struct coding_system *coding;
 unsigned char *source, *destination;
 int src_bytes, dst_bytes;
-int *consumed;
 {
 ...
 }
 #endif
 /*** GENERAL NOTES on `encode_coding_XXX ()' functions ***
 These functions encode SRC_BYTES length text at SOURCE of Emacs'
 internal format (emacs-mule) to CODING.  The resulting text goes to
 a place pointed to by DESTINATION, the length of which should not
-exceed DST_BYTES.  The number of bytes actually processed is
+exceed DST_BYTES.  These functions set the information of
-returned as *CONSUMED.  The return value is the length of the
+original and encoded texts in the members produced, produced_char,
-encoded text.  Below is a template of these functions.  */
+consumed, and consumed_char of the structure *CODING.
+The return value is an integer (CODING_FINISH_XXX) indicating how
+the encoding finished.
+DST_BYTES zero means that source area and destination area are
+overlapped, which means that we can produce a decoded text until it
+reaches at the head of not-yet-decoded source text.
+Below is a template of these functions.  */
 #if 0
-encode_coding_XXX (coding, source, destination, src_bytes, dst_bytes, consumed)
+encode_coding_XXX (coding, source, destination, src_bytes, dst_bytes)
 struct coding_system *coding;
 unsigned char *source, *destination;
 int src_bytes, dst_bytes;
-int *consumed;
 {
 ...
 }
 #endif
 #define DECODE_CHARACTER_ASCII(c)				\
 do {								\
 if (COMPOSING_P (coding->composing))			\
 *dst++ = 0xA0, *dst++ = (c) | 0x80;			\
 else							\
-*dst++ = (c);						\
+{								\
+	*dst++ = (c);						\
+	coding->produced_char++;				\
+}								\
 } while (0)
 /* Decode one DIMENSION1 character whose charset is CHARSET and whose
 position-code is C.  */
 do {									\
 unsigned char leading_code = CHARSET_LEADING_CODE_BASE (charset);	\
 if (COMPOSING_P (coding->composing))				\
 *dst++ = leading_code + 0x20;					\
 else								\
-*dst++ = leading_code;						\
+{									\
+	*dst++ = leading_code;						\
+	coding->produced_char++;					\
+}									\
 if (leading_code = CHARSET_LEADING_CODE_EXT (charset))		\
 *dst++ = leading_code;						\
 *dst++ = (c) | 0x80;						\
 } while (0)
 extern Lisp_Object Qinsert_file_contents, Qwrite_region;
 Lisp_Object Qcall_process, Qcall_process_region, Qprocess_argument;
 Lisp_Object Qstart_process, Qopen_network_stream;
 Lisp_Object Qtarget_idx;
+Lisp_Object Vselect_safe_coding_system_function;
 /* Mnemonic character of each format of end-of-line.  */
 int eol_mnemonic_unix, eol_mnemonic_dos, eol_mnemonic_mac;
 /* Mnemonic character to indicate format of end-of-line is not yet
 decided.  */
 int eol_mnemonic_undecided;
 Lisp_Object Vcoding_system_list, Vcoding_system_alist;
 Lisp_Object Qcoding_system_p, Qcoding_system_error;
-/* Coding system emacs-mule is for converting only end-of-line format.  */
+/* Coding system emacs-mule and raw-text are for converting only
-Lisp_Object Qemacs_mule;
+end-of-line format.  */
+Lisp_Object Qemacs_mule, Qraw_text;
 /* Coding-systems are handed between Emacs Lisp programs and C internal
 routines by the following three variables.  */
 /* Coding-system for reading files and receiving data from process.  */
 Lisp_Object Vcoding_system_for_read;
 Lisp_Object Vprocess_coding_system_alist;
 Lisp_Object Vnetwork_coding_system_alist;
 #endif /* emacs */
-Lisp_Object Qcoding_category_index;
+Lisp_Object Qcoding_category, Qcoding_category_index;
 /* List of symbols `coding-category-xxx' ordered by priority.  */
 Lisp_Object Vcoding_category_list;
-/* Table of coding-systems currently assigned to each coding-category.  */
+/* Table of coding categories (Lisp symbols).  */
-Lisp_Object coding_category_table[CODING_CATEGORY_IDX_MAX];
+Lisp_Object Vcoding_category_table;
 /* Table of names of symbol for each coding-category.  */
 char *coding_category_name[CODING_CATEGORY_IDX_MAX] = {
 "coding-category-emacs-mule",
 "coding-category-sjis",
 "coding-category-iso-7",
+"coding-category-iso-7-tight",
 "coding-category-iso-8-1",
 "coding-category-iso-8-2",
 "coding-category-iso-7-else",
 "coding-category-iso-8-else",
 "coding-category-big5",
 "coding-category-raw-text",
 "coding-category-binary"
 };
+/* Table pointers to coding systems corresponding to each coding
+categories.  */
+struct coding_system *coding_system_table[CODING_CATEGORY_IDX_MAX];
 /* Flag to tell if we look up unification table on character code
 conversion.  */
 Lisp_Object Venable_character_unification;
 /* Standard unification table to look up on decoding (reading).  */
 return 0;		       	\
 } while (0)
 /* See the above "GENERAL NOTES on `detect_coding_XXX ()' functions".
 Check if a text is encoded in Emacs' internal format.  If it is,
-return CODING_CATEGORY_MASK_EMASC_MULE, else return 0.  */
+return CODING_CATEGORY_MASK_EMACS_MULE, else return 0.  */
 int
 detect_coding_emacs_mule (src, src_end)
 unsigned char *src, *src_end;
 {
 Since these are not standard escape sequences of any ISO, the use
 of them for these meaning is restricted to Emacs only.  */
 enum iso_code_class_type iso_code_class[256];
+#define CHARSET_OK(idx, charset)		\
+(CODING_SPEC_ISO_REQUESTED_DESIGNATION	\
+(coding_system_table[idx], charset)		\
+!= CODING_SPEC_ISO_NO_REQUESTED_DESIGNATION)
+#define SHIFT_OUT_OK(idx) \
+(CODING_SPEC_ISO_INITIAL_DESIGNATION (coding_system_table[idx], 1) >= 0)
 /* See the above "GENERAL NOTES on `detect_coding_XXX ()' functions".
 Check if a text is encoded in ISO2022.  If it is, returns an
 integer in which appropriate flag bits any of:
 	CODING_CATEGORY_MASK_ISO_7
+	CODING_CATEGORY_MASK_ISO_7_TIGHT
 	CODING_CATEGORY_MASK_ISO_8_1
 	CODING_CATEGORY_MASK_ISO_8_2
 	CODING_CATEGORY_MASK_ISO_7_ELSE
 	CODING_CATEGORY_MASK_ISO_8_ELSE
 are set.  If a code which should never appear in ISO2022 is found,
 int
 detect_coding_iso2022 (src, src_end)
 unsigned char *src, *src_end;
 {
-int mask = (CODING_CATEGORY_MASK_ISO_7
+int mask = CODING_CATEGORY_MASK_ISO;
-	      | CODING_CATEGORY_MASK_ISO_8_1
+int mask_found = 0;
-	      | CODING_CATEGORY_MASK_ISO_8_2
+int reg[4], shift_out = 0;
-	      | CODING_CATEGORY_MASK_ISO_7_ELSE
+int c, c1, i, charset;
-	      | CODING_CATEGORY_MASK_ISO_8_ELSE
-	      );
+reg[0] = CHARSET_ASCII, reg[1] = reg[2] = reg[3] = -1;
-int g1 = 0;			/* 1 iff designating to G1.  */
-int c, i;
-struct coding_system coding_iso_8_1, coding_iso_8_2;
-/* Coding systems of these categories may accept latin extra codes.  */
-setup_coding_system
-(XSYMBOL (coding_category_table[CODING_CATEGORY_IDX_ISO_8_1])->value,
-&coding_iso_8_1);
-setup_coding_system
-(XSYMBOL (coding_category_table[CODING_CATEGORY_IDX_ISO_8_2])->value,
-&coding_iso_8_2);
 while (mask && src < src_end)
 {
 c = *src++;
 switch (c)
 	{
 	case ISO_CODE_ESC:
 	  if (src >= src_end)
 	    break;
 	  c = *src++;
-	  if ((c >= '(' && c <= '/'))
+	  if (c >= '(' && c <= '/')
 	    {
 	      /* Designation sequence for a charset of dimension 1.  */
 	      if (src >= src_end)
 		break;
-	      c = *src++;
+	      c1 = *src++;
-	      if (c < ' ' || c >= 0x80)
+	      if (c1 < ' ' || c1 >= 0x80
-		/* Invalid designation sequence.  */
+		  || (charset = iso_charset_table[0][c >= ','][c1]) < 0)
-		return 0;
+		/* Invalid designation sequence.  Just ignore.  */
+		break;
+	      reg[(c - '(') % 4] = charset;
 	    }
 	  else if (c == '$')
 	    {
 	      /* Designation sequence for a charset of dimension 2.  */
 	      if (src >= src_end)
 		break;
 	      c = *src++;
 	      if (c >= '@' && c <= 'B')
 		/* Designation for JISX0208.1978, GB2312, or JISX0208.  */
-		;
+		reg[0] = charset = iso_charset_table[1][0][c];
 	      else if (c >= '(' && c <= '/')
 		{
 		  if (src >= src_end)
 		    break;
-		  c = *src++;
+		  c1 = *src++;
-		  if (c < ' ' || c >= 0x80)
+		  if (c1 < ' ' || c1 >= 0x80
-		    /* Invalid designation sequence.  */
+		      || (charset = iso_charset_table[1][c >= ','][c1]) < 0)
-		    return 0;
+		    /* Invalid designation sequence.  Just ignore.  */
+		    break;
+		  reg[(c - '(') % 4] = charset;
 		}
 	      else
-		/* Invalid designation sequence.  */
+		/* Invalid designation sequence.  Just ignore.  */
-		return 0;
+		break;
 	    }
-	  else if (c == 'N' || c == 'O' || c == 'n' || c == 'o')
+	  else if (c == 'N' || c == 'n')
-	    /* Locking shift.  */
+	    {
-	    mask &= (CODING_CATEGORY_MASK_ISO_7_ELSE
+	      if (shift_out == 0
-		     | CODING_CATEGORY_MASK_ISO_8_ELSE);
+		  && (reg[1] >= 0
+		      || SHIFT_OUT_OK (CODING_CATEGORY_IDX_ISO_7_ELSE)
+		      || SHIFT_OUT_OK (CODING_CATEGORY_IDX_ISO_8_ELSE)))
+		{
+		  /* Locking shift out.  */
+		  mask &= ~CODING_CATEGORY_MASK_ISO_7BIT;
+		  mask_found |= CODING_CATEGORY_MASK_ISO_SHIFT;
+		  shift_out = 1;
+		}
+	      break;
+	    }
+	  else if (c == 'O' || c == 'o')
+	    {
+	      if (shift_out == 1)
+		{
+		  /* Locking shift in.  */
+		  mask &= ~CODING_CATEGORY_MASK_ISO_7BIT;
+		  mask_found |= CODING_CATEGORY_MASK_ISO_SHIFT;
+		  shift_out = 0;
+		}
+	      break;
+	    }
 	  else if (c == '0' || c == '1' || c == '2')
-	    /* Start/end composition.  */
+	    /* Start/end composition.  Just ignore.  */
-	    ;
+	    break;
 	  else
-	    /* Invalid escape sequence.  */
+	    /* Invalid escape sequence.  Just ignore.  */
-	    return 0;
+	    break;
+	  /* We found a valid designation sequence for CHARSET.  */
+	  mask &= ~CODING_CATEGORY_MASK_ISO_8BIT;
+	  if (CHARSET_OK (CODING_CATEGORY_IDX_ISO_7, charset))
+	    mask_found |= CODING_CATEGORY_MASK_ISO_7;
+	  else
+	    mask &= ~CODING_CATEGORY_MASK_ISO_7;
+	  if (CHARSET_OK (CODING_CATEGORY_IDX_ISO_7_TIGHT, charset))
+	    mask_found |= CODING_CATEGORY_MASK_ISO_7_TIGHT;
+	  else
+	    mask &= ~CODING_CATEGORY_MASK_ISO_7_TIGHT;
+	  if (! CHARSET_OK (CODING_CATEGORY_IDX_ISO_7_ELSE, charset))
+	    mask &= ~CODING_CATEGORY_MASK_ISO_7_ELSE;
+	  if (! CHARSET_OK (CODING_CATEGORY_IDX_ISO_8_ELSE, charset))
+	    mask &= ~CODING_CATEGORY_MASK_ISO_8_ELSE;
 	  break;
 	case ISO_CODE_SO:
-	  mask &= (CODING_CATEGORY_MASK_ISO_7_ELSE
+	  if (shift_out == 0
-		   | CODING_CATEGORY_MASK_ISO_8_ELSE);
+	      && (reg[1] >= 0
+		  || SHIFT_OUT_OK (CODING_CATEGORY_IDX_ISO_7_ELSE)
+		  || SHIFT_OUT_OK (CODING_CATEGORY_IDX_ISO_8_ELSE)))
+	    {
+	      /* Locking shift out.  */
+	      mask &= ~CODING_CATEGORY_MASK_ISO_7BIT;
+	      mask_found |= CODING_CATEGORY_MASK_ISO_SHIFT;
+	    }
 	  break;
+	case ISO_CODE_SI:
+	  if (shift_out == 1)
+	    {
+	      /* Locking shift in.  */
+	      mask &= ~CODING_CATEGORY_MASK_ISO_7BIT;
+	      mask_found |= CODING_CATEGORY_MASK_ISO_SHIFT;
+	    }
+	  break;
 	case ISO_CODE_CSI:
 	case ISO_CODE_SS2:
 	case ISO_CODE_SS3:
 	  {
 	    int newmask = CODING_CATEGORY_MASK_ISO_8_ELSE;
 	    if (c != ISO_CODE_CSI)
 	      {
-		if (coding_iso_8_1.flags & CODING_FLAG_ISO_SINGLE_SHIFT)
+		if (coding_system_table[CODING_CATEGORY_IDX_ISO_8_1]->flags
+		    & CODING_FLAG_ISO_SINGLE_SHIFT)
 		  newmask |= CODING_CATEGORY_MASK_ISO_8_1;
-		if (coding_iso_8_2.flags & CODING_FLAG_ISO_SINGLE_SHIFT)
+		if (coding_system_table[CODING_CATEGORY_IDX_ISO_8_2]->flags
+		    & CODING_FLAG_ISO_SINGLE_SHIFT)
 		  newmask |= CODING_CATEGORY_MASK_ISO_8_2;
 	      }
 	    if (VECTORP (Vlatin_extra_code_table)
 		&& !NILP (XVECTOR (Vlatin_extra_code_table)->contents[c]))
 	      {
-		if (coding_iso_8_1.flags & CODING_FLAG_ISO_LATIN_EXTRA)
+		if (coding_system_table[CODING_CATEGORY_IDX_ISO_8_1]->flags
+		    & CODING_FLAG_ISO_LATIN_EXTRA)
 		  newmask |= CODING_CATEGORY_MASK_ISO_8_1;
-		if (coding_iso_8_2.flags & CODING_FLAG_ISO_LATIN_EXTRA)
+		if (coding_system_table[CODING_CATEGORY_IDX_ISO_8_2]->flags
+		    & CODING_FLAG_ISO_LATIN_EXTRA)
 		  newmask |= CODING_CATEGORY_MASK_ISO_8_2;
 	      }
 	    mask &= newmask;
+	    mask_found |= newmask;
 	  }
 	  break;
 	default:
 	  if (c < 0x80)
 	      if (VECTORP (Vlatin_extra_code_table)
 		  && !NILP (XVECTOR (Vlatin_extra_code_table)->contents[c]))
 		{
 		  int newmask = 0;
-		  if (coding_iso_8_1.flags & CODING_FLAG_ISO_LATIN_EXTRA)
+		  if (coding_system_table[CODING_CATEGORY_IDX_ISO_8_1]->flags
+		      & CODING_FLAG_ISO_LATIN_EXTRA)
 		    newmask |= CODING_CATEGORY_MASK_ISO_8_1;
-		  if (coding_iso_8_2.flags & CODING_FLAG_ISO_LATIN_EXTRA)
+		  if (coding_system_table[CODING_CATEGORY_IDX_ISO_8_2]->flags
+		      & CODING_FLAG_ISO_LATIN_EXTRA)
 		    newmask |= CODING_CATEGORY_MASK_ISO_8_2;
 		  mask &= newmask;
+		  mask_found |= newmask;
 		}
 	      else
 		return 0;
 	    }
 	  else
 	    {
 	      unsigned char *src_begin = src;
-	      mask &= ~(CODING_CATEGORY_MASK_ISO_7
+	      mask &= ~(CODING_CATEGORY_MASK_ISO_7BIT
 			| CODING_CATEGORY_MASK_ISO_7_ELSE);
+	      mask_found |= CODING_CATEGORY_MASK_ISO_8_1;
 	      while (src < src_end && *src >= 0xA0)
 		src++;
 	      if ((src - src_begin - 1) & 1 && src < src_end)
 		mask &= ~CODING_CATEGORY_MASK_ISO_8_2;
+	      else
+		mask_found |= CODING_CATEGORY_MASK_ISO_8_2;
 	    }
 	  break;
 	}
 }
-return mask;
+return (mask & mask_found);
 }
 /* Decode a character of which charset is CHARSET and the 1st position
 code is C1.  If dimension of CHARSET is 2, the 2nd position code is
 fetched from SRC and set to C2.  If CHARSET is negative, it means
 /* To tell a composition rule follows.  */			\
 coding->composing = COMPOSING_WITH_RULE_RULE;			\
 } while (0)
 /* Set designation state into CODING.  */
-#define DECODE_DESIGNATION(reg, dimension, chars, final_char)		\
+#define DECODE_DESIGNATION(reg, dimension, chars, final_char)		   \
-do {							      		\
+do {									   \
-int charset = ISO_CHARSET_TABLE (make_number (dimension),		\
+int charset = ISO_CHARSET_TABLE (make_number (dimension),		   \
-				     make_number (chars),		\
+				     make_number (chars),		   \
-				     make_number (final_char));		\
+				     make_number (final_char));		   \
-if (charset >= 0)					      		\
+if (charset >= 0							   \
-{					      				\
+	&& CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset) == reg) \
-if (coding->direction == 1					\
+{									   \
-	    && CHARSET_REVERSE_CHARSET (charset) >= 0)      		\
+	if (coding->spec.iso2022.last_invalid_designation_register == 0	   \
-charset = CHARSET_REVERSE_CHARSET (charset);      		\
+	    && reg == 0							   \
-CODING_SPEC_ISO_DESIGNATION (coding, reg) = charset;		\
+	    && charset == CHARSET_ASCII)				   \
-}						      			\
+	  {								   \
+	    /* We should insert this designation sequence as is so	   \
+that it is surely written back to a file.  */		   \
+	    coding->spec.iso2022.last_invalid_designation_register = -1;   \
+	    goto label_invalid_code;					   \
+	  }								   \
+	coding->spec.iso2022.last_invalid_designation_register = -1;	   \
+if ((coding->mode & CODING_MODE_DIRECTION)			   \
+	    && CHARSET_REVERSE_CHARSET (charset) >= 0)			   \
+charset = CHARSET_REVERSE_CHARSET (charset);			   \
+CODING_SPEC_ISO_DESIGNATION (coding, reg) = charset;		   \
+}									   \
+else								   \
+{									   \
+	coding->spec.iso2022.last_invalid_designation_register = reg;	   \
+	goto label_invalid_code;					   \
+}									   \
 } while (0)
+/* Check if the current composing sequence contains only valid codes.
+If the composing sequence doesn't end before SRC_END, return -1.
+Else, if it contains only valid codes, return 0.
+Else return the length of the composing sequence.  */
+int check_composing_code (coding, src, src_end)
+struct coding_system *coding;
+unsigned char *src, *src_end;
+{
+unsigned char *src_start = src;
+int invalid_code_found = 0;
+int charset, c, c1, dim;
+while (src < src_end)
+{
+if (*src++ != ISO_CODE_ESC) continue;
+if (src >= src_end) break;
+if ((c = *src++) == '1') /* end of compsition */
+	return (invalid_code_found ? src - src_start : 0);
+if (src + 2 >= src_end) break;
+if (!coding->flags & CODING_FLAG_ISO_DESIGNATION)
+	invalid_code_found = 1;
+else
+	{
+	  dim = 0;
+	  if (c == '$')
+	    {
+	      dim = 1;
+	      c = (*src >= '@' && *src <= 'B') ? '(' : *src++;
+	    }
+	  if (c >= '(' && c <= '/')
+	    {
+	      c1 = *src++;
+	      if ((c1 < ' ' || c1 >= 0x80)
+		  || (charset = iso_charset_table[dim][c >= ','][c1]) < 0
+		  || (CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset)
+		      == CODING_SPEC_ISO_NO_REQUESTED_DESIGNATION))
+		invalid_code_found = 1;
+	    }
+	  else
+	    invalid_code_found = 1;
+	}
+}
+return ((coding->mode & CODING_MODE_LAST_BLOCK) ? src_end - src_start : -1);
+}
 /* See the above "GENERAL NOTES on `decode_coding_XXX ()' functions".  */
 int
-decode_coding_iso2022 (coding, source, destination,
+decode_coding_iso2022 (coding, source, destination, src_bytes, dst_bytes)
-		       src_bytes, dst_bytes, consumed)
 struct coding_system *coding;
 unsigned char *source, *destination;
 int src_bytes, dst_bytes;
-int *consumed;
 {
 unsigned char *src = source;
 unsigned char *src_end = source + src_bytes;
 unsigned char *dst = destination;
 unsigned char *dst_end = destination + dst_bytes;
 int charset;
 /* Charsets invoked to graphic plane 0 and 1 respectively.  */
 int charset0 = CODING_SPEC_ISO_PLANE_CHARSET (coding, 0);
 int charset1 = CODING_SPEC_ISO_PLANE_CHARSET (coding, 1);
 Lisp_Object unification_table
 = coding->character_unification_table_for_decode;
+int result = CODING_FINISH_NORMAL;
 if (!NILP (Venable_character_unification) && NILP (unification_table))
 unification_table = Vstandard_character_unification_table_for_decode;
-while (src < src_end && dst < adjusted_dst_end)
+coding->produced_char = 0;
+while (src < src_end && (dst_bytes
+			   ? (dst < adjusted_dst_end)
+			   : (dst < src - 6)))
 {
 /* SRC_BASE remembers the start position in source in each loop.
 	 The loop will be exited when there's not enough source text
 	 to analyze long escape sequence or 2-byte code (within macros
 	 ONE_MORE_BYTE or TWO_MORE_BYTES).  In that case, SRC is reset
 	  if (!coding->composing
 	      && (charset0 < 0 || CHARSET_CHARS (charset0) == 94))
 	    {
 	      /* This is SPACE or DEL.  */
 	      *dst++ = c1;
+	      coding->produced_char++;
 	      break;
 	    }
 	  /* This is a graphic character, we fall down ...  */
 	case ISO_graphic_plane_0:
 	  else
 	    DECODE_ISO_CHARACTER (charset0, c1);
 	  break;
 	case ISO_0xA0_or_0xFF:
-	  if (charset1 < 0 || CHARSET_CHARS (charset1) == 94)
+	  if (charset1 < 0 || CHARSET_CHARS (charset1) == 94
+	      || coding->flags & CODING_FLAG_ISO_SEVEN_BITS)
 	    {
 	      /* Invalid code.  */
 	      *dst++ = c1;
+	      coding->produced_char++;
 	      break;
 	    }
 	  /* This is a graphic character, we fall down ... */
 	case ISO_graphic_plane_1:
-	  DECODE_ISO_CHARACTER (charset1, c1);
+	  if (coding->flags & CODING_FLAG_ISO_SEVEN_BITS)
+	    {
+	      /* Invalid code.  */
+	      *dst++ = c1;
+	      coding->produced_char++;
+	    }
+	  else
+	    DECODE_ISO_CHARACTER (charset1, c1);
 	  break;
 	case ISO_control_code:
 	  /* All ISO2022 control characters in this class have the
 same representation in Emacs internal format.  */
+	  if (c1 == '\n'
+	      && (coding->mode & CODING_MODE_INHIBIT_INCONSISTENT_EOL)
+	      && (coding->eol_type == CODING_EOL_CR
+		  || coding->eol_type == CODING_EOL_CRLF))
+	    {
+	      result = CODING_FINISH_INCONSISTENT_EOL;
+	      goto label_end_of_loop_2;
+	    }
 	  *dst++ = c1;
+	  coding->produced_char++;
 	  break;
 	case ISO_carriage_return:
 	  if (coding->eol_type == CODING_EOL_CR)
-	    {
+	    *dst++ = '\n';
-	      *dst++ = '\n';
-	    }
 	  else if (coding->eol_type == CODING_EOL_CRLF)
 	    {
 	      ONE_MORE_BYTE (c1);
 	      if (c1 == ISO_CODE_LF)
 		*dst++ = '\n';
 	      else
 		{
+		  if (coding->mode & CODING_MODE_INHIBIT_INCONSISTENT_EOL)
+		    {
+		      result = CODING_FINISH_INCONSISTENT_EOL;
+		      goto label_end_of_loop_2;
+		    }
 		  src--;
-		  *dst++ = c1;
+		  *dst++ = '\r';
 		}
 	    }
 	  else
-	    {
+	    *dst++ = c1;
-	      *dst++ = c1;
+	  coding->produced_char++;
-	    }
 	  break;
 	case ISO_shift_out:
-	  if (CODING_SPEC_ISO_DESIGNATION (coding, 1) < 0)
+	  if (! (coding->flags & CODING_FLAG_ISO_LOCKING_SHIFT)
-	    goto label_invalid_escape_sequence;
+	      || CODING_SPEC_ISO_DESIGNATION (coding, 1) < 0)
+	    goto label_invalid_code;
 	  CODING_SPEC_ISO_INVOCATION (coding, 0) = 1;
 	  charset0 = CODING_SPEC_ISO_PLANE_CHARSET (coding, 0);
 	  break;
 	case ISO_shift_in:
+	  if (! (coding->flags & CODING_FLAG_ISO_LOCKING_SHIFT))
+	    goto label_invalid_code;
 	  CODING_SPEC_ISO_INVOCATION (coding, 0) = 0;
 	  charset0 = CODING_SPEC_ISO_PLANE_CHARSET (coding, 0);
 	  break;
 	case ISO_single_shift_2_7:
 	case ISO_single_shift_2:
+	  if (! (coding->flags & CODING_FLAG_ISO_SINGLE_SHIFT))
+	    goto label_invalid_code;
 	  /* SS2 is handled as an escape sequence of ESC 'N' */
 	  c1 = 'N';
 	  goto label_escape_sequence;
 	case ISO_single_shift_3:
+	  if (! (coding->flags & CODING_FLAG_ISO_SINGLE_SHIFT))
+	    goto label_invalid_code;
 	  /* SS2 is handled as an escape sequence of ESC 'O' */
 	  c1 = 'O';
 	  goto label_escape_sequence;
 	case ISO_control_sequence_introducer:
 	  switch (c1)
 	    {
 	    case '&':		/* revision of following character set */
 	      ONE_MORE_BYTE (c1);
 	      if (!(c1 >= '@' && c1 <= '~'))
-		goto label_invalid_escape_sequence;
+		goto label_invalid_code;
 	      ONE_MORE_BYTE (c1);
 	      if (c1 != ISO_CODE_ESC)
-		goto label_invalid_escape_sequence;
+		goto label_invalid_code;
 	      ONE_MORE_BYTE (c1);
 	      goto label_escape_sequence;
 	    case '$':		/* designation of 2-byte character set */
+	      if (! (coding->flags & CODING_FLAG_ISO_DESIGNATION))
+		goto label_invalid_code;
 	      ONE_MORE_BYTE (c1);
 	      if (c1 >= '@' && c1 <= 'B')
 		{	/* designation of JISX0208.1978, GB2312.1980,
 				   or JISX0208.1980 */
 		  DECODE_DESIGNATION (0, 2, 94, c1);
 		{	/* designation of DIMENSION2_CHARS96 character set */
 		  ONE_MORE_BYTE (c2);
 		  DECODE_DESIGNATION (c1 - 0x2C, 2, 96, c2);
 		}
 	      else
-		goto label_invalid_escape_sequence;
+		goto label_invalid_code;
 	      break;
 	    case 'n':		/* invocation of locking-shift-2 */
-	      if (CODING_SPEC_ISO_DESIGNATION (coding, 2) < 0)
+	      if (! (coding->flags & CODING_FLAG_ISO_LOCKING_SHIFT)
-		goto label_invalid_escape_sequence;
+		  || CODING_SPEC_ISO_DESIGNATION (coding, 2) < 0)
+		goto label_invalid_code;
 	      CODING_SPEC_ISO_INVOCATION (coding, 0) = 2;
 	      charset0 = CODING_SPEC_ISO_PLANE_CHARSET (coding, 0);
 	      break;
 	    case 'o':		/* invocation of locking-shift-3 */
-	      if (CODING_SPEC_ISO_DESIGNATION (coding, 3) < 0)
+	      if (! (coding->flags & CODING_FLAG_ISO_LOCKING_SHIFT)
-		goto label_invalid_escape_sequence;
+		  || CODING_SPEC_ISO_DESIGNATION (coding, 3) < 0)
+		goto label_invalid_code;
 	      CODING_SPEC_ISO_INVOCATION (coding, 0) = 3;
 	      charset0 = CODING_SPEC_ISO_PLANE_CHARSET (coding, 0);
 	      break;
 	    case 'N':		/* invocation of single-shift-2 */
-	      if (CODING_SPEC_ISO_DESIGNATION (coding, 2) < 0)
+	      if (! (coding->flags & CODING_FLAG_ISO_SINGLE_SHIFT)
-		goto label_invalid_escape_sequence;
+		  || CODING_SPEC_ISO_DESIGNATION (coding, 2) < 0)
+		goto label_invalid_code;
 	      ONE_MORE_BYTE (c1);
 	      charset = CODING_SPEC_ISO_DESIGNATION (coding, 2);
 	      DECODE_ISO_CHARACTER (charset, c1);
 	      break;
 	    case 'O':		/* invocation of single-shift-3 */
-	      if (CODING_SPEC_ISO_DESIGNATION (coding, 3) < 0)
+	      if (! (coding->flags & CODING_FLAG_ISO_SINGLE_SHIFT)
-		goto label_invalid_escape_sequence;
+		  || CODING_SPEC_ISO_DESIGNATION (coding, 3) < 0)
+		goto label_invalid_code;
 	      ONE_MORE_BYTE (c1);
 	      charset = CODING_SPEC_ISO_DESIGNATION (coding, 3);
 	      DECODE_ISO_CHARACTER (charset, c1);
 	      break;
-	    case '0':		/* start composing without embeded rules */
+	    case '0': case '2':	/* start composing */
-	      coding->composing = COMPOSING_NO_RULE_HEAD;
+	      /* Before processing composing, we must be sure that all
+		 characters being composed are supported by CODING.
+		 If not, we must give up composing and insert the
+		 bunch of codes for composing as is without decoding.  */
+	      {
+		int result1;
+		result1 = check_composing_code (coding, src, src_end);
+		if (result1 == 0)
+		  coding->composing = (c1 == '0'
+				       ? COMPOSING_NO_RULE_HEAD
+				       : COMPOSING_WITH_RULE_HEAD);
+		else if (result1 > 0)
+		  {
+		    if (result1 + 2 < (dst_bytes ? dst_end : src_base) - dst)
+		      {
+			bcopy (src_base, dst, result1 + 2);
+			src += result1;
+			dst += result1 + 2;
+			coding->produced_char += result1 + 2;
+		      }
+		    else
+		      {
+			result = CODING_FINISH_INSUFFICIENT_DST;
+			goto label_end_of_loop_2;
+		      }
+		  }
+		else
+		  goto label_end_of_loop;
+	      }
 	      break;
 	    case '1':		/* end composing */
 	      coding->composing = COMPOSING_NO;
+	      coding->produced_char++;
 	      break;
-	    case '2':		/* start composing with embeded rules */
-	      coding->composing = COMPOSING_WITH_RULE_HEAD;
-	      break;
 	    case '[':		/* specification of direction */
+	      if (coding->flags & CODING_FLAG_ISO_NO_DIRECTION)
+		goto label_invalid_code;
 	      /* For the moment, nested direction is not supported.
-		 So, the value of `coding->direction' is 0 or 1: 0
+		 So, `coding->mode & CODING_MODE_DIRECTION' zero means
-		 means left-to-right, 1 means right-to-left.  */
+		 left-to-right, and nozero means right-to-left.  */
 	      ONE_MORE_BYTE (c1);
 	      switch (c1)
 		{
 		case ']':	/* end of the current direction */
-		  coding->direction = 0;
+		  coding->mode &= ~CODING_MODE_DIRECTION;
 		case '0':	/* end of the current direction */
 		case '1':	/* start of left-to-right direction */
 		  ONE_MORE_BYTE (c1);
 		  if (c1 == ']')
-		    coding->direction = 0;
+		    coding->mode &= ~CODING_MODE_DIRECTION;
 		  else
-		    goto label_invalid_escape_sequence;
+		    goto label_invalid_code;
 		  break;
 		case '2':	/* start of right-to-left direction */
 		  ONE_MORE_BYTE (c1);
 		  if (c1 == ']')
-		    coding->direction= 1;
+		    coding->mode |= CODING_MODE_DIRECTION;
 		  else
-		    goto label_invalid_escape_sequence;
+		    goto label_invalid_code;
 		  break;
 		default:
-		  goto label_invalid_escape_sequence;
+		  goto label_invalid_code;
 		}
 	      break;
 	    default:
+	      if (! (coding->flags & CODING_FLAG_ISO_DESIGNATION))
+		goto label_invalid_code;
 	      if (c1 >= 0x28 && c1 <= 0x2B)
 		{	/* designation of DIMENSION1_CHARS94 character set */
 		  ONE_MORE_BYTE (c2);
 		  DECODE_DESIGNATION (c1 - 0x28, 1, 94, c2);
 		}
 		  ONE_MORE_BYTE (c2);
 		  DECODE_DESIGNATION (c1 - 0x2C, 1, 96, c2);
 		}
 	      else
 		{
-		  goto label_invalid_escape_sequence;
+		  goto label_invalid_code;
 		}
 	    }
 	  /* We must update these variables now.  */
 	  charset0 = CODING_SPEC_ISO_PLANE_CHARSET (coding, 0);
 	  charset1 = CODING_SPEC_ISO_PLANE_CHARSET (coding, 1);
 	  break;
-	label_invalid_escape_sequence:
+	label_invalid_code:
-	  {
+	  coding->produced_char += src - src_base;
-	    int length = src - src_base;
+	  while (src_base < src)
+	    *dst++ = *src_base++;
-	    bcopy (src_base, dst, length);
-	    dst += length;
-	  }
 	}
 continue;
 label_end_of_loop:
-coding->carryover_size = src - src_base;
+result = CODING_FINISH_INSUFFICIENT_SRC;
-bcopy (src_base, coding->carryover, coding->carryover_size);
+label_end_of_loop_2:
 src = src_base;
 break;
 }
+if (result == CODING_FINISH_NORMAL
+&& src < src_end)
+result = CODING_FINISH_INSUFFICIENT_DST;
 /* If this is the last block of the text to be decoded, we had
 better just flush out all remaining codes in the text although
 they are not valid characters.  */
-if (coding->last_block)
+if (coding->mode & CODING_MODE_LAST_BLOCK)
 {
 bcopy (src, dst, src_end - src);
 dst += (src_end - src);
 src = src_end;
 }
-*consumed = src - source;
+coding->consumed = coding->consumed_char = src - source;
-return dst - destination;
+coding->produced = dst - destination;
+return result;
 }
 /* ISO2022 encoding stuff.  */
 /*
 It is not enough to say just "ISO2022" on encoding, we have to
-specify more details.  In Emacs, each coding-system of ISO2022
+specify more details.  In Emacs, each coding system of ISO2022
 variant has the following specifications:
 	1. Initial designation to G0 thru G3.
 	2. Allows short-form designation?
 	3. ASCII should be designated to G0 before control characters?
 	4. ASCII should be designated to G0 at end of line?
 charset_alt = charset;						  \
 if (CHARSET_DIMENSION (charset_alt) == 1)				  \
 ENCODE_ISO_CHARACTER_DIMENSION1 (charset_alt, c1);		  \
 else								  \
 ENCODE_ISO_CHARACTER_DIMENSION2 (charset_alt, c1, c2);		  \
+if (! COMPOSING_P (coding->composing))				  \
+coding->consumed_char++;						  \
 } while (0)
 /* Produce designation and invocation codes at a place pointed by DST
 to use CHARSET.  The element `spec.iso2022' of *CODING is updated.
 Return new DST.  */
 	ENCODE_DESIGNATION						    \
 	  (CODING_SPEC_ISO_INITIAL_DESIGNATION (coding, reg), reg, coding); \
 } while (0)
 /* Produce designation sequences of charsets in the line started from
-*SRC to a place pointed by DSTP.
+SRC to a place pointed by *DSTP, and update DSTP.
 If the current block ends before any end-of-line, we may fail to
-find all the necessary *designations.  */
+find all the necessary designations.  */
 encode_designation_at_bol (coding, table, src, src_end, dstp)
 struct coding_system *coding;
 Lisp_Object table;
 unsigned char *src, *src_end, **dstp;
 {
 	  if ((c_alt = unify_char (table, -1, charset, c1, c2)) >= 0)
 	    charset = CHAR_CHARSET (c_alt);
 	}
 reg = CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset);
-if (r[reg] < 0)
+if (reg != CODING_SPEC_ISO_NO_REQUESTED_DESIGNATION && r[reg] < 0)
 	{
 	  found++;
 	  r[reg] = charset;
 	}
 }
 /* See the above "GENERAL NOTES on `encode_coding_XXX ()' functions".  */
 int
-encode_coding_iso2022 (coding, source, destination,
+encode_coding_iso2022 (coding, source, destination, src_bytes, dst_bytes)
-		       src_bytes, dst_bytes, consumed)
 struct coding_system *coding;
 unsigned char *source, *destination;
 int src_bytes, dst_bytes;
-int *consumed;
 {
 unsigned char *src = source;
 unsigned char *src_end = source + src_bytes;
 unsigned char *dst = destination;
 unsigned char *dst_end = destination + dst_bytes;
 from DST_END to assure overflow checking is necessary only at the
 head of loop.  */
 unsigned char *adjusted_dst_end = dst_end - 19;
 Lisp_Object unification_table
 = coding->character_unification_table_for_encode;
+int result = CODING_FINISH_NORMAL;
 if (!NILP (Venable_character_unification) && NILP (unification_table))
 unification_table = Vstandard_character_unification_table_for_encode;
-while (src < src_end && dst < adjusted_dst_end)
+coding->consumed_char = 0;
+while (src < src_end && (dst_bytes
+			   ? (dst < adjusted_dst_end)
+			   : (dst < src - 19)))
 {
 /* SRC_BASE remembers the start position in source in each loop.
 	 The loop will be exited when there's not enough source text
 	 to analyze multi-byte codes (within macros ONE_MORE_BYTE,
 	 TWO_MORE_BYTES, and THREE_MORE_BYTES).  In that case, SRC is
 	  CODING_SPEC_ISO_BOL (coding) = 0;
 	}
 c1 = *src++;
 /* If we are seeing a component of a composite character, we are
-	 seeing a leading-code specially encoded for composition, or a
+	 seeing a leading-code encoded irregularly for composition, or
-	 composition rule if composing with rule.  We must set C1
+	 a composition rule if composing with rule.  We must set C1 to
-	 to a normal leading-code or an ASCII code.  If we are not at
+	 a normal leading-code or an ASCII code.  If we are not seeing
-	 a composed character, we must reset the composition state.  */
+	 a composite character, we must reset composition,
+	 designation, and invocation states.  */
 if (COMPOSING_P (coding->composing))
 	{
 	  if (c1 < 0xA0)
 	    {
 	      /* We are not in a composite character any longer.  */
 	      coding->composing = COMPOSING_NO;
+	      ENCODE_RESET_PLANE_AND_REGISTER;
 	      ENCODE_COMPOSITION_END;
 	    }
 	  else
 	    {
 	      if (coding->composing == COMPOSING_WITH_RULE_RULE)
 	case EMACS_control_code:
 	  if (coding->flags & CODING_FLAG_ISO_RESET_AT_CNTL)
 	    ENCODE_RESET_PLANE_AND_REGISTER;
 	  *dst++ = c1;
+	  coding->consumed_char++;
 	  break;
 	case EMACS_carriage_return_code:
-	  if (!coding->selective)
+	  if (! (coding->mode & CODING_MODE_SELECTIVE_DISPLAY))
 	    {
 	      if (coding->flags & CODING_FLAG_ISO_RESET_AT_CNTL)
 		ENCODE_RESET_PLANE_AND_REGISTER;
 	      *dst++ = c1;
+	      coding->consumed_char++;
 	      break;
 	    }
 	  /* fall down to treat '\r' as '\n' ...  */
 	case EMACS_linefeed_code:
 	  else if (coding->eol_type == CODING_EOL_CRLF)
 	    *dst++ = ISO_CODE_CR, *dst++ = ISO_CODE_LF;
 	  else
 	    *dst++ = ISO_CODE_CR;
 	  CODING_SPEC_ISO_BOL (coding) = 1;
+	  coding->consumed_char++;
 	  break;
 	case EMACS_leading_code_2:
 	  ONE_MORE_BYTE (c2);
 	  if (c2 < 0xA0)
 	    {
 	      /* invalid sequence */
 	      *dst++ = c1;
 	      *dst++ = c2;
+	      coding->consumed_char += 2;
 	    }
 	  else
 	    ENCODE_ISO_CHARACTER (c1, c2, /* dummy */ c3);
 	  break;
 	    {
 	      /* invalid sequence */
 	      *dst++ = c1;
 	      *dst++ = c2;
 	      *dst++ = c3;
+	      coding->consumed_char += 3;
 	    }
 	  else if (c1 < LEADING_CODE_PRIVATE_11)
 	    ENCODE_ISO_CHARACTER (c1, c2, c3);
 	  else
 	    ENCODE_ISO_CHARACTER (c2, c3, /* dummy */ c4);
 	      /* invalid sequence */
 	      *dst++ = c1;
 	      *dst++ = c2;
 	      *dst++ = c3;
 	      *dst++ = c4;
+	      coding->consumed_char += 4;
 	    }
 	  else
 	    ENCODE_ISO_CHARACTER (c2, c3, c4);
 	  break;
 	  if (c2 < 0xA0)
 	    {
 	      /* invalid sequence */
 	      *dst++ = c1;
 	      *dst++ = c2;
+	      coding->consumed_char += 2;
 	    }
 	  else if (c2 == 0xFF)
 	    {
+	      ENCODE_RESET_PLANE_AND_REGISTER;
 	      coding->composing = COMPOSING_WITH_RULE_HEAD;
 	      ENCODE_COMPOSITION_WITH_RULE_START;
+	      coding->consumed_char++;
 	    }
 	  else
 	    {
+	      ENCODE_RESET_PLANE_AND_REGISTER;
 	      /* Rewind one byte because it is a character code of
 composition elements.  */
 	      src--;
 	      coding->composing = COMPOSING_NO_RULE_HEAD;
 	      ENCODE_COMPOSITION_NO_RULE_START;
+	      coding->consumed_char++;
 	    }
 	  break;
 	case EMACS_invalid_code:
 	  *dst++ = c1;
+	  coding->consumed_char++;
 	  break;
 	}
 continue;
 label_end_of_loop:
-/* We reach here because the source date ends not at character
+result = CODING_FINISH_INSUFFICIENT_SRC;
-	 boundary.  */
+src = src_base;
-coding->carryover_size = src_end - src_base;
-bcopy (src_base, coding->carryover, coding->carryover_size);
-src = src_end;
 break;
 }
+if (result == CODING_FINISH_NORMAL
+&& src < src_end)
+result = CODING_FINISH_INSUFFICIENT_DST;
 /* If this is the last block of the text to be encoded, we must
-reset graphic planes and registers to the initial state.  */
+reset graphic planes and registers to the initial state, and
-if (src >= src_end && coding->last_block)
+flush out the carryover if any.  */
-{
+if (coding->mode & CODING_MODE_LAST_BLOCK)
 ENCODE_RESET_PLANE_AND_REGISTER;
-if (coding->carryover_size > 0
-	  && coding->carryover_size < (dst_end - dst))
+coding->consumed = src - source;
-	{
+coding->produced = coding->produced_char = dst - destination;
-	  bcopy (coding->carryover, dst, coding->carryover_size);
+return result;
-	  dst += coding->carryover_size;
-	  coding->carryover_size = 0;
-	}
-}
-*consumed = src - source;
-return dst - destination;
 }
 /*** 4. SJIS and BIG5 handlers ***/
 DECODE_CHARACTER_ASCII (c1);					\
 else if (CHARSET_DIMENSION (charset_alt) == 1)			\
 DECODE_CHARACTER_DIMENSION1 (charset_alt, c1);			\
 else								\
 DECODE_CHARACTER_DIMENSION2 (charset_alt, c1, c2);		\
+coding->produced_char++;						\
 } while (0)
 #define ENCODE_SJIS_BIG5_CHARACTER(charset, c1, c2)			  \
 do {									  \
 int c_alt, charset_alt;						  \
 	    *dst++ = b1, *dst++ = b2;					  \
 	  }								  \
 	else								  \
 	  *dst++ = charset_alt, *dst++ = c1, *dst++ = c2;		  \
 }									  \
+coding->consumed_char++;						  \
 } while (0);
 /* See the above "GENERAL NOTES on `detect_coding_XXX ()' functions".
 Check if a text is encoded in SJIS.  If it is, return
 CODING_CATEGORY_MASK_SJIS, else return 0.  */
 unsigned char c;
 while (src < src_end)
 {
 c = *src++;
-if (c == ISO_CODE_ESC || c == ISO_CODE_SI || c == ISO_CODE_SO)
-	return 0;
 if ((c >= 0x80 && c < 0xA0) || c >= 0xE0)
 	{
 	  if (src < src_end && *src++ < 0x40)
 	    return 0;
 	}
 unsigned char c;
 while (src < src_end)
 {
 c = *src++;
-if (c == ISO_CODE_ESC || c == ISO_CODE_SI || c == ISO_CODE_SO)
-	return 0;
 if (c >= 0xA1)
 	{
 	  if (src >= src_end)
 	    break;
 	  c = *src++;
 /* See the above "GENERAL NOTES on `decode_coding_XXX ()' functions".
 If SJIS_P is 1, decode SJIS text, else decode BIG5 test.  */
 int
 decode_coding_sjis_big5 (coding, source, destination,
-			 src_bytes, dst_bytes, consumed, sjis_p)
+			 src_bytes, dst_bytes, sjis_p)
 struct coding_system *coding;
 unsigned char *source, *destination;
 int src_bytes, dst_bytes;
-int *consumed;
 int sjis_p;
 {
 unsigned char *src = source;
 unsigned char *src_end = source + src_bytes;
 unsigned char *dst = destination;
 from DST_END to assure overflow checking is necessary only at the
 head of loop.  */
 unsigned char *adjusted_dst_end = dst_end - 3;
 Lisp_Object unification_table
 = coding->character_unification_table_for_decode;
+int result = CODING_FINISH_NORMAL;
 if (!NILP (Venable_character_unification) && NILP (unification_table))
 unification_table = Vstandard_character_unification_table_for_decode;
-while (src < src_end && dst < adjusted_dst_end)
+coding->produced_char = 0;
+while (src < src_end && (dst_bytes
+			   ? (dst < adjusted_dst_end)
+			   : (dst < src - 3)))
 {
 /* SRC_BASE remembers the start position in source in each loop.
 	 The loop will be exited when there's not enough source text
 	 to analyze two-byte character (within macro ONE_MORE_BYTE).
 	 In that case, SRC is reset to SRC_BASE before exiting.  */
 unsigned char *src_base = src;
 unsigned char c1 = *src++, c2, c3, c4;
-if (c1 == '\r')
+if (c1 < 0x20)
 	{
-	  if (coding->eol_type == CODING_EOL_CRLF)
+	  if (c1 == '\r')
 	    {
-	      ONE_MORE_BYTE (c2);
+	      if (coding->eol_type == CODING_EOL_CRLF)
-	      if (c2 == '\n')
+		{
-		*dst++ = c2;
+		  ONE_MORE_BYTE (c2);
+		  if (c2 == '\n')
+		    *dst++ = c2;
+		  else if (coding->mode & CODING_MODE_INHIBIT_INCONSISTENT_EOL)
+		    {
+		      result = CODING_FINISH_INCONSISTENT_EOL;
+		      goto label_end_of_loop_2;
+		    }
+		  else
+		    /* To process C2 again, SRC is subtracted by 1.  */
+		    *dst++ = c1, src--;
+		}
+	      else if (coding->eol_type == CODING_EOL_CR)
+		*dst++ = '\n';
 	      else
-		/* To process C2 again, SRC is subtracted by 1.  */
+		*dst++ = c1;
-		*dst++ = c1, src--;
 	    }
-	  else if (coding->eol_type == CODING_EOL_CR)
+	  else if (c1 == '\n'
-	    *dst++ = '\n';
+		   && (coding->mode & CODING_MODE_INHIBIT_INCONSISTENT_EOL)
+		   && (coding->eol_type == CODING_EOL_CR
+		       || coding->eol_type == CODING_EOL_CRLF))
+	    {
+	      result = CODING_FINISH_INCONSISTENT_EOL;
+	      goto label_end_of_loop_2;
+	    }
 	  else
 	    *dst++ = c1;
-	}
+	  coding->produced_char++;
-else if (c1 < 0x20)
+	}
-	*dst++ = c1;
 else if (c1 < 0x80)
 	DECODE_SJIS_BIG5_CHARACTER (charset_ascii, c1, /* dummy */ c2);
 else if (c1 < 0xA0 || c1 >= 0xE0)
 	{
 	  /* SJIS -> JISX0208, BIG5 -> Big5 (only if 0xE0 <= c1 < 0xFF) */
 	      ONE_MORE_BYTE (c2);
 	      DECODE_BIG5 (c1, c2, charset, c3, c4);
 	      DECODE_SJIS_BIG5_CHARACTER (charset, c3, c4);
 	    }
 	  else			/* Invalid code */
-	    *dst++ = c1;
+	    {
+	      *dst++ = c1;
+	      coding->produced_char++;
+	    }
 	}
 else
 	{
 	  /* SJIS -> JISX0201-Kana, BIG5 -> Big5 */
 	  if (sjis_p)
-	    DECODE_SJIS_BIG5_CHARACTER (charset_katakana_jisx0201, c1, /* dummy */ c2);
+	    DECODE_SJIS_BIG5_CHARACTER (charset_katakana_jisx0201, c1,
+					/* dummy */ c2);
 	  else
 	    {
 	      int charset;
 	      ONE_MORE_BYTE (c2);
 	    }
 	}
 continue;
 label_end_of_loop:
-coding->carryover_size = src - src_base;
+result = CODING_FINISH_INSUFFICIENT_SRC;
-bcopy (src_base, coding->carryover, coding->carryover_size);
+label_end_of_loop_2:
 src = src_base;
 break;
 }
-*consumed = src - source;
+if (result == CODING_FINISH_NORMAL
-return dst - destination;
+&& src < src_end)
+result = CODING_FINISH_INSUFFICIENT_DST;
+coding->consumed = coding->consumed_char = src - source;
+coding->produced = dst - destination;
+return result;
 }
 /* See the above "GENERAL NOTES on `encode_coding_XXX ()' functions".
 This function can encode `charset_ascii', `charset_katakana_jisx0201',
 `charset_jisx0208', `charset_big5_1', and `charset_big5-2'.  We are
 charsets are produced without any encoding.  If SJIS_P is 1, encode
 SJIS text, else encode BIG5 text.  */
 int
 encode_coding_sjis_big5 (coding, source, destination,
-			 src_bytes, dst_bytes, consumed, sjis_p)
+			 src_bytes, dst_bytes, sjis_p)
 struct coding_system *coding;
 unsigned char *source, *destination;
 int src_bytes, dst_bytes;
-int *consumed;
 int sjis_p;
 {
 unsigned char *src = source;
 unsigned char *src_end = source + src_bytes;
 unsigned char *dst = destination;
 from DST_END to assure overflow checking is necessary only at the
 head of loop.  */
 unsigned char *adjusted_dst_end = dst_end - 1;
 Lisp_Object unification_table
 = coding->character_unification_table_for_encode;
+int result = CODING_FINISH_NORMAL;
 if (!NILP (Venable_character_unification) && NILP (unification_table))
 unification_table = Vstandard_character_unification_table_for_encode;
-while (src < src_end && dst < adjusted_dst_end)
+coding->consumed_char = 0;
+while (src < src_end && (dst_bytes
+			   ? (dst < adjusted_dst_end)
+			   : (dst < src - 1)))
 {
 /* SRC_BASE remembers the start position in source in each loop.
 	 The loop will be exited when there's not enough source text
 	 to analyze multi-byte codes (within macros ONE_MORE_BYTE and
 	 TWO_MORE_BYTES).  In that case, SRC is reset to SRC_BASE
 	  ENCODE_SJIS_BIG5_CHARACTER (charset_ascii, c1, /* dummy */ c2);
 	  break;
 	case EMACS_control_code:
 	  *dst++ = c1;
+	  coding->consumed_char++;
 	  break;
 	case EMACS_carriage_return_code:
-	  if (!coding->selective)
+	  if (! (coding->mode & CODING_MODE_SELECTIVE_DISPLAY))
 	    {
 	      *dst++ = c1;
+	      coding->consumed_char++;
 	      break;
 	    }
 	  /* fall down to treat '\r' as '\n' ...  */
 	case EMACS_linefeed_code:
 	    *dst++ = '\n';
 	  else if (coding->eol_type == CODING_EOL_CRLF)
 	    *dst++ = '\r', *dst++ = '\n';
 	  else
 	    *dst++ = '\r';
+	  coding->consumed_char++;
 	  break;
 	case EMACS_leading_code_2:
 	  ONE_MORE_BYTE (c2);
 	  ENCODE_SJIS_BIG5_CHARACTER (c1, c2, /* dummy */ c3);
 	  coding->composing = 1;
 	  break;
 	default:		/* i.e. case EMACS_invalid_code: */
 	  *dst++ = c1;
+	  coding->consumed_char++;
 	}
 continue;
 label_end_of_loop:
-coding->carryover_size = src_end - src_base;
+result = CODING_FINISH_INSUFFICIENT_SRC;
-bcopy (src_base, coding->carryover, coding->carryover_size);
+src = src_base;
-src = src_end;
 break;
 }
-*consumed = src - source;
+if (result == CODING_FINISH_NORMAL
-return dst - destination;
+&& src < src_end)
+result = CODING_FINISH_INSUFFICIENT_DST;
+coding->consumed = src - source;
+coding->produced = coding->produced_char = dst - destination;
+return result;
 }
 /*** 5. End-of-line handlers ***/
 /* See the above "GENERAL NOTES on `decode_coding_XXX ()' functions".
 This function is called only when `coding->eol_type' is
 CODING_EOL_CRLF or CODING_EOL_CR.  */
-decode_eol (coding, source, destination, src_bytes, dst_bytes, consumed)
+decode_eol (coding, source, destination, src_bytes, dst_bytes)
 struct coding_system *coding;
 unsigned char *source, *destination;
 int src_bytes, dst_bytes;
-int *consumed;
 {
 unsigned char *src = source;
 unsigned char *src_end = source + src_bytes;
 unsigned char *dst = destination;
 unsigned char *dst_end = destination + dst_bytes;
-int produced;
+int result = CODING_FINISH_NORMAL;
+if (src_bytes <= 0)
+return result;
 switch (coding->eol_type)
 {
 case CODING_EOL_CRLF:
 {
 	/* Since the maximum bytes produced by each loop is 2, we
 	   subtract 1 from DST_END to assure overflow checking is
 	   necessary only at the head of loop.  */
 	unsigned char *adjusted_dst_end = dst_end - 1;
-	while (src < src_end && dst < adjusted_dst_end)
+	while (src < src_end && (dst_bytes
+				 ? (dst < adjusted_dst_end)
+				 : (dst < src - 1)))
 	  {
 	    unsigned char *src_base = src;
 	    unsigned char c = *src++;
 	    if (c == '\r')
 	      {
 		ONE_MORE_BYTE (c);
 		if (c != '\n')
-		  *dst++ = '\r';
+		  {
+		    if (coding->mode & CODING_MODE_INHIBIT_INCONSISTENT_EOL)
+		      {
+			result = CODING_FINISH_INCONSISTENT_EOL;
+			goto label_end_of_loop_2;
+		      }
+		    *dst++ = '\r';
+		  }
 		*dst++ = c;
+	      }
+	    else if (c == '\n'
+		     && (coding->mode & CODING_MODE_INHIBIT_INCONSISTENT_EOL))
+	      {
+		result = CODING_FINISH_INCONSISTENT_EOL;
+		goto label_end_of_loop_2;
 	      }
 	    else
 	      *dst++ = c;
 	    continue;
 	  label_end_of_loop:
-	    coding->carryover_size = src - src_base;
+	    result = CODING_FINISH_INSUFFICIENT_SRC;
-	    bcopy (src_base, coding->carryover, coding->carryover_size);
+	  label_end_of_loop_2:
 	    src = src_base;
 	    break;
 	  }
-	*consumed = src - source;
+	if (result == CODING_FINISH_NORMAL
-	produced = dst - destination;
+	    && src < src_end)
-	break;
+	  result = CODING_FINISH_INSUFFICIENT_DST;
 }
+break;
 case CODING_EOL_CR:
-produced = (src_bytes > dst_bytes) ? dst_bytes : src_bytes;
+if (coding->mode & CODING_MODE_INHIBIT_INCONSISTENT_EOL)
-bcopy (source, destination, produced);
+	{
-dst_end = destination + produced;
+	  while (src < src_end) if (*src++ == '\n') break;
-while (dst < dst_end)
+	  if (*--src == '\n')
-	if (*dst++ == '\r') dst[-1] = '\n';
+	    {
-*consumed = produced;
+	      src_bytes = src - source;
+	      result = CODING_FINISH_INCONSISTENT_EOL;
+	    }
+	}
+if (dst_bytes && src_bytes > dst_bytes)
+	{
+	  result = CODING_FINISH_INSUFFICIENT_DST;
+	  src_bytes = dst_bytes;
+	}
+if (dst_bytes)
+	bcopy (source, destination, src_bytes);
+else
+	safe_bcopy (source, destination, src_bytes);
+src = source + src_bytes;
+while (src_bytes--) if (*dst++ == '\r') dst[-1] = '\n';
 break;
 default:			/* i.e. case: CODING_EOL_LF */
-produced = (src_bytes > dst_bytes) ? dst_bytes : src_bytes;
+if (dst_bytes && src_bytes > dst_bytes)
-bcopy (source, destination, produced);
+	{
-*consumed = produced;
+	  result = CODING_FINISH_INSUFFICIENT_DST;
+	  src_bytes = dst_bytes;
+	}
+if (dst_bytes)
+	bcopy (source, destination, src_bytes);
+else
+	safe_bcopy (source, destination, src_bytes);
+src += src_bytes;
+dst += dst_bytes;
 break;
 }
-return produced;
+coding->consumed = coding->consumed_char = src - source;
+coding->produced = coding->produced_char = dst - destination;
+return result;
 }
 /* See "GENERAL NOTES about `encode_coding_XXX ()' functions".  Encode
 format of end-of-line according to `coding->eol_type'.  If
-`coding->selective' is 1, code '\r' in source text also means
+`coding->mode & CODING_MODE_SELECTIVE_DISPLAY' is nonzero, code
-end-of-line.  */
+'\r' in source text also means end-of-line.  */
-encode_eol (coding, source, destination, src_bytes, dst_bytes, consumed)
+encode_eol (coding, source, destination, src_bytes, dst_bytes)
 struct coding_system *coding;
 unsigned char *source, *destination;
 int src_bytes, dst_bytes;
-int *consumed;
 {
 unsigned char *src = source;
 unsigned char *dst = destination;
-int produced;
+int result = CODING_FINISH_NORMAL;
-if (src_bytes <= 0)
+if (coding->eol_type == CODING_EOL_CRLF)
-return 0;
+{
+unsigned char c;
-switch (coding->eol_type)
+unsigned char *src_end = source + src_bytes;
-{
+unsigned char *dst_end = destination + dst_bytes;
-case CODING_EOL_LF:
+/* Since the maximum bytes produced by each loop is 2, we
-case CODING_EOL_UNDECIDED:
+	 subtract 1 from DST_END to assure overflow checking is
-produced = (src_bytes > dst_bytes) ? dst_bytes : src_bytes;
+	 necessary only at the head of loop.  */
-bcopy (source, destination, produced);
+unsigned char *adjusted_dst_end = dst_end - 1;
-if (coding->selective)
-	{
+while (src < src_end && (dst_bytes
-	  int i = produced;
+			       ? (dst < adjusted_dst_end)
-	  while (i--)
+			       : (dst < src - 1)))
+	{
+	  c = *src++;
+	  if (c == '\n'
+	      || (c == '\r' && (coding->mode & CODING_MODE_SELECTIVE_DISPLAY)))
+	    *dst++ = '\r', *dst++ = '\n';
+	  else
+	    *dst++ = c;
+	}
+if (src < src_end)
+	result = CODING_FINISH_INSUFFICIENT_DST;
+}
+else
+{
+if (dst_bytes && src_bytes > dst_bytes)
+	{
+	  src_bytes = dst_bytes;
+	  result = CODING_FINISH_INSUFFICIENT_DST;
+	}
+if (dst_bytes)
+	bcopy (source, destination, src_bytes);
+else
+	safe_bcopy (source, destination, src_bytes);
+if (coding->eol_type == CODING_EOL_CRLF)
+	{
+	  while (src_bytes--)
+	    if (*dst++ == '\n') dst[-1] = '\r';
+	}
+else if (coding->mode & CODING_MODE_SELECTIVE_DISPLAY)
+	{
+	  while (src_bytes--)
 	    if (*dst++ == '\r') dst[-1] = '\n';
 	}
-*consumed = produced;
+src += src_bytes;
+dst += src_bytes;
-case CODING_EOL_CRLF:
+}
-{
-	unsigned char c;
+coding->consumed = coding->consumed_char = src - source;
-	unsigned char *src_end = source + src_bytes;
+coding->produced = coding->produced_char = dst - destination;
-	unsigned char *dst_end = destination + dst_bytes;
+return result;
-	/* Since the maximum bytes produced by each loop is 2, we
-	   subtract 1 from DST_END to assure overflow checking is
-	   necessary only at the head of loop.  */
-	unsigned char *adjusted_dst_end = dst_end - 1;
-	while (src < src_end && dst < adjusted_dst_end)
-	  {
-	    c = *src++;
-	    if (c == '\n' || (c == '\r' && coding->selective))
-	      *dst++ = '\r', *dst++ = '\n';
-	    else
-	      *dst++ = c;
-	  }
-	produced = dst - destination;
-	*consumed = src - source;
-	break;
-}
-default:			/* i.e. case CODING_EOL_CR: */
-produced = (src_bytes > dst_bytes) ? dst_bytes : src_bytes;
-bcopy (source, destination, produced);
-{
-	int i = produced;
-	while (i--)
-	  if (*dst++ == '\n') dst[-1] = '\r';
-}
-*consumed = produced;
-}
-return produced;
 }
 /*** 6. C library functions ***/
 int
 setup_coding_system (coding_system, coding)
 Lisp_Object coding_system;
 struct coding_system *coding;
 {
-Lisp_Object coding_spec, plist, type, eol_type;
+Lisp_Object coding_spec, coding_type, eol_type, plist;
 Lisp_Object val;
 int i;
-/* At first, set several fields to default values.  */
+/* Initialize some fields required for all kinds of coding systems.  */
-coding->last_block = 0;
+coding->symbol = coding_system;
-coding->selective = 0;
+coding->common_flags = 0;
-coding->composing = 0;
+coding->mode = 0;
-coding->direction = 0;
+coding->heading_ascii = -1;
-coding->carryover_size = 0;
 coding->post_read_conversion = coding->pre_write_conversion = Qnil;
-coding->character_unification_table_for_decode = Qnil;
-coding->character_unification_table_for_encode = Qnil;
-coding->symbol = coding_system;
-eol_type = Qnil;
-/* Get values of property `coding-system' and `eol-type'.
-Also get values of coding system properties:
-`post-read-conversion', `pre-write-conversion',
-`character-unification-table-for-decode',
-`character-unification-table-for-encode'.  */
 coding_spec = Fget (coding_system, Qcoding_system);
 if (!VECTORP (coding_spec)
 || XVECTOR (coding_spec)->size != 5
 || !CONSP (XVECTOR (coding_spec)->contents[3]))
 goto label_invalid_coding_system;
-if (!inhibit_eol_conversion)
-eol_type = Fget (coding_system, Qeol_type);
+eol_type = inhibit_eol_conversion ? Qnil : Fget (coding_system, Qeol_type);
+if (VECTORP (eol_type))
+{
+coding->eol_type = CODING_EOL_UNDECIDED;
+coding->common_flags = CODING_REQUIRE_DETECTION_MASK;
+}
+else if (XFASTINT (eol_type) == 1)
+{
+coding->eol_type = CODING_EOL_CRLF;
+coding->common_flags
+	= CODING_REQUIRE_DECODING_MASK | CODING_REQUIRE_ENCODING_MASK;
+}
+else if (XFASTINT (eol_type) == 2)
+{
+coding->eol_type = CODING_EOL_CR;
+coding->common_flags
+	= CODING_REQUIRE_DECODING_MASK | CODING_REQUIRE_ENCODING_MASK;
+}
+else
+coding->eol_type = CODING_EOL_LF;
+coding_type = XVECTOR (coding_spec)->contents[0];
+/* Try short cut.  */
+if (SYMBOLP (coding_type))
+{
+if (EQ (coding_type, Qt))
+	{
+	  coding->type = coding_type_undecided;
+	  coding->common_flags |= CODING_REQUIRE_DETECTION_MASK;
+	}
+else
+	coding->type = coding_type_no_conversion;
+return 0;
+}
+/* Initialize remaining fields.  */
+coding->composing = 0;
+coding->character_unification_table_for_decode = Qnil;
+coding->character_unification_table_for_encode = Qnil;
+/* Get values of coding system properties:
+`post-read-conversion', `pre-write-conversion',
+`character-unification-table-for-decode',
+`character-unification-table-for-encode'.  */
 plist = XVECTOR (coding_spec)->contents[3];
 coding->post_read_conversion = Fplist_get (plist, Qpost_read_conversion);
 coding->pre_write_conversion = Fplist_get (plist, Qpre_write_conversion);
 val = Fplist_get (plist, Qcharacter_unification_table_for_decode);
 if (SYMBOLP (val))
 val = Fplist_get (plist, Qcharacter_unification_table_for_encode);
 if (SYMBOLP (val))
 val = Fget (val, Qcharacter_unification_table_for_encode);
 coding->character_unification_table_for_encode
 = CHAR_TABLE_P (val) ? val : Qnil;
+val = Fplist_get (plist, Qcoding_category);
+if (!NILP (val))
+{
+val = Fget (val, Qcoding_category_index);
+if (INTEGERP (val))
+	coding->category_idx = XINT (val);
+else
+	goto label_invalid_coding_system;
+}
+else
+goto label_invalid_coding_system;
 val = Fplist_get (plist, Qsafe_charsets);
 if (EQ (val, Qt))
 {
 for (i = 0; i <= MAX_CHARSET; i++)
 	    coding->safe_charsets[i] = 1;
 	  val = XCONS (val)->cdr;
 	}
 }
-if (VECTORP (eol_type))
+switch (XFASTINT (coding_type))
-{
-coding->eol_type = CODING_EOL_UNDECIDED;
-coding->common_flags = CODING_REQUIRE_DETECTION_MASK;
-}
-else if (XFASTINT (eol_type) == 1)
-{
-coding->eol_type = CODING_EOL_CRLF;
-coding->common_flags
-	= CODING_REQUIRE_DECODING_MASK | CODING_REQUIRE_ENCODING_MASK;
-}
-else if (XFASTINT (eol_type) == 2)
-{
-coding->eol_type = CODING_EOL_CR;
-coding->common_flags
-	= CODING_REQUIRE_DECODING_MASK | CODING_REQUIRE_ENCODING_MASK;
-}
-else
-{
-coding->eol_type = CODING_EOL_LF;
-coding->common_flags = 0;
-}
-type = XVECTOR (coding_spec)->contents[0];
-switch (XFASTINT (type))
 {
 case 0:
 coding->type = coding_type_emacs_mule;
 if (!NILP (coding->post_read_conversion))
 	coding->common_flags |= CODING_REQUIRE_DECODING_MASK;
 coding->common_flags
 	|= CODING_REQUIRE_DECODING_MASK | CODING_REQUIRE_ENCODING_MASK;
 {
 	Lisp_Object val, temp;
 	Lisp_Object *flags;
-	int i, charset, default_reg_bits = 0;
+	int i, charset, reg_bits = 0;
 	val = XVECTOR (coding_spec)->contents[4];
 	if (!VECTORP (val) || XVECTOR (val)->size != 32)
 	  goto label_invalid_coding_system;
 		t: designate nothing to REG initially, but can be used
 		  by any charsets,
 		list of integer, nil, or t: designate the first
 		  element (if integer) to REG initially, the remaining
 		  elements (if integer) is designated to REG on request,
-		  if an element is t, REG can be used by any charset,
+		  if an element is t, REG can be used by any charsets,
 		nil: REG is never used.  */
 	for (charset = 0; charset <= MAX_CHARSET; charset++)
 	  CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset)
 	    = CODING_SPEC_ISO_NO_REQUESTED_DESIGNATION;
 	for (i = 0; i < 4; i++)
 		CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset) = i;
 	      }
 	    else if (EQ (flags[i], Qt))
 	      {
 		CODING_SPEC_ISO_INITIAL_DESIGNATION (coding, i) = -1;
-		default_reg_bits |= 1 << i;
+		reg_bits |= 1 << i;
+		coding->flags |= CODING_FLAG_ISO_DESIGNATION;
 	      }
 	    else if (CONSP (flags[i]))
 	      {
 		Lisp_Object tail = flags[i];
+		coding->flags |= CODING_FLAG_ISO_DESIGNATION;
 		if (INTEGERP (XCONS (tail)->car)
 		    && (charset = XINT (XCONS (tail)->car),
 			CHARSET_VALID_P (charset))
 		    || (charset = get_charset_id (XCONS (tail)->car)) >= 0)
 		  {
 			    CHARSET_VALID_P (charset))
 			|| (charset = get_charset_id (XCONS (tail)->car)) >= 0)
 		      CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset)
 			= i;
 		    else if (EQ (XCONS (tail)->car, Qt))
-		      default_reg_bits |= 1 << i;
+		      reg_bits |= 1 << i;
 		    tail = XCONS (tail)->cdr;
 		  }
 	      }
 	    else
 	      CODING_SPEC_ISO_INITIAL_DESIGNATION (coding, i) = -1;
 	    CODING_SPEC_ISO_DESIGNATION (coding, i)
 	      = CODING_SPEC_ISO_INITIAL_DESIGNATION (coding, i);
 	  }
-	if (! (coding->flags & CODING_FLAG_ISO_LOCKING_SHIFT))
+	if (reg_bits && ! (coding->flags & CODING_FLAG_ISO_LOCKING_SHIFT))
 	  {
 	    /* REG 1 can be used only by locking shift in 7-bit env.  */
 	    if (coding->flags & CODING_FLAG_ISO_SEVEN_BITS)
-	      default_reg_bits &= ~2;
+	      reg_bits &= ~2;
 	    if (! (coding->flags & CODING_FLAG_ISO_SINGLE_SHIFT))
 	      /* Without any shifting, only REG 0 and 1 can be used.  */
-	      default_reg_bits &= 3;
+	      reg_bits &= 3;
 	  }
-	for (charset = 0; charset <= MAX_CHARSET; charset++)
+	if (reg_bits)
-	  if (CHARSET_VALID_P (charset)
+	  for (charset = 0; charset <= MAX_CHARSET; charset++)
-	      && (CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset)
-		  == CODING_SPEC_ISO_NO_REQUESTED_DESIGNATION))
 	    {
-	      /* We have not yet decided where to designate CHARSET.  */
+	      if (CHARSET_VALID_P (charset))
-	      int reg_bits = default_reg_bits;
+		{
+		  /* There exist some default graphic registers to be
-	      if (CHARSET_CHARS (charset) == 96)
+		     used CHARSET.  */
-		/* A charset of CHARS96 can't be designated to REG 0.  */
-		reg_bits &= ~1;
+		  /* We had better avoid designating a charset of
+		     CHARS96 to REG 0 as far as possible.  */
-	      if (reg_bits)
+		  if (CHARSET_CHARS (charset) == 96)
-		/* There exist some default graphic register.  */
+		    CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset)
-		CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset)
+		      = (reg_bits & 2
-		  = (reg_bits & 1
+			 ? 1 : (reg_bits & 4 ? 2 : (reg_bits & 8 ? 3 : 0)));
-		     ? 0 : (reg_bits & 2 ? 1 : (reg_bits & 4 ? 2 : 3)));
+		  else
-	      else
+		    CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset)
-		/* We anyway have to designate CHARSET to somewhere.  */
+		      = (reg_bits & 1
-		CODING_SPEC_ISO_REQUESTED_DESIGNATION (coding, charset)
+			 ? 0 : (reg_bits & 2 ? 1 : (reg_bits & 4 ? 2 : 3)));
-		  = (CHARSET_CHARS (charset) == 94
+		}
-		     ? 0
-		     : ((coding->flags & CODING_FLAG_ISO_LOCKING_SHIFT
-			 || ! coding->flags & CODING_FLAG_ISO_SEVEN_BITS)
-			? 1
-			: (coding->flags & CODING_FLAG_ISO_SINGLE_SHIFT
-			   ? 2 : 0)));
 	    }
 }
 coding->common_flags |= CODING_REQUIRE_FLUSHING_MASK;
+coding->spec.iso2022.last_invalid_designation_register = -1;
 break;
 case 3:
 coding->type = coding_type_big5;
 coding->common_flags
 case 5:
 coding->type = coding_type_raw_text;
 break;
 default:
-if (EQ (type, Qt))
+goto label_invalid_coding_system;
-	{
-	  coding->type = coding_type_undecided;
-	  coding->common_flags |= CODING_REQUIRE_DETECTION_MASK;
-	}
-else
-	coding->type = coding_type_no_conversion;
-break;
 }
 return 0;
 label_invalid_coding_system:
 coding->type = coding_type_no_conversion;
+coding->category_idx = CODING_CATEGORY_IDX_BINARY;
 coding->common_flags = 0;
 coding->eol_type = CODING_EOL_LF;
-coding->symbol = coding->pre_write_conversion = coding->post_read_conversion
+coding->pre_write_conversion = coding->post_read_conversion = Qnil;
-= Qnil;
 return -1;
 }
 /* Emacs has a mechanism to automatically detect a coding system if it
 is one of Emacs' internal format, ISO2022, SJIS, and BIG5.  But,
 o coding-category-iso-7
 	The category for a coding system which has the same code range
 	as ISO2022 of 7-bit environment.  This doesn't use any locking
-	shift and single shift functions.  Assigned the coding-system
+	shift and single shift functions.  This can encode/decode all
-	(Lisp symbol) `iso-2022-7bit' by default.
+	charsets.  Assigned the coding-system (Lisp symbol)
+	`iso-2022-7bit' by default.
+o coding-category-iso-7-tight
+	Same as coding-category-iso-7 except that this can
+	encode/decode only the specified charsets.
 o coding-category-iso-8-1
 	The category for a coding system which has the same code range
 	as ISO2022 of 8-bit environment and graphic plane 1 used only
 highest priority.  Priorities of categories are also specified by a
 user in a Lisp variable `coding-category-list'.
 */
-/* Detect how a text of length SRC_BYTES pointed by SRC is encoded.
+/* Detect how a text of length SRC_BYTES pointed by SOURCE is encoded.
 If it detects possible coding systems, return an integer in which
 appropriate flag bits are set.  Flag bits are defined by macros
-CODING_CATEGORY_MASK_XXX in `coding.h'.  */
+CODING_CATEGORY_MASK_XXX in `coding.h'.
-int
+How many ASCII characters are at the head is returned as *SKIP.  */
-detect_coding_mask (src, src_bytes)
-unsigned char *src;
+static int
-int src_bytes;
+detect_coding_mask (source, src_bytes, priorities, skip)
+unsigned char *source;
+int src_bytes, *priorities, *skip;
 {
 register unsigned char c;
-unsigned char *src_end = src + src_bytes;
+unsigned char *src = source, *src_end = source + src_bytes;
-int mask;
+unsigned int mask = (CODING_CATEGORY_MASK_ISO_7BIT
+		       | CODING_CATEGORY_MASK_ISO_SHIFT);
+int i;
 /* At first, skip all ASCII characters and control characters except
 for three ISO2022 specific control characters.  */
 label_loop_detect_coding:
 while (src < src_end)
 {
 c = *src;
 if (c >= 0x80
-	  || (c == ISO_CODE_ESC || c == ISO_CODE_SI || c == ISO_CODE_SO))
+	  || ((mask & CODING_CATEGORY_MASK_ISO_7BIT)
+	      && c == ISO_CODE_ESC)
+	  || ((mask & CODING_CATEGORY_MASK_ISO_SHIFT)
+	      && (c == ISO_CODE_SI || c == ISO_CODE_SO)))
 	break;
 src++;
 }
+*skip = src - source;
 if (src >= src_end)
 /* We found nothing other than ASCII.  There's nothing to do.  */
-return CODING_CATEGORY_MASK_ANY;
+return 0;
 /* The text seems to be encoded in some multilingual coding system.
 Now, try to find in which coding system the text is encoded.  */
 if (c < 0x80)
 {
 /* i.e. (c == ISO_CODE_ESC || c == ISO_CODE_SI || c == ISO_CODE_SO) */
 /* C is an ISO2022 specific control code of C0.  */
 mask = detect_coding_iso2022 (src, src_end);
-src++;
 if (mask == 0)
-	/* No valid ISO2022 code follows C.  Try again.  */
+	{
-	goto label_loop_detect_coding;
+	  /* No valid ISO2022 code follows C.  Try again.  */
-mask |= CODING_CATEGORY_MASK_RAW_TEXT;
+	  src++;
-}
+	  mask = (c != ISO_CODE_ESC
-else if (c < 0xA0)
+		  ? CODING_CATEGORY_MASK_ISO_7BIT
-{
+		  : CODING_CATEGORY_MASK_ISO_SHIFT);
-/* If C is a special latin extra code,
+	  goto label_loop_detect_coding;
-	 or is an ISO2022 specific control code of C1 (SS2 or SS3),
+	}
-	 or is an ISO2022 control-sequence-introducer (CSI),
+if (priorities)
-	 we should also consider the possibility of ISO2022 codings.  */
+	goto label_return_highest_only;
-if ((VECTORP (Vlatin_extra_code_table)
+}
-	   && !NILP (XVECTOR (Vlatin_extra_code_table)->contents[c]))
+else
-	  || (c == ISO_CODE_SS2 || c == ISO_CODE_SS3)
+{
-	  || (c == ISO_CODE_CSI
+int try;
-	      && (src < src_end
-		  && (*src == ']'
+if (c < 0xA0)
-		      || (src + 1 < src_end
+	{
-			  && src[1] == ']'
+	  /* C is the first byte of SJIS character code,
-			  && (*src == '0' || *src == '1' || *src == '2'))))))
+	     or a leading-code of Emacs' internal format (emacs-mule).  */
-	mask = (detect_coding_iso2022 (src, src_end)
+	  try = CODING_CATEGORY_MASK_SJIS | CODING_CATEGORY_MASK_EMACS_MULE;
-		| detect_coding_sjis (src, src_end)
-		| detect_coding_emacs_mule (src, src_end)
+	  /* Or, if C is a special latin extra code,
-		| CODING_CATEGORY_MASK_RAW_TEXT);
+	     or is an ISO2022 specific control code of C1 (SS2 or SS3),
+	     or is an ISO2022 control-sequence-introducer (CSI),
+	     we should also consider the possibility of ISO2022 codings.  */
+	  if ((VECTORP (Vlatin_extra_code_table)
+	       && !NILP (XVECTOR (Vlatin_extra_code_table)->contents[c]))
+	      || (c == ISO_CODE_SS2 || c == ISO_CODE_SS3)
+	      || (c == ISO_CODE_CSI
+		  && (src < src_end
+		      && (*src == ']'
+			  || ((*src == '0' || *src == '1' || *src == '2')
+			      && src + 1 < src_end
+			      && src[1] == ']')))))
+	    try |= (CODING_CATEGORY_MASK_ISO_8_ELSE
+		     | CODING_CATEGORY_MASK_ISO_8BIT);
+	}
 else
-	/* C is the first byte of SJIS character code,
+	/* C is a character of ISO2022 in graphic plane right,
-	   or a leading-code of Emacs' internal format (emacs-mule).  */
+	   or a SJIS's 1-byte character code (i.e. JISX0201),
-	mask = (detect_coding_sjis (src, src_end)
+	   or the first byte of BIG5's 2-byte code.  */
-		| detect_coding_emacs_mule (src, src_end)
+	try = (CODING_CATEGORY_MASK_ISO_8_ELSE
-		| CODING_CATEGORY_MASK_RAW_TEXT);
+		| CODING_CATEGORY_MASK_ISO_8BIT
-}
+		| CODING_CATEGORY_MASK_SJIS
-else
+		| CODING_CATEGORY_MASK_BIG5);
-/* C is a character of ISO2022 in graphic plane right,
-or a SJIS's 1-byte character code (i.e. JISX0201),
+mask = 0;
-or the first byte of BIG5's 2-byte code.  */
+if (priorities)
-mask = (detect_coding_iso2022 (src, src_end)
+	{
-	    | detect_coding_sjis (src, src_end)
+	  for (i = 0; i < CODING_CATEGORY_IDX_MAX; i++)
-	    | detect_coding_big5 (src, src_end)
+	    {
-	    | CODING_CATEGORY_MASK_RAW_TEXT);
+	      priorities[i] &= try;
+	      if (priorities[i] & CODING_CATEGORY_MASK_ISO)
-return mask;
+		mask = detect_coding_iso2022 (src, src_end);
+	      else if (priorities[i] & CODING_CATEGORY_MASK_SJIS)
+		mask = detect_coding_sjis (src, src_end);
+	      else if (priorities[i] & CODING_CATEGORY_MASK_BIG5)
+		mask = detect_coding_big5 (src, src_end);
+	      else if (priorities[i] & CODING_CATEGORY_MASK_EMACS_MULE)
+		mask = detect_coding_emacs_mule (src, src_end);
+	      if (mask)
+		goto label_return_highest_only;
+	    }
+	  return CODING_CATEGORY_MASK_RAW_TEXT;
+	}
+if (try & CODING_CATEGORY_MASK_ISO)
+	mask |= detect_coding_iso2022 (src, src_end);
+if (try & CODING_CATEGORY_MASK_SJIS)
+	mask |= detect_coding_sjis (src, src_end);
+if (try & CODING_CATEGORY_MASK_BIG5)
+	mask |= detect_coding_big5 (src, src_end);
+if (try & CODING_CATEGORY_MASK_EMACS_MULE)
+	mask |= detect_coding_emacs_mule (src, src_end);
+}
+return (mask | CODING_CATEGORY_MASK_RAW_TEXT);
+label_return_highest_only:
+for (i = 0; i < CODING_CATEGORY_IDX_MAX; i++)
+{
+if (mask & priorities[i])
+	return priorities[i];
+}
+return CODING_CATEGORY_MASK_RAW_TEXT;
 }
 /* Detect how a text of length SRC_BYTES pointed by SRC is encoded.
 The information of the detected coding system is set in CODING.  */
 detect_coding (coding, src, src_bytes)
 struct coding_system *coding;
 unsigned char *src;
 int src_bytes;
 {
-int mask = detect_coding_mask (src, src_bytes);
+unsigned int idx;
-int idx;
+int skip, mask, i;
+int priorities[CODING_CATEGORY_IDX_MAX];
 Lisp_Object val = Vcoding_category_list;
-if (mask == CODING_CATEGORY_MASK_ANY)
+i = 0;
-/* We found nothing other than ASCII.  There's nothing to do.  */
+while (CONSP (val) && i < CODING_CATEGORY_IDX_MAX)
-return;
+{
+if (! SYMBOLP (XCONS (val)->car))
-/* We found some plausible coding systems.  Let's use a coding
+	break;
-system of the highest priority.  */
+idx = XFASTINT (Fget (XCONS (val)->car, Qcoding_category_index));
+if (idx >= CODING_CATEGORY_IDX_MAX)
-if (CONSP (val))
+	break;
-while (!NILP (val))
+priorities[i++] = (1 << idx);
-{
+val = XCONS (val)->cdr;
-	idx = XFASTINT (Fget (XCONS (val)->car, Qcoding_category_index));
+}
-	if ((idx < CODING_CATEGORY_IDX_MAX) && (mask & (1 << idx)))
+/* If coding-category-list is valid and contains all coding
-	  break;
+categories, `i' should be CODING_CATEGORY_IDX_MAX now.  If not,
-	val = XCONS (val)->cdr;
+the following code saves Emacs from craching.  */
-}
+while (i < CODING_CATEGORY_IDX_MAX)
-else
+priorities[i++] = CODING_CATEGORY_MASK_RAW_TEXT;
-val = Qnil;
+mask = detect_coding_mask (src, src_bytes, priorities, &skip);
-if (NILP (val))
+coding->heading_ascii = skip;
-{
-/* For unknown reason, `Vcoding_category_list' contains none of
+if (!mask) return;
-	 found categories.  Let's use any of them.  */
-for (idx = 0; idx < CODING_CATEGORY_IDX_MAX; idx++)
+/* We found a single coding system of the highest priority in MASK.  */
-	if (mask & (1 << idx))
+idx = 0;
-	  break;
+while (mask && ! (mask & 1)) mask >>= 1, idx++;
-}
+if (! mask)
-setup_coding_system (XSYMBOL (coding_category_table[idx])->value, coding);
+idx = CODING_CATEGORY_IDX_RAW_TEXT;
-}
+val = XSYMBOL (XVECTOR (Vcoding_category_table)->contents[idx])->value;
-/* Detect how end-of-line of a text of length SRC_BYTES pointed by SRC
-is encoded.  Return one of CODING_EOL_LF, CODING_EOL_CRLF,
+if (coding->eol_type != CODING_EOL_UNDECIDED)
-CODING_EOL_CR, and CODING_EOL_UNDECIDED.  */
+{
+Lisp_Object tmp = Fget (val, Qeol_type);
+if (VECTORP (tmp))
+	val = XVECTOR (tmp)->contents[coding->eol_type];
+}
+setup_coding_system (val, coding);
+/* Set this again because setup_coding_system reset this member.  */
+coding->heading_ascii = skip;
+}
+/* Detect how end-of-line of a text of length SRC_BYTES pointed by
+SOURCE is encoded.  Return one of CODING_EOL_LF, CODING_EOL_CRLF,
+CODING_EOL_CR, and CODING_EOL_UNDECIDED.
+How many non-eol characters are at the head is returned as *SKIP.  */
 #define MAX_EOL_CHECK_COUNT 3
-int
+static int
-detect_eol_type (src, src_bytes)
+detect_eol_type (source, src_bytes, skip)
-unsigned char *src;
+unsigned char *source;
-int src_bytes;
+int src_bytes, *skip;
 {
-unsigned char *src_end = src + src_bytes;
+unsigned char *src = source, *src_end = src + src_bytes;
 unsigned char c;
 int total = 0;		/* How many end-of-lines are found so far.  */
 int eol_type = CODING_EOL_UNDECIDED;
 int this_eol_type;
+*skip = 0;
 while (src < src_end && total < MAX_EOL_CHECK_COUNT)
 {
 c = *src++;
 if (c == '\n' || c == '\r')
 	{
+	  if (*skip == 0)
+	    *skip = src - 1 - source;
 	  total++;
 	  if (c == '\n')
 	    this_eol_type = CODING_EOL_LF;
 	  else if (src >= src_end || *src != '\n')
 	    this_eol_type = CODING_EOL_CR;
 	  if (eol_type == CODING_EOL_UNDECIDED)
 	    /* This is the first end-of-line.  */
 	    eol_type = this_eol_type;
 	  else if (eol_type != this_eol_type)
-	    /* The found type is different from what found before.
+	    {
-	       Let's notice the caller about this inconsistency.  */
+	      /* The found type is different from what found before.  */
-	    return CODING_EOL_INCONSISTENT;
+	      eol_type = CODING_EOL_INCONSISTENT;
-	}
+	      break;
-}
+	    }
+	}
+}
+if (*skip == 0)
+*skip = src_end - source;
 return eol_type;
 }
 /* Detect how end-of-line of a text of length SRC_BYTES pointed by SRC
 is encoded.  If it detects an appropriate format of end-of-line, it
 struct coding_system *coding;
 unsigned char *src;
 int src_bytes;
 {
 Lisp_Object val;
-int eol_type = detect_eol_type (src, src_bytes);
+int skip;
+int eol_type = detect_eol_type (src, src_bytes, &skip);
+if (coding->heading_ascii > skip)
+coding->heading_ascii = skip;
+else
+skip = coding->heading_ascii;
 if (eol_type == CODING_EOL_UNDECIDED)
-/*  We found no end-of-line in the source text.  */
 return;
 if (eol_type == CODING_EOL_INCONSISTENT)
 {
 #if 0
 /* This code is suppressed until we find a better way to
 	 distinguish raw text file and binary file.  */
 eol_type = CODING_EOL_LF;
 }
 val = Fget (coding->symbol, Qeol_type);
 if (VECTORP (val) && XVECTOR (val)->size == 3)
-setup_coding_system (XVECTOR (val)->contents[eol_type], coding);
+{
+setup_coding_system (XVECTOR (val)->contents[eol_type], coding);
+coding->heading_ascii = skip;
+}
+}
+#define CONVERSION_BUFFER_EXTRA_ROOM 256
+#define DECODING_BUFFER_MAG(coding)					     \
+(coding->type == coding_type_iso2022					     \
+? 3									     \
+: ((coding->type == coding_type_sjis || coding->type == coding_type_big5) \
+? 2								     \
+: (coding->type == coding_type_raw_text				     \
+	 ? 1								     \
+	 : (coding->type == coding_type_ccl				     \
+	    ? coding->spec.ccl.decoder.buf_magnification		     \
+	    : 2))))
+/* Return maximum size (bytes) of a buffer enough for decoding
+SRC_BYTES of text encoded in CODING.  */
+int
+decoding_buffer_size (coding, src_bytes)
+struct coding_system *coding;
+int src_bytes;
+{
+return (src_bytes * DECODING_BUFFER_MAG (coding)
+	  + CONVERSION_BUFFER_EXTRA_ROOM);
+}
+/* Return maximum size (bytes) of a buffer enough for encoding
+SRC_BYTES of text to CODING.  */
+int
+encoding_buffer_size (coding, src_bytes)
+struct coding_system *coding;
+int src_bytes;
+{
+int magnification;
+if (coding->type == coding_type_ccl)
+magnification = coding->spec.ccl.encoder.buf_magnification;
+else
+magnification = 3;
+return (src_bytes * magnification + CONVERSION_BUFFER_EXTRA_ROOM);
+}
+#ifndef MINIMUM_CONVERSION_BUFFER_SIZE
+#define MINIMUM_CONVERSION_BUFFER_SIZE 1024
+#endif
+char *conversion_buffer;
+int conversion_buffer_size;
+/* Return a pointer to a SIZE bytes of buffer to be used for encoding
+or decoding.  Sufficient memory is allocated automatically.  If we
+run out of memory, return NULL.  */
+char *
+get_conversion_buffer (size)
+int size;
+{
+if (size > conversion_buffer_size)
+{
+char *buf;
+int real_size = conversion_buffer_size * 2;
+while (real_size < size) real_size *= 2;
+buf = (char *) xmalloc (real_size);
+xfree (conversion_buffer);
+conversion_buffer = buf;
+conversion_buffer_size = real_size;
+}
+return conversion_buffer;
+}
+int
+ccl_coding_driver (coding, source, destination, src_bytes, dst_bytes, encodep)
+struct coding_system *coding;
+unsigned char *source, *destination;
+int src_bytes, dst_bytes, encodep;
+{
+struct ccl_program *ccl
+= encodep ? &coding->spec.ccl.encoder : &coding->spec.ccl.decoder;
+int result;
+coding->produced = ccl_driver (ccl, source, destination,
+				 src_bytes, dst_bytes, &(coding->consumed));
+if (encodep)
+{
+coding->produced_char = coding->produced;
+coding->consumed_char
+	= multibyte_chars_in_text (source, coding->consumed);
+}
+else
+{
+coding->produced_char
+	= multibyte_chars_in_text (destination, coding->produced);
+coding->consumed_char = coding->consumed;
+}
+switch (ccl->status)
+{
+case CCL_STAT_SUSPEND_BY_SRC:
+result = CODING_FINISH_INSUFFICIENT_SRC;
+break;
+case CCL_STAT_SUSPEND_BY_DST:
+result = CODING_FINISH_INSUFFICIENT_DST;
+break;
+default:
+result = CODING_FINISH_NORMAL;
+break;
+}
+return result;
 }
 /* See "GENERAL NOTES about `decode_coding_XXX ()' functions".  Before
 decoding, it may detect coding system and format of end-of-line if
 those are not yet decided.  */
 int
-decode_coding (coding, source, destination, src_bytes, dst_bytes, consumed)
+decode_coding (coding, source, destination, src_bytes, dst_bytes)
 struct coding_system *coding;
 unsigned char *source, *destination;
 int src_bytes, dst_bytes;
-int *consumed;
+{
-{
+int result;
-int produced;
 if (src_bytes <= 0)
 {
-*consumed = 0;
+coding->produced = coding->produced_char = 0;
-return 0;
+coding->consumed = coding->consumed_char = 0;
+return CODING_FINISH_NORMAL;
 }
 if (coding->type == coding_type_undecided)
 detect_coding (coding, source, src_bytes);
 if (coding->eol_type == CODING_EOL_UNDECIDED)
 detect_eol (coding, source, src_bytes);
-coding->carryover_size = 0;
 switch (coding->type)
 {
-case coding_type_no_conversion:
-label_no_conversion:
-produced = (src_bytes > dst_bytes) ? dst_bytes : src_bytes;
-bcopy (source, destination, produced);
-*consumed = produced;
-break;
 case coding_type_emacs_mule:
 case coding_type_undecided:
 case coding_type_raw_text:
 if (coding->eol_type == CODING_EOL_LF
 	  ||  coding->eol_type == CODING_EOL_UNDECIDED)
 	goto label_no_conversion;
-produced = decode_eol (coding, source, destination,
+result = decode_eol (coding, source, destination, src_bytes, dst_bytes);
-			     src_bytes, dst_bytes, consumed);
 break;
 case coding_type_sjis:
-produced = decode_coding_sjis_big5 (coding, source, destination,
+result = decode_coding_sjis_big5 (coding, source, destination,
-					  src_bytes, dst_bytes, consumed,
+					src_bytes, dst_bytes, 1);
-					  1);
 break;
 case coding_type_iso2022:
-produced = decode_coding_iso2022 (coding, source, destination,
+result = decode_coding_iso2022 (coding, source, destination,
-					src_bytes, dst_bytes, consumed);
+				      src_bytes, dst_bytes);
 break;
 case coding_type_big5:
-produced = decode_coding_sjis_big5 (coding, source, destination,
+result = decode_coding_sjis_big5 (coding, source, destination,
-					  src_bytes, dst_bytes, consumed,
+					src_bytes, dst_bytes, 0);
-					  0);
 break;
 case coding_type_ccl:
-produced = ccl_driver (&coding->spec.ccl.decoder, source, destination,
+result = ccl_coding_driver (coding, source, destination,
-			     src_bytes, dst_bytes, consumed);
+				  src_bytes, dst_bytes, 0);
 break;
-}
+default:			/* i.e. case coding_type_no_conversion: */
-return produced;
+label_no_conversion:
+if (dst_bytes && src_bytes > dst_bytes)
+	{
+	  coding->produced = dst_bytes;
+	  result = CODING_FINISH_INSUFFICIENT_DST;
+	}
+else
+	{
+	  coding->produced = src_bytes;
+	  result = CODING_FINISH_NORMAL;
+	}
+if (dst_bytes)
+	bcopy (source, destination, coding->produced);
+else
+	safe_bcopy (source, destination, coding->produced);
+coding->consumed
+	= coding->consumed_char = coding->produced_char = coding->produced;
+break;
+}
+return result;
 }
 /* See "GENERAL NOTES about `encode_coding_XXX ()' functions".  */
 int
-encode_coding (coding, source, destination, src_bytes, dst_bytes, consumed)
+encode_coding (coding, source, destination, src_bytes, dst_bytes)
 struct coding_system *coding;
 unsigned char *source, *destination;
 int src_bytes, dst_bytes;
-int *consumed;
+{
-{
+int result;
-int produced;
+if (src_bytes <= 0)
+{
+coding->produced = coding->produced_char = 0;
+coding->consumed = coding->consumed_char = 0;
+return CODING_FINISH_NORMAL;
+}
 switch (coding->type)
 {
-case coding_type_no_conversion:
-label_no_conversion:
-produced = (src_bytes > dst_bytes) ? dst_bytes : src_bytes;
-if (produced > 0)
-	{
-	  bcopy (source, destination, produced);
-	  if (coding->selective)
-	    {
-	      unsigned char *p = destination, *pend = destination + produced;
-	      while (p < pend)
-		if (*p++ == '\015') p[-1] = '\n';
-	    }
-	}
-*consumed = produced;
-break;
 case coding_type_emacs_mule:
 case coding_type_undecided:
 case coding_type_raw_text:
 if (coding->eol_type == CODING_EOL_LF
 	  ||  coding->eol_type == CODING_EOL_UNDECIDED)
 	goto label_no_conversion;
-produced = encode_eol (coding, source, destination,
+result = encode_eol (coding, source, destination, src_bytes, dst_bytes);
-			     src_bytes, dst_bytes, consumed);
 break;
 case coding_type_sjis:
-produced = encode_coding_sjis_big5 (coding, source, destination,
+result = encode_coding_sjis_big5 (coding, source, destination,
-					  src_bytes, dst_bytes, consumed,
+					src_bytes, dst_bytes, 1);
-					  1);
 break;
 case coding_type_iso2022:
-produced = encode_coding_iso2022 (coding, source, destination,
+result = encode_coding_iso2022 (coding, source, destination,
-					src_bytes, dst_bytes, consumed);
+				      src_bytes, dst_bytes);
 break;
 case coding_type_big5:
-produced = encode_coding_sjis_big5 (coding, source, destination,
+result = encode_coding_sjis_big5 (coding, source, destination,
-					  src_bytes, dst_bytes, consumed,
+					src_bytes, dst_bytes, 0);
-					  0);
 break;
 case coding_type_ccl:
-produced = ccl_driver (&coding->spec.ccl.encoder, source, destination,
+result = ccl_coding_driver (coding, source, destination,
-			     src_bytes, dst_bytes, consumed);
+				  src_bytes, dst_bytes, 1);
 break;
-}
+default:			/* i.e. case coding_type_no_conversion: */
-return produced;
+label_no_conversion:
-}
+if (dst_bytes && src_bytes > dst_bytes)
+	{
-#define CONVERSION_BUFFER_EXTRA_ROOM 256
+	  coding->produced = dst_bytes;
+	  result = CODING_FINISH_INSUFFICIENT_DST;
-/* Return maximum size (bytes) of a buffer enough for decoding
+	}
-SRC_BYTES of text encoded in CODING.  */
+else
+	{
+	  coding->produced = src_bytes;
+	  result = CODING_FINISH_NORMAL;
+	}
+if (dst_bytes)
+	bcopy (source, destination, coding->produced);
+else
+	safe_bcopy (source, destination, coding->produced);
+if (coding->mode & CODING_MODE_SELECTIVE_DISPLAY)
+	{
+	  unsigned char *p = destination, *pend = p + coding->produced;
+	  while (p < pend)
+	    if (*p++ == '\015') p[-1] = '\n';
+	}
+coding->consumed
+	= coding->consumed_char = coding->produced_char = coding->produced;
+break;
+}
+return result;
+}
+/* Scan text in the region between *BEG and *END, skip characters
+which we don't have to decode by coding system CODING at the head
+and tail, then set *BEG and *END to the region of the text we
+actually have to convert.
+If STR is not NULL, *BEG and *END are indices into STR.  */
+static void
+shrink_decoding_region (beg, end, coding, str)
+int *beg, *end;
+struct coding_system *coding;
+unsigned char *str;
+{
+unsigned char *begp_orig, *begp, *endp_orig, *endp;
+int eol_conversion;
+if (coding->type == coding_type_ccl
+|| coding->type == coding_type_undecided
+|| !NILP (coding->post_read_conversion))
+{
+/* We can't skip any data.  */
+return;
+}
+else if (coding->type == coding_type_no_conversion)
+{
+/* We need no conversion.  */
+*beg = *end;
+return;
+}
+if (coding->heading_ascii >= 0)
+/* Detection routine has already found how much we can skip at the
+head.  */
+*beg += coding->heading_ascii;
+if (str)
+{
+begp_orig = begp = str + *beg;
+endp_orig = endp = str + *end;
+}
+else
+{
+move_gap (*beg);
+begp_orig = begp = GAP_END_ADDR;
+endp_orig = endp = begp + *end - *beg;
+}
+eol_conversion = (coding->eol_type != CODING_EOL_LF);
+switch (coding->type)
+{
+case coding_type_emacs_mule:
+case coding_type_raw_text:
+if (eol_conversion)
+	{
+	  if (coding->heading_ascii < 0)
+	    while (begp < endp && *begp != '\r') begp++;
+	  while (begp < endp && *(endp - 1) != '\r') endp--;
+	}
+else
+	begp = endp;
+break;
+case coding_type_sjis:
+case coding_type_big5:
+/* We can skip all ASCII characters at the head.  */
+if (coding->heading_ascii < 0)
+	{
+	  if (eol_conversion)
+	    while (begp < endp && *begp < 0x80 && *begp != '\n') begp++;
+	  else
+	    while (begp < endp && *begp < 0x80) begp++;
+	}
+/* We can skip all ASCII characters at the tail except for the
+	 second byte of SJIS or BIG5 code.  */
+if (eol_conversion)
+	while (begp < endp && endp[-1] < 0x80 && endp[-1] != '\n') endp--;
+else
+	while (begp < endp && endp[-1] < 0x80) endp--;
+if (begp < endp && endp < endp_orig && endp[-1] >= 0x80)
+	endp++;
+break;
+default:		/* i.e. case coding_type_iso2022: */
+if (coding->heading_ascii < 0)
+	{
+	  unsigned char c;
+	  /* We can skip all ASCII characters at the head except for a
+	     few control codes.  */
+	  while (begp < endp && (c = *begp) < 0x80
+		 && c != ISO_CODE_CR && c != ISO_CODE_SO
+		 && c != ISO_CODE_SI && c != ISO_CODE_ESC
+		 && (!eol_conversion || c != ISO_CODE_LF))
+	    begp++;
+	}
+switch (coding->category_idx)
+	{
+	case CODING_CATEGORY_IDX_ISO_8_1:
+	case CODING_CATEGORY_IDX_ISO_8_2:
+	  /* We can skip all ASCII characters at the tail.  */
+	  if (eol_conversion)
+	    while (begp < endp && endp[-1] < 0x80 && endp[-1] != '\n') endp--;
+	  else
+	    while (begp < endp && endp[-1] < 0x80) endp--;
+	  break;
+	case CODING_CATEGORY_IDX_ISO_7:
+	case CODING_CATEGORY_IDX_ISO_7_TIGHT:
+	  /* We can skip all charactes at the tail except for ESC and
+the following 2-byte at the tail.  */
+	  if (eol_conversion)
+	    while (begp < endp && endp[-1] != ISO_CODE_ESC && endp[-1] != '\n')
+	      endp--;
+	  else
+	    while (begp < endp && endp[-1] != ISO_CODE_ESC)
+	      endp--;
+	  if (begp < endp && endp[-1] == ISO_CODE_ESC)
+	    {
+	      if (endp + 1 < endp_orig && end[0] == '(' && end[1] == 'B')
+		/* This is an ASCII designation sequence.  We can
+surely skip the tail.  */
+		endp += 2;
+	      else
+		/* Hmmm, we can't skip the tail.  */
+		endp = endp_orig;
+	    }
+	}
+}
+*beg += begp - begp_orig;
+*end += endp - endp_orig;
+return;
+}
+/* Like shrink_decoding_region but for encoding.  */
+static void
+shrink_encoding_region (beg, end, coding, str)
+int *beg, *end;
+struct coding_system *coding;
+unsigned char *str;
+{
+unsigned char *begp_orig, *begp, *endp_orig, *endp;
+int eol_conversion;
+if (coding->type == coding_type_ccl)
+/* We can't skip any data.  */
+return;
+else if (coding->type == coding_type_no_conversion)
+{
+/* We need no conversion.  */
+*beg = *end;
+return;
+}
+if (str)
+{
+begp_orig = begp = str + *beg;
+endp_orig = endp = str + *end;
+}
+else
+{
+move_gap (*beg);
+begp_orig = begp = GAP_END_ADDR;
+endp_orig = endp = begp + *end - *beg;
+}
+eol_conversion = (coding->eol_type == CODING_EOL_CR
+		    || coding->eol_type == CODING_EOL_CRLF);
+/* Here, we don't have to check coding->pre_write_conversion because
+the caller is expected to have handled it already.  */
+switch (coding->type)
+{
+case coding_type_undecided:
+case coding_type_emacs_mule:
+case coding_type_raw_text:
+if (eol_conversion)
+	{
+	  while (begp < endp && *begp != '\n') begp++;
+	  while (begp < endp && endp[-1] != '\n') endp--;
+	}
+else
+	begp = endp;
+break;
+case coding_type_iso2022:
+if (coding->flags & CODING_FLAG_ISO_DESIGNATE_AT_BOL)
+	{
+	  unsigned char *bol = begp;
+	  while (begp < endp && *begp < 0x80)
+	    {
+	      begp++;
+	      if (begp[-1] == '\n')
+		bol = begp;
+	    }
+	  begp = bol;
+	  goto label_skip_tail;
+	}
+/* fall down ... */
+default:
+/* We can skip all ASCII characters at the head and tail.  */
+if (eol_conversion)
+	while (begp < endp && *begp < 0x80 && *begp != '\n') begp++;
+else
+	while (begp < endp && *begp < 0x80) begp++;
+label_skip_tail:
+if (eol_conversion)
+	while (begp < endp && endp[-1] < 0x80 && endp[-1] != '\n') endp--;
+else
+	while (begp < endp && *(endp - 1) < 0x80) endp--;
+break;
+}
+*beg += begp - begp_orig;
+*end += endp - endp_orig;
+return;
+}
+/* Decode (if ENCODEP is zero) or encode (if ENCODEP is nonzero) the
+text from FROM to TO by coding system CODING, and return number of
+characters in the resulting text.
+If ADJUST is nonzero, we do various things as if the original text
+is deleted and a new text is inserted.  See the comments in
+replace_range (insdel.c) to know what we are doing.
+ADJUST nonzero also means that post-read-conversion or
+pre-write-conversion functions (if any) should be processed.  */
 int
-decoding_buffer_size (coding, src_bytes)
+code_convert_region (from, to, coding, encodep, adjust)
+int from, to, encodep, adjust;
 struct coding_system *coding;
-int src_bytes;
+{
-{
+int len = to - from, require, inserted, inserted_byte;
-int magnification;
+int from_byte, to_byte, len_byte;
+int from_byte_orig, to_byte_orig;
-if (coding->type == coding_type_iso2022)
+Lisp_Object saved_coding_symbol = Qnil;
-magnification = 3;
-else if (coding->type == coding_type_ccl)
+if (adjust)
-magnification = coding->spec.ccl.decoder.buf_magnification;
+{
+prepare_to_modify_buffer (from, to, &from);
+to = from + len;
+}
+from_byte = CHAR_TO_BYTE (from); to_byte = CHAR_TO_BYTE (to);
+len_byte = from_byte - to_byte;
+if (! encodep && CODING_REQUIRE_DETECTION (coding))
+{
+/* We must detect encoding of text and eol.  Even if detection
+routines can't decide the encoding, we should not let them
+undecided because the deeper decoding routine (decode_coding)
+tries to detect the encodings in vain in that case.  */
+if (from < GPT && to > GPT)
+	move_gap_both (from, from_byte);
+if (coding->type == coding_type_undecided)
+	{
+	  detect_coding (coding, BYTE_POS_ADDR (from), len);
+	  if (coding->type == coding_type_undecided)
+	    coding->type = coding_type_emacs_mule;
+	}
+if (coding->eol_type == CODING_EOL_UNDECIDED)
+	{
+	  saved_coding_symbol = coding->symbol;
+	  detect_eol (coding, BYTE_POS_ADDR (from_byte), len_byte);
+	  if (coding->eol_type == CODING_EOL_UNDECIDED)
+	    coding->eol_type = CODING_EOL_LF;
+	  /* We had better recover the original eol format if we
+	     encounter an inconsitent eol format while decoding.  */
+	  coding->mode |= CODING_MODE_INHIBIT_INCONSISTENT_EOL;
+	}
+}
+if (encodep
+? ! CODING_REQUIRE_ENCODING (coding)
+: ! CODING_REQUIRE_DECODING (coding))
+return len;
+/* Now we convert the text.  */
+/* For encoding, we must process pre-write-conversion in advance.  */
+if (encodep
+&& adjust
+&& ! NILP (coding->pre_write_conversion)
+&& SYMBOLP (coding->pre_write_conversion)
+&& ! NILP (Ffboundp (coding->pre_write_conversion)))
+{
+/* The function in pre-write-conversion put a new text in a new
+buffer.  */
+struct buffer *prev = current_buffer, *new;
+call2 (coding->pre_write_conversion, from, to);
+if (current_buffer != prev)
+	{
+	  len = ZV - BEGV;
+	  new = current_buffer;
+	  set_buffer_internal_1 (prev);
+	  del_range (from, to);
+	  insert_from_buffer (new, BEG, len, 0);
+	  to = from + len;
+	  to_byte = CHAR_TO_BYTE (to);
+	  len_byte = to_byte - from_byte;
+	}
+}
+/* Try to skip the heading and tailing ASCIIs.  */
+from_byte_orig = from_byte; to_byte_orig = to_byte;
+if (encodep)
+shrink_encoding_region (&from_byte, &to_byte, coding, NULL);
 else
-magnification = 2;
+shrink_decoding_region (&from_byte, &to_byte, coding, NULL);
+if (from_byte == to_byte)
-return (src_bytes * magnification + CONVERSION_BUFFER_EXTRA_ROOM);
+return len;
-}
+/* Here, the excluded region by shrinking contains only ASCIIs.  */
+from += (from_byte - from_byte_orig);
-/* Return maximum size (bytes) of a buffer enough for encoding
+to += (to_byte - to_byte_orig);
-SRC_BYTES of text to CODING.  */
+len = to - from;
+len_byte = to_byte - from_byte;
-int
-encoding_buffer_size (coding, src_bytes)
+/* For converion, we must put the gap before the text to be decoded
+in addition to make the gap larger for efficient decoding.  The
+required gap size starts from 2000 which is the magic number used
+in make_gap.  But, after one batch of conversion, it will be
+incremented if we find that it is not enough .  */
+require = 2000;
+if (GAP_SIZE  < require)
+make_gap (require - GAP_SIZE);
+move_gap_both (from, from_byte);
+if (adjust)
+adjust_before_replace (from, from_byte, to, to_byte);
+if (GPT - BEG < beg_unchanged)
+beg_unchanged = GPT - BEG;
+if (Z - GPT < end_unchanged)
+end_unchanged = Z - GPT;
+inserted = inserted_byte = 0;
+for (;;)
+{
+int result, diff_char, diff_byte;
+/* The buffer memory is changed from:
+	 +--------+converted-text+------------+-----original-text-----+---+
+	 |<-from->|<--inserted-->|<-GAP_SIZE->|<---------len--------->|---|  */
+if (encodep)
+	result = encode_coding (coding, GAP_END_ADDR, GPT_ADDR, len_byte, 0);
+else
+	result = decode_coding (coding, GAP_END_ADDR, GPT_ADDR, len_byte, 0);
+/* to:
+	 +--------+-------converted-text--------+--+---original-text--+---+
+	 |<-from->|<----(inserted+produced)---->|--|<-(len-consumed)->|---|  */
+diff_char = coding->produced_char - coding->consumed_char;
+diff_byte = coding->produced - coding->consumed;
+GAP_SIZE -= diff_byte;
+ZV += diff_char; ZV_BYTE += diff_byte;
+Z += diff_char; Z_BYTE += diff_byte;
+GPT += coding->produced_char; GPT_BYTE += coding->produced;
+inserted += coding->produced_char;
+inserted_byte += coding->produced;
+len -= coding->consumed_char;
+len_byte -= coding->consumed;
+if (! encodep && result == CODING_FINISH_INCONSISTENT_EOL)
+	{
+	  unsigned char *p = GPT_ADDR - inserted_byte, *pend = GPT_ADDR;
+	  /* Encode LFs back to the original eol format (CR or CRLF).  */
+	  if (coding->eol_type == CODING_EOL_CR)
+	    {
+	      while (p < pend) if (*p++ == '\n') p[-1] = '\r';
+	    }
+	  else
+	    {
+	      unsigned char *p2 = p;
+	      int count = 0;
+	      while (p2 < pend) if (*p2++ == '\n') count++;
+	      if (GAP_SIZE < count)
+		make_gap (count - GAP_SIZE);
+	      p2 = GPT_ADDR + count;
+	      while (p < pend)
+		{
+		  *--p2 = *--pend;
+		  if (*pend == '\n') *--p2 = '\r';
+		}
+	      GPT += count; GAP_SIZE -= count; ZV += count; Z += count;
+	      ZV_BYTE += count; Z_BYTE += count;
+	      coding->produced += count;
+	      coding->produced_char += count;
+	      inserted += count;
+	      inserted_byte += count;
+	    }
+	  /* Suppress eol-format conversion in the further conversion.  */
+	  coding->eol_type = CODING_EOL_LF;
+	  /* Restore the original symbol.  */
+	  coding->symbol = saved_coding_symbol;
+	}
+if (len_byte <= 0)
+	break;
+if (result == CODING_FINISH_INSUFFICIENT_SRC)
+	{
+	  /* The source text ends in invalid codes.  Let's just
+	     make them valid buffer contents, and finish conversion.  */
+	  inserted += len;
+	  inserted_byte += len_byte;
+	  break;
+	}
+if (inserted == coding->produced_char)
+	/* We have just done the first batch of conversion.  Let's
+	   reconsider the required gap size now.
+	   We have converted CONSUMED bytes into PRODUCED bytes.  To
+	   convert the remaining LEN bytes, we may need REQUIRE bytes
+	   of gap, where:
+	       REQUIRE + LEN = (LEN * PRODUCED / CONSUMED)
+	       REQUIRE = LEN * (PRODUCED - CONSUMED) / CONSUMED
+		       = LEN * DIFF / CONSUMED
+	   Here, we are sure that DIFF is positive.  */
+	require = len_byte * diff_byte / coding->consumed;
+if (GAP_SIZE  < require)
+	make_gap (require - GAP_SIZE);
+}
+if (GAP_SIZE > 0) *GPT_ADDR = 0; /* Put an anchor.  */
+if (adjust)
+{
+adjust_after_replace (from, from_byte, to, to_byte,
+			    inserted, inserted_byte);
+if (! encodep && ! NILP (coding->post_read_conversion))
+	{
+	  Lisp_Object val;
+	  int orig_inserted = inserted, pos = PT;
+	  temp_set_point_both (current_buffer, from, from_byte);
+	  val = call1 (coding->post_read_conversion, make_number (inserted));
+	  if (! NILP (val))
+	    {
+	      CHECK_NUMBER (val, 0);
+	      inserted = XFASTINT (val);
+	    }
+	  if (pos >= from + orig_inserted)
+	    temp_set_point (current_buffer, pos + (inserted - orig_inserted));
+	}
+}
+return ((from_byte - from_byte_orig) + inserted + (to_byte_orig - to_byte));
+}
+Lisp_Object
+code_convert_string (str, coding, encodep, nocopy)
+Lisp_Object str;
 struct coding_system *coding;
-int src_bytes;
+int encodep, nocopy;
 {
-int magnification;
+int len;
+char *buf;
-if (coding->type == coding_type_ccl)
+int from = 0, to = XSTRING (str)->size, to_byte = XSTRING (str)->size_byte;
-magnification = coding->spec.ccl.encoder.buf_magnification;
+struct gcpro gcpro1;
+Lisp_Object saved_coding_symbol = Qnil;
+int result;
+if (encodep && !NILP (coding->pre_write_conversion)
+|| !encodep && !NILP (coding->post_read_conversion))
+{
+/* Since we have to call Lisp functions which assume target text
+is in a buffer, after setting a temporary buffer, call
+code_convert_region.  */
+int count = specpdl_ptr - specpdl;
+struct buffer *prev = current_buffer;
+record_unwind_protect (Fset_buffer, Fcurrent_buffer ());
+temp_output_buffer_setup (" *code-converting-work*");
+set_buffer_internal (XBUFFER (Vstandard_output));
+if (encodep)
+	insert_from_string (str, 0, 0, to, to_byte, 0);
+else
+	{
+	  /* We must insert the contents of STR as is without
+unibyte<->multibyte conversion.  */
+	  current_buffer->enable_multibyte_characters = Qnil;
+	  insert_from_string (str, 0, 0, to_byte, to_byte, 0);
+	  current_buffer->enable_multibyte_characters = Qt;
+	}
+code_convert_region (BEGV, ZV, coding, encodep, 1);
+if (encodep)
+	/* We must return the buffer contents as unibyte string.  */
+	current_buffer->enable_multibyte_characters = Qnil;
+str = make_buffer_string (BEGV, ZV, 0);
+set_buffer_internal (prev);
+return unbind_to (count, str);
+}
+if (! encodep && CODING_REQUIRE_DETECTION (coding))
+{
+/* See the comments in code_convert_region.  */
+if (coding->type == coding_type_undecided)
+	{
+	  detect_coding (coding, XSTRING (str)->data, to_byte);
+	  if (coding->type == coding_type_undecided)
+	    coding->type = coding_type_emacs_mule;
+	}
+if (coding->eol_type == CODING_EOL_UNDECIDED)
+	{
+	  saved_coding_symbol = coding->symbol;
+	  detect_eol (coding, XSTRING (str)->data, to_byte);
+	  if (coding->eol_type == CODING_EOL_UNDECIDED)
+	    coding->eol_type = CODING_EOL_LF;
+	  /* We had better recover the original eol format if we
+	     encounter an inconsitent eol format while decoding.  */
+	  coding->mode |= CODING_MODE_INHIBIT_INCONSISTENT_EOL;
+	}
+}
+if (encodep
+? ! CODING_REQUIRE_ENCODING (coding)
+: ! CODING_REQUIRE_DECODING (coding))
+from = to_byte;
 else
-magnification = 3;
+{
+/* Try to skip the heading and tailing ASCIIs.  */
-return (src_bytes * magnification + CONVERSION_BUFFER_EXTRA_ROOM);
+if (encodep)
-}
+	shrink_encoding_region (&from, &to_byte, coding, XSTRING (str)->data);
+else
-#ifndef MINIMUM_CONVERSION_BUFFER_SIZE
+	shrink_decoding_region (&from, &to_byte, coding, XSTRING (str)->data);
-#define MINIMUM_CONVERSION_BUFFER_SIZE 1024
+}
-#endif
+if (from == to_byte)
+return (nocopy ? str : Fcopy_sequence (str));
-char *conversion_buffer;
-int conversion_buffer_size;
+if (encodep)
+len = encoding_buffer_size (coding, to_byte - from);
-/* Return a pointer to a SIZE bytes of buffer to be used for encoding
+else
-or decoding.  Sufficient memory is allocated automatically.  If we
+len = decoding_buffer_size (coding, to_byte - from);
-run out of memory, return NULL.  */
+len += from + XSTRING (str)->size_byte - to_byte;
+GCPRO1 (str);
-char *
+buf = get_conversion_buffer (len);
-get_conversion_buffer (size)
+UNGCPRO;
-int size;
-{
+if (from > 0)
-if (size > conversion_buffer_size)
+bcopy (XSTRING (str)->data, buf, from);
-{
+result = (encodep
-char *buf;
+	    ? encode_coding (coding, XSTRING (str)->data + from,
-int real_size = conversion_buffer_size * 2;
+			     buf + from, to_byte - from, len)
+	    : decode_coding (coding, XSTRING (str)->data + from,
-while (real_size < size) real_size *= 2;
+			     buf + from, to - from, len));
-buf = (char *) xmalloc (real_size);
+if (! encodep && result == CODING_FINISH_INCONSISTENT_EOL)
-xfree (conversion_buffer);
+{
-conversion_buffer = buf;
+/* We simple try to decode the whole string again but without
-conversion_buffer_size = real_size;
+eol-conversion this time.  */
-}
+coding->eol_type = CODING_EOL_LF;
-return conversion_buffer;
+coding->symbol = saved_coding_symbol;
+return code_convert_string (str, coding, encodep, nocopy);
+}
+bcopy (XSTRING (str)->data + to_byte, buf + from + coding->produced,
+	 XSTRING (str)->size_byte - to_byte);
+len = from + XSTRING (str)->size_byte - to_byte;
+if (encodep)
+str = make_unibyte_string (buf, len + coding->produced);
+else
+str = make_multibyte_string (buf, len + coding->produced_char,
+				 len + coding->produced);
+return str;
 }
 #ifdef emacs
 /*** 7. Emacs Lisp library functions ***/
 return coding_system;
 while (1)
 Fsignal (Qcoding_system_error, Fcons (coding_system, Qnil));
 }
-DEFUN ("detect-coding-region", Fdetect_coding_region, Sdetect_coding_region,
+Lisp_Object
-2, 2, 0,
+detect_coding_system (src, src_bytes, highest)
-"Detect coding system of the text in the region between START and END.\n\
+unsigned char *src;
-Return a list of possible coding systems ordered by priority.\n\
+int src_bytes, highest;
-If only ASCII characters are found, it returns `undecided'\n\
-or its subsidiary coding system according to a detected end-of-line format.")
-(b, e)
-Lisp_Object b, e;
 {
 int coding_mask, eol_type;
-Lisp_Object val;
+Lisp_Object val, tmp;
-int beg, end;
+int dummy;
-int beg_byte, end_byte;
+coding_mask = detect_coding_mask (src, src_bytes, NULL, &dummy);
-validate_region (&b, &e);
+eol_type  = detect_eol_type (src, src_bytes, &dummy);
-beg = XINT (b), end = XINT (e);
+if (eol_type == CODING_EOL_INCONSISTENT)
-beg_byte = CHAR_TO_BYTE (beg);
+eol_type == CODING_EOL_UNDECIDED;
-end_byte = CHAR_TO_BYTE (end);
+if (!coding_mask)
-if (beg < GPT && end >= GPT)
-move_gap_both (end, end_byte);
-coding_mask = detect_coding_mask (BYTE_POS_ADDR (beg_byte),
-				    end_byte - beg_byte);
-eol_type = detect_eol_type (BYTE_POS_ADDR (beg_byte), end_byte - beg_byte);
-if (coding_mask == CODING_CATEGORY_MASK_ANY)
 {
 val = Qundecided;
-if (eol_type != CODING_EOL_UNDECIDED
+if (eol_type != CODING_EOL_UNDECIDED)
-	  && eol_type != CODING_EOL_INCONSISTENT)
 	{
 	  Lisp_Object val2;
 	  val2 = Fget (Qundecided, Qeol_type);
 	  if (VECTORP (val2))
 	    val = XVECTOR (val2)->contents[eol_type];
 	}
-}
+return val;
-else
+}
-{
-Lisp_Object val2;
+/* At first, gather possible coding systems in VAL.  */
+val = Qnil;
-/* At first, gather possible coding-systems in VAL in a reverse
+for (tmp = Vcoding_category_list; !NILP (tmp); tmp = XCONS (tmp)->cdr)
-	 order.  */
+{
-val = Qnil;
+int idx
-for (val2 = Vcoding_category_list;
+	= XFASTINT (Fget (XCONS (tmp)->car, Qcoding_category_index));
-	   !NILP (val2);
+if (coding_mask & (1 << idx))
-	   val2 = XCONS (val2)->cdr)
+	{
-	{
+	  val = Fcons (Fsymbol_value (XCONS (tmp)->car), val);
-	  int idx
+	  if (highest)
-	    = XFASTINT (Fget (XCONS (val2)->car, Qcoding_category_index));
+	    break;
-	  if (coding_mask & (1 << idx))
+	}
-	    {
+}
-#if 0
+if (!highest)
-	      /* This code is suppressed until we find a better way to
+val = Fnreverse (val);
-		 distinguish raw text file and binary file.  */
+/* Then, substitute the elements by subsidiary coding systems.  */
-	      if (idx == CODING_CATEGORY_IDX_RAW_TEXT
+for (tmp = val; !NILP (tmp); tmp = XCONS (tmp)->cdr)
-		  && eol_type == CODING_EOL_INCONSISTENT)
+{
-		val = Fcons (Qno_conversion, val);
+if (eol_type != CODING_EOL_UNDECIDED)
-	      else
+	{
-#endif /* 0 */
+	  Lisp_Object eol;
-		val = Fcons (Fsymbol_value (XCONS (val2)->car), val);
+	  eol = Fget (XCONS (tmp)->car, Qeol_type);
-	    }
+	  if (VECTORP (eol))
-	}
+	    XCONS (tmp)->car = XVECTOR (eol)->contents[eol_type];
+	}
-/* Then, change the order of the list, while getting subsidiary
+}
-	 coding-systems.  */
+return (highest ? XCONS (val)->car : val);
-val2 = val;
+}
-val = Qnil;
-if (eol_type == CODING_EOL_INCONSISTENT)
+DEFUN ("detect-coding-region", Fdetect_coding_region, Sdetect_coding_region,
-	eol_type == CODING_EOL_UNDECIDED;
+2, 3, 0,
-for (; !NILP (val2); val2 = XCONS (val2)->cdr)
+"Detect coding system of the text in the region between START and END.\n\
-	{
+Return a list of possible coding systems ordered by priority.\n\
-	  if (eol_type == CODING_EOL_UNDECIDED)
+\n\
-	    val = Fcons (XCONS (val2)->car, val);
+If only ASCII characters are found, it returns `undecided'\n\
-	  else
+or its subsidiary coding system according to a detected end-of-line format.\n\
-	    {
+\n\
-	      Lisp_Object val3;
+If optional argument HIGHEST is non-nil, return the coding system of\n\
-	      val3 = Fget (XCONS (val2)->car, Qeol_type);
+highest priority.")
-	      if (VECTORP (val3))
+(start, end, highest)
-		val = Fcons (XVECTOR (val3)->contents[eol_type], val);
+Lisp_Object start, end, highest;
-	      else
+{
-		val = Fcons (XCONS (val2)->car, val);
+int from, to;
-	    }
+int from_byte, to_byte;
-	}
-}
+CHECK_NUMBER_COERCE_MARKER (start, 0);
+CHECK_NUMBER_COERCE_MARKER (end, 1);
-return val;
-}
+validate_region (&start, &end);
+from = XINT (start), to = XINT (end);
-/* Scan text in the region between *BEGP and *ENDP, skip characters
+from_byte = CHAR_TO_BYTE (from);
-which we never have to encode to (iff ENCODEP is 1) or decode from
+to_byte = CHAR_TO_BYTE (to);
-coding system CODING at the head and tail, then set BEGP and ENDP
-to the addresses of start and end of the text we actually convert.  */
+if (from < GPT && to >= GPT)
+move_gap_both (to, to_byte);
-void
-shrink_conversion_area (begp, endp, coding, encodep)
+return detect_coding_system (BYTE_POS_ADDR (from_byte),
-unsigned char **begp, **endp;
+			       to_byte - from_byte,
-struct coding_system *coding;
+			       !NILP (highest));
-int encodep;
+}
-{
-register unsigned char *beg_addr = *begp, *end_addr = *endp;
+DEFUN ("detect-coding-string", Fdetect_coding_string, Sdetect_coding_string,
+1, 2, 0,
-if (coding->eol_type != CODING_EOL_LF
+"Detect coding system of the text in STRING.\n\
-&& coding->eol_type != CODING_EOL_UNDECIDED)
+Return a list of possible coding systems ordered by priority.\n\
-/* Since we anyway have to convert end-of-line format, it is not
+\n\
-worth skipping at most 100 bytes or so.  */
+If only ASCII characters are found, it returns `undecided'\n\
-return;
+or its subsidiary coding system according to a detected end-of-line format.\n\
+\n\
-if (encodep)			/* for encoding */
+If optional argument HIGHEST is non-nil, return the coding system of\n\
-{
+highest priority.")
-switch (coding->type)
+(string, highest)
-	{
+Lisp_Object string, highest;
-	case coding_type_no_conversion:
+{
-	case coding_type_emacs_mule:
+CHECK_STRING (string, 0);
-	case coding_type_undecided:
-	case coding_type_raw_text:
+return detect_coding_system (XSTRING (string)->data,
-	  /* We need no conversion.  */
+			       XSTRING (string)->size_byte,
-	  *begp = *endp;
+			       !NILP (highest));
-	  return;
-	case coding_type_ccl:
-	  /* We can't skip any data.  */
-	  return;
-	case coding_type_iso2022:
-	  if (coding->flags & CODING_FLAG_ISO_DESIGNATE_AT_BOL)
-	    {
-	      unsigned char *bol = beg_addr;
-	      while (beg_addr < end_addr && *beg_addr < 0x80)
-		{
-		  beg_addr++;
-		  if (*(beg_addr - 1) == '\n')
-		    bol = beg_addr;
-		}
-	      beg_addr = bol;
-	      goto label_skip_tail;
-	    }
-	  /* fall down ... */
-	default:
-	  /* We can skip all ASCII characters at the head and tail.  */
-	  while (beg_addr < end_addr && *beg_addr < 0x80) beg_addr++;
-	label_skip_tail:
-	  while (beg_addr < end_addr && *(end_addr - 1) < 0x80) end_addr--;
-	  break;
-	}
-}
-else				/* for decoding */
-{
-switch (coding->type)
-	{
-	case coding_type_no_conversion:
-	  /* We need no conversion.  */
-	  *begp = *endp;
-	  return;
-	case coding_type_emacs_mule:
-	case coding_type_raw_text:
-	  if (coding->eol_type == CODING_EOL_LF)
-	    {
-	      /* We need no conversion.  */
-	      *begp = *endp;
-	      return;
-	    }
-	  /* We can skip all but carriage-return.  */
-	  while (beg_addr < end_addr && *beg_addr != '\r') beg_addr++;
-	  while (beg_addr < end_addr && *(end_addr - 1) != '\r') end_addr--;
-	  break;
-	case coding_type_sjis:
-	case coding_type_big5:
-	  /* We can skip all ASCII characters at the head.  */
-	  while (beg_addr < end_addr && *beg_addr < 0x80) beg_addr++;
-	  /* We can skip all ASCII characters at the tail except for
-	     the second byte of SJIS or BIG5 code.  */
-	  while (beg_addr < end_addr && *(end_addr - 1) < 0x80) end_addr--;
-	  if (end_addr != *endp)
-	    end_addr++;
-	  break;
-	case coding_type_ccl:
-	  /* We can't skip any data.  */
-	  return;
-	default:		/* i.e. case coding_type_iso2022: */
-	  {
-	    unsigned char c;
-	    /* We can skip all ASCII characters except for a few
-	       control codes at the head.  */
-	    while (beg_addr < end_addr && (c = *beg_addr) < 0x80
-		   && c != ISO_CODE_CR && c != ISO_CODE_SO
-		   && c != ISO_CODE_SI && c != ISO_CODE_ESC)
-	      beg_addr++;
-	  }
-	  break;
-	}
-}
-*begp = beg_addr;
-*endp = end_addr;
-return;
-}
-/* Encode into or decode from (according to ENCODEP) coding system CODING
-the text between char positions B and E.  */
-Lisp_Object
-code_convert_region (b, e, coding, encodep)
-Lisp_Object b, e;
-struct coding_system *coding;
-int encodep;
-{
-int beg, end, len, consumed, produced;
-char *buf;
-unsigned char *begp, *endp;
-int opoint = PT, opoint_byte = PT_BYTE;
-int beg_byte, end_byte, len_byte;
-int zv_before = ZV;
-int zv_byte_before = ZV_BYTE;
-validate_region (&b, &e);
-beg = XINT (b), end = XINT (e);
-beg_byte = CHAR_TO_BYTE (beg);
-end_byte = CHAR_TO_BYTE (end);
-if (beg < GPT && end >= GPT)
-move_gap_both (end, end_byte);
-if (encodep && !NILP (coding->pre_write_conversion))
-{
-/* We must call a pre-conversion function which may put a new
-	 text to be converted in a new buffer.  */
-struct buffer *old = current_buffer, *new;
-TEMP_SET_PT_BOTH (beg, beg_byte);
-call2 (coding->pre_write_conversion, b, e);
-if (old != current_buffer)
-	{
-	  /* Replace the original text by the text just generated.  */
-	  len = ZV - BEGV;
-	  len_byte = ZV_BYTE - BEGV_BYTE;
-	  new = current_buffer;
-	  set_buffer_internal (old);
-	  del_range_both (beg, end, beg_byte, end_byte, 1);
-	  insert_from_buffer (new, 1, len, 0);
-	  end = beg + len;
-	  end_byte = len_byte;
-	}
-}
-/* We may be able to shrink the conversion region.  */
-begp = BYTE_POS_ADDR (beg_byte);
-endp = begp + (end_byte - beg_byte);
-shrink_conversion_area (&begp, &endp, coding, encodep);
-if (begp == endp)
-/* We need no conversion.  */
-len = end - beg;
-else
-{
-int shrunk_beg_byte, shrunk_end_byte;
-int shrunk_beg;
-int shrunk_len_byte;
-int new_len_byte;
-int buflen;
-shrunk_beg_byte = PTR_BYTE_POS (begp);
-shrunk_beg = BYTE_TO_CHAR (shrunk_beg_byte);
-shrunk_end_byte = PTR_BYTE_POS (endp);
-shrunk_len_byte = shrunk_end_byte - shrunk_beg_byte;
-if (encodep)
-	buflen = encoding_buffer_size (coding, shrunk_len_byte);
-else
-	buflen = decoding_buffer_size (coding, shrunk_len_byte);
-buf = get_conversion_buffer (buflen);
-coding->last_block = 1;
-produced = (encodep
-		  ? encode_coding (coding, begp, buf, shrunk_len_byte, buflen,
-				   &consumed)
-		  : decode_coding (coding, begp, buf, shrunk_len_byte, buflen,
-				   &consumed));
-TEMP_SET_PT_BOTH (shrunk_beg, shrunk_beg_byte);
-/* We let the number of characters in the result
-	 be computed in accord with enable-multilibyte-characters
-	 even when encoding.  Otherwise the buffer contents
-	 will be inconsistent.  */
-insert (buf, produced);
-del_range_byte (PT_BYTE, PT_BYTE + shrunk_len_byte, 1);
-if (opoint >= end)
-	{
-	  opoint += ZV - zv_before;
-	  opoint_byte += ZV_BYTE - zv_byte_before;
-	}
-else if (opoint > beg)
-	{
-	  opoint = beg;
-	  opoint_byte = beg_byte;
-	}
-TEMP_SET_PT_BOTH (opoint, opoint_byte);
-end += ZV - zv_before;
-}
-if (!encodep && !NILP (coding->post_read_conversion))
-{
-Lisp_Object insval;
-/* We must call a post-conversion function which may alter
-	 the text just converted.  */
-zv_before = ZV;
-zv_byte_before = ZV_BYTE;
-TEMP_SET_PT_BOTH (beg, beg_byte);
-insval = call1 (coding->post_read_conversion, make_number (end - beg));
-CHECK_NUMBER (insval, 0);
-if (opoint >= beg + ZV - zv_before)
-	{
-	  opoint += ZV - zv_before;
-	  opoint_byte += ZV_BYTE - zv_byte_before;
-	}
-else if (opoint > beg)
-	{
-	  opoint = beg;
-	  opoint_byte = beg_byte;
-	}
-TEMP_SET_PT_BOTH (opoint, opoint_byte);
-len = XINT (insval);
-}
-return make_number (len);
 }
 DEFUN ("decode-coding-region", Fdecode_coding_region, Sdecode_coding_region,
 3, 3, "r\nzCoding system: ",
-"Decode current region by specified coding system.\n\
+"Decode the current region by specified coding system.\n\
 When called from a program, takes three arguments:\n\
-START, END, and CODING-SYSTEM.  START END are buffer positions.\n\
+START, END, and CODING-SYSTEM.  START and END are buffer positions.\n\
 Return length of decoded text.")
-(b, e, coding_system)
+(start, end, coding_system)
-Lisp_Object b, e, coding_system;
+Lisp_Object start, end, coding_system;
 {
 struct coding_system coding;
+int from, to;
-CHECK_NUMBER_COERCE_MARKER (b, 0);
-CHECK_NUMBER_COERCE_MARKER (e, 1);
+CHECK_NUMBER_COERCE_MARKER (start, 0);
+CHECK_NUMBER_COERCE_MARKER (end, 1);
 CHECK_SYMBOL (coding_system, 2);
+validate_region (&start, &end);
+from = XFASTINT (start);
+to = XFASTINT (end);
 if (NILP (coding_system))
-return make_number (XFASTINT (e) - XFASTINT (b));
+return make_number (to - from);
 if (setup_coding_system (Fcheck_coding_system (coding_system), &coding) < 0)
-error ("Invalid coding-system: %s", XSYMBOL (coding_system)->name->data);
+error ("Invalid coding system: %s", XSYMBOL (coding_system)->name->data);
-return code_convert_region (b, e, &coding, 0);
+coding.mode |= CODING_MODE_LAST_BLOCK;
+return code_convert_region (from, to, &coding, 0, 1);
 }
 DEFUN ("encode-coding-region", Fencode_coding_region, Sencode_coding_region,
 3, 3, "r\nzCoding system: ",
-"Encode current region by specified coding system.\n\
+"Encode the current region by specified coding system.\n\
 When called from a program, takes three arguments:\n\
-START, END, and CODING-SYSTEM.  START END are buffer positions.\n\
+START, END, and CODING-SYSTEM.  START and END are buffer positions.\n\
 Return length of encoded text.")
-(b, e, coding_system)
+(start, end, coding_system)
-Lisp_Object b, e, coding_system;
+Lisp_Object start, end, coding_system;
 {
 struct coding_system coding;
+int from, to;
-CHECK_NUMBER_COERCE_MARKER (b, 0);
-CHECK_NUMBER_COERCE_MARKER (e, 1);
+CHECK_NUMBER_COERCE_MARKER (start, 0);
+CHECK_NUMBER_COERCE_MARKER (end, 1);
 CHECK_SYMBOL (coding_system, 2);
+validate_region (&start, &end);
+from = XFASTINT (start);
+to = XFASTINT (end);
 if (NILP (coding_system))
-return make_number (XFASTINT (e) - XFASTINT (b));
+return make_number (to - from);
 if (setup_coding_system (Fcheck_coding_system (coding_system), &coding) < 0)
-error ("Invalid coding-system: %s", XSYMBOL (coding_system)->name->data);
+error ("Invalid coding system: %s", XSYMBOL (coding_system)->name->data);
-return code_convert_region (b, e, &coding, 1);
+coding.mode |= CODING_MODE_LAST_BLOCK;
-}
+return code_convert_region (from, to, &coding, 1, 1);
-/* Encode or decode (according to ENCODEP) the text of string STR
-using coding CODING.  If NOCOPY is nil, we never return STR
-itself, but always a copy.  If NOCOPY is non-nil, we return STR
-if no change is needed.  */
-Lisp_Object
-code_convert_string (str, coding, encodep, nocopy)
-Lisp_Object str, nocopy;
-struct coding_system *coding;
-int encodep;
-{
-int len, consumed, produced;
-char *buf;
-unsigned char *begp, *endp;
-int head_skip, tail_skip;
-struct gcpro gcpro1;
-if (encodep && !NILP (coding->pre_write_conversion)
-|| !encodep && !NILP (coding->post_read_conversion))
-{
-/* Since we have to call Lisp functions which assume target text
-is in a buffer, after setting a temporary buffer, call
-code_convert_region.  */
-int count = specpdl_ptr - specpdl;
-int len = XSTRING (str)->size_byte;
-Lisp_Object result;
-struct buffer *old = current_buffer;
-record_unwind_protect (Fset_buffer, Fcurrent_buffer ());
-temp_output_buffer_setup (" *code-converting-work*");
-set_buffer_internal (XBUFFER (Vstandard_output));
-insert_from_string (str, 0, 0, XSTRING (str)->size, len, 0);
-code_convert_region (make_number (BEGV), make_number (ZV),
-			   coding, encodep);
-result = make_buffer_string (BEGV, ZV, 0);
-set_buffer_internal (old);
-return unbind_to (count, result);
-}
-/* We may be able to shrink the conversion region.  */
-begp = XSTRING (str)->data;
-endp = begp + XSTRING (str)->size_byte;
-shrink_conversion_area (&begp, &endp, coding, encodep);
-if (begp == endp)
-/* We need no conversion.  */
-return (NILP (nocopy) ? Fcopy_sequence (str) : str);
-/* We assume that head_skip and tail_skip count single-byte characters.  */
-head_skip = begp - XSTRING (str)->data;
-tail_skip = XSTRING (str)->size_byte - head_skip - (endp - begp);
-GCPRO1 (str);
-if (encodep)
-len = encoding_buffer_size (coding, endp - begp);
-else
-len = decoding_buffer_size (coding, endp - begp);
-buf = get_conversion_buffer (len + head_skip + tail_skip);
-bcopy (XSTRING (str)->data, buf, head_skip);
-coding->last_block = 1;
-produced = (encodep
-	      ? encode_coding (coding, XSTRING (str)->data + head_skip,
-			       buf + head_skip, endp - begp, len, &consumed)
-	      : decode_coding (coding, XSTRING (str)->data + head_skip,
-			       buf + head_skip, endp - begp, len, &consumed));
-bcopy (XSTRING (str)->data + head_skip + (endp - begp),
-	 buf + head_skip + produced,
-	 tail_skip);
-UNGCPRO;
-if (encodep)
-/* When encoding, the result is all single-byte characters.  */
-return make_unibyte_string (buf, head_skip + produced + tail_skip);
-/* When decoding, count properly the number of chars in the string.  */
-return make_string (buf, head_skip + produced + tail_skip);
 }
 DEFUN ("decode-coding-string", Fdecode_coding_string, Sdecode_coding_string,
 2, 3, 0,
 "Decode STRING which is encoded in CODING-SYSTEM, and return the result.\n\
 CHECK_STRING (string, 0);
 CHECK_SYMBOL (coding_system, 1);
 if (NILP (coding_system))
 return (NILP (nocopy) ? Fcopy_sequence (string) : string);
 if (setup_coding_system (Fcheck_coding_system (coding_system), &coding) < 0)
-error ("Invalid coding-system: %s", XSYMBOL (coding_system)->name->data);
+error ("Invalid coding system: %s", XSYMBOL (coding_system)->name->data);
-return code_convert_string (string, &coding, 0, nocopy);
+coding.mode |= CODING_MODE_LAST_BLOCK;
+return code_convert_string (string, &coding, 0, !NILP (nocopy));
 }
 DEFUN ("encode-coding-string", Fencode_coding_string, Sencode_coding_string,
 2, 3, 0,
 "Encode STRING to CODING-SYSTEM, and return the result.\n\
 CHECK_STRING (string, 0);
 CHECK_SYMBOL (coding_system, 1);
 if (NILP (coding_system))
 return (NILP (nocopy) ? Fcopy_sequence (string) : string);
 if (setup_coding_system (Fcheck_coding_system (coding_system), &coding) < 0)
-error ("Invalid coding-system: %s", XSYMBOL (coding_system)->name->data);
+error ("Invalid coding system: %s", XSYMBOL (coding_system)->name->data);
-return code_convert_string (string, &coding, 1, nocopy);
+coding.mode |= CODING_MODE_LAST_BLOCK;
+return code_convert_string (string, &coding, 1, !NILP (nocopy));
 }
 DEFUN ("decode-sjis-char", Fdecode_sjis_char, Sdecode_sjis_char, 1, 1, 0,
 "Decode a JISX0208 character of shift-jis encoding.\n\
 CODE is the character code in SJIS.\n\
 XSETFASTINT (val, MAKE_NON_ASCII_CHAR (charset_jisx0208, c1, c2));
 return val;
 }
 DEFUN ("encode-sjis-char", Fencode_sjis_char, Sencode_sjis_char, 1, 1, 0,
-"Encode a JISX0208 character CHAR to SJIS coding-system.\n\
+"Encode a JISX0208 character CHAR to SJIS coding system.\n\
 Return the corresponding character code in SJIS.")
 (ch)
 Lisp_Object ch;
 {
 int charset, c1, c2, s1, s2;
 XSETFASTINT (val, 0);
 return val;
 }
 DEFUN ("decode-big5-char", Fdecode_big5_char, Sdecode_big5_char, 1, 1, 0,
-"Decode a Big5 character CODE of BIG5 coding-system.\n\
+"Decode a Big5 character CODE of BIG5 coding system.\n\
 CODE is the character code in BIG5.\n\
 Return the corresponding character.")
 (code)
 Lisp_Object code;
 {
 XSETFASTINT (val, MAKE_NON_ASCII_CHAR (charset, c1, c2));
 return val;
 }
 DEFUN ("encode-big5-char", Fencode_big5_char, Sencode_big5_char, 1, 1, 0,
-"Encode the Big5 character CHAR to BIG5 coding-system.\n\
+"Encode the Big5 character CHAR to BIG5 coding system.\n\
 Return the corresponding character code in Big5.")
 (ch)
 Lisp_Object ch;
 {
 int charset, c1, c2, b1, b2;
 	}
 }
 return Qnil;
 }
+DEFUN ("update-iso-coding-systems", Fupdate_iso_coding_systems,
+Supdate_iso_coding_systems, 0, 0, 0,
+"Update internal database for ISO2022 based coding systems.\n\
+When values of the following coding categories are changed, you must\n\
+call this function:\n\
+coding-category-iso-7, coding-category-iso-7-tight,\n\
+coding-category-iso-8-1, coding-category-iso-8-2,\n\
+coding-category-iso-7-else, coding-category-iso-8-else")
+()
+{
+int i;
+for (i = CODING_CATEGORY_IDX_ISO_7; i <= CODING_CATEGORY_IDX_ISO_8_ELSE;
+i++)
+{
+if (! coding_system_table[i])
+	coding_system_table[i]
+	  = (struct coding_system *) xmalloc (sizeof (struct coding_system));
+setup_coding_system
+	(XSYMBOL (XVECTOR (Vcoding_category_table)->contents[i])->value,
+	 coding_system_table[i]);
+}
+return Qnil;
+}
 #endif /* emacs */
 /*** 8. Post-amble ***/
 setup_coding_system (Qnil, &keyboard_coding);
 setup_coding_system (Qnil, &terminal_coding);
 setup_coding_system (Qnil, &safe_terminal_coding);
+bzero (coding_system_table, sizeof coding_system_table);
 #if defined (MSDOS) || defined (WINDOWSNT)
 system_eol_type = CODING_EOL_CRLF;
 #else
 system_eol_type = CODING_EOL_LF;
 #endif
 Fput (Qcoding_system_error, Qerror_conditions,
 	Fcons (Qcoding_system_error, Fcons (Qerror, Qnil)));
 Fput (Qcoding_system_error, Qerror_message,
 	build_string ("Invalid coding system"));
+Qcoding_category = intern ("coding-category");
+staticpro (&Qcoding_category);
 Qcoding_category_index = intern ("coding-category-index");
 staticpro (&Qcoding_category_index);
+Vcoding_category_table
+= Fmake_vector (make_number (CODING_CATEGORY_IDX_MAX), Qnil);
+staticpro (&Vcoding_category_table);
 {
 int i;
 for (i = 0; i < CODING_CATEGORY_IDX_MAX; i++)
 {
-	coding_category_table[i] = intern (coding_category_name[i]);
+	XVECTOR (Vcoding_category_table)->contents[i]
-	staticpro (&coding_category_table[i]);
+	  = intern (coding_category_name[i]);
-	Fput (coding_category_table[i], Qcoding_category_index,
+	Fput (XVECTOR (Vcoding_category_table)->contents[i],
-	      make_number (i));
+	      Qcoding_category_index, make_number (i));
 }
 }
 Qcharacter_unification_table = intern ("character-unification-table");
 staticpro (&Qcharacter_unification_table);
 Qsafe_charsets = intern ("safe-charsets");
 staticpro (&Qsafe_charsets);
 Qemacs_mule = intern ("emacs-mule");
 staticpro (&Qemacs_mule);
+Qraw_text = intern ("raw-text");
+staticpro (&Qraw_text);
 defsubr (&Scoding_system_p);
 defsubr (&Sread_coding_system);
 defsubr (&Sread_non_nil_coding_system);
 defsubr (&Scheck_coding_system);
 defsubr (&Sdetect_coding_region);
+defsubr (&Sdetect_coding_string);
 defsubr (&Sdecode_coding_region);
 defsubr (&Sencode_coding_region);
 defsubr (&Sdecode_coding_string);
 defsubr (&Sencode_coding_string);
 defsubr (&Sdecode_sjis_char);
 defsubr (&Sset_safe_terminal_coding_system_internal);
 defsubr (&Sterminal_coding_system);
 defsubr (&Sset_keyboard_coding_system_internal);
 defsubr (&Skeyboard_coding_system);
 defsubr (&Sfind_operation_coding_system);
+defsubr (&Supdate_iso_coding_systems);
 DEFVAR_LISP ("coding-system-list", &Vcoding_system_list,
 "List of coding systems.\n\
 \n\
 Do not alter the value of this variable manually.  This variable should be\n\
 int i;
 Vcoding_category_list = Qnil;
 for (i = CODING_CATEGORY_IDX_MAX - 1; i >= 0; i--)
 Vcoding_category_list
-	= Fcons (coding_category_table[i], Vcoding_category_list);
+	= Fcons (XVECTOR (Vcoding_category_table)->contents[i],
+		 Vcoding_category_list);
 }
 DEFVAR_LISP ("coding-system-for-read", &Vcoding_system_for_read,
 "Specify the coding system for read operations.\n\
 It is useful to bind this variable with `let', but do not set it globally.\n\
 a coding system of ISO 2022 variant which has a flag\n\
 `accept-latin-extra-code' t (e.g. iso-latin-1) on reading a file\n\
 or reading output of a subprocess.\n\
 Only 128th through 159th elements has a meaning.");
 Vlatin_extra_code_table = Fmake_vector (make_number (256), Qnil);
+DEFVAR_LISP ("select-safe-coding-system-function",
+	       &Vselect_safe_coding_system_function,
+"Function to call to select safe coding system for encoding a text.\n\
+\n\
+If set, this function is called to force a user to select a proper\n\
+coding system which can encode the text in the case that a default\n\
+coding system used in each operation can't encode the text.\n\
+\n\
+The default value is `select-safe-codign-system' (which see).");
+Vselect_safe_coding_system_function = Qnil;
 }
 #endif /* emacs */

Mercurial > emacs

comparison src/coding.c @ 20718:c600dea3b06b