Home | History | Annotate | Line # | Download | only in dist
FIXES revision 1.4
      1 /****************************************************************
      2 Copyright (C) Lucent Technologies 1997
      3 All Rights Reserved
      4 
      5 Permission to use, copy, modify, and distribute this software and
      6 its documentation for any purpose and without fee is hereby
      7 granted, provided that the above copyright notice appear in all
      8 copies and that both that the copyright notice and this
      9 permission notice and warranty disclaimer appear in supporting
     10 documentation, and that the name Lucent Technologies or any of
     11 its entities not be used in advertising or publicity pertaining
     12 to distribution of the software without specific, written prior
     13 permission.
     14 
     15 LUCENT DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE,
     16 INCLUDING ALL IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS.
     17 IN NO EVENT SHALL LUCENT OR ANY OF ITS ENTITIES BE LIABLE FOR ANY
     18 SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES
     19 WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER
     20 IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION,
     21 ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF
     22 THIS SOFTWARE.
     23 ****************************************************************/
     24 
     25 This file lists all bug fixes, changes, etc., made since the
     26 second edition of the AWK book was published in September 2023.
     27 
     28 Jul 28, 2024
     29 	Fixed readcsvrec resize segfault when reading csv records longer
     30 	than 8k. Thanks to Ozan Yigit.
     31 	mktime() added to bsd-features branch. Thanks to Todd Miller.
     32 
     33 Jun 23, 2024
     34 	Fix signal for system-status test. Thanks to Tim van der Molen.
     35 	Rewrite if-else chain as switch. Thanks to Andrew Sukach.
     36 
     37 May 27, 2024
     38 	Spelling fixes and removal of unneeded prototypes and extern.
     39 	Thanks to Jonathan Gray.
     40 
     41 May 4, 2024
     42 	Fixed a use-after-free bug with ARGV for "delete ARGV".
     43 	Also ENVtab is no longer global. Thanks to Benjamin Sturz
     44 	for spotting the ARGV issue and	Todd Miller for the fix.
     45 
     46 May 3, 2024:
     47 	Remove warnings when compiling with g++. Thanks to Arnold Robbins.
     48 
     49 Apr 22, 2024:
     50 	Fixed regex engine gototab reallocation issue that was
     51 	Introduced during the Nov 24 rewrite. Thanks to Arnold Robbins.
     52 	Fixed a scan bug in split in the case the separator is a single
     53 	character. Thanks to Oguz Ismail for spotting the issue.
     54 
     55 Mar 10, 2024:
     56 	Fixed use-after-free bug in fnematch due to adjbuf invalidating
     57 	the pointers to buf. Thanks to github user caffe3 for spotting
     58 	the issue and providing a fix, and to Miguel Pineiro Jr.
     59 	for the alternative fix.
     60 	MAX_UTF_BYTES in fnematch has been replaced with awk_mb_cur_max.
     61 	thanks to Miguel Pineiro Jr.
     62 
     63 Jan 22, 2024:
     64 	Restore the ability to compile with g++. Thanks to
     65 	Arnold Robbins.
     66 
     67 Dec 24, 2023:
     68 	Matchop dereference after free problem fix when the first
     69 	argument is a function call. Thanks to Oguz Ismail Uysal.
     70 	Fix inconsistent handling of --csv and FS set in the
     71 	command line. Thanks to Wilbert van der Poel.
     72 	Casting changes to int for is* functions.
     73 
     74 Nov 27, 2023:
     75 	Fix exit status of system on MacOS. Update to REGRESS.
     76 	Thanks to Arnold Robbins.
     77 	Fix inconsistent handling of -F and --csv, and loss of csv
     78 	mode when FS is set.
     79 
     80 Nov 24, 2023:
     81         Fix issue #199: gototab improvements to dynamically resize the
     82         table, qsort and bsearch to improve the lookup speed as the
     83         table gets larger for multibyte input. Thanks to Arnold Robbins.
     84 
     85 Nov 23, 2023:
     86 	Fix Issue #169, related to escape sequences in strings.
     87 	Thanks to Github user rajeevvp.
     88 	Fix Issue #147, reported by Github user drawkula, and fixed
     89 	by Miguel Pineiro Jr.
     90 
     91 Nov 20, 2023:
     92 	Rewrite of fnematch to fix a number of issues, including
     93 	extraneous output, out-of-bounds access, number of bytes
     94 	to push back after a failed match etc.
     95 	Thanks to Miguel Pineiro Jr.
     96 
     97 Nov 15, 2023:
     98 	Man page edit, regression test fixes. Thanks to Arnold Robbins
     99 	Consolidation of sub and gsub into dosub, removing duplicate
    100 	code. Thanks to Miguel Pineiro Jr.
    101 	gcc replaced with cc everywhere.
    102 
    103 Oct 30, 2023:
    104 	Multiple fixes and a minor code cleanup.
    105 	Disabled utf-8 for non-multibyte locales, such as C or POSIX.
    106 	Fixed a bad char * cast that causes incorrect results on big-endian
    107 	systems. Also fixed an out-of-bounds read for empty CCL.
    108 	Fixed a buffer overflow in substr with utf-8 strings.
    109 	Many thanks to Todd C Miller.
    110 
    111 Sep 24, 2023:
    112 	fnematch and getrune have been overhauled to solve issues around
    113 	unicode FS and RS. Also fixed gsub null match issue with unicode.
    114 	Big thanks to Arnold Robbins.
    115 
    116 Sep 12, 2023:
    117 	Fixed a length error in u8_byte2char that set RSTART to
    118 	incorrect (cannot happen) value for EOL match(str, /$/).
    119 
    120 
    121 -----------------------------------------------------------------
    122 
    123 [This entry is a summary, not a precise list of changes.]
    124 
    125 	Added --csv option to enable processing of comma-separated
    126 	values inputs.  When --csv is enabled, fields are separated
    127 	by commas, fields may be quoted with " double quotes, fields
    128 	may contain embedded newlines.
    129 
    130 	If no explicit separator argument is provided, split() uses
    131 	the setting of --csv to determine how fields are split.
    132 
    133 	Strings may now contain UTF-8 code points (not necessarily
    134 	characters).  Functions that operate on characters, like
    135 	length, substr, index, match, etc., use UTF-8, so the length
    136 	of a string of 3 emojis is 3, not 12 as it would be if bytes
    137 	were counted.
    138 
    139 	Regular expressions are processed as UTF-8.
    140 
    141 	Unicode literals can be written as \u followed by one
    142 	to eight hexadecimal digits.  These may appear in strings and
    143 	regular expressions.
    144