Source-Changes-HG archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
[src/netbsd-9]: src/external/historical/nawk/dist Pull up following revision(...
details: https://anonhg.NetBSD.org/src/rev/8e6c2aab062c
branches: netbsd-9
changeset: 462289:8e6c2aab062c
user: martin <martin%NetBSD.org@localhost>
date: Sun Aug 04 19:19:31 2019 +0000
description:
Pull up following revision(s) (requested by christos in ticket #11):
external/historical/nawk/dist/awk.h: revision 1.3
external/historical/nawk/dist/run.c: revision 1.10
external/historical/nawk/dist/FIXES: revision 1.2
external/historical/nawk/dist/b.c: revision 1.7
external/historical/nawk/dist/b.c: revision 1.8
external/historical/nawk/dist/lex.c: revision 1.5
external/historical/nawk/dist/main.c: revision 1.9
external/historical/nawk/dist/proto.h: revision 1.8
external/historical/nawk/dist/tran.c: revision 1.10
external/historical/nawk/dist/proto.h: revision 1.9
external/historical/nawk/dist/awk.1: revision 1.2
external/historical/nawk/dist/lib.c: revision 1.9
external/historical/nawk/dist/ytab.c: revision 1.2
external/historical/nawk/dist/tran.c: revision 1.9
remove trailing whitespace.
PR/54424: Martijn Dekker: awk: broken character classes in UTF-8 locale:
only the first matches
Pick up some of the fixes from upstream:
- posix paren matching
- print \v \a
- some more fatal handling
- init all the character range.
remove ### error output accidentally committed.
Add translators for \v and \a per posix.
diffstat:
external/historical/nawk/dist/FIXES | 34 +++++++++---------
external/historical/nawk/dist/awk.1 | 12 +++---
external/historical/nawk/dist/awk.h | 4 +-
external/historical/nawk/dist/b.c | 63 +++++++++++++++++++++++++----------
external/historical/nawk/dist/lex.c | 22 ++++++------
external/historical/nawk/dist/lib.c | 2 +-
external/historical/nawk/dist/main.c | 6 +-
external/historical/nawk/dist/run.c | 4 +-
external/historical/nawk/dist/tran.c | 14 ++++---
external/historical/nawk/dist/ytab.c | 10 ++--
10 files changed, 99 insertions(+), 72 deletions(-)
diffs (truncated from 657 to 300 lines):
diff -r eb4c3084d363 -r 8e6c2aab062c external/historical/nawk/dist/FIXES
--- a/external/historical/nawk/dist/FIXES Sun Aug 04 19:09:16 2019 +0000
+++ b/external/historical/nawk/dist/FIXES Sun Aug 04 19:19:31 2019 +0000
@@ -52,10 +52,10 @@
/pat/, \n /pat/ {...} is now legal, though bad style to use.
added checks to new -v code that permits -vnospace; thanks to
- ruslan ermilov for spotting this and providing the patch.
+ ruslan ermilov for spotting this and providing the patch.
removed fixed limit on number of open files; thanks to aleksey
- cheusov and christos zoulos.
+ cheusov and christos zoulos.
fixed day 1 bug that resurrected deleted elements of ARGV when
used as filenames (in lib.c).
@@ -73,10 +73,10 @@
and arnold robbins, changed srand() to return the previous
seed (which is 1 on the first call of srand). the seed is
an Awkfloat internally though converted to unsigned int to
- pass to the library srand(). thanks, everyone.
+ pass to the library srand(). thanks, everyone.
fixed a subtle (and i hope low-probability) overflow error
- in fldbld, by adding space for one extra \0. thanks to
+ in fldbld, by adding space for one extra \0. thanks to
robert bassett for spotting this one and providing a fix.
removed the files related to compilation on windows. i no
@@ -113,7 +113,7 @@
Oct 23, 2007:
minor fix in lib.c: increase inputFS to 100, change malloc
- for fields to n+1.
+ for fields to n+1.
fixed memory fault caused by out of order test in setsval.
@@ -160,7 +160,7 @@
core dump on linux with BEGIN {nextfile}, now fixed.
- removed some #ifdef's in run.c and lex.c that appear to no
+ removed some #ifdef's in run.c and lex.c that appear to no
longer be necessary.
Apr 24, 2005:
@@ -174,8 +174,8 @@
rethinking it.
Dec 31, 2004:
- prevent overflow of -f array in main, head off potential error in
- call of SYNTAX(), test malloc return in lib.c, all with thanks to
+ prevent overflow of -f array in main, head off potential error in
+ call of SYNTAX(), test malloc return in lib.c, all with thanks to
todd miller.
Dec 22, 2004:
@@ -203,8 +203,8 @@
code known to man.
fixed a storage leak in call() that appears to have been there since
- 1983 or so -- a function without an explicit return that assigns a
- string to a parameter leaked a Cell. thanks to moinak ghosh for
+ 1983 or so -- a function without an explicit return that assigns a
+ string to a parameter leaked a Cell. thanks to moinak ghosh for
spotting this very subtle one.
Jul 31, 2003:
@@ -226,7 +226,7 @@
radix character in programs and command line arguments regardless of
the locale; otherwise, the locale should prevail for input and output
of numbers. so it's intended to work that way.
-
+
i have rescinded the attempt to use strcoll in expanding shorthands in
regular expressions (cclenter). its properties are much too
surprising; for example [a-c] matches aAbBc in locale en_US but abBcC
@@ -290,7 +290,7 @@
Jun 28, 2002:
modified run/format() and tran/getsval() to do a slightly better
job on using OFMT for output from print and CONVFMT for other
- number->string conversions, as promised by posix and done by
+ number->string conversions, as promised by posix and done by
gawk and mawk. there are still places where it doesn't work
right if CONVFMT is changed; by then the STR attribute of the
variable has been irrevocably set. thanks to arnold robbins for
@@ -322,7 +322,7 @@
Jan 1, 2002:
fflush() or fflush("") flushes all files and pipes.
- length(arrayname) returns number of elements; thanks to
+ length(arrayname) returns number of elements; thanks to
arnold robbins for suggestion.
added a makefile.win to make it easier to build on windows.
@@ -372,7 +372,7 @@
May 25, 2000:
yet another attempt at making 8-bit input work, with another
- band-aid in b.c (member()), and some (uschar) casts to head
+ band-aid in b.c (member()), and some (uschar) casts to head
off potential errors in subscripts (like isdigit). also
changed HAT to NCHARS-2. thanks again to santiago vila.
@@ -419,7 +419,7 @@
the test case.)
Apr 16, 1999:
- with code kindly provided by Bruce Lilly, awk now parses
+ with code kindly provided by Bruce Lilly, awk now parses
/=/ and similar constructs more sensibly in more places.
Bruce also provided some helpful test cases.
@@ -476,7 +476,7 @@
Oct 19, 1998:
fixed a couple of bugs in getrec: could fail to update $0
- after a getline var; because inputFS wasn't initialized,
+ after a getline var; because inputFS wasn't initialized,
could split $0 on every character, a misleading diversion.
fixed caching bug in makedfa: LRU was actually removing
@@ -622,7 +622,7 @@
input file. (thanks to arnold robbins for inspiration and code).
small fixes to regexpr code: can now handle []], [[], and
- variants; [] is now a syntax error, rather than matching
+ variants; [] is now a syntax error, rather than matching
everything; [z-a] is now empty, not z. far from complete
or correct, however. (thanks to jeffrey friedl for pointing out
some awful behaviors.)
diff -r eb4c3084d363 -r 8e6c2aab062c external/historical/nawk/dist/awk.1
--- a/external/historical/nawk/dist/awk.1 Sun Aug 04 19:09:16 2019 +0000
+++ b/external/historical/nawk/dist/awk.1 Sun Aug 04 19:19:31 2019 +0000
@@ -49,7 +49,7 @@
Each line is matched against the
pattern portion of every pattern-action statement;
the associated action is performed for each matched pattern.
-The file name
+The file name
.B \-
means the standard input.
Any
@@ -91,7 +91,7 @@
.IP
.IB pattern " { " action " }
.PP
-A missing
+A missing
.BI { " action " }
means print the line;
a missing pattern always matches.
@@ -195,7 +195,7 @@
.BR sin ,
.BR cos ,
and
-.BR atan2
+.BR atan2
are built in.
Other built-in functions:
.TF length
@@ -224,7 +224,7 @@
substring of
.I s
that begins at position
-.IR m
+.IR m
counted from 1.
.TP
.BI index( s , " t" )
@@ -352,7 +352,7 @@
of regular expressions and
relational expressions.
Regular expressions are as in
-.IR egrep ;
+.IR egrep ;
see
.IR grep (1).
Isolated regular expressions
@@ -512,7 +512,7 @@
.fi
.EE
.SH SEE ALSO
-.IR lex (1),
+.IR lex (1),
.IR sed (1)
.br
A. V. Aho, B. W. Kernighan, P. J. Weinberger,
diff -r eb4c3084d363 -r 8e6c2aab062c external/historical/nawk/dist/awk.h
--- a/external/historical/nawk/dist/awk.h Sun Aug 04 19:09:16 2019 +0000
+++ b/external/historical/nawk/dist/awk.h Sun Aug 04 19:19:31 2019 +0000
@@ -32,7 +32,7 @@
#define xfree(a) { if ((a) != NULL) { free((void *) (a)); (a) = NULL; } }
-#define NN(p) ((p) ? (p) : "(null)") /* guaranteed non-null for dprintf
+#define NN(p) ((p) ? (p) : "(null)") /* guaranteed non-null for dprintf
*/
#define DEBUG
#ifdef DEBUG
@@ -155,7 +155,7 @@
#define CCOPY 6
#define CCON 5
#define CTEMP 4
-#define CNAME 3
+#define CNAME 3
#define CVAR 2
#define CFLD 1
#define CUNK 0
diff -r eb4c3084d363 -r 8e6c2aab062c external/historical/nawk/dist/b.c
--- a/external/historical/nawk/dist/b.c Sun Aug 04 19:09:16 2019 +0000
+++ b/external/historical/nawk/dist/b.c Sun Aug 04 19:19:31 2019 +0000
@@ -31,6 +31,7 @@
#define DEBUG
#include <ctype.h>
+#include <limits.h>
#include <stdio.h>
#include <string.h>
#include <stdlib.h>
@@ -220,7 +221,7 @@
f->curstat = 2;
f->out[2] = 0;
k = *(f->re[0].lfollow);
- xfree(f->posns[2]);
+ xfree(f->posns[2]);
if ((f->posns[2] = calloc(1, (k+1)*sizeof(int))) == NULL)
overflo("out of space in makeinit");
for (i=0; i <= k; i++) {
@@ -333,6 +334,10 @@
c = '\r';
else if (c == 'b')
c = '\b';
+ else if (c == 'v')
+ c = '\v';
+ else if (c == 'a')
+ c = '\a';
else if (c == '\\')
c = '\\';
else if (c == 'x') { /* hexadecimal goo follows */
@@ -649,9 +654,9 @@
* RETURN VALUES
* 0 No match found.
* 1 Match found.
- */
+ */
-int fnematch(fa *pfa, FILE *f, uschar **pbuf, int *pbufsize, int quantum)
+int fnematch(fa *pfa, FILE *f, uschar **pbuf, int *pbufsize, int quantum)
{
uschar *buf = *pbuf;
int bufsize = *pbufsize;
@@ -676,7 +681,7 @@
if (++j == k) {
if (k == bufsize)
if (!adjbuf(&buf, &bufsize, bufsize+1, quantum, 0, "fnematch"))
- FATAL("stream '%.30s...' too long", buf);
+ FATAL("stream '%.30s...' too long", buf);
buf[k++] = (c = getc(f)) != EOF ? c : 0;
}
c = buf[j];
@@ -716,7 +721,7 @@
*/
do
if (buf[--k] && ungetc(buf[k], f) == EOF)
- FATAL("unable to ungetc '%c'", buf[k]);
+ FATAL("unable to ungetc '%c'", buf[k]);
while (k > i + patlen);
buf[k] = 0;
return 1;
@@ -905,8 +910,8 @@
uschar *buf = 0;
int ret = 1;
int init_q = (firstnum==0); /* first added char will be ? */
- int n_q_reps = secondnum-firstnum; /* m>n, so reduce until {1,m-n} left */
- int prefix_length = reptok - basestr; /* prefix includes first rep */
+ int n_q_reps = secondnum-firstnum; /* m>n, so reduce until {1,m-n} left */
+ int prefix_length = reptok - basestr; /* prefix includes first rep */
int suffix_length = strlen(reptok) - reptoklen; /* string after rep specifier */
int size = prefix_length + suffix_length;
@@ -924,7 +929,7 @@
}
if ((buf = (uschar *) malloc(size+1)) == NULL)
FATAL("out of space in reg expr %.10s..", lastre);
- memcpy(buf, basestr, prefix_length); /* copy prefix */
+ memcpy(buf, basestr, prefix_length); /* copy prefix */
j = prefix_length;
if (special_case == REPEAT_ZERO) {
j -= atomlen;
@@ -978,26 +983,28 @@
if (secondnum < 0) { /* means {n,} -> repeat n-1 times followed by PLUS */
if (firstnum < 2) {
/* 0 or 1: should be handled before you get here */
+ FATAL("internal error");
} else {
- return replace_repeat(reptok, reptoklen, atom, atomlen,
+ return replace_repeat(reptok, reptoklen, atom, atomlen,
firstnum, secondnum, REPEAT_PLUS_APPENDED);
Home |
Main Index |
Thread Index |
Old Index