pkgsrc-Changes archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
CVS commit: pkgsrc/textproc/R-data.table
Module Name: pkgsrc
Committed By: mef
Date: Sun Jun 6 13:32:51 UTC 2021
Modified Files:
pkgsrc/textproc/R-data.table: Makefile distinfo
Log Message:
(textproc/R-data.table) updated 1.12.2 to 1.14.0
NEWS.md from 1.12.2 to 1.14.0 has over 1,000 lines, See following
URL for full text:
https://github.com/Rdatatable/data.table/blob/master/NEWS.md
Here is only for 'POTENTIALLY BREAKING CHANGES' in 1.140.0
# data.table [v1.14.0](https://github.com/Rdatatable/data.table/milestone/23?closed=1)
## POTENTIALLY BREAKING CHANGES
1. In v1.13.0 (July 2020) native parsing of datetime was added to
`fread` by Michael Chirico which dramatically improved
performance. Before then datetime was read as type character by
default which was slow. Since v1.13.0, UTC-marked datetime
(e.g. `2020-07-24T10:11:12.134Z` where the final `Z` is present) has
been read automatically as POSIXct and quickly. We provided the
migration option `datatable.old.fread.datetime.character` to revert to
the previous slow character behavior. We also added the `tz=` argument
to control unmarked datetime; i.e. where the `Z` (or equivalent UTC
postfix) is missing in the data. The default `tz=""` reads unmarked
datetime as character as before, slowly. We gave you the ability to
set `tz="UTC"` to turn on the new behavior and read unmarked datetime
as UTC, quickly. R sessions that are running in UTC by setting the TZ
environment variable, as is good practice and common in production,
have also been reading unmarked datetime as UTC since v1.13.0, much
faster. Note 1 of v1.13.0 (below in this file) ended `In addition to
convenience, fread is now significantly faster in the presence of
dates, UTC-marked datetimes, and unmarked datetime when tz="UTC" is
provided.`.
At `rstudio::global(2021)`, Neal Richardson, Director of
Engineering at Ursa Labs, compared Arrow CSV performance to
`data.table` CSV performance, [Bigger Data With Ease Using Apache
Arrow](https://rstudio.com/resources/rstudioglobal-2021/bigger-data-with-ease-using-apache-arrow/). He
opened by comparing to `data.table` as his main point. Arrow was
presented as 3 times faster than `data.table`. He talked at length
about this result. However, no reproducible code was provided and
we were not contacted in advance in case we had any comments. He
mentioned New York Taxi data in his talk which is a dataset known
to us as containing unmarked
datetime. [Rebuttal](https://twitter.com/MattDowle/status/1360073970498875394).
`tz=`'s default is now changed from `""` to `"UTC"`. If you have
been using `tz=` explicitly then there should be no change. The
change to read UTC-marked datetime as POSIXct rather than
character already happened in v1.13.0. The change now is that
unmarked datetimes are now read as UTC too by default without
needing to set `tz="UTC"`. None of the 1,017 CRAN packages
directly using `data.table` are affected. As before, the migration
option `datatable.old.fread.datetime.character` can still be set
to TRUE to revert to the old character behavior. This migration
option is temporary and will be removed in the near future.
To generate a diff of this commit:
cvs rdiff -u -r1.2 -r1.3 pkgsrc/textproc/R-data.table/Makefile
cvs rdiff -u -r1.1 -r1.2 pkgsrc/textproc/R-data.table/distinfo
Please note that diffs are not public domain; they are subject to the
copyright notices on the relevant files.
Modified files:
Index: pkgsrc/textproc/R-data.table/Makefile
diff -u pkgsrc/textproc/R-data.table/Makefile:1.2 pkgsrc/textproc/R-data.table/Makefile:1.3
--- pkgsrc/textproc/R-data.table/Makefile:1.2 Thu Aug 8 19:53:56 2019
+++ pkgsrc/textproc/R-data.table/Makefile Sun Jun 6 13:32:51 2021
@@ -1,13 +1,18 @@
-# $NetBSD: Makefile,v 1.2 2019/08/08 19:53:56 brook Exp $
+# $NetBSD: Makefile,v 1.3 2021/06/06 13:32:51 mef Exp $
R_PKGNAME= data.table
-R_PKGVER= 1.12.2
+R_PKGVER= 1.14.0
CATEGORIES= textproc
MAINTAINER= bacon%NetBSD.org@localhost
COMMENT= Extension of 'data.frame'
LICENSE= mpl-2.0
+TEST_DEPENDS+= R-bit64-[0-9]*:../../devel/R-bit64
+TEST_DEPENDS+= R-bit-[0-9]*:../../devel/R-bit
+TEST_DEPENDS+= R-xts-[0-9]*:../../math/R-xts
+TEST_DEPENDS+= R-zoo-[0-9]*:../../math/R-zoo
+
USE_LANGUAGES= c
.include "../../math/R/Makefile.extension"
Index: pkgsrc/textproc/R-data.table/distinfo
diff -u pkgsrc/textproc/R-data.table/distinfo:1.1 pkgsrc/textproc/R-data.table/distinfo:1.2
--- pkgsrc/textproc/R-data.table/distinfo:1.1 Wed Jul 31 13:43:56 2019
+++ pkgsrc/textproc/R-data.table/distinfo Sun Jun 6 13:32:51 2021
@@ -1,6 +1,6 @@
-$NetBSD: distinfo,v 1.1 2019/07/31 13:43:56 brook Exp $
+$NetBSD: distinfo,v 1.2 2021/06/06 13:32:51 mef Exp $
-SHA1 (R/data.table_1.12.2.tar.gz) = 3583fc54515c8419925098061c7a2add9ecccd4c
-RMD160 (R/data.table_1.12.2.tar.gz) = 5f75feea4718479f1d87eb9cededcf292783e000
-SHA512 (R/data.table_1.12.2.tar.gz) = addfddd432cdda8c37959ed08b30ad0d030938d1ea531e299f8398c895d1da00ba409dcc88ae0677c71e060dcd665e484f9562c791b406e02bc8f68bb2de9e2c
-Size (R/data.table_1.12.2.tar.gz) = 3343698 bytes
+SHA1 (R/data.table_1.14.0.tar.gz) = 6ca317410f49abd751905572e943fb3f09790eb0
+RMD160 (R/data.table_1.14.0.tar.gz) = b724e1a468411d678fbe29806b921e8251fdcdbe
+SHA512 (R/data.table_1.14.0.tar.gz) = db90998904270f42cc654faa0b333f8e720e4044c7f2c82f52efbb69e1ed30c5b2c1b034f7a5baff4543151eda7fb33aa454817889ac95a5d17d6aba364396df
+Size (R/data.table_1.14.0.tar.gz) = 5303849 bytes
Home |
Main Index |
Thread Index |
Old Index