pkgsrc-WIP-changes archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
apache-arrow: Update to 15.0.0.0
Module Name: pkgsrc-wip
Committed By: Matthew Danielson <matthewd%fastmail.us@localhost>
Pushed By: matthewd
Date: Tue Jan 30 06:38:31 2024 -0800
Changeset: bdc4ce78c8a179a9ed1a2929fa0accfd6897b2d2
Modified Files:
apache-arrow/PLIST
apache-arrow/distinfo
apache-arrow/version.mk
Log Message:
apache-arrow: Update to 15.0.0.0
Apache Arrow 15.0.0 (2024-01-21 08:00:00)
Bug Fixes
GH-15192 - [C++] Bring back case_when tests for union types (#39308)
GH-32570 - [C++] Fix the issue of ExecBatchBuilder when appending consecutive tail rows with the same id may exceed buffer boundary (#39234)
GH-32662 - [C#] Make dictionaries in file and memory implementations work correctly and support integration tests (#39146)
GH-33475 - [Java] Add parameter binding for Prepared Statements in JDBC driver (#38404)
GH-34532 - [Java][FlightSQL] Change JDBC to handle multi-endpoints (#38521)
GH-34610 - [Java] Fix valueCount and field name when loading/transferring NullVector (#38973)
GH-34890 - [C++][Python] Add a no-op kernel for dictionary_encode(dictionary) (#38349)
GH-35497 - [C++] Use the latest tagged version of flatbuffers (#38192)
GH-36588 - [C#] Support blank column names and enable more integration tests. (#39167)
GH-36594 - [C++] Don’t use MSVC_VERSION to determin -fms-compatibility-version (#36595)
GH-36912 - [Java] JDBC driver stops consuming roots if it sees an empty root (#38590)
GH-37055 - [C++] Optimize hash kernels for Dictionary ChunkedArrays (#38394)
GH-37657 - [JS] Run bin scripts with ts-node (#38500)
GH-37726 - [Swift][FlightSQL] Update behavior to be similar to existing impls (#37764)
GH-37751 - [C++][Gandiva] Avoid registering exported functions multiple times in gandiva (#37752)
GH-37796 - [C++][Acero] Fix race condition caused by straggling input in the as-of-join node (#37839)
GH-37884 - [Swift] allow reading of unaligned FlatBuffers buffers (#38635)
GH-37969 - [C++][Parquet] add more closed file checks for ParquetFileWriter (#38390)
GH-38096 - [Java] FlightStream with metadata can cause error when closing (#38110)
GH-38198 - [Go] Fix AuthenticateBasicToken to be reliable behind proxies (#38199)
GH-38210 - [C++][FlightRPC] Add missing app_metadata arguments (#38231)
GH-38216 - [R] open_dataset(format = “json”) not documented (#38258)
GH-38242 - [Java] Fix incorrect internal struct accounting for DenseUnionVector#getBufferSizeFor (#38305)
GH-38254 - [Java] Add reusable buffer getters to char/binary vectors (#38266)
GH-38268 - [Java] Disable flaky TestFlightSqlStreams (#38319)
GH-38281 - [Go] Ensure CData imported arrays are freed on release (#38314)
GH-38297 - [C#] Fix build for .NET 4.7.2 (#38299)
GH-38304 - [C++][Parquet] Fix Valgrind memory leak in arrow-dataset-file-parquet-encryption-test (#38306)
GH-38307 - [CI] Remove gemfury_clean.rb (#38308)
GH-38318 - [Java][FlightRPC] Enable tests that leaked (#38719)
GH-38323 - [CI][Python] Use system gdb on test-conda-python (#38324)
GH-38363 - [Release][CI] Omit tests for main/maintenance branches on RC branch (#38365)
GH-38366 - [Java] Fix Murmur hash on buffers less than 4 bytes (#38368)
GH-38378 - [C++][Parquet] Don’t initialize OpenSSL explicitly with OpenSSL 1.1 (#38379)
GH-38382 - [R] Explicitly clean up arrow_duck_connection() on exit (#38495)
GH-38387 - [Java] Fix JDK8 compilation issue with TestAllTypes (#38388)
GH-38395 - [Go] fix rounding errors in decimal256 string functions (#38426)
GH-38399 - [Go][Parquet] DeltaBitPack decoder reset usedFirst after SetData (#38413)
GH-38401 - [C++] Re-generate flatbuffers C++ for Skyhook (#38405)
GH-38436 - [R] Test segfault on reading CSVs with non-UTF-8 encoding
GH-38439 - [Java][CI] Use Eclipse Temurin for all Java CI linux jobs (#38440)
GH-38447 - [CI][Release] Don’t use “ {exit,continue}” (#38486)
GH-38458 - [Go] Add ValueLen to BinaryLike interface (#39242)
GH-38470 - [CI][Integration] Install jpype and build JNI c-data to run integration tests (#39502)
GH-38477 - [Go] Fixing decimal 128 rounding issue (#38478)
GH-38479 - [C++] Avoid passing null pointer to LZ4 frame decompressor (#39125)
GH-38503 - [Go][Parquet] Make the arrow column writer internal (#38727)
GH-38503 - [Go][Parquet] Style improvement for using ArrowColumnWriter (#38581)
GH-38516 - [Go][Parquet] Increment the number of rows written when appending a new row group (#38517)
GH-38535 - [Python] Fix S3FileSystem equals None segfault (#39276)
GH-38554 - [Release][Website] post-03-website.sh doesn’t quote current.date (#38555)
GH-38556 - [C++] Add missing explicit size_t cast for i386 (#38557)
GH-38594 - [Docs][C++][Gandiva] Document how to register Gandiva external functions (#38763)
GH-38599 - [Docs] Update Headers (#38696)
GH-38614 - [Java] Add VarBinary and VarCharWriter helper methods to more writers (#38631)
GH-38624 - [C++] Fix: add TestingEqualOptions for gtest functions. (#38642)
GH-38630 - [MATLAB] arrow.array.BooleanArray’s toMATLAB method does not take slice offsets into account (#38636)
GH-38653 - [Packaging][Java][Python][Ruby] Raise the minimum macOS version to 10.15 catalina to allow using new APIs in C++17 (#38677)
GH-38683 - [Python][Docs] Update docstrings for Time32Type and Time64Type (#39059)
GH-38684 - [Integration] Try to strengthen C Data Interface testing (#38846)
GH-38697 - [C++][Gandiva] Use arrow io util to replace std::filesystem::path in gandiva (#38698)
GH-38709 - [C++] Protect against PREALLOCATE preprocessor defined on macOS (#38760)
GH-38711 - [CI] Rollback aws-cli for preview documentation (#38723)
GH-38725 - [Java] decompression in Lz4CompressionCodec.java does not set writer index (#38840)
GH-38728 - [Go] ipc: put lz4 decompression buffers back into sync.Pool (#38729)
GH-38737 - [Java] Fix JDBC caching of SqlInfo values (#38739)
GH-38738 - [C++] Check variadic buffer counts in bounds (#38740)
GH-38745 - [Integration] Fix huge integration test (#38746)
GH-38762 - [R] Versions of R and RTools in CI config are no longer current
GH-38764 - [Java] Clarify warning about --add-opens=java.base/java.nio=ALL-UNNAMED (#38765)
GH-38782 - [C++][FS][Azure] Do nothing for CreateDir(“/container”, true) (#38783)
GH-38795 - [Go] Fix race GetToTimeFunc for Timestamp (#38797)
GH-38811 - [R] Actually use fetched cmake on macos (#39453)
GH-38816 - [C#] Fix IArrowRecord implementation on StructArray (#38827)
GH-38823 - Fix TestArrowReaderAdHoc.ReadFloat16Files to use new uncompressed files (#38825)
GH-38832 - [Java] Avoid building twice in ci/scripts/java_build.sh (#38829)
GH-38844 - [C++] S3FileSystem export s3 sdk config “use_virtual_addressing” to arrow::fs::S3Options (#38858)
GH-38851 - Website: Contributing link is not working
GH-38879 - [C++][Gandiva] Fix Gandiva to_date function’s validation for supress errors parameter (#38987)
GH-38883 - [Docs] Fix struct example to show hiding a child’s entry (#38898)
GH-38906 - [R] Improve Windows CI configuration (#38927)
GH-38921 - [CI] Fix spelling (#38922)
GH-38925 - [CI] Fix spelling (#38926)
GH-38928 - [R] Fix spelling (#38929)
GH-38930 - [Java] Fix spelling (#38931)
GH-38932 - [GO] Fix spelling (#38933)
GH-38938 - [FlightRPC] Fix spelling (#38939)
GH-38940 - [Ruby] Fix spelling (#38941)
GH-38942 - [C#] Fix spelling (#38943)
GH-38944 - [Python] Fix spelling (#38945)
GH-38946 - [MATLAB] Fix spelling (#38947)
GH-38948 - [Swift] Fix spelling (#38949)
GH-38950 - [Docs] Fix spelling (#38951)
GH-38952 - [Format] Fix spelling (#38953)
GH-38954 - [Dev] Fix spelling (#38955)
GH-38956 - [Gandiva] Fix spelling (#38957)
GH-38960 - [C++] Fix spelling (acero) (#38961)
GH-38964 - [C++] Fix spelling (compute) (#38965)
GH-38966 - [C++] Fix spelling (util) (#38967)
GH-38968 - [C++] Fix spelling (dataset) (#38969)
GH-38971 - [C++] Fix spelling (filesystem) (#38972)
GH-38975 - [Dev] Fix spelling (#38976)
GH-38977 - [C++] Fix spelling (#38978)
GH-38979 - [C++] Fix spelling (#38980)
GH-38981 - [R][Release] Don’t update version.json on compatible version release (#38982)
GH-39014 - [Java] Add default truststore along with KeychainStore when on Mac system (#39235)
GH-39031 - [Docs] Remove misspelled rule from contrib css (#39032)
GH-39045 - [C++][Acero] union node output batches should be unordered (#39046)
GH-39113 - [Integration][Flight][Java] Fix occasional failure starting Java server (#39115)
GH-39116 - [Go] Fix CI Staticcheck (#39117)
GH-39126 - [C++][CI] Fix Valgrind failures (#39127)
GH-39130 - [CI][GLib][Windows] Use old Ruby as workaround for load error (#39168)
GH-39136 - [C++] Remove needless system Protobuf dependency with -DARROW_HDFS=ON (#39137)
GH-39138 - [R] Fix implicit conversion warnings (#39250)
GH-39156 - [C++][Compute] Fix negative duration division (#39158)
GH-39163 - [C++] Add missing data copy in StreamDecoder::Consume(data) (#39164)
GH-39185 - [C++] Remove compiler warnings with -Wconversion -Wno-sign-conversion in public headers (#39186)
GH-39191 - [R] throw error when string_replace is passed vector of values in pattern (#39219)
GH-39238 - [Go] PATCH Prevents empty record to be appended to empty resultset (#39239)
GH-39288 - [Java][FlightSQL] Update Apache Avatica to version 1.24.0 (#39325)
GH-39306 - [C++][Benchmarking] Remove hardcoded min times (#39307)
GH-39327 - [Java] define assemble descriptor for new custom maven plugin project (#39331)
GH-39333 - [C++] Don’t use “if constexpr” in lambda (#39334)
GH-39359 - [CI][C++] Remove MinGW MINGW32 C++ job (#39376)
GH-39384 - [C++] Disable -Werror=attributes for Azure SDK’s identity.hpp (#39448)
GH-39387 - [C++] Fix compile warning (#39389)
GH-39421 - [CI][Ruby] Update to using Ubuntu 22.04 on test-ruby and test-c-glib nightly jobs (#39422)
GH-39423 - [CI][JS] TypeScript: Compilation failed on yarn build for several CI jobs
GH-39425 - [CI] Fix import to match new substrait repo structure (#39426)
GH-39433 - [Ruby] Add support for Table.load(format: json) options (#39464)
GH-39437 - [CI][Python] Update pandas tests failing on pandas nightly CI build (#39498)
GH-39468 - [Java] Fix site build for docs (#39471)
GH-39469 - [CI][JS] Force node 20 on JS build on arm64 to fix build issues (#39499)
GH-39488 - [Ruby] Add support for ChunkedArray in Ractor (#39490)
GH-39517 - [C++] Disable parallelism for jemalloc external project (#39522)
GH-39562 - [C++][Parquet] Fix crash in test_parquet_dataset_lazy_filtering (#39632)
GH-39564 - [CI][Java] Set correct version on Java BOM (#39580)
GH-39584 - [R] fallback to source gracefully (#39587)
GH-39588 - [CI][Go] Add CGO_ENABLED=1 to cdata_integration build to fix macOS build with conda (#39589)
GH-39598 - [C#] Fix verification script (#39605)
GH-39604 - [JS] Do not use resizable buffers yet (#39607)
GH-39628 - [C++] Disable parallelism for all `make`-based externalProjects when CMake >= 3.28 is used
New Features and Improvements
GH-14936 - [Java] Remove netty dependency from arrow-vector (#38493)
GH-28994 - [C++][JSON] Change the max rows to Unlimited(int_32) (#38582)
GH-30117 - [C++][Python] Add “Z” to the end of timestamp print string when tz defined (#39272)
GH-30717 - [C#] Add ToString() methods to Arrow classes (#36566)
GH-31303 - [Python] Remove the legacy ParquetDataset custom python-based implementation (#39112)
GH-31579 - [C#] : Remove out-of-support versions of .NET and update C# README (#39165)
GH-33500 - [Python] add Table.to/from_struct_array (#38520)
GH-33984 - [C++][Python] DLPack implementation for Arrow Arrays (producer) (#38472)
GH-34316 - [Python] FixedSizeListArray.from_arrays supports mask parameter (#39396)
GH-34569 - [C++] Diffing of Run-End Encoded arrays (#35003)
GH-34636 - [C#] Reduce allocations when using ArrayPool (#39166)
GH-35260 - [C++][Python][R] Allow users to adjust S3 log level by environment variable (#38267)
GH-35331 - [Python] Expose Parquet sorting metadata (#37665)
GH-35344 - [C++][Format] Implementation of the LIST_VIEW and LARGE_LIST_VIEW array formats (#35345)
GH-35560 - [C++] Use Cast() instead of CastTo() for Scalar in test (#39044)
GH-36036 - [C++][Python][Parquet] Implement Float16 logical type (#36073)
GH-36044 - [Python][Docs] Added ParquetFileFragment to the API reference docs (#38277)
GH-36099 - [C++] Add Utf8View and BinaryView to the c ABI (#38443)
GH-36441 - [Python] Make CacheOptions configurable from Python (#36627)
GH-36760 - [Go] Add Avro OCF reader (#37115)
GH-36815 - [C#] : Enable net472 tests under Windows (#36818)
GH-36898 - [CI] Hashpin Sensitive GitHub Actions (#37676)
GH-37002 - [C++][Parquet] Add api to get RecordReader from RowGroupReader (#37003)
GH-37061 - [Docs][Format] Clarify semantics of GetSchema in FSQL (#38549)
GH-37199 - [C++] Expose a span converter for Buffer and ArraySpan (#38027)
GH-37242 - [Python][Parquet] Parquet Support write and validate Page CRC (#38360)
GH-37312 - [Python][Docs] Update Python docstrings to reflect new parquet encoding option (#38070)
GH-37359 - [C#] Add ToList() to Decimal128Array and Decimal256Array (#37383)
GH-37378 - [C++] Add A Dictionary Compaction Function For DictionaryArray (#37418)
GH-37429 - [C++] Add arrow::ipc::StreamDecoder::Reset() (#37970)
GH-37511 - [C++] Implement file reads for Azure filesystem (#38269)
GH-37582 - [Go][Parquet] Implement Float16 logical type (#37599)
GH-37592 - [MATLAB] Add NumRows property to arrow.tabular.RecordBatch (#38215)
GH-37710 - [C++][Integration] Add C++ Utf8View implementation (#37792)
GH-37753 - [C++][Gandiva] Add external function registry support (#38116)
GH-37812 - [MATLAB] Add arrow.type.ListType MATLAB class (#38189)
GH-37815 - [MATLAB] Add arrow.array.ListArray MATLAB class (#38357)
GH-37848 - [C++][Gandiva] Migrate LLVM JIT engine from MCJIT to ORC v2/LLJIT (#39098)
GH-37857 - [Python][Dataset] Expose file size to python dataset (#37868)
GH-37889 - [Java][Doc] Improve JDBC driver documentation (#38469)
GH-37895 - [C++] Feature: support concatenate recordbatches. (#37896)
GH-37910 - [Java][Integration] Implement C Data Interface integration testing (#38248)
GH-37943 - [Java] Add parquet file with all supported types (#38249)
GH-37979 - [C++] Add support for specifying custom Array opening and closing delimiters to arrow::PrettyPrintDelimiters (#38187)
GH-38022 - [Java][FlightRPC] Expose app_metadata on FlightInfo and FlightEndpoint (#38331)
GH-38024 - [Java][FlightRPC] Expose appMetadata through JDBC ResultSet (#38781)
GH-38033 - [R] Allow code() to return package name prefix. (#38144)
GH-38042 - [C++][Benchmark] Add non-stream Codec Compression/Decompression (#38067)
GH-38117 - [C++][Parquet] Change DictEncoder dtor checking to warning log (#38118)
GH-38131 - [Swift][CI] Add linting and fix linting errors (#38133)
GH-38153 - [C#] expose ArrayDataConcatenator.Concatenate (#38154)
GH-38164 - [MATLAB] Rename Length property on arrow.array.Array and arrow.array.ChunkedArray to NumElements (#38190)
GH-38166 - [MATLAB] Improve tabular object display (#38482)
GH-38246 - [JAVA] added new getTransferPair() function that takes in a Field type for Complex Type Vectors (#38261)
GH-38264 - [Java][Packaging] Add BOM file (#38336)
GH-38271 - [C++][Parquet] Support reading parquet files with multiple gzip members (#38272)
GH-38300 - [Dev][Docs] Update dev/README.md for the current merge_arrow_pr.py (#38301)
GH-38310 - [MATLAB] Create the testing guideline document for testing MATLAB interface (#38459)
GH-38316 - [C#] Implement interval types (#39043)
GH-38326 - [C++][Parquet] check the decompressed page size same as size in page header (#38327)
GH-38330 - [C++][Azure] Use properties for input stream metadata (#38524)
GH-38333 - [C++][FS][Azure] Implement file writes (#38780)
GH-38335 - [C++] Implement GetFileInfo for a single file in Azure filesystem (#38505)
GH-38339 - [C++][CMake] Use transitive dependency for system GoogleTest (#38340)
GH-38341 - [Python] Remove usage of pandas internals DatetimeTZBlock (#38321)
GH-38346 - [C++][Parquet] Use new encrypted files for page index encryption test (#38347)
GH-38348 - [C#] Make PrimitiveArray support IReadOnlyList<T?> (#38680)
GH-38351 - [C#] Add SqlDecimal support to Decimal128Array (#38481)
GH-38354 - [MATLAB] Implement fromMATLAB method for arrow.array.ListArray (#38561)
GH-38361 - Add validation logic for offsets and values to arrow.array.ListArray.fromArrays (#38531)
GH-38376 - [R] : Add dimnames method to Dataset class (#38377)
GH-38381 - [C++][Acero] Create a sorted merge node (#38380)
GH-38398 - [MATLAB] Improve array display (#38400)
GH-38402 - [CI][Integration] Provide wrapper scripts for integration testing (#38403)
GH-38415 - [MATLAB] Add indexing “slice” method to C++ Array Proxy class (#38674)
GH-38417 - [MATLAB] Implement a TableTypeValidator class that validates a MATLAB cell array contains only tables that share the same schema (#38551)
GH-38418 - [MATLAB] Add method for extracting one row of an arrow.tabular.Table as a string (#38463)
GH-38419 - [MATLAB] Implement a ClassTypeValidator class that validates a MATLAB cell array contains only values of the same class type. (#38530)
GH-38420 - [MATLAB] Implement a DatetimeValidator class that validates a MATLAB cell array contains only values of zoned or unzoned datetimes (#38533)
GH-38424 - [CI][C++] Use Fedora 38 instead of 35 (#38425)
GH-38452 - [C++][Benchmark] Adding benchmark for LZ4/Snappy Compression (#38453)
GH-38457 - [C++] Support LogicalNullCount for DictionaryArray (#38681)
GH-38460 - [Java][FlightRPC] Add mTLS support for Flight SQL JDBC driver (#38461)
GH-38462 - [Go][Parquet] Handle Boolean RLE encoding/decoding (#38367)
GH-38483 - [C#] Add support for more decimal conversions (#38508)
GH-38506 - [Go][Parquet] Add NumRows and RowGroupNumRows to pqarrow.FileWriter (#38507)
GH-38511 - [Java] Add getTransferPair(Field, BufferAllocator, CallBack) for StructVector and MapVector (#38512)
GH-38528 - [Python][Compute] Describe strptime format semantics (#38665)
GH-38537 - [Java] upgrade to netty 4.1.100.Final (#38538)
GH-38541 - [MATLAB] Add remaining tests for arrow tabular object display (#38564)
GH-38542 - [C++][Parquet] Faster scalar BYTE_STREAM_SPLIT (#38529)
GH-38562 - [Packaging] Add support for Ubuntu 23.10 (#38563)
GH-38576 - [Java] Change JDBC driver to optionally preserve cookies and auth tokens when getting streams (#38580)
GH-38578 - [Java][FlightSQL] Remove joda usage from flight-sql library (#38579)
GH-38589 - [C++][Gandiva] Support registering external C functions (#38632)
GH-38597 - [C++] Implement GetFileInfo(selector) for Azure filesystem (#39009)
GH-38602 - [R] Add missing prod for summarize (#38601)
GH-38627 - [Java][FlightRPC] Handle null parameter values (#38628)
GH-38648 - [Java] Regenerate Flatbuffers (#38650)
GH-38652 - [MATLAB] Add tests about time precision preservation when converting MATLAB duration to arrow.array.Time32Array and arrow.array.Time64Array (#38661)
GH-38662 - [Java] Add comparators (#38669)
GH-38699 - [C++][FS][Azure] Implement CreateDir() (#38708)
GH-38700 - [C++][FS][Azure] Implement DeleteDir() (#38793)
GH-38701 - [C++][FS][Azure] Implement DeleteDirContents() (#38888)
GH-38702 - [C++] : Implement AzureFileSystem::DeleteRootDirContents (#39151)
GH-38705 - [C++][FS][Azure] Implement CopyFile() (#39058)
GH-38712 - [Python] Remove dead code in _reconstruct_block (#38714)
GH-38718 - [Go][Format][Integration] Add StringView/BinaryView to Go implementation (#35769)
GH-38732 - [Java][FlightRPC] Add support for Array parameter binding in JDBC (#38733)
GH-38751 - [C++][Go][Parquet] Add tests for reading Float16 files in parquet-testing (#38753)
GH-38757 - [C#] Implement common interfaces for structure arrays and record batches (#38759)
GH-38758 - [C++][FS][Azure] Rename AzurePath to AzureLocation (#38773)
GH-38772 - [C++] Implement directory semantics even when the storage account doesn’t support HNS (#39361)
GH-38798 - [Integration] Enable C Data Interface integration testing on Rust (#38799)
GH-38814 - [C++][Parquet] Update parquet.thrift to sync with 2.10.0 (#38815)
GH-38824 - [Go] Enable GC checks (#38826)
GH-38836 - [Go] Add Size() for ArrayData (#38839)
GH-38852 - [C++] Replace “#ifdef ARROW_WITH_GZIP” in dataset test to ARROW_WITH_ZLIB (#38853)
GH-38857 - [Python] Fix append mode for cython 2 (#39027)
GH-38857 - [Python] Add append mode for pyarrow.OsFile (#38820)
GH-38860 - [C++][Parquet] Using length to optimize bloom filter read (#38863)
GH-38870 - [Documentation] Add List View and Large List View to status.rst (#38871)
GH-38874 - [C++][Parquet] Minor: making parquet TypedComparator operation as const method (#38875)
GH-38884 - [C++] DatasetWriter release rows_in_flight_throttle when allocate writing failed (#38885)
GH-38887 - [C++][Parquet] Move EstimatedBufferedValueBytes from TypedColumnWriter to ColumnWriter (#39055)
GH-38907 - [C++] Stop installing internal bpacking_simd* headers (#38908)
GH-38909 - [Packaging] Drop support for Ubuntu 23.04 (#38910)
GH-38918 - [Go] Avoid schema.Fields allocations in some places (#38919)
GH-38920 - [C++][Gandiva] Refactor function holder to return arrow Result (#38873)
GH-38958 - [C++][Parquet] Fix spelling (#38959)
GH-38990 - [Java] Upgrade to flatc version 23.5.26 (#38991)
GH-38996 - [Java] Update dependencies and plugins for JPMS modules (#38994)
GH-39006 - [Python] Extract libparquet requirements out of libarrow_python.so to new libarrow_python_parquet_encryption.so (#39316)
GH-39013 - [Go][Integration] Support cABI import/export of StringView (#39019)
GH-39020 - [CI][Release][JS] Use Node.js 18 instead of 16 (#39021)
GH-39028 - [Python][CI] Fix dask integration build by temporarily skipping test_categorize_info (#39029)
GH-39037 - [Java] Remove (Contrib/Experimental) mention in Flight SQL (#39040)
GH-39049 - [C++] Use Cast() instead of CastTo() for Dictionary Scalar in test (#39362)
GH-39050 - [C++] Use Cast() instead of CastTo() for Timestamp Scalar in test (#39060)
GH-39051 - [C++] Use Cast() instead of CastTo() for List Scalar in test (#39353)
GH-39064 - [C++][Parquet] Support row group filtering for nested paths for struct fields (#39065)
GH-39088 - [Dev][Java] Add Dependabot configuration for Java (#39089)
GH-39096 - [Python] Release GIL in .nbytes (#39097)
GH-39119 - [C++] Refactor the Azure FS tests and filesystem class instantiation (#39207)
GH-39122 - [C++][Parquet] Optimize FLBA record reader (#39124)
GH-39134 - Create module info compiler plugin (#39135)
GH-39159 - [C++] : Try to make Buffer::device_type_ non-optional (#39150)
GH-39170 - [Java] Improve error message explaining why TestTls might fail (#39171)
GH-39189 - [Java] Bump com.h2database:h2 from 1.4.196 to 2.2.224 in /java (#39188)
GH-39196 - [Python][Docs] Document the Arrow PyCapsule protocol in the ‘extending pyarrow’ section of the Python docs (#39199)
GH-39208 - [C++][Parquet] Remove deprecated AppendRowGroup(int64_t num_rows) (#39209)
GH-39210 - [C++][Parquet] Avoid WriteRecordBatch from produce zero-sized RowGroup (#39211)
GH-39217 - [Python] RecordBatchReader.from_stream constructor for objects implementing the Arrow PyCapsule protocol (#39218)
GH-39223 - [C#] Support IReadOnlyList<T?> on remaining scalar types (#39224)
GH-39225 - [GLib] Use Cast() instaed of CastTo (#39228)
GH-39232 - [C++] Support binary to fixed_size_binary cast (#39236)
GH-39243 - [R][CI] Remove r-arrow conda nightlies (#39244)
GH-39246 - [CI][GLib][Ruby] Use Ubuntu 22.04 not 20.04 (#39247)
GH-39262 - [C++][Azure][FS] Add default credential auth configuration (#39263)
GH-39265 - [Java] Make it run well with the netty newest version 4.1.104 (#39266)
GH-39268 - [C++] Don’t install bundled Azure SDK for C++ with CMake 3.28+ (#39269)
GH-39292 - [C++][FS] : Remove the AzureBackend enum and add more flexible connection options (#39293)
GH-39297 - [C++][FS] : Inform caller of container not-existing when checking for HNS support (#39298)
GH-39299 - [Java] Upgrade to Avro 1.11.3 (#39300)
GH-39303 - [Archery][Benchmarking] Allow setting C++ repetition min time (#39324)
GH-39318 - [C++][FS][Azure] Add workload identity auth configuration (#39319)
GH-39320 - [C++][FS][Azure] Add managed identity auth configuration (#39321)
GH-39322 - [C++] Forward arguments to ExceptionToStatus all the way to Status::FromArgs (#39323)
GH-39326 - [C++] Flaky DatasetWriterTestFixture.MaxRowsOneWriteBackpresure test (#39379)
GH-39328 - [Java] Make default getConsumer public (#39329)
GH-39335 - [C#] Support creating FlightClient with Grpc.Core.Channel (#39348)
GH-39339 - [C++] Add ForceCachedHierarchicalNamespaceSupport to help with testing (#39340)
GH-39341 - [C#] Support Utf8View, BinaryView and ListView (#39342)
GH-39343 - [C++][FS][Azure] Add client secret auth configuration (#39346)
GH-39355 - [Java] Improve JdbcConsumer exceptions (#39356)
GH-39357 - [C++] Reduce function.h includes (#39312)
GH-39363 - [C++] Use Cast() instead of CastTo() for Parquet (#39364)
GH-39413 - [C++][Parquet] Vectorize decode plain on FLBA (#39414)
GH-39419 - [C++][Parquet] Style: Using arrow::Buffer data_as api rather than reinterpret_cast (#39420)
GH-39430 - [C++][ORC] Upgrade ORC to 1.9.2 (#39431)
GH-39449 - [C++] Use default Azure credentials implicitly and support anonymous credentials explicitly (#39450)
GH-39484 - [Java] Support 256 bit decimals in JdbcToArrowUtils (#39485)
GH-39500 - [Docs] Pin pydata-sphinx-theme to 0.14 (#39501)
GH-39515 - [Python] Pass in type to MapType.from_arrays (#39516)
GH-39531 - [Python][CI] Skip failing dask tests: test_describe_empty and test_view (#39534)
GH-39533 - [Python] NumPy 2.0 compat: remove usage of np.core (#39535)
GH-39537 - [Packaging][Python] Add a numpy<2 pin to the install requirements for the 15.x release branch (#39538)
GH-39601 - [R] Don’t download cmake when TEST_OFFLINE_BUILD=true (#39602)
GH-39624 - [R][CI] Add CMake to docker file and update envvars (#39625)
GH-39626 - [Docs][R] Update NEWS.md for 15.0.0
PARQUET-2411 - [C++][Parquet] Allow reading dictionary without reading data via ByteArrayDictionaryRecordReader (#39153)
To see a diff of this commit:
https://wip.pkgsrc.org/cgi-bin/gitweb.cgi?p=pkgsrc-wip.git;a=commitdiff;h=bdc4ce78c8a179a9ed1a2929fa0accfd6897b2d2
Please note that diffs are not public domain; they are subject to the
copyright notices on the relevant files.
diffstat:
apache-arrow/PLIST | 34 ++++++++++++++++++++--------------
apache-arrow/distinfo | 6 +++---
apache-arrow/version.mk | 2 +-
3 files changed, 24 insertions(+), 18 deletions(-)
diffs:
diff --git a/apache-arrow/PLIST b/apache-arrow/PLIST
index b1af66deaa..7484b9f584 100644
--- a/apache-arrow/PLIST
+++ b/apache-arrow/PLIST
@@ -8,6 +8,7 @@ include/arrow/acero/accumulation_queue.h
include/arrow/acero/aggregate_node.h
include/arrow/acero/api.h
include/arrow/acero/asof_join_node.h
+include/arrow/acero/backpressure_handler.h
include/arrow/acero/benchmark_util.h
include/arrow/acero/bloom_filter.h
include/arrow/acero/exec_plan.h
@@ -23,8 +24,10 @@ include/arrow/acero/query_context.h
include/arrow/acero/schema_util.h
include/arrow/acero/task_util.h
include/arrow/acero/test_nodes.h
+include/arrow/acero/time_series_util.h
include/arrow/acero/tpch_node.h
include/arrow/acero/type_fwd.h
+include/arrow/acero/unmaterialized_table.h
include/arrow/acero/util.h
include/arrow/acero/visibility.h
include/arrow/api.h
@@ -56,6 +59,8 @@ include/arrow/buffer_builder.h
include/arrow/builder.h
include/arrow/c/abi.h
include/arrow/c/bridge.h
+include/arrow/c/dlpack.h
+include/arrow/c/dlpack_abi.h
include/arrow/c/helpers.h
include/arrow/chunk_resolver.h
include/arrow/chunked_array.h
@@ -68,6 +73,7 @@ include/arrow/compute/cast.h
include/arrow/compute/exec.h
include/arrow/compute/expression.h
include/arrow/compute/function.h
+include/arrow/compute/function_options.h
include/arrow/compute/kernel.h
include/arrow/compute/key_hash.h
include/arrow/compute/key_map.h
@@ -204,6 +210,7 @@ include/arrow/util/async_util.h
include/arrow/util/base64.h
include/arrow/util/basic_decimal.h
include/arrow/util/benchmark_util.h
+include/arrow/util/binary_view_util.h
include/arrow/util/bit_block_counter.h
include/arrow/util/bit_run_reader.h
include/arrow/util/bit_stream_utils.h
@@ -222,11 +229,7 @@ include/arrow/util/bpacking_avx2.h
include/arrow/util/bpacking_avx512.h
include/arrow/util/bpacking_default.h
include/arrow/util/bpacking_neon.h
-include/arrow/util/bpacking_simd128_generated.h
-include/arrow/util/bpacking_simd256_generated.h
-include/arrow/util/bpacking_simd512_generated.h
include/arrow/util/byte_size.h
-include/arrow/util/byte_stream_split.h
include/arrow/util/cancel.h
include/arrow/util/checked_cast.h
include/arrow/util/compare.h
@@ -240,9 +243,11 @@ include/arrow/util/crc32.h
include/arrow/util/debug.h
include/arrow/util/decimal.h
include/arrow/util/delimiting.h
+include/arrow/util/dict_util.h
include/arrow/util/dispatch.h
include/arrow/util/double_conversion.h
include/arrow/util/endian.h
+include/arrow/util/float16.h
include/arrow/util/formatting.h
include/arrow/util/functional.h
include/arrow/util/future.h
@@ -254,6 +259,7 @@ include/arrow/util/io_util.h
include/arrow/util/iterator.h
include/arrow/util/key_value_metadata.h
include/arrow/util/launder.h
+include/arrow/util/list_util.h
include/arrow/util/logging.h
include/arrow/util/macros.h
include/arrow/util/map.h
@@ -415,21 +421,21 @@ lib/cmake/Parquet/ParquetTargets-release.cmake
lib/cmake/Parquet/ParquetTargets.cmake
lib/libarrow.a
lib/libarrow.so
-lib/libarrow.so.1400
-lib/libarrow.so.1400.1.0
+lib/libarrow.so.1500
+lib/libarrow.so.1500.0.0
lib/libarrow_acero.a
lib/libarrow_acero.so
-lib/libarrow_acero.so.1400
-lib/libarrow_acero.so.1400.1.0
+lib/libarrow_acero.so.1500
+lib/libarrow_acero.so.1500.0.0
lib/libarrow_bundled_dependencies.a
lib/libarrow_dataset.a
lib/libarrow_dataset.so
-lib/libarrow_dataset.so.1400
-lib/libarrow_dataset.so.1400.1.0
+lib/libarrow_dataset.so.1500
+lib/libarrow_dataset.so.1500.0.0
lib/libparquet.a
lib/libparquet.so
-lib/libparquet.so.1400
-lib/libparquet.so.1400.1.0
+lib/libparquet.so.1500
+lib/libparquet.so.1500.0.0
lib/pkgconfig/arrow-acero.pc
lib/pkgconfig/arrow-compute.pc
lib/pkgconfig/arrow-csv.pc
@@ -439,8 +445,8 @@ lib/pkgconfig/arrow-json.pc
lib/pkgconfig/arrow.pc
lib/pkgconfig/parquet.pc
share/arrow/gdb/gdb_arrow.py
-share/arrow/gdb/libarrow.so.1400.1.0-gdb.py
+share/arrow/gdb/libarrow.so.1500.0.0-gdb.py
share/doc/arrow/LICENSE.txt
share/doc/arrow/NOTICE.txt
share/doc/arrow/README.md
-@pkgdir share/gdb/auto-load/home/matthew/pkgsrc/install.20231028/lib
+@pkgdir share/gdb/auto-load/home/matthew/pkgsrc/install.20231221/lib
diff --git a/apache-arrow/distinfo b/apache-arrow/distinfo
index 3087fcb15e..a82ecb704b 100644
--- a/apache-arrow/distinfo
+++ b/apache-arrow/distinfo
@@ -3,9 +3,9 @@ $NetBSD$
BLAKE2s (9.0.1.tar.gz) = a785e1ad5fd5df76c95e7cf9a6eadeb86ffbc46ea4342f49f19381434bd0f78c
SHA512 (9.0.1.tar.gz) = ed56287f608ccdf5bc5d5fc2918e313e7c4cecdd9ef2c9993a72ea900d9ff662c57ac5326c7a809eb11505c6f39d4599f3f161b97b6e03c65783b824b8d700d2
Size (9.0.1.tar.gz) = 215065 bytes
-BLAKE2s (apache-arrow-14.0.1.tar.gz) = d841ca7299f8ea118e9b55826c34a353ee995936c6a88da001de149992274381
-SHA512 (apache-arrow-14.0.1.tar.gz) = 31d19f0ca80349f63db74bae813256b47907f85725a9bf01ef6f32406e79828ebb4701faedb52696b6a5b3bb89ad4e136485fd5eb35d396dd42147c11d4d2713
-Size (apache-arrow-14.0.1.tar.gz) = 20968461 bytes
+BLAKE2s (apache-arrow-15.0.0.tar.gz) = 04d54ce9da23d76b9cfc650e0c39af3b85340c9092368b08587c99c92b9c7eff
+SHA512 (apache-arrow-15.0.0.tar.gz) = d5dccaa0907b0e6f2a460e32ae75091942dcb70b51db4aefe2767ee8d99882694607b723a9c06898dda3938d8eb498258d7f9aad11054665b6ea9c2fbaeafa74
+Size (apache-arrow-15.0.0.tar.gz) = 21491996 bytes
BLAKE2s (jemalloc-5.3.0.tar.bz2) = 285e6145b9d3b575b1ec5cfdae8af40b461149085f001839d64685c0d56e2689
SHA512 (jemalloc-5.3.0.tar.bz2) = 22907bb052096e2caffb6e4e23548aecc5cc9283dce476896a2b1127eee64170e3562fa2e7db9571298814a7a2c7df6e8d1fbe152bd3f3b0c1abec22a2de34b1
Size (jemalloc-5.3.0.tar.bz2) = 736023 bytes
diff --git a/apache-arrow/version.mk b/apache-arrow/version.mk
index 28687b59be..0baf4349fd 100644
--- a/apache-arrow/version.mk
+++ b/apache-arrow/version.mk
@@ -1,2 +1,2 @@
# $NetBSD$
-APACHE_ARROW_VERSION= 14.0.1
+APACHE_ARROW_VERSION= 15.0.0
Home |
Main Index |
Thread Index |
Old Index