You are here: Home / LBN / Up2date / Data Science / BastionLinux 36 / python3-adlfs-2023.12.0-1.lbn36.noarch

python3-adlfs-2023.12.0-1.lbn36.noarch

Package Attributes
RPM  python3-adlfs-2023.12.0-1.lbn36.noarch.rpm Architecture  noarch Size  354330 Created  2024/01/21 13:50:02 UTC
Package Specification
Summary Access Azure Datalake Gen1 with fsspec and dask
Group Unspecified
License ZPL
Home Page https://pypi.org/project/adlfs
Description

Filesystem interface to Azure-Datalake Gen1 and Gen2 Storage

Quickstart This package can be installed using: pip install adlfs or conda install -c conda-forge adlfs The adl:/ and abfs:/ protocols are included in fsspec's known_implementations registry in fsspec > 0.6.1, otherwise users must explicitly inform fsspec about the supported adlfs protocols. To use the Gen1 filesystem: import dask.dataframe as dd

storage_options={'tenant_id': TENANT_ID, 'client_id': CLIENT_ID, 'client_secret': CLIENT_SECRET}

dd.read_csv(adl:/{STORE_NAME}/{FOLDER}/*.csv, storage_options=storage_options)

To use the Gen2 filesystem you can use the protocol abfs or az: import dask.dataframe as dd

storage_options={'account_name': ACCOUNT_NAME, 'account_key': ACCOUNT_KEY}

ddf = dd.read_csv(abfs:/{CONTAINER}/{FOLDER}/*.csv, storage_options=storage_options) ddf = dd.read_parquet(az:/{CONTAINER}/folder.parquet, storage_options=storage_options)

Accepted protocol / uri formats include: 'PROTOCOL:/container

Requires
(python3.10dist(azure-datalake-store) < 0.1~~ with python3.10dist(azure-datalake-store) >= 0.0.46)  
(python3.10dist(azure-core) < 2~~ with python3.10dist(azure-core) >= 1.23.1)  
rpmlib(PayloadFilesHavePrefix)  
rpmlib(FileDigests)  
rpmlib(CompressedFileNames)  
rpmlib(PayloadIsZstd)  
rpmlib(RichDependencies)  
rpmlib(PartialHardlinkSets)  
rpmlib(TildeInVersions)  
Provides
python-adlfs
python3-adlfs
python3.10-adlfs
python3.10dist(adlfs)
python3dist(adlfs)
Obsoletes
python-adlfs

Document Actions