`typeddfs.abs_dfs`

Defines a low-level DataFrame subclass. It overrides a lot of methods to auto-change the type back to cls.

Module Contents

class typeddfs.abs_dfs.AbsDf

classmethod _check(cls, df) → None: Should raise an typeddfs.df_errors.InvalidDfError or subclass for issues.

classmethod can_read(cls) → Set[typeddfs.file_formats.FileFormat]: Returns all formats that can be read using read_file. Some depend on the availability of optional packages. The lines format (.txt, .lines, etc.) is only included if this DataFrame can support only 1 column+index. See typeddfs.file_formats.FileFormat.can_read().

classmethod can_write(cls) → Set[typeddfs.file_formats.FileFormat]: Returns all formats that can be written to using write_file. Some depend on the availability of optional packages. The lines format (.txt, .lines, etc.) is only included if this DataFrame type can support only 1 column+index. See typeddfs.file_formats.FileFormat.can_write().

classmethod from_records(cls, *args, **kwargs) → __qualname__

classmethod read_file(cls, path: Union[pathlib.Path, str], *, file_hash: Optional[bool] = None, dir_hash: Optional[bool] = None, hex_hash: Optional[str] = None, attrs: Optional[bool] = None) → __qualname__

Reads from a file (or possibly URL), guessing the format from the filename extension. Delegates to the read_* functions of this class.

You can always write and then read back to get the same dataframe. .. code-block:

# df is any DataFrame from typeddfs
# path can use any suffix
df.write_file(path))
df.read_file(path)

Text files always allow encoding with .gz, .zip, .bz2, or .xz.

Supports:

.csv, .tsv, or .tab
.json
.xml
.feather
.parquet or .snappy
.h5 or .hdf
.xlsx, .xls, .odf, etc.
.toml
.properties
.ini
.fxf (fixed-width)
.flexwf (fixed-but-unspecified-width with an optional delimiter)
.txt, .lines, or .list

typeddfs.abs_dfs

Module Contents

`typeddfs.abs_dfs`