Skip to content

Text

src.dx.datatypes.text.generate_text_series(num_rows)

Generate a series of random long str values. (Requires faker to be installed)

Parameters:

Name Type Description Default
num_rows int

Number of rows to generate

required
Source code in src/dx/datatypes/text.py
def generate_text_series(num_rows: int) -> pd.Series:
    """
    Generate a series of random long `str` values. (Requires `faker` to be installed)

    Parameters
    ----------
    num_rows: int
        Number of rows to generate
    """
    if not FAKER_INSTALLED:
        logger.warning("faker is not installed, skipping text_column")
        return np.nan

    return pd.Series([fake.text() for _ in range(num_rows)])

src.dx.datatypes.text.generate_keyword_series(num_rows, num_letters=2)

Generate a series of random short str values.

Parameters:

Name Type Description Default
num_rows int

Number of rows to generate

required
num_letters int

Number of letters to use in each keyword

2
Source code in src/dx/datatypes/text.py
def generate_keyword_series(num_rows: int, num_letters: int = 2) -> pd.Series:
    """
    Generate a series of random short `str` values.

    Parameters
    ----------
    num_rows: int
        Number of rows to generate
    num_letters: int
        Number of letters to use in each keyword
    """
    return pd.Series(
        ["".join(random.sample(string.ascii_uppercase, num_letters)) for _ in range(num_rows)]
    )