[logging] Add centralized logging - Bump-up cache loads to warnings #538

thomwolf · 2020-08-28T11:42:29Z

Add a nlp.logging module to set the global logging level easily. The verbosity level also controls the tqdm bars (disabled when set higher than INFO).

You can use:

nlp.logging.set_verbosity(verbosity: int)
nlp.logging.set_verbosity_info()
nlp.logging.set_verbosity_warning()
nlp.logging.set_verbosity_debug()
nlp.logging.set_verbosity_error()
nlp.logging.get_verbosity() -> int

And use the levels:

nlp.logging.CRITICAL
nlp.logging.DEBUG
nlp.logging.ERROR
nlp.logging.FATAL
nlp.logging.INFO
nlp.logging.NOTSET
nlp.logging.WARN
nlp.logging.WARNING

lhoestq

Nice addition !

lhoestq · 2020-08-31T10:28:28Z

benchmarks/results/benchmark_array_xd.json

@@ -1 +1 @@
-{"write_array2d": 0.07093274600629229, "read_unformated after write_array2d": 0.03530075500020757, "read_formatted_as_numpy after write_array2d": 0.10929270699853078, "read_batch_unformated after write_array2d": 0.03727920600795187, "read_batch_formatted_as_numpy after write_array2d": 0.018853643006877974, "read_col_unformated after write_array2d": 0.05644163000397384, "read_col_formatted_as_numpy after write_array2d": 0.011610292000113986, "write_nested_sequence": 1.6535991109994939, "read_unformated after write_nested_sequence": 0.3739209540071897, "read_formatted_as_numpy after write_nested_sequence": 0.40762836500653066, "read_batch_unformated after write_nested_sequence": 0.3337586460111197, "read_batch_formatted_as_numpy after write_nested_sequence": 0.054717567007173784, "read_col_unformated after write_nested_sequence": 0.3173944180016406, "read_col_formatted_as_numpy after write_nested_sequence": 0.004956340009812266, "write_flattened_sequence": 1.4975415869994322, "read_unformated after write_flattened_sequence": 0.26713552299770527, "read_formatted_as_numpy after write_flattened_sequence": 0.07673935199272819, "read_batch_unformated after write_flattened_sequence": 0.25450974798877724, "read_batch_formatted_as_numpy after write_flattened_sequence": 0.009374254994327202, "read_col_unformated after write_flattened_sequence": 0.25912448299641255, "read_col_formatted_as_numpy after write_flattened_sequence": 0.004277604995877482}


are these files supposed to be part of the PR ?

we don't care that much I guess but let me remove them indeed

lhoestq · 2020-08-31T10:32:27Z

src/nlp/arrow_dataset.py

            if os.path.exists(indices_cache_file_name) and load_from_cache_file:
-                if verbose:
-                    logger.info("Loading cached shuffled indices for dataset at %s", indices_cache_file_name)
+                logger.warn("Loading cached shuffled indices for dataset at %s", indices_cache_file_name)


use logger.warning instead ? iirc warn is deprecated

lhoestq · 2020-08-31T10:32:42Z

src/nlp/arrow_dataset.py

-                        train_indices_cache_file_name,
-                        test_indices_cache_file_name,
-                    )
+                logger.warn(


lhoestq · 2020-08-31T10:33:04Z

src/nlp/arrow_dataset.py

            if os.path.exists(indices_cache_file_name) and load_from_cache_file:
-                if verbose:
-                    logger.info("Loading cached sorted indices for dataset at %s", indices_cache_file_name)
+                logger.warn("Loading cached sorted indices for dataset at %s", indices_cache_file_name)


lhoestq · 2020-08-31T10:33:24Z

src/nlp/arrow_dataset.py

            if os.path.exists(cache_file_name) and load_from_cache_file:
-                if verbose:
-                    logger.info("Loading cached processed dataset at %s", cache_file_name)
+                logger.warn("Loading cached processed dataset at %s", cache_file_name)


lhoestq · 2020-08-31T10:40:23Z

src/nlp/utils/logging.py

+
+def enable_propagation() -> None:
+    """Enable propagation of the library log outputs.
+    Please disable the HuggingFace Transformers's default handler to prevent double logging if the root logger has


what would be the issue with transformers exactly ?

copy-past error

thomwolf added 2 commits August 28, 2020 13:41

add centralized logging - Bump-up cache loads to warnings

4c1ca1b

add logging in the doc

4a46291

thomwolf requested a review from lhoestq August 31, 2020 08:05

lhoestq approved these changes Aug 31, 2020

View reviewed changes

fix following Q's comments

1950b0d

thomwolf merged commit df94a7c into master Aug 31, 2020

thomwolf deleted the logging branch August 31, 2020 11:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[logging] Add centralized logging - Bump-up cache loads to warnings #538

[logging] Add centralized logging - Bump-up cache loads to warnings #538

Uh oh!

thomwolf commented Aug 28, 2020 •

edited

Loading

Uh oh!

lhoestq left a comment

Uh oh!

lhoestq Aug 31, 2020

Uh oh!

thomwolf Aug 31, 2020

Uh oh!

lhoestq Aug 31, 2020

Uh oh!

lhoestq Aug 31, 2020

Uh oh!

lhoestq Aug 31, 2020

Uh oh!

lhoestq Aug 31, 2020

Uh oh!

lhoestq Aug 31, 2020

Uh oh!

thomwolf Aug 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -1 +1 @@
		{"write_array2d": 0.07093274600629229, "read_unformated after write_array2d": 0.03530075500020757, "read_formatted_as_numpy after write_array2d": 0.10929270699853078, "read_batch_unformated after write_array2d": 0.03727920600795187, "read_batch_formatted_as_numpy after write_array2d": 0.018853643006877974, "read_col_unformated after write_array2d": 0.05644163000397384, "read_col_formatted_as_numpy after write_array2d": 0.011610292000113986, "write_nested_sequence": 1.6535991109994939, "read_unformated after write_nested_sequence": 0.3739209540071897, "read_formatted_as_numpy after write_nested_sequence": 0.40762836500653066, "read_batch_unformated after write_nested_sequence": 0.3337586460111197, "read_batch_formatted_as_numpy after write_nested_sequence": 0.054717567007173784, "read_col_unformated after write_nested_sequence": 0.3173944180016406, "read_col_formatted_as_numpy after write_nested_sequence": 0.004956340009812266, "write_flattened_sequence": 1.4975415869994322, "read_unformated after write_flattened_sequence": 0.26713552299770527, "read_formatted_as_numpy after write_flattened_sequence": 0.07673935199272819, "read_batch_unformated after write_flattened_sequence": 0.25450974798877724, "read_batch_formatted_as_numpy after write_flattened_sequence": 0.009374254994327202, "read_col_unformated after write_flattened_sequence": 0.25912448299641255, "read_col_formatted_as_numpy after write_flattened_sequence": 0.004277604995877482}

[logging] Add centralized logging - Bump-up cache loads to warnings #538

[logging] Add centralized logging - Bump-up cache loads to warnings #538

Uh oh!

Conversation

thomwolf commented Aug 28, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

thomwolf commented Aug 28, 2020 •

edited

Loading