Debug mode for ignite #1992

sdesrozis · 2021-05-07T05:46:31Z

🚀 Feature

The idea is to provide a debug mode as discussed here: #1989 (comment).

All the possible options can be discussed here. We could add an helper to track the workflow of handlers from the engine run. On the other hand, some specific debug tools are needed to track the values of PyTorch objects (LR, loss, grads, etc.). Note that the preferred way to extend ignite is handlers' design so let's create a new set of tools for the debugging.

vfdev-5 · 2021-05-08T22:59:58Z

We could also think of different levels of debugging:

LR, loss and grads will be essential for training low-level debugging.
Dataflow debugging :
- printing input shapes, dtypes, devices, data stats
- saving input data as tensors or images for vision, etc
usage of pytorch built-in tools
usage of fake data for dataio profiling
...

Ishan-Kumar2 · 2021-10-09T07:56:27Z

Hi @sdesrozis and @vfdev-5, I think this is a great idea and much needed tool. I am interested in implementing it.

I agree with @vfdev-5's idea of having it in levels. We can pass the level in engine.run. So level 1 could be dataflow/workflow debugging, that means running the engine and logging all the events and handlers.
Something like

ITERATION STARTED
   Input Shape - 
   Handler 1 called
   ....
ITERATION COMPLETED
   Handler 2 called
...

Doing this is start forward, we just need to set the logger level in engine.run

The level 2 would be logging information during training like loss, LR, scheduler. For this we need to make changes to the training_step the user provides, I am not sure how we can go about that.

Then level 3 would be gradients. (can use PyTorch hooks)

What do you think?

sdesrozis · 2021-10-09T09:46:17Z

@Ishan-Kumar2 I totally agree. Doing it by level is a smart way to provide a good API to users and schedule the team's work.

Yesterday, we discussed with @Priyansi another debug feature which could be great. We thought we would provide a visual description of the workflow. It would be generated statically by the engine, similar to what could be considered at the first level. We could generate a dot file to describe the links between the handlers through what is added to the state.

@Ishan-Kumar2 I will be busy very soon because of a change in job. I can't contribute the way I would like to. If you are interested to contribute on this feature, I would be happy to discuss this with you, and help in any way I can.

DevPranjal · 2022-03-21T19:39:00Z

Hello, @sdesrozis @vfdev-5. Is this feature request still on the cards? I would love to discuss further and contribute.

vfdev-5 · 2022-03-21T19:54:48Z

@DevPranjal yes, it is still open issue that we would like to have.

DevPranjal · 2022-03-22T09:11:50Z

Great, so if I can compile the previous discussions, we need level wise debugging modes:

Level 1: simple event and handler prints, output saving and visualization
Level 2: tracking loss, lr, scheduler calls
Level 3: tracking gradients, updates

@vfdev-5 The debug level API will be accessed through engine.run(...)?

vfdev-5 · 2022-03-22T09:48:53Z

I was thinking about engine.debug(enabled=True, level=level) and engine.debug(enabled=False).

DevPranjal · 2022-03-22T12:11:08Z

Ok. Can we just have default level = 0 instead of the enabled arg? Also, are we expecting the intermediate output same as mentioned previously in the comment?

ITERATION STARTED
   Input Shape - 
   Handler 1 called
   ....
ITERATION COMPLETED
   Handler 2 called
...

vfdev-5 · 2022-03-22T12:47:06Z

Yes, you are right maybe a binary flag would be better option:

# to enable
engine.debug(level=Engine.DEBUG_EVENTS | Engine.DEBUG_OUTPUT | Engine.DEBUG_GRADS)

# to disable
engine.debug(level=0)

Provided flag names are just examples and can be renamed if a better naming can be found

DevPranjal · 2022-03-22T13:26:11Z

Sure.
For a start, I tried the following, within the Engine class:

def debug(self, level: int = 0):
    self.debug_level = level

def _print_debug_mode(self, string: str, level: int):
    if level > self.debug_level:
        print(string)

def _fire_event(...):
    ...
    self._print_debug_mode(f'firing handler: {func.__name__}', level=1)
    ...

vfdev-5 · 2022-03-22T13:34:58Z

@DevPranjal there is also self.logger

ignite/ignite/engine/engine.py

Line 125 in 3a286b1

self.logger = logging.getLogger(__name__ + "." + self.__class__.__name__)

DevPranjal · 2022-03-22T14:04:39Z

Yes.

Can I change the approach to have custom log levels, using logging.addLevelName for all the three levels?

or do I simply change the above to:

def _log_debug_mode(self, string: str, level: int):
    if level > self.debug_level:
        self.logger.debug(string)

or anything else?

Update engine.py to resolve pytorch#1992

sawradip · 2023-02-19T07:43:45Z

Does @puhuk 's PR solve this issue, or is this open yet?

puhuk · 2023-02-19T08:05:07Z

Hi, @sawradip I’m working on it :)

vfdev-5 · 2023-02-19T12:18:44Z

@puhuk @sawradip the fact that someone is working on this issue does not mean that produced PR will be certainly merged. It can happen that contributor stops working on the PR and PR is suspended or PR's code does not meet requirement or has poor quality...
Anyway, @sawradip you can also try to propose your version of solution. Otherwise pick another issue from the list.

Also, you can both collaborate together and send 2 PR to solve the issue

vfdev-5 added enhancement help wanted labels May 8, 2021

vfdev-5 added this to To do in 0.6.0 via automation May 8, 2021

vfdev-5 assigned sdesrozis May 11, 2021

vfdev-5 unassigned sdesrozis Oct 1, 2021

vfdev-5 added the Hacktoberfest label Oct 1, 2021

vfdev-5 added the PyDataGlobal PyData Global 2020 Sprint label Oct 21, 2021

vfdev-5 removed Hacktoberfest PyDataGlobal PyData Global 2020 Sprint labels Mar 21, 2022

puhuk added a commit to puhuk/ignite that referenced this issue Feb 6, 2023

Update engine.py to resolve pytorch#1992

8e73cfd

Update engine.py to resolve pytorch#1992

puhuk linked a pull request Feb 6, 2023 that will close this issue

Added debug mode for Engine #2851

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Debug mode for ignite #1992

Debug mode for ignite #1992

sdesrozis commented May 7, 2021 •

edited

vfdev-5 commented May 8, 2021 •

edited

Ishan-Kumar2 commented Oct 9, 2021

sdesrozis commented Oct 9, 2021 •

edited

DevPranjal commented Mar 21, 2022

vfdev-5 commented Mar 21, 2022

DevPranjal commented Mar 22, 2022 •

edited

vfdev-5 commented Mar 22, 2022

DevPranjal commented Mar 22, 2022 •

edited

vfdev-5 commented Mar 22, 2022

DevPranjal commented Mar 22, 2022 •

edited

vfdev-5 commented Mar 22, 2022

DevPranjal commented Mar 22, 2022

sawradip commented Feb 19, 2023

puhuk commented Feb 19, 2023

vfdev-5 commented Feb 19, 2023 •

edited

Debug mode for ignite #1992

Debug mode for ignite #1992

Comments

sdesrozis commented May 7, 2021 • edited

🚀 Feature

vfdev-5 commented May 8, 2021 • edited

Ishan-Kumar2 commented Oct 9, 2021

sdesrozis commented Oct 9, 2021 • edited

DevPranjal commented Mar 21, 2022

vfdev-5 commented Mar 21, 2022

DevPranjal commented Mar 22, 2022 • edited

vfdev-5 commented Mar 22, 2022

DevPranjal commented Mar 22, 2022 • edited

vfdev-5 commented Mar 22, 2022

DevPranjal commented Mar 22, 2022 • edited

vfdev-5 commented Mar 22, 2022

DevPranjal commented Mar 22, 2022

sawradip commented Feb 19, 2023

puhuk commented Feb 19, 2023

vfdev-5 commented Feb 19, 2023 • edited

sdesrozis commented May 7, 2021 •

edited

vfdev-5 commented May 8, 2021 •

edited

sdesrozis commented Oct 9, 2021 •

edited

DevPranjal commented Mar 22, 2022 •

edited

DevPranjal commented Mar 22, 2022 •

edited

DevPranjal commented Mar 22, 2022 •

edited

vfdev-5 commented Feb 19, 2023 •

edited