Retrieve hook.stored tensor for Model Parallelism

Patataman · August 20, 2024, 3:59pm

Hello, I have searched but I think I might be the first person trying to use PyTorch RPC module for having Model Parallelism with FastAI

At work I’m doing a Python module to simplify data and model parallelism with PyTorch, as some of my coworkers use FastAI., I thought “Ok, let’s try to include FastAI too”.

So far, a small change in SequentialEx is needed to be able to split the model for Model Parallelism (Add x_orig param in SequentialEx to allow split models by Patataman · Pull Request #4042 · fastai/fastai · GitHub) However, when I was finishing testing it I discovered that hooks are another problem for this.

Until now I have been testing with Unet because it’s the model we usually train and, for example, using the same code as in the PR:

class SimplifiedModel(torch.nn.Module):
    def __init__(self, model):
        super().__init__()
        layers = [m for m in model.layers]
        m_len = len(model.layers)
        self.layer1 = SequentialEx(*layers[:m_len//3])
        self.layer2 = SequentialEx(*layers[m_len//3:m_len//3*2])
        self.layer3 = SequentialEx(*layers[m_len//3*2:])

    def forward(self, x):
        _x = self.layer1(x)
        _x = self.layer2(_x, x_orig=x)
        return self.layer3(_x, x_orig=x)

model = resnet34
learn = unet_learner(dls, model, loss_func, [...])
newmodel = SimplifiedModel(learn.model)

I have noticed that only the first 4 layers in the Unet (layers (sequentials with BasicBlocks), BatchNorm2D, ReLU, and another sequential with Conv2D and ReLU) trigger the hook, and then finally used in the first UnetBlock, and the hook is no longer triggered.

Therefore, if I split the model as:

Node1: Layer1
Node2: Layer2, Layer3

Only the hooks are triggered in the layer1, and if I send that layer to another node (physically separated), no update occurs in layer2 and therefore, the data retrieved from hook.stored in the first UnetBlock is wrong (link to code)

I have tried to get, somehow, the data stored at the hooks before entering the first UnetBlock, but despite triggering the hook, I found impossible to get anything using hook_output or similar functions.

TL;DR
Is there a way to get the tensor stored in the hooks so I can send it to the other node and (somehow, need to figure out this) update the stored tensor in the hook before continue?

PS: There is a workaround, but I don’t like it because it doesn’t solve the real problem, which is, in this very specific case, split the model as:

Node1: Layer1, Layer2 (This way, the tensor stored in the hook is shared when needed)
Node2: Layer3

bvnjj6 · December 5, 2024, 6:35am

Types of presentations

Different types of presentations and ways of holding presentations are available for you to choose from. You need to consider several components when preparing for your presentation. Primarily, you should think about the main purpose of the presentation. For example, determine whether your main objective is to inform, motivate, or persuade the customer.

Informative/instructional

An informative/instructional presentation should be concise and educational. Additionally, it should be straightforward and give the customer the information that they need. An example of this type of presentation would be one about licensing. In that presentation, you’ll inform your customer about the type of licenses that are available and the benefits that they’ll get from these licenses. You’re educating them about the licenses and what they need to build their system rather than talking them into buying the licenses.

This type of presentation is helpful in teaching or demonstrating your product to the customer. It can be a presentation of your design document, your functional overview, or of the proof of concept.

Motivational

A motivational type of presentation is one in which you try to inspire your audience. A typical motivational presentation in business is the project kickoff. This presentation is where you inform your customers and other consultants about your remarkable project and its exceptional results. Your goal is to inspire everyone who will be involved in the project to look forward to it.

Another common motivational presentation in business is the solution demonstration for the end users. Likely, it’s your users’ first look at the new solution; therefore, it should be uplifting and should motivate them to start using the system. Make sure that you bring the right energy to a motivational presentation, especially if you’re presenting to a customer. Be engaging so that you can inspire them and energetic so that you can stimulate the audience to be excited about using the new system.

Persuasive

A persuasive presentation is where you need to convince the audience about something. A classic example of this type of presentation is a sales presentation. At some point, most consultants will do a sales presentation.

A sales presentation can involve any or all the following elements:

A demonstration for a new customer
An explanation of a new functionality to an existing customer, where your goal is for them to buy it
An opportunity to persuade a potential customer to buy something from you

dell emc training courses malaysia