当前位置：首页 > news >正文

网站程序更换知乎问答网站开发教程

news 2025/12/7 8:11:35

网站程序更换,知乎问答网站开发教程,设计好英文网站,让网站会员做产品标签确认FCOS 中有计算特征图#xff08;Feature map中的每个特征点到gt_box的左、上、右、下的距离#xff09; 1、特征点到gt_box框的左、上、右、下距离计算 x coords[:, 0] # h*w#xff0c;2 即第一列y coords[:, 1] l_off x[None, :, None] - gt_boxes[..., 0][:, No… FCOS 中有计算特征图Feature map中的每个特征点到gt_box的左、上、右、下的距离 1、特征点到gt_box框的左、上、右、下距离计算 x coords[:, 0] # h*w2 即第一列y coords[:, 1] l_off x[None, :, None] - gt_boxes[..., 0][:, None, :] # [1,h*w,1]-[batch_size,1,m]--[batch_size,h*w,m]t_off y[None, :, None] - gt_boxes[..., 1][:, None, :]r_off gt_boxes[..., 2][:, None, :] - x[None, :, None]b_off gt_boxes[..., 3][:, None, :] - y[None, :, None]ltrb_off torch.stack([l_off, t_off, r_off, b_off], dim-1) # [batch_size,h*w,m,4]areas (ltrb_off[..., 0] ltrb_off[..., 2]) * (ltrb_off[..., 1] ltrb_off[..., 3]) # [batch_size,h*w,m]off_min torch.min(ltrb_off, dim-1)[0] # [batch_size,h*w,m]off_max torch.max(ltrb_off, dim-1)[0] # [batch_size,h*w,m]根据上边的画的图可以看出假设对应的 feature map 大小为 2x2stride4原始图片为8x8。将特征图中的每个特征点映射回去可以得到相应的 4个h*w个坐标。对应图中的红色a绿色b黄色c和蓝色d的点。 print(x,\n,y,x.shape)tensor([2., 6., 2., 6.]) tensor([2., 2., 6., 6.]) torch.Size([4]) print(x[None,:,None]) # [1,4,1]tensor([[[2.],[6.],[2.],[6.]]]) print(gt_boxes # [1,2,4] batch1, 两个框每个框左上角和右下角坐标tensor([[[5, 4, 7, 6],[1, 1, 4, 6]]]) print(gt_boxes[...,0],gt_boxes[...,0][:,None,:])tensor([[5, 1]]) tensor([[[5, 1]]])l_off [2,2]-[5,1][-3,1] 以此类推print(l_off,\n, l_off.shape) **第一列代表所有的点abcd横坐标与第一个框的左边偏移量。第二列代表到第二个框的偏移量** tensor([[[-3., 1.],[ 1., 5.],[-3., 1.],[ 1., 5.]]]) torch.Size([1, 4, 2])print(ltrb_off)第一列代表所有的投影点abcd,到两个框的左边偏移量。第一行第二行分别代表两个框。 tensor([[[[-3., -2., 5., 4.], # a 点到第一个框的左边、上边、右边、下边的偏移[ 1., 1., 2., 4.]], # a 点到第二框的左边、上边、右边、下边的偏移[[ 1., -2., 1., 4.], # b 点到第一个框的左边、上边、右边、下边的偏移[ 5., 1., -2., 4.]],[[-3., 2., 5., 0.],[ 1., 5., 2., 0.]],[[ 1., 2., 1., 0.],[ 5., 5., -2., 0.]]]]) torch.Size([1, 4, 2, 4]) #[batch_size,h*w,m,4] print(ltrb_off[...,0]) tensor([[[-3., 1.],[ 1., 5.],[-3., 1.],[ 1., 5.]]]) torch.Size([1, 4, 2]) print(areas)areas: tensor([[[ 4., 15.],[ 4., 15.],[ 4., 15.],[ 4., 15.]]]) torch.return_types.min( valuestensor([[[-3., 1.],[-2., -2.],[-3., 0.],[ 0., -2.]]]), indicestensor([[[0, 0],[1, 2],[0, 3],[3, 2]]])) torch.return_types.max( valuestensor([[[5., 4.],[4., 5.],[5., 5.],[2., 5.]]]), indicestensor([[[2, 3],[3, 0],[2, 1],[1, 0]]])) 2、确定该特征点在哪一个框内是否在该FPN特征层进行尺寸判断并进行后续预测 off_min torch.min(ltrb_off, dim-1)[0] # [batch_size,h*w,m] # off_min 找出所有特征点到每个框的四条边最小的距离 off_max torch.max(ltrb_off, dim-1)[0] # [batch_size,h*w,m] #off_max 找出所有特征点到每个框的四条边最大的距离mask_in_gtboxes off_min 0 mask_in_level (off_max limit_range[0]) (off_max limit_range[1]) # 锁定在这个limit range上的所有的特征的点 print(ltrf_off,ltrb_off) print(off_min,off_min,\n,off_max,off_max) print(mask_in_gtboxes--,mask_in_gtboxes) print(mask_in_level--,mask_in_level) ltrf_off tensor([[[[-3., -2., 5., 4.], # a 点到第一个框的左边、上边、右边、下边的偏移[ 1., 1., 2., 4.]], # a 点到第二个框的左边、上边、右边、下边的偏移[[ 1., -2., 1., 4.], # b 点到第一个框的左边、上边、右边、下边的偏移[ 5., 1., -2., 4.]],[[-3., 2., 5., 0.],[ 1., 5., 2., 0.]],[[ 1., 2., 1., 0.],[ 5., 5., -2., 0.]]]])off_min tensor([[[-3., 1.], # a点到第一个框最小距离-3 a点到第二个框的最小偏移距离 1[-2., -2.], #b点到第一个框最小距离-2 b点到第二个框的最小偏移距离 -2[-3., 0.], # c点到第一个框最小距离-3 a点到第二个框的最小偏移距离 0[ 0., -2.]]]) # d点到第一个框最小距离0 a点到第二个框的最小偏移距离 -2off_max tensor([[[5., 4.],[4., 5.],[5., 5.],[2., 5.]]])mask_in_gtboxes-- # 判断了特征点是否在框内 tensor([[[False, True], # a点到第一个框四边最小偏移距离小于0所以a点不属于第一个框为false以此类推。[False, False],[False, False],[False, False]]]) # [batch,h*w,m]mask_in_level-- # 锁定在这个limit range上的所有的特征的点 tensor([[[True, True], # 锁定了a 在这个level中[True, True], # 锁定了b[True, True], # 锁定了c[True, True]]])# 锁定了d 都在这个FPN级别上 [batch,h*w,m]3、特征点是否在框中心的范围内用来判断是否为正样本 radiu stride * sample_radiu_ratio # 4*1.15 4.6gt_center_x (gt_boxes[..., 0] gt_boxes[..., 2]) / 2gt_center_y (gt_boxes[..., 1] gt_boxes[..., 3]) / 2c_l_off x[None, :, None] - gt_center_x[:, None, :] # [1,h*w,1]-[batch_size,1,m]--[batch_size,h*w,m]c_t_off y[None, :, None] - gt_center_y[:, None, :]c_r_off gt_center_x[:, None, :] - x[None, :, None]c_b_off gt_center_y[:, None, :] - y[None, :, None]c_ltrb_off torch.stack([c_l_off, c_t_off, c_r_off, c_b_off], dim-1) # [batch_size,h*w,m,4]c_off_max torch.max(c_ltrb_off, dim-1)[0]mask_center c_off_max radiuprint(c_ltrb_off,c_ltrb_off) print(c_off_max,c_off_max) print(mask_center,mask_center) c_ltrb_off tensor([[[[-4.0000, -3.0000, 4.0000, 3.0000], # 同上边一样a到第一个框小的中心框四边的距离[-0.5000, -1.5000, 0.5000, 1.5000]], # a到第二个框小的中心框四边的距离[[ 0.0000, -3.0000, 0.0000, 3.0000], # b到第一个框小的中心框四边的距离[ 3.5000, -1.5000, -3.5000, 1.5000]], # # b到第二个框小的中心框四边的距离[[-4.0000, 1.0000, 4.0000, -1.0000],[-0.5000, 2.5000, 0.5000, -2.5000]],[[ 0.0000, 1.0000, 0.0000, -1.0000],[ 3.5000, 2.5000, -3.5000, -2.5000]]]])c_off_max tensor([[[4.0000, 1.5000], # 找到a特征点到第一个框中心框和第二个框的中心框的最大距离[3.0000, 3.5000],[4.0000, 2.5000],[1.0000, 3.5000]]]) # [batch,h*w,m] 4个特征点(a,b,c,d) x 框的个数2个第一个框第二个框mask_center tensor([[[True, True], # 判断是否在这个框里中心点里边正样本[True, True],[True, True],[True, True]]]) ## [batch,h*w,m]3、制定mask,根据上边的 gt_box、fpn_level、mask_center ‘’’ mask_pos 是三个约束条件的交集分别是特征点在gt中特征点在level中以及特征点距离Gt中的center小于指定的范围 ‘’’ mask_pos mask_in_gtboxes mask_in_level mask_center # [batch_size,h*w,m]areas[~mask_pos] 99999999 areas_min_ind torch.min(areas, dim-1)[1] # [batch_size,h*w]mask_pos mask_in_gtboxes mask_in_level mask_center # [batch_size,h*w,m] print(pre_areas:,areas) areas[~mask_pos] 99999999 areas_min_ind torch.min(areas, dim-1)[1] # [batch_size,h*w] # 返回索引注意和上边的区别上边返回值比大小 # torch.max() or torch.min() dim0 找列dim1 找行 print(mask_pos--,mask_pos) print(post_ares,areas) print(areas_min_ind,areas_min_ind) mask_in_gtboxes-- tensor([[[False, True],[False, False],[False, False],[False, False]]]) mask_in_level-- tensor([[[True, True],[True, True],[True, True],[True, True]]]) mask_center tensor([[[True, True],[True, True],[True, True],[True, True]]])mask_pos-- tensor([[[False, True], # 只有a点在第二个框在这个fpn这个level, 同时满足这三个条件[False, False],[False, False],[False, False]]])post_ares tensor([[[1.0000e08, 1.5000e01],[1.0000e08, 1.0000e08],[1.0000e08, 1.0000e08],[1.0000e08, 1.0000e08]]]) # #[batch_size,h*w,m] 将满足要求的保持面积不面其他设置为很大的值areas_min_ind tensor([[1, 0, 0, 0]]) # [batch_size,h*w] min[1]返回的是对应的indices 找到最小的面积返回索引。 4、回归的目标给a,b,c,d四个特征点确定偏移量。用于后续回归 ![在这里插入图片描述](https://img-blog.csdnimg.cn/3e504ea230ff47c097ba9eb6caddca55.png reg_targets ltrb_off[torch.zeros_like(areas, dtypetorch.bool) .scatter_(-1, areas_min_ind.unsqueeze(dim-1), 1)] # [batch_size*h*w,4] reg_targets torch.reshape(reg_targets, (batch_size, -1, 4)) # [batch_size,h*w,4]scatter_的用法参考 https://blog.csdn.net/weixin_43496455/article/details/103870889 scatter(dim, index, src)将src中数据根据index中的索引按照dim的方向进行填充。dim0areas: tensor([[[ 4., 15.],[ 4., 15.],[ 4., 15.],[ 4., 15.]]]) [1,4,2] 扩展维度之后 [1,4] -- torch.Size([1, 4, 1]) [[[1,0,0,0]]] torch.zeros_like(areas, dtypetorch.bool) tensor([[[False, False],[False, False],[False, False],[False, False]]])after scatter_-- tensor([[[False, True],[ True, False],[ True, False],[ True, False]]]) # [1,4,2]ltrf_off tensor([[[[-3., -2., 5., 4.], # a 点到第一个框的左边、上边、右边、下边的偏移[ 1., 1., 2., 4.]], # a 点到第二个框的左边、上边、右边、下边的偏移[[ 1., -2., 1., 4.], # b 点到第一个框的左边、上边、右边、下边的偏移[ 5., 1., -2., 4.]],[[-3., 2., 5., 0.],[ 1., 5., 2., 0.]],[[ 1., 2., 1., 0.],[ 5., 5., -2., 0.]]]])reg_targets1 tensor([[ 1., 1., 2., 4.], # a 点第二个框[ 1., -2., 1., 4.], # b 点第一个框[-3., 2., 5., 0.], # c 点第一个框[ 1., 2., 1., 0.]])# d 点第一个框# torch.Size([4, 4])reg_targets2 tensor([[[ 1., 1., 2., 4.],[ 1., -2., 1., 4.],[-3., 2., 5., 0.],[ 1., 2., 1., 0.]]]) # torch.Size([1, 4, 4])5、目标的分类给a,b,c,d四个点分配类别 # 设置classes torch.tensor([[100,200]]) classes torch.broadcast_tensors(classes[:, None, :], areas.long())[0] # [batch_size,h*w,m] cls_targets classes[torch.zeros_like(areas, dtypetorch.bool).scatter_(-1, areas_min_ind.unsqueeze(dim-1), 1) ] cls_targets torch.reshape(cls_targets, (batch_size, -1, 1)) # [batch_size,h*w,1]pythonclasses torch.broadcast_tensors(classes[:, None, :], areas.long())[0] # [batch_size,h*w,m]print(class,classes,classes.shape)cls_targets classes[torch.zeros_like(areas, dtypetorch.bool).scatter_(-1, areas_min_ind.unsqueeze(dim-1), 1)]print(torch.zeros_like(areas, dtypetorch.bool).scatter_(-1, areas_min_ind.unsqueeze(dim-1), 1))print(cls_targets1,cls_targets,cls_targets.shape)cls_targets torch.reshape(cls_targets, (1, -1, 1)) # [batch_size,h*w,1]print(cls_targets,cls_targets,cls_targets.shape)class tensor([[[100, 200],[100, 200],[100, 200],[100, 200]]]) torch.Size([1, 4, 2])cls_targets1 tensor([200, 100, 100, 100]) torch.Size([4])cls_targets tensor([[[200], # a点属于第二个框的类别200[100], # b点属于第一个框的类别100[100], # c点。。。。。。[100]]]) torch.Size([1, 4, 1]) [batch_size,h*w,1] 6、 center-ness 越靠边cneterness越小。 left_right_min torch.min(reg_targets[..., 0], reg_targets[..., 2]) # [batch_size,h*w]print(reg_trargets[...,0],reg_targets[..., 0],reg_targets[..., 2])left_right_max torch.max(reg_targets[..., 0], reg_targets[..., 2])top_bottom_min torch.min(reg_targets[..., 1], reg_targets[..., 3])top_bottom_max torch.max(reg_targets[..., 1], reg_targets[..., 3]) cnt_targets (((left_right_min * top_bottom_min) / (left_right_max * top_bottom_max 1e-10)).sqrt().unsqueeze(dim-1)) # [batch_size,h*w,1]reg_targets a,b,c,d四个点与对应框的偏移量。 tensor([[[ 1., 1., 2., 4.],[ 1., -2., 1., 4.],[-3., 2., 5., 0.],[ 1., 2., 1., 0.]]]) # torch.Size([1, 4, 4])tensor([[ 1., 1., -3., 1.]]) tensor([[2., 1., 5., 1.]]) left_right_min tensor([[ 1., 1., -3., 1.]]) left_right_max tensor([[2., 1., 5., 1.]]) top_bottom_min tensor([[ 1., -2., 0., 0.]]) top_bottom_max tensor([[4., 4., 2., 2.]])cnt_targets: tensor([[[0.3536],[ nan],[-0.0000],[0.0000]]]) torch.Size([1, 4, 1]) 7、确定最后的结果 assert reg_targets.shape (batch_size, h_mul_w, 4)assert cls_targets.shape (batch_size, h_mul_w, 1)assert cnt_targets.shape (batch_size, h_mul_w, 1)# process neg coordsmask_pos_2 mask_pos.long().sum(dim-1) # [batch_size,h*w]# num_posmask_pos_2.sum(dim-1)# assert num_pos.shape(batch_size,)mask_pos_2 mask_pos_2 1assert mask_pos_2.shape (batch_size, h_mul_w)cls_targets[~mask_pos_2] 0 # [batch_size,h*w,1]cnt_targets[~mask_pos_2] -1reg_targets[~mask_pos_2] -1return cls_targets, cnt_targets, reg_targetsmask_pos-- tensor([[[False, True], # 只有a点在第二个框在这个fpn这个level, 同时满足这三个条件[False, False],[False, False],[False, False]]])mask_pos2-- # mask_pos_2 mask_pos.long().sum(dim-1) # [batch_size,h*w] tensor([[1, 0, 0, 0]]) torch.Size([1, 4]) # 按照列相加第一列第二列 # mask_pos_2 mask_pos_2 1 mask_pos_2-- # 看特征点是否同时存在于多个满足条件 tensor([[ True, False, False, False]]) # 【14】print(cls_targets) print(cnt_targets) print(reg_targets)#将非 positive location的地方设置成 cls设置成0 ,reg_targets 设置成对应的-1cls_targets[~mask_pos_2] 0 # [batch_size,h*w,1] cnt_targets[~mask_pos_2] -1 reg_targets[~mask_pos_2] -1cls_targets tensor([[[200], # a点属于第二个框的类别200[100], # b点属于第一个框的类别100[100], # c点。。。。。。[100]]]) torch.Size([1, 4, 1]) [batch_size,h*w,1] tensor([[[200],[ 0],[ 0],[ 0]]])cnt_targets: tensor([[[0.3536],[ nan],[-0.0000],[0.0000]]]) torch.Size([1, 4, 1]) tensor([[[ 0.3536],[-1.0000],[-1.0000],[-1.0000]]])reg_targets2 tensor([[[ 1., 1., 2., 4.],[ 1., -2., 1., 4.],[-3., 2., 5., 0.],[ 1., 2., 1., 0.]]]) # torch.Size([1, 4, 4]) tensor([[[ 1., 1., 2., 4.],[-1., -1., -1., -1.],[-1., -1., -1., -1.],[-1., -1., -1., -1.]]]) 至此呢经过层层限制能够找到符合三个条件的特征点例如本案例为 a点符合第二个框。并给出a点对应的类别-200cnt值-0。3536偏移量【1124】值的注意的是满足三个条件为正样本cnt是计算所有正样本的 centerness值后续为了抑制低质量的正样本根据cnterness的值计算了一个loss。

查看全文

http://www.sadfv.cn/news/170618/