高端网站的特点,软件开发商网站,北京外包推广,中国十大门户网站排行Depth-Wise Emergence of Prediction-Centric Geometry in Large Language Models Authors: Shahar Haim, Daniel C McNamee Deep-Dive Summary: 论文总结#xff1a;ControlNet - 为文本到图像扩散模型添加条件控制 这篇文章介绍了一种名为 ControlNet 的神经网络架构 Θ ) Z ( F ( x Z ( c ; Θ z 1 ) ; Θ c ) ; Θ z 2 ) y \mathcal{F}(x; \Theta) \mathcal{Z}(\mathcal{F}(x \mathcal{Z}(c; \Theta_{z1}); \Theta_c); \Theta_{z2})yF(x;Θ)Z(F(xZ(c;Θz1​);Θc​);Θz2​)其中Z ( ⋅ ) \mathcal{Z}(\cdot)Z(⋅)表示零卷积操作c cc是条件向量。这种设计使得模型在训练初期能够保持输出与原模型一致避免了随机噪声对预训练权重的干扰。3. 模型集成与条件处理ControlNet 被应用于 Stable Diffusion 的编码器部分。通过这种方式它能够提取各种图像特征如 Canny 边缘、HED 边缘、人体骨架点等并将其转化为引导信息。作者探讨了在不同数据规模和计算资源下的训练稳定性。即使在计算资源受限的情况下如单张消费级 GPUControlNet 也能展现出强大的学习能力。4. 实验与功能展示论文展示了 ControlNet 支持的多种控制模式包括但不限于Canny 边缘引导基于图像轮廓生成。OpenPose 姿态引导通过人体骨架控制生成角色的动作。深度图与法线贴图保留场景的三维结构信息。通过对比实验ControlNet 在控制精度和图像质量上显著优于此前的基准模型。无论是复杂的线条还是精细的纹理ControlNet 都能在保持文本一致性的同时严谨地遵循空间条件约束。5. 结论与未来影响ControlNet 为大型生成模型提供了一种高效、稳健的微调方案。它不仅推动了图像生成技术在专业艺术创作、工业设计等领域的应用也为后续多模态生成研究奠定了基础。Original Abstract:We show that decoder-only large language models exhibit a depth-wise transition from context-processing to prediction-forming phases of computation accompanied by a reorganization of representational geometry. Using a unified framework combining geometric analysis with mechanistic intervention, we demonstrate that late-layer representations implement a structured geometric code that enables selective causal control over token prediction. Specifically, angular organization of the representation geometry parametrizes prediction distributional similarity, while representation norms encode context-specific information that does not determine prediction. Together, these results provide a mechanistic-geometric account of the dynamics of transforming context into predictions in LLMs.PDF Link:2602.04931v1部分平台可能图片显示异常请以我的博客内容为准