Close Menu
    Facebook X (Twitter) Instagram
    Facebook Instagram YouTube
    Crypto Go Lore News
    Subscribe
    Monday, June 9
    • Home
    • Market Analysis
    • Latest
      • Bitcoin News
      • Ethereum News
      • Altcoin News
      • Blockchain News
      • NFT News
      • Market Analysis
      • Mining News
      • Technology
      • Videos
    • Trending Cryptos
    • AI News
    • Market Cap List
    • Mining
    • Trading
    • Contact
    Crypto Go Lore News
    Home»AI News»Microsoft AI Researchers Developed a New Improved Framework ResLoRA for Low-Rank Adaptation (LoRA)
    AI News

    Microsoft AI Researchers Developed a New Improved Framework ResLoRA for Low-Rank Adaptation (LoRA)

    CryptoExpertBy CryptoExpertMarch 6, 2024No Comments3 Mins Read
    Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
    Microsoft AI Researchers Developed a New Improved Framework ResLoRA for Low-Rank Adaptation (LoRA)
    Share
    Facebook Twitter Pinterest Email Copy Link
    Changelly


    Large language models (LLMs) with hundreds of billions of parameters have significantly improved performance on various tasks. Fine-tuning LLMs on specific datasets enhances performance compared to prompting during inference but incurs high costs due to parameter volume. Low-rank adaptation (LoRA) is a popular parameter-efficient fine-tuning method for LLMs, yet updating LoRA block weights efficiently is challenging due to the model’s long calculation path.

    Various parameter-efficient fine-tuning (PEFT) methods have been proposed to address this issue. PEFT methods freeze all parameters in the original model and only tune a few in the newly added modules. Among them, one of the most popular PEFT methods is LoRA. LoRA freezes most parameters in the original model and only updates a few in added modules. It employs low-rank adaptation, merging matrices parallel to frozen linear layers during inference. However, LoRA’s long backward path poses challenges. Integrating LoRA with ResNet and Transformers introduces design complexities, impacting gradient flow during training.

    Researchers from the School of Computer Science and Engineering, Beihang University, Beijing, China, and Microsoft have introduced ResLoRA, Β an improved framework of LoRA. ResLoRA mainly consists of two parts: ResLoRA blocks and merging approaches. ResLoRA blocks add residual paths to LoRA blocks during training, while merging approaches convert ResLoRA to LoRA blocks during inference. Researchers also claimed that, to their knowledge, ResLoRA is the first work that combines the residual path with LoRA.Β 

    They designed three blocks inspired by ResNet: input-shortcut, block-shortcut, and middle-shortcut, adding residual paths to LoRA blocks. These structures aim to optimize gradient flow during training and are essential for efficient parameter tuning. An important issue arises as ResLoRA introduces a non-plain structure, unlike LoRA, which seamlessly merges with linear layers. To address this issue, they have designed a merging approach. For block-shortcut structures, merging relies on previous block weights. The precision of scaling factors, determined using Frobenius norms, ensures accurate model merging. Two approaches, based on input and block weights, facilitate seamless integration, minimizing latency in inference.

    Binance

    In extensive experiments spanning natural language generation (NLG) and understanding (NLU), ResLoRA outperforms LoRA variants such as AdaLoRA, LoHA, and LoKr. ResLoRA and ResLoRAbs consistently surpass LoRA across NLG and NLU benchmarks, showcasing improvements in accuracy ranging from 10.98% to 36.85%. ResLoRA also demonstrated faster training and superior image generation quality than LoRA in the text-to-image task.

    To conclude, researchers from the School of Computer Science and Engineering, Beihang University, Beijing, China, and Microsoft have introduced ResLoRA, an enhanced framework for LoRA. ResLoRA introduces residual paths during training and employs merging approaches for path removal during inference. It outperforms original LoRA and other baseline methods across NLG, NLU, and text-to-image tasks. The results confirm ResLoRA’s effectiveness, achieving superior outcomes with fewer training steps and no additional trainable parameters.

    Check out theΒ Paper.Β All credit for this research goes to the researchers of this project. Also,Β don’t forget to follow us onΒ TwitterΒ andΒ Google News.Β JoinΒ our 38k+ ML SubReddit,Β 41k+ Facebook Community,Β Discord Channel, andΒ LinkedIn Group.

    If you like our work, you will love ourΒ newsletter..

    Don’t Forget to join ourΒ Telegram Channel

    You may also like ourΒ FREE AI Courses….

    Asjad is an intern consultant at Marktechpost. He is persuing B.Tech in mechanical engineering at the Indian Institute of Technology, Kharagpur. Asjad is a Machine learning and deep learning enthusiast who is always researching the applications of machine learning in healthcare.

    🐝 Join the Fastest Growing AI Research Newsletter Read by Researchers from Google + NVIDIA + Meta + Stanford + MIT + Microsoft and many others…



    Source link

    Phemex
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
    CryptoExpert
    • Website

    Related Posts

    AI News

    Learn CSS Easily with AI _ Step-by-Step Guide for Beginners _ai _aitools _css _aicoding#viral#shorts

    June 8, 2025
    AI News

    Privacy is the most fundamental aspect of human rights! #ai #ainews #chatgpt #openai #technews

    June 7, 2025
    AI News

    Test your AI knowledge | Fun AI Quiz for beginners & Developers

    June 6, 2025
    AI News

    Struggling with One Part? Let AI Guide You, Not Replace You #ai #shorts #homework

    June 5, 2025
    AI News

    Nude photo dikhai parliament me #news #nude #ai #parliament #newsupdate #foryou #shortsvideo #short

    June 4, 2025
    AI News

    Top 10 AI Tools in 2025 πŸ”₯ | Life-Changing Tools for Beginners | AI Use at 55 Story

    June 3, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Recommended
    Editors Picks

    Circle rejected Ripple’s $5 billion buyout β€” now valued at over $20 billion after NYSE debut

    June 8, 2025

    Bitcoin at $104K, but falling MVRV ratio hints at short-term correction

    June 8, 2025

    Learn CSS Easily with AI _ Step-by-Step Guide for Beginners _ai _aitools _css _aicoding#viral#shorts

    June 8, 2025

    Crypto News 77 #ZKJ #BINANCE #LA #SXT #SOPH #NXPC #HUMA #ZRC #BNB #BTC #XRP #USDC #ONDO #anime #XNXX

    June 8, 2025
    Latest Posts

    We are a leading platform dedicated to delivering authoritative insights, news, and resources on cryptocurrencies and blockchain technology. At Crypto Go Lore News, our mission is to empower individuals and businesses with reliable, actionable, and up-to-date information about the cryptocurrency ecosystem. We aim to bridge the gap between complex blockchain technology and practical understanding, fostering a more informed global community.

    Latest Posts

    Circle rejected Ripple’s $5 billion buyout β€” now valued at over $20 billion after NYSE debut

    June 8, 2025

    Bitcoin at $104K, but falling MVRV ratio hints at short-term correction

    June 8, 2025

    Learn CSS Easily with AI _ Step-by-Step Guide for Beginners _ai _aitools _css _aicoding#viral#shorts

    June 8, 2025
    Newsletter

    Subscribe to Updates

    Get the latest Crypto news from Crypto Golore News about crypto around the world.

    Facebook Instagram YouTube
    • Contact
    • Privacy Policy
    • Terms Of Service
    • Social Media Disclaimer
    • DMCA Compliance
    • Anti-Spam Policy
    © 2025 CryptoGoLoreNews. All rights reserved by CryptoGoLoreNews.

    Type above and press Enter to search. Press Esc to cancel.

    bitcoin
    Bitcoin (BTC) $ 105,845.36
    ethereum
    Ethereum (ETH) $ 2,508.95
    tether
    Tether (USDT) $ 1.00
    xrp
    XRP (XRP) $ 2.26
    bnb
    BNB (BNB) $ 652.62
    solana
    Solana (SOL) $ 152.61
    usd-coin
    USDC (USDC) $ 1.00
    dogecoin
    Dogecoin (DOGE) $ 0.184104
    tron
    TRON (TRX) $ 0.282043
    cardano
    Cardano (ADA) $ 0.669553