{"id":229837,"date":"2024-06-08T08:58:29","date_gmt":"2024-06-08T08:58:29","guid":{"rendered":"https:\/\/michigandigitalnews.com\/index.php\/2024\/06\/08\/ibm-and-red-hat-introduce-instructlab-for-collaborative-llm-customization\/"},"modified":"2025-06-25T17:17:34","modified_gmt":"2025-06-25T17:17:34","slug":"ibm-and-red-hat-introduce-instructlab-for-collaborative-llm-customization","status":"publish","type":"post","link":"https:\/\/michigandigitalnews.com\/index.php\/2024\/06\/08\/ibm-and-red-hat-introduce-instructlab-for-collaborative-llm-customization\/","title":{"rendered":"IBM and Red Hat Introduce InstructLab for Collaborative LLM Customization"},"content":{"rendered":"<p> [ad_1]<br \/>\n<\/p>\n<div>\n<figure class=\"figure mt-2\">&#13;<br \/>\n                        <a href=\"https:\/\/blockchainstock.blob.core.windows.net:443\/features\/C3C126EBB32A98D3B1066F2AA8C12A350FE26155D7FFE608CC2469EBE42F114D.jpg\" data-glightbox=\"\" data-gallery=\"image-popup\">&#13;<br \/>\n                            <img decoding=\"async\" class=\"rounded\" src=\"https:\/\/blockchainstock.blob.core.windows.net:443\/features\/C3C126EBB32A98D3B1066F2AA8C12A350FE26155D7FFE608CC2469EBE42F114D.jpg\" alt=\"IBM and Red Hat Introduce InstructLab for Collaborative LLM Customization\"\/>&#13;<br \/>\n&#13;<br \/>\n                        <\/a>&#13;<br \/>\n                    <\/figure>\n<p><img decoding=\"async\" src=\"https:\/\/d1bqvdqmynqyrb.cloudfront.net\/_next\/image?url=https%3A%2F%2Fresearch-website-prod-cms-uploads.s3.us.cloud-object-storage.appdomain.cloud%2FAI_Cube_Loop_32b79bec2e.gif&amp;w=1200&amp;q=75\" alt=\"image?url=https%3A%2F%2Fresearch-website-prod-cms-uploads.s3.us.cloud-object-storage.appdomain.cloud%2FAI_Cube_Loop_32b79bec2e.gif&amp;w=1200&amp;q=75\"\/><\/p>\n<p>IBM Research, in collaboration with Red Hat, has launched InstructLab, an innovative open-source project designed to facilitate the collaborative customization of large language models (LLMs) without necessitating full retraining. This initiative aims to streamline the integration of community contributions into base models, significantly reducing the time and effort traditionally required.<\/p>\n<h2>InstructLab&#8217;s Mechanism<\/h2>\n<p>InstructLab operates by augmenting human-curated data with high-quality examples generated by an LLM, thereby lowering the cost of data creation. This data can then be used to enhance the base model without requiring it to be retrained from scratch, which is a substantial cost-saving measure. IBM Research has already utilized InstructLab to generate synthetic data for improving its open-source Granite models for language and code.<\/p>\n<p>\u201cThere\u2019s no good way to combine all of that innovation into a coherent whole,\u201d said David Cox, vice president for AI models at IBM Research.<\/p>\n<h2>Recent Applications<\/h2>\n<p>Researchers recently used InstructLab to refine an IBM 20B Granite code model, transforming it into an expert for modernizing software written for IBM Z mainframes. This process demonstrated both speed and effectiveness, which led to IBM forming a strategic partnership with Red Hat.<\/p>\n<p>IBM\u2019s current solution for mainframe modernization, the <a rel=\"nofollow\" href=\"https:\/\/www.ibm.com\/products\/watsonx-code-assistant-z\">watsonx Code Assistant for Z<\/a>, was fine-tuned on paired COBOL-Java programs. These were amplified through traditional rules-based synthetic generators and enhanced further using InstructLab&#8217;s capabilities.<\/p>\n<p>\u201cThe most exciting part of InstructLab is its ability to generate new data from traditional knowledge sources,\u201d noted Ruchir Puri, chief scientist at IBM Research. An updated version of WCA for Z is expected to be released soon.<\/p>\n<h2>How InstructLab Works<\/h2>\n<p>InstructLab features a command-line interface (CLI) that enables users to add and merge new alignment data to their target model via a GitHub workflow. This CLI acts as a test kitchen for trying out new \u201crecipes\u201d for generating synthetic data to teach an LLM new knowledge and skills.<\/p>\n<p>The backend of InstructLab is powered by IBM Research\u2019s synthetic data generation and phased-training method known as Large-Scale Alignment for ChatBots (LAB). This method uses a taxonomy-driven approach to create high-quality data for specific tasks, ensuring that new information can be assimilated without overwriting previously learned data.<\/p>\n<p>\u201cInstead of having a large company decide what your model knows, InstructLab lets you dictate through its taxonomy what knowledge and skills your model should have,\u201d said Akash Srivastava, the IBM researcher who led the team that developed LAB.<\/p>\n<h2>Community Collaboration<\/h2>\n<p>InstructLab encourages community participation by allowing users to experiment with local versions of IBM\u2019s Granite-7B and Merlinite-7B models, and submit improvements as pull requests to the InstructLab taxonomy on GitHub. Project maintainers review the proposed skills, and if they meet community guidelines, the data is generated and used to fine-tune the base model. Updated versions are then released back to the community on Hugging Face.<\/p>\n<p>IBM has dedicated its AI supercomputer, Vela, to updating InstructLab models weekly. As the project scales, other public models may be included. The Apache 2.0 license governs all data and code generated by the project.<\/p>\n<h2>The Power of Open Source<\/h2>\n<p>Open-source software has been a cornerstone of the internet, driving innovation and security. InstructLab aims to bring these benefits to generative language models by providing transparent, collaborative tools for model customization. This initiative follows IBM and Red Hat\u2019s long history of open-source contributions, including projects like PyTorch, Kubernetes, and the Red Hat OpenShift platform.<\/p>\n<p>\u201cThis breakthrough innovation unlocks something that was next to impossible before \u2014 the ability for communities to contribute to models and improve them together,\u201d said M\u00e1ir\u00edn Duffy, software engineering manager of the Red Hat Enterprise Linux AI team.<\/p>\n<p>For more details, visit the official <a rel=\"nofollow\" href=\"https:\/\/research.ibm.com\/blog\/instruct-lab\">IBM Research blog<\/a>.<\/p>\n<p><span><i>Image source: Shutterstock<\/i><\/span>                    <!-- Divider --><\/p>\n<p>. . .<\/p>\n<h4>Tags<\/h4>\n<p>                    <!-- Divider --><\/p>\n<p>                    <!-- Author info START --><\/p>\n<p>                    <!-- Author info END --><br \/>\n                    <!-- Divider -->\n                <\/div>\n<p>[ad_2]<br \/>\n<br \/><a href=\"https:\/\/blockchain.news\/news\/ibm-red-hat-instructlab-llm-customization\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[ad_1] &#13; &#13; &#13; &#13; &#13; IBM Research, in collaboration with Red Hat, has launched InstructLab, an innovative open-source project designed to facilitate the collaborative<\/p>\n","protected":false},"author":1,"featured_media":229838,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_uf_show_specific_survey":0,"_uf_disable_surveys":false,"footnotes":""},"categories":[171],"tags":[],"_links":{"self":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/229837"}],"collection":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/comments?post=229837"}],"version-history":[{"count":0,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/posts\/229837\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media\/229838"}],"wp:attachment":[{"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/media?parent=229837"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/categories?post=229837"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/michigandigitalnews.com\/index.php\/wp-json\/wp\/v2\/tags?post=229837"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}