AI inference

WasmEdge at KubeCon NA 2024: AI-Driven Video Translation

Second State is thrilled to bring its open source work around LLMs, WasmEdge and Gaia, to KubeCon + CloudNativeCon North America 2024, November 12-15, 2024 in Salt Lake City, Utah. This year, Second State unveils VideoLangua.com—a groundbreaking platform using the open source WasmEdge and Gaia tech stack to deliver high-quality video translation, dubbing, and subtitling. This innovation highlights Second State’s commitment to democratizing access to global communication through advanced open-source solutions.…
LLM AI inference Rust WebAssembly
WebAssembly Devroom at FOSDEM 2025 – Call for Speakers Open

We're excited to announce that the WebAssembly Devroom will be held on 2nd February 2025 at FOSDEM 2025 in Brussels, Belgium. WebAssembly is expanding its use cases from browsers to the cloud, and this devroom is a fantastic opportunity for the community to meet and discuss the latest developments in the WebAssembly ecosystem. This is cohosted by WasmEdge and NTT software innovation lab engineers. About FOSDEM: FOSDEM is a free event for software developers to meet, share ideas, and collaborate.…
LLM AI inference Rust WebAssembly
Lightweight and cross-platform LLM agents on Ascend 910B

The Ascend 910B is a popular alternative to the Nvidia H100 in China. While it is a powerhouse for AI training workloads, we are mostly interested in its inference performance. That is especially relevant as new Ascend NPUs are released for edge devices. Recently, Huawei generously donated 5 bare metal servers with 8x Ascend 910B each to support the GOSIM Super Agent hackathon event. Those machines are truly beasts costing well over $100k USD each.…
LLM AI inference Rust WebAssembly
Run FLUX.1 [schnell] on your MacBook

FLUX.1 is an open-source image generation model developed by Black Forest Labs, the creators of Stable Diffusion. They recently released FLUX.1 [schnell], a lightweight, high-speed variant designed for local use, ideal for personal projects, and licensed under Apache 2.0. With WasmEdge's release of version 0.14.1, which includes Stable Diffusion plugin support, you can use LlamaEdge (the Rust + Wasm stack) to run the FLUX.1 [schnell] model and Stable Diffusion model and generate images directly on your machine without needing to install complex Python packages or C++ toolchains!…
LLM AI inference Rust WebAssembly
Getting started with Qwen2.5-14B

The Qwen 2.5 series includes models ranging from 0.5B to 110B parameters, optimized for diverse tasks like coding, logical reasoning, and natural language understanding. These models, including smaller ones (0.5B, 1.8B, 4B, 7B, 14B) for edge devices and larger ones (72B, 110B) for enterprise use, have seen significant improvements in instruction-following, logic, and over 29 languages support. They have long-context support (up to 128K input tokens and over 8k token generation), and can generate structured outputs like JSON.…
LLM AI inference Rust WebAssembly
Tutorial: Run Yi-Coder as a private coding assistant

Yi-Coder is an open-source, high-performance code language model designed for efficient coding. It supports 52 programming languages and excels in tasks requiring long-context understanding, such as project-level code comprehension and generation. The model comes in two sizes—1.5B and 9B parameters—and is available in both base and chat versions. In this tutorial, you’ll learn how to Run the Yi-coder model locally with an OpenAI-compatible API Use Yi-coder to power Cursor Cursor is one of the hottest AI code editors.…
LLM Llama AI inference Rust WebAssembly
10 More Free Linux Foundation Certification Exam or Course Vouchers

Follow-Up Offer: 10 More Free Vouchers Up for Grabs! We have reached out to the winners for the last 10 vouchers-new contributors during the past 6 months. Check your inbox and confirm by replying to our email! If you think you're eligible and have not received an email, please reach out to furao@secondstate.io by sending your merged Pull Request! After the success of our previous giveaway, Second State is thrilled to announce a new round of opportunities for open-source contributors.…
LLM Llama AI inference Rust WebAssembly
Getting Started with Phi-3.5-mini-instruct

Phi-3.5-mini is a cutting-edge, lightweight version of the renowned Phi-3 model, designed to handle extensive contexts up to 128K tokens with unparalleled efficiency. Built from a mix of synthetic data and meticulously filtered web content, this model excels in high-quality, reasoning-intensive tasks. The development of Phi-3.5-mini involved advanced techniques such as supervised fine-tuning and innovative optimization strategies like proximal policy optimization and direct preference optimization. These rigorous enhancements guarantee exceptional adherence to instructions and robust safety protocols, setting a new standard in the AI landscape.…
LLM Llama AI inference Rust WebAssembly
Win Gifts from Second State/WasmEdge at KubeCon+CloudNativeCon+ OSSummit+AI_dev 2024

The GenAI, cloud-native and pen source community is eagerly anticipating the upcoming KubeCon + CloudNativeCon + Open Source Summit + AI_dev China 2024, set to take place in Hong Kong 21-23rd August. This event promises to be a remarkable gathering of open-source luminaries, including the legendary Linus Trovalds and other star speakers. Developers will have the rare opportunity for face-to-face interactions with these influential figures, as well as with Jim Zemlin, the CEO of the Linux Foundation, and other industry leaders.…
LLM Gemma AI inference Rust WebAssembly
Advance Your Skills with WasmEdge LFX Mentorship 2024 Fall: LLMs, Trading Bots and More

The 2024 Term 3 of the LFX Mentorship program is here, and it's packed with exciting opportunities! Running from September to November, this program invites passionate developers to contribute to open source while boosting their CV. WasmEdge has 4 projects that offer an exciting opportunity for aspiring developers to work on cutting-edge projects within the WasmEdge ecosystem, with a focus on enhancing WebAssembly (WASM) capabilities, improving software reliability, and integrating modern AI techniques to build innovative applications.…
LLM Gemma AI inference Rust WebAssembly

1
2
3
4
5