MAGIA

This repo contains MAGIA (Mesh Architecture for Generative Intelligence Acceleration), an open-source large-scale accelerator designed for Generative Artificial Intelligence (GenAI). MAGIA is a network of tiles that have at their heart RedMulE for General Matrix Multiply (GeMM) acceleration, iDMA for fast and efficient data movement, Spatz Core Complex (Spatz CC) for vector workloads acceleration, and an L1 scratchpad memory (SPM). Tiles are connected to a mesh Network-on-Chip (NoC) - FlooNoC - used for communication, and a dedicated network for synchronization - FractalSync. Each tile is equipped with an Event Unit for tile synchronization and event aggregation. MAGIA is designed to support matrices of sizes that vary by orders of magnitude, and also sparse matrix multiplication.

MAGIA is developed as part of the PULP (Parallel Ultra-Low Power) project, a joint effort between ETH Zurich and the University of Bologna.

⭐ Getting Started

Pre-requisites

MAGIA uses bender to manage its dependencies and to automatically generate compilation scripts.

We use a virtual python environment which requires python>=3.6.8. To create the envrionment use (MAGIA folder):

make python_venv

By default, the python in your $PATH is used. You can specify the version by optionally exporting the BASE_PYTHON environment variable.

Simulation

The following optional parameters can be specified:

mesh_dv: 0|1 (Default: 1). 0 simulation of a single tile; 1 simulation of the entire mesh.

fast_sim: 0|1 (Default: 0). 0 faster simulation that does not track signals; 1 simulation that tracks signals (for debugging).

gui: 0|1 (Default: 0). 0 simulation without GUI; 1 simulation with GUI.

test: tile_test|mesh_test (Default: mesh_test). Specifies which tests should be run. More fine-grain tests are available, see sw/tests.

Instructions to build HW/SW and run simulations:

1) Setup the environment (MAGIA folder):

source setup_env.sh

2) Install python dependencies (MAGIA folder):

make python_deps

3) Download Bender (MAGIA folder):

make bender

4) Clone the dependencies and generate the compilation script (MAGIA folder):

make update-ips > update-ips.log <mesh_dv>

4*) Apply FlooNoC patch - currently FlooNoC requires this step but should not need it in the future (MAGIA folder):

make floonoc-patch

5) Build the hardware (MAGIA folder):

make build-hw > build-hw.log <mesh_dv> <fast_sim>

6) Compile the test code (MAGIA folder):

make all <test>

7) Run test (MAGIA folder):

make run <test> <gui> <mesh_dv>

Full example:

make python_venv
source setup_env.sh
make python_deps
make bender
make update-ips > update-ips.log 
make build-hw > build-hw.log fast_sim=1
make all test=fsync_test
make run test=fsync_test

⚙️ Architecture

Tile

The central piece of the architecture is the MAGIA tile containing a GeMM accelerator, a Vector Processor, a DMA engine, a multi-banked L1 SPM, an Event Unit, and a lightweight control core. The L1 features interleaved memory banks that compose the Tightly-Coupled Data Memory (TCDM). Each tile has access to the global L2 and to a subset of other tiles' L1, accessing the latter via on-chip remote direct memory access (RDMA). Inter-tile and global communication is carried out through AXI-based narrow (32-bit) and wide (256-bit) NoC channels in FlooNoC. External tiles and the core access the L1 through an OpenBus Interface (OBI) XBAR.

Each tile is controlled by a cv32e40p. Control of iDMA, RedMulE, FractalSync, and Spatz CC control registers follows a memory-mapped model, with the Event Unit handling event aggregation for system control.

Mesh

Replicating the MAGIA tile, we scale up to a homogeneous two-dimensional (2D) mesh of compute tiles. The NoC allows access to the global west-side L2 through row-side interfaces, while tiles exchange traffic through FlooNoC router. The mesh uses XY routing and carries both AXI narrow channels (32-bit) and AXI wide channels (256-bit), with protocol conversion handled by per-tile Network Interfaces (NIs).

Rendez-vous among tiles are managed through the FractalSync (FS) mechanism and the dedicated network.

Memory map

This map reflects the RTL memory-mapped layout defined in hw/tile/magia_tile_pkg.sv.

tile_base = mhartid * 0x0010_0000

Per-tile local map (offset from tile_base, starts at 0x0000_0000):

Region	Local Range	Global Range (`tile_base + offset`)
RedMulE CTRL	`0x0000_0100-0x0000_01FF`	`tile_base + 0x0000_0100 ... 0x0000_01FF`
iDMA CTRL	`0x0000_0200-0x0000_05FF`	`tile_base + 0x0000_0200 ... 0x0000_05FF`
FractalSync CTRL	`0x0000_0600-0x0000_06FF`	`tile_base + 0x0000_0600 ... 0x0000_06FF`
Event Unit	`0x0000_0700-0x0000_16FF`	`tile_base + 0x0000_0700 ... 0x0000_16FF`
Spatz CTRL	`0x0000_1700-0x0000_17FF`	`tile_base + 0x0000_1700 ... 0x0000_17FF`
Reserved	`0x0000_0000-0x0000_FFFF`	`tile_base + 0x0000_0000 ... 0x0000_FFFF`
Stack	`0x0001_0000-0x0001_FFFF`	`tile_base + 0x0001_0000 ... 0x0001_FFFF`
L1 SPM	`0x0002_0000-0x000F_FFFF`	`tile_base + 0x0002_0000 ... 0x000F_FFFF`

Shared/global map:

Region	Range	Notes
Spatz BootROM	`0x1000_0000-0x1000_00FF`	Tile AXI xbar bootrom target
L2	`0xC000_0000-0xFFFF_FFFF`	Global L2 window
Instructions	`0xCC00_0000-0xCC00_7FFF`	Instruction sub-region inside L2

Software/test utility addresses (used by SW runtime and testbench VIP):

Region	Address	Notes
Test End	`0xCCFF_0000`	Exit code location used by SW runtime/tests
String (utoa)	`tile_base + 0x0000_1800`	String scratch location (`RESERVED_START + STR_OFFSET`)
Print (stderr)	`0xFFFF_0000`	Memory-mapped stderr sink in simulation VIP
Print (stdio)	`0xFFFF_0004`	Memory-mapped stdio sink in simulation VIP
Synch.	`tile_base + 0x0000_F000`	Derived from `RESERVED_START + SYNC_OFFSET`

🖥️ Programming model

The flow is memory-mapped (MM): software configures and starts accelerators by writing control registers in each tile address space.

Execution model: SPMD over tiles, with mhartid selecting tile_base = mhartid * 0x0010_0000.
Control path: CV32E40P accesses RedMulE, iDMA, FractalSync, Event Unit, and Spatz control registers via MMIO.
Data path: iDMA moves data between L1 and external memory, while compute engines consume/produce data in L1.
Synchronization: Event Unit and FractalSync provide interrupt/event and barrier mechanisms for inter-tile coordination.

Software APIs for MM control are under sw/utils/ (for example redmule_mm_utils.h, idma_mm_utils.h, fsync_mm_api.h, magia_spatz_utils.h and event_unit_utils.h). For Spatz Core Complex programming flow (runtime handshake, task loading, and execution model), see spatz/README.md.

🧰 Changing number of tiles

Supported Mesh Configurations: 2x2, 4x4, 8x8, 16x16, 32x32

Scripts: The num_cores parameter in the Makefile specifies for how many core stack traces should be generated.

Tests : The MESH_X_TILES and MESH_Y_TILES parameters in sw/utils/magia_utils.h adapt the software stack to the specific mesh configuration.

RTL/TB : The N_TILES_X and N_TILES_Y parameters in hw/mesh/magia_pkg.sv specifie the number of tiles and allows the derivation of the appropriate data and syncrhonization networks.

🔏 License

MAGIA is an open-source project with a permissive license. All software sources are licensed under the Apache License 2.0 (LICENSE.APACHE). All hardware sources are licensed under the Solderpad Hardware License 0.51 (LICENSE.SHL).

Name		Name	Last commit message	Last commit date
Latest commit History 183 Commits
.github/workflows		.github/workflows
doc		doc
hw		hw
magia-sdk @ 6f1b483		magia-sdk @ 6f1b483
scripts		scripts
sim		sim
spatz		spatz
sw		sw
target/sim/src		target/sim/src
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.gitmodules		.gitmodules
Bender.local		Bender.local
Bender.yml		Bender.yml
LICENSE.APACHE		LICENSE.APACHE
LICENSE.SHL		LICENSE.SHL
Makefile		Makefile
README.md		README.md
bender_common.mk		bender_common.mk
bender_profile.mk		bender_profile.mk
bender_sim.mk		bender_sim.mk
bender_synth.mk		bender_synth.mk
floonoc.patch		floonoc.patch
requirements.txt		requirements.txt
setup_env.sh		setup_env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MAGIA

⭐ Getting Started

Pre-requisites

Simulation

⚙️ Architecture

Tile

Mesh

Memory map

🖥️ Programming model

🧰 Changing number of tiles

🔏 License

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MAGIA

⭐ Getting Started

Pre-requisites

Simulation

⚙️ Architecture

Tile

Mesh

Memory map

🖥️ Programming model

🧰 Changing number of tiles

🔏 License

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages