#[device]Expand description
Marks a function as a device entry point for launch().
Generates a unit struct implementing DeviceFn with execute().
cargo <subcommand>: execute() calls the original function body (CPU).
cargo furiosa-opt <subcommand>: execute() loads the compiled EDF and runs on NPU.