Fast6502Mult

This project aims to assess the fastest way of multiplying two numbers on the 6502 architecture, on a standard, flat 64k address space (so no pre-calculated 8x8 multiplication table, though other kinds of tables are taken into account). Contibutions and suggestions are more than welcome!

The speed will be judged initially by the average number of cycles required to perform all possible multiplications (so 64k for an 8x8 or 4.4x4.4 bit, 16M for 8.8x8.8 or 4.12x4.12 bit, and so on). If we can also manage to generate more accurate timing information, the decision criterion will later be changed to the analysis of histogram plots for the number of cycles. There are plans for later extensions to fixed-point math and perhaps some small, practical 3D demos using the 6502asm platform. Speed is a priority, but of course correctness is a must. Speed-size tradeoffs will be studied later, especially in regards to unrolled vs. looped code. All compiling and simulating is done with the help of asm6 and fake6502, respectively.

Current project status

8x8->16 bit	Avg. Cycles	Max Cycles	Min Cycles	Size [bytes]
Naive Implementation	317	389	245	39
Naive Implementation w/ Early Exit	305.25	409	41	42
Naive Implementation Unrolled	249	321	177	185
Swapping argument with less zeroes using table	??	??	??	512
4x4 bit precalc + rotation table	174	174	174	768

MVittiS/Fast6502Mult

Fast6502Mult

Current project status

TODOs