Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
src		src
README.md		README.md

Repository files navigation

floating_point_operation

build: $ g++ main.cpp fp32.cc FP16G.cc fp16.cc -o main

Running: $ ./main

C-based bit-level FP16/FP32 operator for designing and verifying a 16bits/32bits floating pointer hardware operator.

FP32 Addition
FP32 Subtraction
FP32 Multiplication
FP32 division (occur bit error at last bit of mantisa due to round-up)
FP16 Addition/Subtraction (support normal and subnormal)
FP16 Multiplication (support normal and subnormal) TODO list
add FP16 operator (Division)

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages