[R] FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

APaperADay@alien.top · 1 year ago

[R] FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

currentscurrents@alien.top · 1 year ago

Then you lose the 2D grid structure of the image, which is why you want to use a CNN in the first place.

I think it’s possible to apply many of these optimizations to 2D convs as well though. This group is just more interested in language modeling than images.

Raion17@alien.top · 1 year ago

Yeah they built this project because they have one previous work using fft to finish sequence modeling