Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • Home
  • SEARCH
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6722623
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 26, 20262026-05-26T09:28:09+00:00 2026-05-26T09:28:09+00:00

I have a computational intensive task which I used CUDA to implement it and

  • 0

I have a computational intensive task which I used CUDA to implement it and now I want to make it even faster with FPGAs (if possible)

The system I want to implement is a series of computations each similar to matrix multiplication in sense of being parallel. It also has some non-parallel parts in between. It works with big amounts of data.

Although I want it as fast as possible, I have enough time to learn and explore with FPGAs.

here I’m asking for suggestions on how I start my path? Which FPGA to choose and where to learn about it. any website or online class or books? I’ve decided to do this anyway but your idea of whether this will be faster on FPGA or not would be helpful too.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-26T09:28:10+00:00Added an answer on May 26, 2026 at 9:28 am

    The big wins from an FPGA over using a GPU come from:

    • Using non-standard word widths optimised to your application. This allows denser logic, which allows more parallel processing blocks
    • using your knowledge of the required accesses to external RAM to schedule them in hardware more efficiently than a general purpose memory controller can.

    The downside is getting data to and from the FPGA. Draw a data-transfer diagram before you start. Even if the FPGA provides infinite speedup, you might still find it’s not worth the effort if there’s loads of data to be shuffled to and fro!

    It’s likely you’ll be wanting a PCI express based board. Which is (I imagine) a whole new learning-curve before you get to do anything with the FPGA – but if you’re up for it, it’ll be a very interesting task!

    In terms of choosing FPGAs, have a play with the software tools from the various vendors – at the learning stage that’s much more important than the chips themselves. You won’t find (at this early learning-stage) a show-stopper feature in any of the various chips. Also take into account the availability of boards with your required interfaces on, and any IP-core you might need to do the high-speed interfacing (eg PCIe)

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

We have a computationally intensive service which is used to do a number of
I have a computational intensive project that is highly parallelizable: basically, I have a
I have an application written in native C++ which intends intensive computation. In fact
I have a computationally intensive task that users can perform using a Silverlight app.
I have a computational algebra task I need to code up. The problem is
I have a nested, mutual recursive data structure , and want to associate computational
I'm taking a course in computational complexity and have so far had an impression
I have a computationally expensive task in perl, and would like to inform the
I have to write a program that performs highly computationally intensive calculations. The program
I would like to implement a thread pool in Java, which can dynamically resize

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.