10 - Architectures of Supercomputers [ID:10234]
50 von 1373 angezeigt

Welcome to the second to last lecture on architectural supercomputers.

Before we begin with the actual lecture, I would like to thank you guys for the evaluation.

We will discuss that tomorrow in detail, but I'm very happy that eight people did submit

the evaluation.

Oh, it's even nine people right now.

Eight people, yeah.

If we include the cameraman, then nine.

So last time, we did stop after a brief historical overview

of GPUs and how the development went,

basically from entertainment devices

to programmable devices, which can be used

for actual scientific computations.

I would like now to continue with a more scientific view on GPUs,

not so much a historical view.

Who of you knows Michael Flynn's taxonomy?

Okay, at least some of you.

Basically Flynn tried to classify computers

and also parallel computers.

So his taxonomy basically distinguishes

how many instruction streams and data streams a machine has.

For instance, a classic single core, non-vectorized,

non-simultaneous multi-threading enabled CPU

would be a single instruction, single data machine.

Probably the microcontroller in my toaster is a SysD device.

SIMD, SIMD is probably the most used abbreviation.

SIMD stands for single instruction multiple data,

and that's for instance used for describing

vector instruction sets.

For instance, if you have SSE or AVX in your CPU,

then you could say that core is a SIMD device.

Although I have to say that this MD

originally had a different meaning.

Originally this was really,

it referred to multiple data streams,

and classic short vector instruction sets we have in,

for instance, the x86 CPUs,

don't have multiple data streams for their vectorized,

for the vectorized instructions.

They typically assume that you're reading a large chunk,

for instance half a cache line,

depending on how large your vector word is,

and they say this is our multiple data,

but it's not multiple data streams.

Originally SIMD did refer, for instance, to vector CPUs,

where they could use different offsets

for the individual elements in a vector register.

So for instance, they were able to fetch

one data element from here to the vector register,

and another element could be fetched from here.

They were able to do some sort of gather,

Teil einer Videoserie :

Zugänglich über

Offener Zugang

Dauer

01:19:51 Min

Aufnahmedatum

2015-01-20

Hochgeladen am

2019-04-03 13:19:04

Sprache

en-US

Einbetten
Wordpress FAU Plugin
iFrame
Teilen