[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: scatter/gather op ( was:Re: [f-cpu] New EU_SHL Instruction)

To: f-cpu@seul.org
Subject: Re: scatter/gather op ( was:Re: [f-cpu] New EU_SHL Instruction)
From: Yann Guidon <whygee@f-cpu.org>
Date: Fri, 10 Jan 2003 00:34:34 +0100
Delivered-to: archiver@seul.org
Delivered-to: f-cpu-outgoing@seul.org
Delivered-to: f-cpu@seul.org
Delivery-date: Thu, 09 Jan 2003 18:19:38 -0500
Organization: Freedom CPU Project
References: <20030108084730.18996.qmail@web14910.mail.yahoo.com> <3E1CC977.7070803@f-cpu.org> <20030109140151.50113@thrai.stud.uni-hannover.de> <20030110205936.16c2fba1.nicolas.boulay@ifrance.com>
Reply-to: f-cpu@seul.org
Sender: owner-f-cpu@seul.org
User-agent: Mozilla/5.0 (Windows; U; Win95; en-US; rv:1.0.0) Gecko/20020530

hi,

nico wrote:

On Thu, 9 Jan 2003 14:01:51 +0100
Michael Riepe <michael@stud.uni-hannover.de> wrote:
On Thu, Jan 09, 2003 at 01:59:35AM +0100, Yann Guidon wrote:
[...]

and_reduce (or "combine" as written in ROP2) is not possible
for very wide data.

Furthermore, the xorn.and trick is useful for "detecting" that a
byte corresponds, but if you need to find the index of the
character, the "obvious" answer is to loop over the register.
if you have a result of 0x00FF000000000000, it's not a good
solution. So the idea is to "transpose" the bits in the word, that
would become 0x4040404040404040 and the last byte can then ben
binary encoded in INC (if it's implemented).

Wouldn't it be sufficient to `collapse' each chunk into a single bit?

that's a gather intra-chunk operation. (Such gather op are a lack in
all the f-cpu ISA because inter-chunk operation are maid in 64 bits cpu
instead of thinking about a 256 bits version.)

A add gather could be usefull too !

gather.add.64 V1 V2 R3

R3 = V1[0]+V1[1]+V1[2]+V1[3]
+V2[0]+V2[1]+V2[2]+V2[3]

(big tree adder ?)

This is easily "emulated" with a logarithmic shift/add sequence :
srhi 8, r1, r2
add.8 r1, r2, r1
srhi 16, r1, r2
add.16 r1, r2, r1
srhi 32, r1, r2
add.16 r1, r2, r1
and it works with any kind of instructions (boolean, arithmetic, FP etc.)

any comment ? (except "it is slow")

YG

*************************************************************
To unsubscribe, send an e-mail to majordomo@seul.org with
unsubscribe f-cpu       in the body. http://f-cpu.seul.org/

Follow-Ups:
- Re: scatter/gather op ( was:Re: [f-cpu] New EU_SHL Instruction)
  - From: nico <nicolas.boulay@ifrance.com>

References:
- Re: [f-cpu] New EU_SHL Instruction
  - From: Just an Illusion <illusion_to_net@yahoo.fr>
- Re: [f-cpu] New EU_SHL Instruction
  - From: Yann Guidon <whygee@f-cpu.org>
- Re: [f-cpu] New EU_SHL Instruction
  - From: Michael Riepe <michael@stud.uni-hannover.de>
- scatter/gather op ( was:Re: [f-cpu] New EU_SHL Instruction)
  - From: nico <nicolas.boulay@ifrance.com>

Prev by Date: Re: [f-cpu] Are 8 bits SIMD mode usefull ?
Next by Date: Re: [f-cpu] Are 8 bits SIMD mode usefull ?
Previous by thread: scatter/gather op ( was:Re: [f-cpu] New EU_SHL Instruction)
Next by thread: Re: scatter/gather op ( was:Re: [f-cpu] New EU_SHL Instruction)
Index(es):
- Date
- Thread