cid:
"bafyreiahnmjkz467hoehesavappg6hd5m2bxxenuzc7ueoketgel5vtdcy"
value:
text:
"Yes-pairwise is a significantly smaller calculation- one line versus a matrix! Interesting how it āholdsā up. A few small matrices randomly chosen to do column wise attention breaks the problem down. Moving pairwise means the problem is āembarrassingly parallelisableā and 16 GPU can be used."
$type:
"app.bsky.feed.post"
langs:
"en"
reply:
root:
cid:
"bafyreifb4mvheu5kl43pv5bc6b37omkyhqu47jocahjiuezl4dm74wjlsa"
parent:
cid:
"bafyreid3hn6lrkdfnw34yztju5fmb3svjp5776o5q6zkpfqx7skq2v5ieu"
createdAt:
"2024-05-19T22:17:29.961Z"