Easton Man's Channel

中国共产党第十七届、十八届、十九届中央政治局常委，国务院原总理李克强同志，近日在上海休息，2023年10月26日因突发心脏病，经全力抢救无效，于10月27日0时10分在上海逝世，享年68岁。讣告后发。

https://content-static.cctvnews.cctv.com/snow-book/index.html?item_id=17611136792319939361

13:29 · Oct 26, 2023 · Thu

Easton Man's Channel

#今日看了什么 #学习 https://youtu.be/JzTrDyoLHTg?si=j9Jyc_BSY2yIAjEc

感觉从微架构设计的角度来说，ARM把发射宽度从6拉到10是不成熟的设计，比苹果还差了非常多。
给人一种香山的感觉🤯

13:23 · Oct 26, 2023 · Thu

#今日看了什么
#学习
https://youtu.be/JzTrDyoLHTg?si=j9Jyc_BSY2yIAjEc

YouTube

小米14性能分析：骁龙8Gen3到底有多强？

今日看了什么学习

10:37 · Oct 24, 2023 · Tue

Cascade: CPU Fuzzing via Intricate Program Generation https://comsec.ethz.ch/research/hardware-design-security/cascade-cpu-fuzzing-via-intricate-program-generation/

21:36 · Oct 23, 2023 · Mon

Daniel Lemire's blog
Appending to an std::string character-by-character: how does the capacity grow?

In C++, suppose that you append to a string one character at a time:

while(my_string.size() <= 10'000'000) {
  my_string += "a";
}

In theory, it might be possible for the C++ runtime library to implement this routine as the creation of a new string with each append: it could allocate a new memory region that contains just one extra character, and copy to the new region. It would be very slow in the worst case. Of course, the people designing the runtime libraries are aware of such potential problem. Instead of allocating memory and copying with each append, they will typically grow the memory usage in bulk. That is, every time new memory is needed, they double the memory usage (for example).

Empirically, we can measure the allocation. Starting with an empty string, we may add one character at a time. I find that GCC 12 uses capacities of size 15 × 2 k for every increasing integers k, so that the string capacities are 15, 30, 60, 120, 240, 480, 960, 1920, etc. Under macOS (LLVM 15), I get that clang doubles the capacity and add one, except for the initial doubling, so you get capacities of 22, 47, 95, 191, 383, 767, etc. So the string capacity grows exponentially.

If you omit the cost of writing the character, what is the cost of these allocations and copy for long strings? Assume that allocating N bytes costs you N units of work. Let us consider the GCC 12 model : they both lead to the same conclusion. To construct a string of size up to 15 × 2n, it costs you 15 + 15 × 21 + 15 × 22 + … + 15 × 2n which is 15 × (2n + 1 – 1). Generally speaking, you find that this incremental doubling approach costs you no more than 2N units of work to construct a string of size N. In computer science parlance, the complexity is linear. In common sense parlance, it scales well.

A consequence of how strings allocate memory is that you may find that many of your strings have excess capacity if you construct them by repeatedly appending characters. To save memory, you may call the method shrink_to_fit() to remove this excess capacity.

source

16:52 · Oct 23, 2023 · Mon

#XiangShan
频道香山内鬼+1
是不是已有三位了

XiangShan

15:34 · Oct 23, 2023 · Mon

https://t.me/loongson_users/72574
========
创车

白铭骢 in Loongson

3A6000 正式版主板 (XA61200) 订购信息

近日龙芯武汉通知我可以订购 3A6000 正式版主板 (XA61200) 且可以分享此消息，所以在这里扩散一下：

— 板型为 DTX（可理解为窄版 mATX，203mm × 244mm）
— 桥片依然为 7A2000
— 扩展槽有：2 * DDR4 UDIMM, 1 * PCIe 3.0 x16 (x8), 1* PCIe 3.0 x8 (x8), 1* PCIe 3.0 x4 (x4), 1 * m.2 E.Key (Wi-Fi), 1* mPCIe…

10:29 · Oct 23, 2023 · Mon

Chips and Cheese
Cinebench 2024: Reviewing the Benchmark
#ChipAndCheese

Telegraph | source
(author: clamchowder)

Telegraph

Cinebench 2024: Reviewing the Benchmark

Maxon’s Cinebench is a perennial benchmark favorite. It’s free, easy to run, and scales across as many cores as you can give it. Its $0 cost allows the internet to provide plenty of results for reference. Consumers and tech reviewers alike therefore heavily…

ChipAndCheese

15:43 · Oct 21, 2023 · Sat

#龙芯 #JamesAslan
https://zhuanlan.zhihu.com/p/662561990

龙芯 JamesAslan

12:12 · Oct 19, 2023 · Thu

Easton Man's Channel

#今日看了什么专业的 https://kurnal.xlog.app/Hi36A0V120fenxi

11:56 · Oct 19, 2023 · Thu

#今日看了什么
专业的
https://kurnal.xlog.app/Hi36A0V120fenxi

Kirin 9000s Chip-Level Analysis - Kurnal

This article is an analysis of Hi36a0V120 This article was written three days after the release of Mate60, with an unknown publication date.…

今日看了什么

09:59 · Oct 19, 2023 · Thu

Daniel Lemire's blog
For processing strings, streams in C++ can be slow

Telegraph | source

Telegraph

For processing strings, streams in C++ can be slow

The C++ library has long been organized around stream classes, at least when it comes to reading and parsing strings. But streams can be surprisingly slow. For example, if you want to parse numbers, then this C++ routine is close to being the worst possible…

22:19 · Oct 18, 2023 · Wed

#XiangShan
过度封装害人啊
修 Area 有感

XiangShan

21:59 · Oct 18, 2023 · Wed

Daniel Lemire's blog
How many billions of transistors in your iPhone processor?

In about 10 years, Apple has multiplied by 19 the number of transistors in its mobile processors. It corresponds roughly to a steady rate of improvement of 34% per year on the number of transistors, or a doubling every 2.5 years. In real dollars, an iPhone has roughly a constant price: the price tag of a new iPhone increases every year, but it does so while tracking the inflation. Thus you are getting ever more transistors in your iPhone for the same price.

source

23:07 · Oct 17, 2023 · Tue

Chips and Cheese
Arm Announces Total Design: Taking Neoverse CSS Forward
#ChipAndCheese

Telegraph | source
(author: Nexus)

Telegraph

Arm Announces Total Design: Taking Neoverse CSS Forward

We recently covered the announcement of Arm’s Neoverse CSS Genesis N2 platform, a near off-the-shelf compute subsystem design created to accelerate the time to market for custom accelerators in leading edge infrastructure. We commented at the time that we…

ChipAndCheese

18:56 · Oct 17, 2023 · Tue

杰哥的{运维，编程，调板子}小笔记

Clang 如何支持 CUDA 程序

前言

编译 CUDA 程序的主要工具是 NVIDIA 提供的闭源编译器 NVCC，但实际上，NVCC 是基于 LLVM 开发的（来源：NVIDIA CUDA Compiler），NVIDIA 也把 NVCC 其中一部分逻辑贡献给了 LLVM 上游，使得 Clang 也可以在 CUDA 的配合下编译 CUDA 程序。这篇博客尝试研究 Clang/LLVM 如何实现 CUDA 程序的编译，主要是 Clang 前端部分，后端部分，也就是从 LLVM IR 到 NVPTX 的这一步还没有进行深入的研究。

source

Before

After