If you only read one thingMeasuring developer productivity? A response to McKinsey (15 minute read) The McKinsey report about developer productivity Iâve linked to last time made some waves on the internet. Kent Beck, who thought about developer productivity before most current developers were born, has something to add. And there are many great points about productivity in his article. Go read it up, so you have some ammunition when being called out. The core argument against the McKinsey view is about âInput < Output < Outcome < Impactâ. I use that as a categorizer of goals all the time myself and you should too. Input (what he calls effort) is the worst, only use it if you have nothing else to measure success with. Lines of code is an input metric. Features are an outcome metric. Impact is the best category. Aim here. And McKinsey is mostly about âInputâ metrics, whereas SPACE and DORA are Outcome and Impact metrics. And input is easier to measure, and managers donât know what those numbers mean, so of course they will tend towards input metrics. MUST READ https://tidyfirst.substack.com/p/measuring-developer-productivity Image of the week(Via XKCD) Once this happened when I was at eBay. Someone sent an email to everyone. Then everyone wrote âStop!â and âGet me off that listâ and then the mail server crashed with 35.000 employees.
AIAsking 60+ LLMs a set of 20 questions (3 minute read) Ask the same questions to many LLMs. Not only is this very interesting to understand differences between models, but also between sizes of the same model, like Falcon 40B vs Falcon 7B. I did like *âSally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have? Letâs think step by step.â Especially *âSo the number of sisters Sally has is 6.â HA take that logic!âfunniest is âerm.. 3â - but there quite are a lot funny ones for 9, 12, 24. But there is the right answer in the results (1) somewhere. https://benchmarks.llmonitor.com/ A GPT-4 Capability Forecasting Challenge (12 minute read) It is difficult to know what at this point in time, AI can do and canât do. This small game challenges you to forecast what tasks GPT-4 can succeed with. It helped me understand that the power of GPT-4 is nuanced. Must read https://nicholas.carlini.com/writing/llm-forecast/question/Capital-of-Paris Apple is reportedly spending âmillions of dollars a dayâ training AI (3 minute read) âThose involved in its development tell The Information that Appleâs most advanced LLM, known internally as Ajax GPT, has been trained on âmore than 200 billion parametersâ and is more powerful than OpenAIâs GPT-3.5â 200 billion is a large model, larger than the 180B Falcon model. https://www.theverge.com/2023/9/6/23861763/apple-ai-language-models-ajax-gpt-training-spending Exclusive: ChatGPT traffic slips again for third month in a row (6 minute read) Do people move to the app? Do people use IDE integrations? Do people use the GPT API? Or did the novelty wear off? The article bases its conclusion on data from Similarweb, a company that estimates web traffic. âWorldwide desktop and mobile website visits to the ChatGPT website decreased by 3.2% to 1.43 billion in August, following approximately 10% drops from each of the previous two months.â https://www.reuters.com/technology/chatgpt-traffic-slips-again-third-month-row-2023-09-07/ Casually running a 180B parameter LLM on M2 Ultra (1 minute read) It is a little slow on tokens/sec, and quantized down, but seems to work nicely. It does this by using the CPU and RAM it seems. Not clear if inference (not training) will be a CPU or GPU task in the future. Or if GPUs get massively more RAM, because LLM models get bigger by the week. https://twitter.com/ggerganov/status/1699791226780975439 Microsoft announces new Copilot Copyright Commitment for customers (10 minute read) Startups move aside, here comes a company with deep pockets. Is copyright the factor which decides about SaaS AI success? âAs customers ask whether they can use Microsoftâs Copilot services and the output they generate without worrying about copyright claims, we are providing a straightforward answer: yes, you can, and if you are challenged on copyright grounds, we will assume responsibility for the potential legal risks involved.â https://blogs.microsoft.com/on-the-issues/2023/09/07/copilot-copyright-commitment-ai-legal-concerns/ Stories Iâve enjoyed this weekWhy startups do need strategy â despite what youâve heard (10 minute read) Iâd argue most of the talk in the articleââForce yourself to write down a single clear goal which is stretching but achievable in five years.â is about visions not strategy, but the points are valid. I talk to startup CEOs and CTOs, and they have no strategy. Therefore, features are all over the place, work is chaos, and success seems random. âWhy startups do need strategyâ gives some good step by step advice on how tyo achieve a strategy - and as CTO you should have a tech strategy too, where is tech in 5 years? https://sifted.eu/articles/startups-need-strategy The end of the Googleverse (8 minute read) Similarweb says GPT traffic drops (see other article), The Verge sees the end of Google. Google dominated the way we saw the internet - everything through the Google lens. Then through a social media lens. Is AI next? But more to you the CTO: What if SEO traffic of your company drops 90%. Can it survive? What is plan B? Not your problem, but what if marketing comes screaming? https://www.theverge.com/23846048/google-search-memes-images-pagerank-altavista-seo-keywords Signs that youâre working as a âfirst teamâ and why it matters (16 minute read) Most CTOs I know neglect working with their peers on the management team. âWorking as a âfirst teamâ with your fellow leaders is a mindset and operational mode in which you prioritize supporting your peers instead of your direct reports.â First team looks extreme when looking at it on a management level, but there are good insights for CTOs. First, youâre in a management team, youâre no longer just a techie. The VP of Marketing and the CFO are your peers. Second, create a management team of your direct reports. And do âfirst teamâ with them. This was the most successful thing to make my CTO jobs easier, happier and more successful. https://www.lenareinhard.com/articles/first-team-signs-why-it-matters UK Air-Traffic Software Misread Spots on Map to Cause Outage (1 minute read) âThe UKâs worst air-traffic outage in a decade was caused by an anomaly in the airspace managerâs software system, which confused two geographical checkpoints separated by some 4,000 nautical miles. [..] The glitch triggered a shutdown of the software system run by NATS for safety reasons, according to a preliminary reportâ Again a crisis happens because two problems interact. And how does your system work with bad input? And do you Fuzzing to find bad input? No? Start. All You Need to Become a Servant Leader (11 minute read) Iâm a fan of being a servant leaderânot. Well again I am. Can you make up your mind, please? What is a servant leader? âServant leadership is a leadership style that empowers employees based on the premise that if employees are satisfied and cared for, they will be more motivated to give their best.â Agreed. But the terminology is manager. I do agree as a manager people on your team are your biggest lever for success. And if you care for them (and respect them, and youâre loyal to them), they will give their best. But we need to stop throwing around manager and leader interchangeably. A leader is someone who leads. Doh. A leader points out where to go, and then leads people there. But yes, managers should be servant managers. https://effyai.substack.com/p/all-you-need-to-become-a-servant Move Over, Software Developers â In The Name Of Cybersecurity, The Government Wants To Drive (108 minute read) Engineers neglected security for too long. This will become cumbersome: âat its essence it is no small thing that the White House proposes: legislation instructing people on how to code their software and requiring adherence to those instructions.â* Toyota Shut Down 14 Factories Due to âInsufficient Disk Spaceâ (8 minute read) Largest car manufacturer in the world. Doh. âThe system malfunction was caused by the unavailability of some multiple servers that process parts orders. As for the circumstances, regular maintenance work was performed on August 27, the day before the malfunction occurred. During the maintenance procedure, data that had accumulated in the database was deleted and organized, and an error occurred due to insufficient disk space, causing the system to stop. Since these servers were running on the same system, a similar failure occurred in the backup function, and a switchover could not be made.â Tell this your CEO so she is mentally prepared when you make the same mistake I know, you wonât, until you do. https://global.toyota/en/newsroom/corporate/39732568.html How Universal Are Our Emotions? (23 minute read) We assume emotions are universal. They are not. Emotions are those for which we have word. Germans have different emotions than people in the US-perceived ones. With more international teams due to remote work, keep this in mind. People on your team might have emotions you donât have. And you have some, they canât express. https://www.newyorker.com/magazine/2022/08/08/how-universal-are-our-emotions Join the CTO newsletter! | |