Search results for: “Malaysia”

  • Intelligence per watt

    My thinking on the concept of intelligence per watt started as bullets in my notebook. It was more of a timeline than anything else at first and provided a framework of sorts from which I could explore the concept of efficiency in terms of intelligence per watt. 

    TL;DR (too long, didn’t read)

    Our path to the current state of ‘artificial intelligence’ (AI) has been shaped by the interplay and developments of telecommunications, wireless communications, materials science, manufacturing processes, mathematics, information theory and software engineering. 

    Progress in one area spurred advances in others, creating a feedback loop that propelled innovation.  

    Over time, new use cases have become more personal and portable – necessitating a focus on intelligence per watt as a key parameter. Energy consumption directly affects industrial design and end-user benefits. Small low-power integrated circuits (ICs) facilitated fuzzy logic in portable consumer electronics like cameras and portable CD players. Low power ICs and power management techniques also helped feature phones evolve into smartphones.  

    A second-order effect of optimising for intelligence per watt is reducing power consumption across multiple applications. This spurs yet more new use cases in a virtuous innovation circle. This continues until the laws of physics impose limits. 

    Energy storage density and consumption are fundamental constraints, driving the need for a focus on intelligence per watt.  

    As intelligence per watt improves, there will be a point at which the question isn’t just what AI can do, but what should be done with AI? And where should it be processed? Trust becomes less about emotional reassurance and more about operational discipline. Just because it can handle a task doesn’t mean it should – particularly in cases where data sensitivity, latency, or transparency to humans is non-negotiable. A highly capable, off-device AI might be a fine at drafting everyday emails, but a questionable choice for handling your online banking. 

    Good ‘operational security’ outweighs trust. The design of AI systems must therefore account not just for energy efficiency, but user utility and deployment context. The cost of misplaced trust is asymmetric and potentially irreversible.

    Ironically the force multiplier in intelligence per watt is people and their use of ‘artificial intelligence’ as a tool or ‘co-pilot’. It promises to be an extension of the earlier memetic concept of a ‘bicycle for the mind’ that helped inspire early developments in the personal computer industry. The upside of an intelligence per watt focus is more personal, trusted services designed for everyday use. 

    Integration

    In 1926 or 27, Loewe (now better known for their high-end televisions) created the 3NF[i].

    While not a computer, but instead to integrate several radio parts in one glass envelope vacuum valve. This had three triodes (early electronic amplifiers), two capacitors and four resistors. Inside the valve the extra resistor and capacitor components went inside their own glass tubes. Normally each triode would be inside its own vacuum valve. At the time, German radio tax laws were based on the number of valve sockets in a device, making this integration financially advantageous. 

    Post-war scientific boom

    Between 1949 and 1957 engineers and scientists from the UK, Germany, Japan and the US proposed what we’d think of as the integrated circuit (IC). These ideas were made possible when breakthroughs in manufacturing happened. Shockley Semiconductor built on work by Bell Labs and Sprague Electric Company to connect different types of components on the one piece of silicon to create the IC. 

    Credit is often given to Jack Kilby of Texas Instruments as the inventor of the integrated circuit. But that depends how you define IC, with what is now called a monolithic IC being considered a ‘true’ one. Kilby’s version wasn’t a true monolithic IC. As with most inventions it is usually the child of several interconnected ideas that coalesce over a given part in time. In the case of ICs, it was happening in the midst of materials and technology developments including data storage and computational solutions such as the idea of virtual memory through to the first solar cells. 

    Kirby’s ICs went into an Air Force computer[ii] and an onboard guidance system for the Minuteman missile. He went on to help invent the first handheld calculator and thermal printer, both of which took advantage of progress in IC design to change our modern way of life[iii]

    TTL (transistor-to-transistor logic) circuitry was invented at TRW in 1961, they licensed it out for use in data processing and communications – propelling the development of modern computing. TTL circuits powered mainframes. Mainframes were housed in specialised temperature and humidity-controlled rooms and owned by large corporates and governments. Modern banking and payments systems rely on the mainframe as a concept. 

    AI’s early steps 

    Science Museum highlights

    What we now thing of as AI had been considered theoretically for as long as computers could be programmed. As semiconductors developed, a parallel track opened up to move AI beyond being a theoretical possibility. A pivotal moment was a workshop was held in 1956 at Dartmouth College. The workshop focused on a hypothesis ‘every aspect of learning or any other feature of intelligence can be so precisely described that a machine can be made to simulate it’. Later on, that year a meeting at MIT (Massachusetts Institute of Technology) brought together psychologists and linguists to discuss the possibility of simulating cognitive processes using a computer. This is the origin of what we’d now call cognitive science. 

    Out of the cognitive approach came some early successes in the move towards artificial intelligence[iv]. A number of approaches were taken based on what is now called symbolic or classical AI:

    • Reasoning as search – essentially step-wise trial and error approach to problem solving that was compared to wandering through a maze and back-tracking if a dead end was found. 
    • Natural language – where related phrases existed within a structured network. 
    • Micro-worlds – solving for artificially simple situations, similar to economic models relying on the concept of the rational consumer. 
    • Single layer neural networks – to do rudimentary image recognition. 

     By the time the early 1970s came around AI researchers ran into a number of problems, some of which still plague the field to this day:

    • Symbolic AI wasn’t fit for purpose solving many real-world tasks like crossing a crowded room. 
    • Trying to capture imprecise concepts with precise language.
    • Commonsense knowledge was vast and difficult to encode. 
    • Intractability – many problems require an exponential amount of computing time. 
    • Limited computing power available – there was insufficient intelligence per watt available for all but the simplest problems. 

    By 1966, US and UK funding bodies were frustrated with the lack of progress on the research undertaken. The axe fell first on a project to use computers on language translation. Around the time of the OPEC oil crisis, funding to major centres researching AI was reduced by both the US and UK governments respectively. Despite the reduction of funding to the major centres, work continued elsewhere. 

    Mini-computers and pocket calculators

    ICs allowed for mini-computers due to the increase in computing power per watt. As important as the relative computing power, ICs made mini-computers more robust, easier to manufacture and maintain. DEC (Digital Equipment Corporation) launched the first minicomputer, the PDP-8 in 1964. The cost of mini-computers allowed them to run manufacturing processes, control telephone network switching and control labouratory equipment. Mini-computers expanded computer access in academia facilitating more work in artificial life and what we’d think of as early artificial intelligence. This shift laid the groundwork for intelligence per watt as a guiding principle.

    A second development helped drive mass production of ICs – the pocket calculator, originally invented at Texas Instruments.  It demonstrated how ICs could dramatically improve efficiency in compact, low-power devices.

    LISP machines and PCs

    AI researchers required more computational power than mini-computers could provide, leading to the development of LISP machines—specialised workstations designed for AI applications. Despite improvements in intelligence per watt enabled by Moore’s Law, their specialised nature meant that they were expensive. AI researchers continued with these machines until personal computers (PCs) progressed to a point that they could run LISP quicker than LISP machines themselves. The continuous improvements in data storage, memory and processing that enabled LISP machines, continued on and surpassed them as the cost of computing dropped due to mass production. 

    The rise of LISP machines and their decline was not only due to Moore’s Law in effect, but also that of Makimoto’s Wave. While Gordon Moore outlined an observation that the number of transistors on a given area of silicon doubled every two years or so. Tsugio Makimoto originally observed 10-year pivots from standardised semiconductor processors to customised processors[v]. The rise of personal computing drove a pivot towards standardised architectures. 

    PCs and workstations extended computing beyond computer rooms and labouratories to offices and production lines. During the late 1970s and 1980s standardised processor designs like the Zilog Z80, MOS Technology 6502 and the Motorola 68000 series drove home and business computing alongside Intel’s X86 processors. 

    Personal computing started in businesses when office workers brought a computer to use early computer programmes like the VisiCalc spreadsheet application. This allowed them to take a leap forward in not only tabulating data, but also seeing how changes to the business might affect financial performance. 

    Businesses then started to invest more in PCs for a wide range of uses. PCs could emulate the computer terminal of a mainframe or minicomputer, but also run applications of their own. 

    Typewriters were being placed by word processors that allowed the operator to edit a document in real time without resorting to using correction fluid

    A Bicycle for the Mind

    Steve Jobs at Apple was as famous for being a storyteller as he was for being a technologist in the broadest sense. Internally with the Mac team he shared stories and memetic concepts to get his ideas across in everything from briefing product teams to press interviews. As a concept, a 1990 filmed interview with Steve Jobs articulates the context of this saying particularly well. 

    In reality, Jobs had been telling the story for a long time through the development of the Apple II and right from the beginning of the Mac. There is a version of the talk that was recorded some time in 1980 when the personal computer was still a very new idea – the video was provided to the Computer History Museum by Regis McKenna[vi].

    The ‘bicycle for the mind’ concept was repeated in early Apple advertisements for the time[vii] and even informed the Macintosh project codename[viii]

    Jobs articulated a few key concepts. 

    • Buying a computer creates, rather than reduces problems. You needed software to start solving problems and making computing accessible. Back in 1980, you programmed a computer if you bought one. Which was the reason why early personal computer owners in the UK went on to birth a thriving games software industry including the likes of Codemasters[ix]. Done well, there should be no seem in the experience between hardware and software. 
    • The idea of a personal, individual computing device (rather than a shared resource).  My own computer builds on my years of how I have grown to adapt and use my Macs, from my first sit-up and beg Macintosh, to the MacBook Pro that I am writing this post on. This is even more true most people and their use of the smartphone. I am of an age, where my iPhone is still an appendage and emissary of my Mac. My Mac is still my primary creative tool. A personal computer is more powerful than a shared computer in terms of the real difference made. 
    • At the time Jobs originally did the speech, PCs were underpowered for anything but data processing (through spreadsheets and basic word processor applications). But that didn’t stop his idea for something greater. 

    Jobs idea of the computer as an adjunct to the human intellect and imagination still holds true, but it doesn’t neatly fit into the intelligence per watt paradigm. It is harder to measure the effort developing prompts, or that expended evaluating, refining and filtering generative AI results. Of course, Steve Jobs Apple owed a lot to the vision shown in Doug Engelbart’s ‘Mother of All Demos’[x].

    Networks

    Work took a leap forward with office networked computers pioneered by Macintosh office by Apple[xi]. This was soon overtaken by competitors. This facilitated work flow within an office and its impact can still be seen in offices today, even as components from print management to file storage have moved to cloud-based services. 

    At the same time, what we might think of as mobile was starting to gain momentum. Bell Labs and Motorola came up with much of the technology to create cellular communications. Martin Cooper of Motorola made the first phone call on a cellular phone to a rival researcher at Bell Labs. But Motorola didn’t sell the phone commercially until 1983, as a US-only product called the DynaTAC 8000x[xii].  This was four years after Japanese telecoms company NTT launched their first cellular network for car phones. Commercial cellular networks were running in Scandinavia by 1981[xiii]

    In the same way that the networked office radically changed white collar work, the cellular network did a similar thing for self-employed plumbers, electricians and photocopy repair men to travelling sales people. If they were technologically advanced, they may have had an answer machine, but it would likely have to be checked manually by playing back the tape. 

    Often it was a receptionist in their office if they had one. Or more likely, someone back home who took messages. The cell phone freed homemakers in a lot of self-employed households to go out into the workplace and helped raise household incomes. 

    Fuzzy logic 

    The first mainstream AI applications emerged from fuzzy logic, introduced by Lofti A. Zadeh in 1965 mathematical paper. Initial uses were for industrial controls in cement kilns and steel production[xiv]. The first prominent product to rely on fuzzy logic was the Zojirushi Micom Electric Rice Cooker (1983), which adjusted cooking time dynamically to ensure perfect rice. 

    Rice Cooker with Fuzzy Logic 3,000 yen avail end june

    Fuzzy logic reacted to changing conditions in a similar way to people. Through the 1980s and well into the 1990s, the power of fuzzy logic was under appreciated outside of Japanese product development teams. In a quote a spokesperson for the American Electronics Association’s Tokyo office said to the Washington Post[xv].

    “Some of the fuzzy concepts may be valid in the U.S.,”

    “The idea of better energy efficiency, or more precise heating and cooling, can be successful in the American market,”

    “But I don’t think most Americans want a vacuum cleaner that talks to you and says, ‘Hey, I sense that my dust bag will be full before we finish this room.’ “

    The end of the 1990s, fuzzy logic was embedded in various consumer devices: 

    • Air-conditioner units – understands the room, the temperature difference inside-and-out, humidity. It then switches on-and-off to balance cooling and energy efficiency.
    • CD players – enhanced error correction on playback dealing with imperfections on the disc surface.
    • Dishwashers – understood how many dishes were loaded, their type of dirt and then adjusts the wash programme.
    • Toasters – recognised different bread types, the preferable degree of toasting and performs accordingly.
    • TV sets – adjust the screen brightness to the ambient light of the room and the sound volume to how far away the viewer is sitting from the TV set. 
    • Vacuum cleaners – vacuum power that is adjusted as it moves from carpeted to hard floors. 
    • Video cameras – compensate for the movement of the camera to reduce blurred images. 

    Fuzzy logic sold on the benefits and concealed the technology from western consumers. Fuzzy logic embedded intelligence in the devices. Because it worked on relatively simple dedicated purposes it could rely on small lower power specialist chips[xvi] offering a reasonable amount of intelligence per watt, some three decades before generative AI. By the late 1990s, kitchen appliances like rice cookers and microwave ovens reached ‘peak intelligence’ for what they needed to do, based on the power of fuzzy logic[xvii].

    Fuzzy logic also helped in business automation. It helped to automatically read hand-written numbers on cheques in banking systems and the postcodes on letters and parcels for the Royal Mail. 

    Decision support systems & AI in business

    Decision support systems or Business Information Systems were being used in large corporates by the early 1990s. The techniques used were varied but some used rules-based systems. These were used in at least some capacity to reduce manual office work tasks. For instance, credit card approvals were processed based on rules that included various factors including credit scores. Only some credit card providers had an analyst manually review the decision made by system.  However, setting up each use case took a lot of effort involving highly-paid consultants and expensive software tools. Even then, vendors of business information systems such as Autonomy struggled with a high rate of projects that failed to deliver anything like the benefits promised. 

    Three decades on, IBM had a similar problem with its Watson offerings, with particularly high-profile failure in mission-critical healthcare applications[xviii]. Secondly, a lot of tasks were ad-hoc in nature, or might require transposing across disparate separate systems. 

    The rise of the web

    The web changed everything. The underlying technology allowed for dynamic data. 

    Software agents

    Examples of intelligence within the network included early software agents. A good example of this was PapriCom. PapriCom had a client on the user’s computer. The software client monitored price changes for products that the customer was interested in buying. The app then notified the user when the monitored price reached a price determined by the customer. The company became known as DealTime in the US and UK, or Evenbetter.com in Germany[xix].  

    The PapriCom client app was part of a wider set of technologies known as ‘push technology’ which brought content that the netizen would want directly to their computer. In a similar way to mobile app notifications now. 

    Web search

    The wealth of information quickly outstripped netizen’s ability to explore the content. Search engines became essential for navigating the new online world. Progress was made in clustering vast amounts of cheap Linux powered computers together and sharing the workload to power web search amongst them.  As search started to trying and make sense of an exponentially growing web, machine learning became part of the developer tool box. 

    Researchers at Carnegie-Mellon looked at using games to help teach machine learning algorithms based on human responses that provided rich metadata about the given item[xx]. This became known as the ESP game. In the early 2000s, Yahoo! turned to web 2.0 start-ups that used user-generated labels called tags[xxi] to help organise their data. Yahoo! bought Flickr[xxii] and deli.ico.us[xxiii]

    All the major search engines looked at how deep learning could help improve search results relevance. 

    Given that the business model for web search was an advertising-based model, reducing the cost per search, while maintaining search quality was key to Google’s success. Early on Google focused on energy consumption, with its (search) data centres becoming carbon neutral in 2007[xxiv]. This was achieved by a whole-system effort: carefully managing power management in the silicon, storage, networking equipment and air conditioning to maximise for intelligence per watt. All of which were made using optimised versions of open-source software and cheap general purpose PC components ganged together in racks and operating together in clusters. 

    General purpose ICs for personal computers and consumer electronics allowed easy access relatively low power computing. Much of this was down to process improvements that were being made at the time. You needed the volume of chips to drive innovation in mass-production at a chip foundry. While application-specific chips had their uses, commodity mass-volume products for uses for everything from embedded applications to early mobile / portable devices and computers drove progress in improving intelligence-per-watt.

    Makimoto’s tsunami back to specialised ICs

    When I talked about the decline of LISP machines, I mentioned the move towards standardised IC design predicted by Tsugio Makimoto. This led to a surge in IC production, alongside other components including flash and RAM memory.  From the mid-1990s to about 2010, Makimoto’s predicted phase was stuck in ‘standardisation’. It just worked. But several factors drove the swing back to specialised ICs. 

    • Lithography processes got harder: standardisation got its performance and intelligence per watt bump because there had been a steady step change in improvements in foundry lithography processes that allowed components to be made at ever-smaller dimensions. The dimensions are a function wavelength of light used. The semiconductor hit an impasse when it needed to move to EUV (extreme ultra violet) light sources. From the early 1990s on US government research projects championed development of key technologies that allow EUV photolithography[xxv]. During this time Japanese equipment vendors Nikon and Canon gave up on EUV. Sole US vendor SVG (Silicon Valley Group) was acquired by ASML, giving the Dutch company a global monopoly on cutting edge lithography equipment[xxvi]. ASML became the US Department of Energy research partner on EUV photo-lithography development[xxvii]. ASML spent over two decades trying to get EUV to work. Once they had it in client foundries further time was needed to get commercial levels of production up and running. All of which meant that production processes to improve IC intelligence per watt slowed down and IC manufacturers had to start about systems in a more holistic manner. As foundry development became harder, there was a rise in fabless chip businesses. Alongside the fabless firms, there were fewer foundries: Global Foundries, Samsung and TSMC (Taiwan Semiconductor Manufacturing Company Limited). TSMC is the worlds largest ‘pure-play’ foundry making ICs for companies including AMD, Apple, Nvidia and Qualcomm. 
    • Progress in EDA (electronic design automation). Production process improvements in IC manufacture allowed for an explosion in device complexity as the number of components on a given size of IC doubled every 18 months or so. In the mid-to-late 1970s this led to technologists thinking about the idea of very large-scale integration (VLSI) within IC designs[xxviii]. Through the 1980s, commercial EDA software businesses were formed. The EDA market grew because it facilitated the continual scaling of semiconductor technology[xxix]. Secondly, it facilitated new business models. Businesses like ARM Semiconductor and LSI Logic allowed their customers to build their own processors based on ‘blocs’ of proprietary designs like ARM’s cores. That allowed companies like Apple to focus on optimisation in their customer silicon and integration with software to help improve the intelligence per watt[xxx]
    • Increased focus on portable devices. A combination of digital networks, wireless connectivity, the web as a communications platform with universal standards, flat screen displays and improving battery technology led the way in moving towards more portable technologies. From personal digital assistants, MP3 players and smartphone, to laptop and tablet computers – disconnected mobile computing was the clear direction of travel. Cell phones offered days of battery life; the Palm Pilot PDA had a battery life allowing for couple of days of continuous use[xxxi]. In reality it would do a month or so of work. Laptops at the time could do half a day’s work when disconnected from a power supply. Manufacturers like Dell and HP provided spare batteries for travellers. Given changing behaviours Apple wanted laptops that were easy to carry and could last most of a day without a charge. This was partly driven by a move to a cleaner product design that wanted to move away from swapping batteries. In 2005, Apple moved from PowerPC to Intel processors. During the announcement at the company’s worldwide developer conference (WWDC), Steve Jobs talked about the focus on computing power per watt moving forwards[xxxii]

    Apple’s first in-house designed IC, the A4 processor was launched in 2010 and marked the pivot of Makimoto’s wave back to specialised processor design[xxxiii].  This marked a point of inflection in the growth of smartphones and specialised computing ICs[xxxiv]

    New devices also meant new use cases that melded data on the web, on device, and in the real world. I started to see this in action working at Yahoo! with location data integrated on to photos and social data like Yahoo! Research’s ZoneTag and Flickr. I had been the Yahoo! Europe marketing contact on adding Flickr support to Nokia N-series ‘multimedia computers’ (what we’d now call smartphones), starting with the Nokia N73[xxxv].  A year later the Nokia N95 was the first smartphone released with a built-in GPS receiver. William Gibson’s speculative fiction story Spook Country came out in 2007 and integrated locative art as a concept in the story[xxxvi]

    Real-world QRcodes helped connect online services with the real world, such as mobile payments or reading content online like a restaurant menu or a property listing[xxxvii].

    I labelled the web-world integration as a ‘web-of-no-web’[xxxviii] when I presented on it back in 2008 as part of an interactive media module, I taught to an executive MBA class at Universitat Ramon Llull in Barcelona[xxxix]. In China, wireless payment ideas would come to be labelled O2O (offline to online) and Kevin Kelly articulated a future vision for this fusion which he called Mirrorworld[xl]

    Deep learning boom

    Even as there was a post-LISP machine dip in funding of AI research, work on deep (multi-layered) neural networks continued through the 1980s. Other areas were explored in academia during the 1990s and early 2000s due to the large amount of computing power needed. Internet companies like Google gained experience in large clustered computing, AND, had a real need to explore deep learning. Use cases include image recognition to improve search and dynamically altered journeys to improve mapping and local search offerings. Deep learning is probabilistic in nature, which dovetailed nicely with prior work Microsoft Research had been doing since the 1980s on Bayesian approaches to problem-solving[xli].  

    A key factor in deep learning’s adoption was having access to powerful enough GPUs to handle the neural network compute[xlii]. This has allowed various vendors to build Large Language Models (LLMs). The perceived strategic importance of artificial intelligence has meant that considerations on intelligence per watt has become a tertiary consideration at best. Microsoft has shown interest in growing data centres with less thought has been given on the electrical infrastructure required[xliii].  

    Google’s conference paper on attention mechanisms[xliv] highlighted the development of the transformer model. As an architecture it got around problems in previous approaches, but is computationally intensive. Even before the paper was published, the Google transformer model had created fictional Wikipedia entries[xlv]. A year later OpenAI built on Google’s work with the generative pre-trained transformer model better known as GPT[xlvi]

    Since 2018 we’ve seen successive GPT-based models from Amazon, Anthropic, Google, Meta, Alibaba, Tencent, Manus and DeepSeek. All of these models were trained on vast amounts of information sources. One of the key limitations for building better models was access to training material, which is why Meta used pirated copies of e-books obtained using bit-torrent[xlvii]

    These models were so computationally intensive that the large-scale cloud service providers (CSPs) offering these generative AI services were looking at nuclear power access for their data centres[xlviii]

    The current direction of development in generative AI services is raw computing power, rather than having a more energy efficient focus of intelligence per watt. 

    Technology consultancy / analyst Omdia estimated how many GPUs were bought by hyperscalers in 2024[xlix].

    CompanyNumber of Nvidia GPUs boughtNumber of AMD GPUs boughtNumber of self-designed custom processing chips bought
    Amazon196,0001,300,000
    Alphabet (Google)169,0001,500,000
    ByteDance230,000
    Meta224,000173,0001,500,000
    Microsoft485,00096,000200,000
    Tencent230,000

    These numbers provide an indication of the massive deployment on GPT-specific computing power. Despite the massive amount of computing power available, services still weren’t able to cope[l] mirroring some of the service problems experienced by early web users[li] and the Twitter ‘whale FAIL’[lii] phenomenon of the mid-2000s. The race to bigger, more powerful models is likely to continue for the foreseeable future[liii]

    There is a second class of players typified by Chinese companies DeepSeek[liv] and Manus[lv] that look to optimise the use of older GPT models to squeeze the most utility out of them in a more efficient manner. Both of these services still rely on large cloud computing facilities to answer queries and perform tasks. 

    Agentic AI

    Thinking on software agents went back to work being done in computer science in the mid-1970s[lvi]. Apple articulated a view[lvii]of a future system dubbed the ‘Knowledge Navigator’[lviii] in 1987 which hinted at autonomous software agents. What we’d now think of as agentic AI was discussed as a concept at least as far back as 1995[lix], this was mirrored in research labs around the world and was captured in a 1997 survey of research on intelligent software agents was published[lx]. These agents went beyond the vision that PapriCom implemented. 

    A classic example of this was Wildfire Communications, Inc. who created a voice enabled virtual personal assistant in 1994[lxi].  Wildfire as a service was eventually shut down in 2005 due to an apparent decline in subscribers using the service[lxii]. In terms of capability, Wildfire could do tasks that are currently beyond Apple’s Siri. Wildfire did have limitations due to it being an off-device service that used a phone call rather than an internet connection, which limited its use to Orange mobile service subscribers using early digital cellular mobile networks. 

    Almost a quarter century later we’re now seeing devices that are looking to go beyond Wildfire with varying degrees of success. For instance, the Rabbit R1 could order an Uber ride or groceries from DoorDash[lxiii]. Google Duplex tries to call restaurants on your behalf to make reservations[lxiv] and Amazon claims that it can shop across other websites on your behalf[lxv]. At the more extreme end is Boeing’s MQ-28[lxvi] and the Loyal Wingman programme[lxvii]. The MQ-28 is an autonomous drone that would accompany US combat aircraft into battle, once it’s been directed to follow a course of action by its human colleague in another plane. 

    The MQ-28 will likely operate in an electronic environment that could be jammed. Even if it wasn’t jammed the length of time taken to beam AI instructions to the aircraft would negatively impact aircraft performance. So, it is likely to have a large amount of on-board computing power. As with any aircraft, the size of computing resources and their power is a trade-off with the amount of fuel or payload it will carry. So, efficiency in terms of intelligence per watt becomes important to develop the smallest, lightest autonomous pilot. 

    As well as a more hostile world, we also exist in a more vulnerable time in terms of cyber security and privacy. It makes sense to have critical, more private AI tasks run on a local machine. At the moment models like DeepSeek can run natively on a top-of-the-range Mac workstation with enough memory[lxviii].  

    This is still a long way from the vision of completely local execution of ‘agentic AI’ on a mobile device because the intelligence per watt hasn’t scaled down to that level to useful given the vast amount of possible uses that would be asked of the Agentic AI model. 

    Maximising intelligence per watt

    There are three broad approaches to maximise the intelligence per watt of an AI model. 

    • Take advantage of the technium. The technium is an idea popularised by author Kevin Kelly[lxix]. Kelly argues that technology moves forward inexorably, each development building on the last. Current LLMs such as ChatGPT and Google Gemini take advantage of the ongoing technium in hardware development including high-speed computer memory and high-performance graphics processing units (GPU).  They have been building large data centres to run their models in. They build on past developments in distributed computing going all the way back to the 1962[lxx]
    • Optimise models to squeeze the most performance out of them. The approach taken by some of the Chinese models has been to optimise the technology just behind the leading-edge work done by the likes of Google, OpenAI and Anthropic. The optimisation may use both LLMs[lxxi] and quantum computing[lxxii] – I don’t know about the veracity of either claim. 
    • Specialised models. Developing models by use case can reduce the size of the model and improve the applied intelligence per watt. Classic examples of this would be fuzzy logic used for the past four decades in consumer electronics to Mistral AI[lxxiii] and Anduril’s Copperhead underwater drone family[lxxiv].  

    Even if an AI model can do something, should the model be asked to do so?

    AI use case appropriateness

    We have a clear direction of travel over the decades to more powerful, portable computing devices –which could function as an extension of their user once intelligence per watt allows it to be run locally. 

    Having an AI run on a cloud service makes sense where you are on a robust internet connection, such as using the wi-fi network at home. This makes sense for general everyday task with no information risk, for instance helping you complete a newspaper crossword if there is an answer you are stuck on and the intellectual struggle has gone nowhere. 

    A private cloud AI service would make sense when working, accessing or processing data held on the service. Examples of this would be Google’s Vertex AI offering[lxxv]

    On-device AI models make sense in working with one’s personal private details such as family photographs, health information or accessing apps within your device. Apps like Strava which share data, have been shown to have privacy[lxxvi] and security[lxxvii] implications. ***I am using Strava as an example because it is popular and widely-known, not because it is a bad app per se.***

    While businesses have the capability and resources to have a multi-layered security infrastructure to protect their data most[lxxviii]of[lxxix] the[lxxx] time[lxxxi], individuals don’t have the same security. As I write this there are privacy concerns[lxxxii] expressed about Waymo’s autonomous taxis. However, their mobile device is rarely out of physical reach and for many their laptop or tablet is similarly close. All of these devices tend to be used in concert with each other. So, for consumers having an on-device AI model makes the most sense. All of which results in a problem, how do technologists squeeze down their most complex models inside a laptop, tablet or smartphone? 


    [i] Radiomuseum – Loewe (Opta), Germany. Multi-system internal coupling 3NF

    [ii] (1961) Solid Circuit(tm) Semiconductor Network Computer, 6.3 Cubic inches in Size, is Demonstrated in Operation by U.S. Air Force and Texas Instruments (United States) Texas Instruments news release

    [iii] (2000) The Chip that Jack Built Changed the World (United States) Texas Instruments website

    [iv] Moravec H (1988), Mind Children (United States) Harvard University Press

    [v] (2010) Makimoto’s Wave | EDN (United States) AspenCore Inc.

    [vi] Jobs, S. (1980) Presentation on Apple Computer history and vision (United States) Computer History Museum via Regis McKenna

    [vii] Sinofsky, S. (2019) ‘Bicycle for the Mind’ (United States) Learning By Shipping

    [viii] Hertzfeld, A. (1981) Bicycle (United States) Folklore.org

    [ix] Jones, D. (2016) Codemasters (United Kingdom) Retro Gamer – Future Publishing

    [x] Engelbert, D. (1968) A Research Center For Augmenting Human Intellect (United States) Stanford Research Institute (SRI)

    [xi] Hormby, T. (2006) Apple’s Worst business Decisions (United States) OSnews

    [xii] Honam, M. (2009) From Brick to Slick: A History of Mobile Phones (United States) Wired

    [xiii] Ericsson History: The Nordics take charge (Sweden) LM Ericsson.

    [xiv] Singh, H., Gupta, M.M., Meitzler, T., Hou, Z., Garg, K., Solo, A.M.G & Zadeh, L.A. (2013) Real-Life Applications of Fuzzy Logic – Advances in Fuzzy Systems (Egypt) Hindawi Publishing Corporation

    [xv] Reid, T.R. (1990) The Future of Electronics Looks ‘Fuzzy’. (United States) Washington Post

    [xvi] Kushairi, A. (1993). “Omron showcases latest in fuzzy logic”. (Malaysia) New Straits Times

    [xvii] Watson, A. (2021) The Antique Microwave Oven that’s Better than Yours (United States) Technology Connections

    [xviii] Durbhakula, S. (2022) IBM dumping Watson Health is an opportunity to reevaluate artificial intelligence (United States) MedCity News

    [xix] (1998) PapriCom Technologies Wins CommerceNet Award (Israel) Globes

    [xx] Von Ahn, L., Dabbish, L. (2004) Labeling Images with a Computer Game (United States) School of Computing, Carnegie-Mellon University

    [xxi] Butterfield, D., Fake, C., Henderson-Begg, C., Mourachov, S., (2006) Interestingness ranking of media objects (United States) US Patent Office

    [xxii] Delaney, K.J., (2005) Yahoo acquires Flickr creator (United States) Wall Street Journal

    [xxiii] Hood, S., (2008) Delicious is 5 (United States) Delicious blog

    [xxiv] (2017) 10 years of Carbon Neutrality (United States) Google

    [xxv] Bakshi, V. (2018) EUV Lithography (United States) SPIE Press

    [xxvi] Wade, W. (2000) ASML acquires SVG, becomes largest litho supplier (United States) EE Times

    [xxvii] Lammers, D. (1999) U.S. gives ok to ASML on EUV effort (United States) EE Times

    [xxviii] Meade, C., Conway, L. (1979) Introduction to VLSI Systems (United States) Addison-Wesley

    [xxix] Lavagno, L., Martin, G., Scheffer, L., et al (2006) Electronic Design Automation for Integrated Circuits Handbook (United States) Taylor & Francis

    [xxx] (2010) Apple Launches iPad (United States) Apple Inc. website

    [xxxi] (1997) PalmPilot Professional (United Kingdom) Centre for Computing History

    [xxxii] Jobs, S. (2005) Apple WWDC 2005 keynote speech (United States) Apple Inc.

    [xxxiii] (2014) Makimoto’s Wave Revisited for Multicore SoC Design (United States) EE Times

    [xxxiv] Makimoto, T. (2014) Implications of Makimoto’s Wave (United States) IEEE Computer Society

    [xxxv] (2006) Nokia and Yahoo! add Flickr support in Nokia Nseries Multimedia Computers (Germany) Cision PR Newswire

    [xxxvi] Gibson, W. (2007) Spook Country (United States) Putnam Publishing Group

    [xxxvii] The O2O Business In China (China) GAB China

    [xxxviii] Carroll, G. (2008) Web Centric Business Model (United States) Waggener Edstrom Worldwide for LaSalle School of Business, Universitat Ramon Llull, Barcelona

    [xxxix] Carroll, G. (2008) Web of no web (United Kingdom) renaissance chambara

    [xl] Kelly, K. (2018) AR Will Spark the Next Big Tech Platform – Call It Mirrorworld (United States) Wired

    [xli] Heckerman, D. (1988) An Empirical Comparison of Three Inference Methods (United States) Microsoft Research

    [xlii] Sze, V., Chen, Y.H., Yang, T.J., Emer, J. (2017) Efficient Processing of Deep Neural Networks: A Tutorial and Survey (United States) Cornell University

    [xliii] Webber, M. E. (2024) Energy Blog: Is AI Too Power-Hungry for Our Own Good? (United States) American Society of Mechanical Engineers

    [xliv] Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I. (2017) Attention Is All You Need (United States) 31st Conference on Neural Information Processing Systems (NIPS 2017)

    [xlv] Marche, S. (2024) Was Linguistic A.I. Created By Accident? (United States) The New Yorker.

    [xlvi] Radford, A. (2018) Improving language understanding with unsupervised learning (United States) OpenAI

    [xlvii] Heath, N. (2025) Authors outraged to discover Meta used their pirated work to train its AI systems (Australia) ABC (Australian Broadcast Corporation)

    [xlviii] Morey, M., O’Sullivan, J. (2024) In-brief analysis: Data center owners turn to nuclear as potential energy source (United States) Today in Energy published by U.S. Energy Information Administration

    [xlix] Bradshaw, T., Morris, S. (2024) Microsoft acquires twice as many Nvidia AI chips as tech rivals (United Kingdom) Financial Times

    [l] Smith, C. (2025) ChatGPT’s viral image-generation upgrade is ruining the chatbot for everyone (United States) BGR (Boy Genius Report)

    [li] Wayner, P. (1997) Human Error Cripples the Internet (United States) The New York Times

    [lii] Honan, M. (2013) Killing the Fail Whale with Twitter’s Christopher Fry (United States) Wired

    [liii] Mazarr, M. (2025) The Coming Strategic Revolution of Artificial Intelligence (United States) MIT (Massachusetts Institute of Technology)

    [liv] Knight, W. (2025) DeepSeek’s New AI Model Sparks Shock, Awe, and Questions from US Competitors (United States) Wired

    [lv] Sharwood, S. (2025) Manus mania is here: Chinese ‘general agent’ is this week’s ‘future of AI’ and OpenAI-killer (United Kingdom) The Register

    [lvi] Hewitt, C., Bishop, P., Steiger, R. (1973). A Universal Modular Actor Formalism for Artificial Intelligence. (United States) IJCAI (International Joint Conference on Artificial Intelligence).

    [lvii] Sculley, J. (1987) Keynote Address On The Knowledge Navigator at Educom (United States) Apple Computer Inc.

    [lviii] (1987) Apple’s Future Computer: The Knowledge Navigator (United States) Apple Computer Inc.

    [lix] Kelly, K. (1995) Out of Control: The New Biology of Machines (United States) Fourth Estate

    [lx] Nwana, H.S., Azarmi, N. (1997) Software Agents and Soft Computing: Towards Enhancing Machine Intelligence Concepts and Applications (Germany) Springer

    [lxi] Rifkin, G. (1994) Interface; A Phone That Plays Secretary for Travelers (United States) The New York Times

    [lxii] Richardson, T. (2005) Orange kills Wildfire – finally (United Kingdom) The Register

    [lxiii] Spoonauer, M. (2024) The Truth about the Rabbit R1 – your questions answered about the AI gadget (United States) Tom’s Guide

    [lxiv] Garun, N. (2019) One year later, restaurants are still confused by Google Duplex (United States) The Verge

    [lxv] Roth, E. (2025) Amazon can now buy products from other websites for you (United States) The Verge

    [lxvi] MQ-28 microsite (United States) Boeing Inc.

    [lxvii] Warwick, G. (2019) Boeing Unveils ‘Loyal Wingman’ UAV Developed In Australia (United Kingdom) Aviation Week Network – part of Informa Markets

    [lxviii] Udinmwen, E. (2025) Apple Mac Studio M3 Ultra workstation can run Deepseek R1 671B AI model entirely in memory using less than 200W, reviewer finds (United Kingdom) TechRadar

    [lxix] Kelly, K. (2010) What Technology Wants (United States) Viking Books

    [lxx] Andrews, G.R. (2000) Foundations of Multithreaded, Parallel, and Distributed Programming (United States) Addison-Wesley

    [lxxi] Criddle, C., Olcott, E. (2025) OpenAI says it has evidence China’s DeepSeek used its model to train competitor (United Kingdom) Financial Times

    [lxxii] Russell, J. (2025) China Researchers Report Using Quantum Computer to Fine-Tune Billion Parameter AI Model (United States) HPC Wire

    [lxxiii] Mistral AI home page (France) Mistral AI

    [lxxiv] (2025) High-Speed Autonomous Underwater Effects. Copperhead (United States) Anduril Industries

    [lxxv] Vertex AI with Gemini 1.5 Pro and Gemini 1.5 Flash (United States) Google Cloud website

    [lxxvi] Untersinger, M. (2024) Strava, the exercise app filled with security holes (France) Le Monde

    [lxxvii] Nilsson-Julien, E. (2025) French submarine crew accidentally leak sensitive information through Strava app (France) Le Monde

    [lxxviii] Arsene, Liviu (2018) Hack of US Navy Contractor Nets China 614 Gigabytes of Classified Information (Romania) Bitdefender

    [lxxix] Wendling, M. (2024) What to know about string of US hacks blamed on China (United Kingdom) BBC News

    [lxxx] Kidwell, D. (2020) Cyber espionage for the Chinese government (United States) U.S. Air Force Office of Special Investigations

    [lxxxi] Gorman, S., Cole, A., Dreazen, Y. (2009) Computer Spies Breach Fighter-Jet Project (United States) The Wall Street Journal

    [lxxxii] Bellan, R. (2025) Waymo may use interior camera data to train generative AI models, but riders will be able to opt out (United States) TechCrunch

  • PHNX awards jury interview

    I am fortunate to be an awards juror for the second time. This is for the Adforum PHNX advertising awards which attracts entries from around the world.

    Proud to be a juror again this year for Adforms PHNX awards

    As part of the process I responded to some interview questions. I hope that my old gaffer Tony Gresty appreciated my quoting of him decades later and was surprised that there wasn’t pushback about my assertion of a ‘post-social’ marketing era.

    What motivates you to be part of the PHNX Jury, and what do you hope to bring to the judging process?

    Before I worked in advertising, I served an apprenticeship in plant process engineering. My old gaffer who was responsible for me had a few sayings. One of which was practice sharpens skill. By being a judge, I hope that I am helping people within the industry around the world to sharpen their skill. Seeing great challenging work and asking myself how it fits the customer and client needs in turn, helps further sharpen my skill as a strategist. TL;DR (too long; didn’t read) altruist generosity?

    PHNX has always been about celebrating creativity in all its forms. What new perspectives or disciplines do you think deserve more recognition in award shows today?

    Strategy has the Effies, BUT its focus is larger than creativity with a major focus on efficiency and effectiveness rather the creative process. Strategy always provides the ‘assist’ with single-minded insight and creative JOTBD (job to be done), but is never the ‘goal scorer’ to use an American sports stats metaphor. I think that creativity is not only about the creative, but also the context where the creative is placed – which brings in the disciplines of project management’s orchestration, production’s craft and media planning. I think this is going to become far more important as we go towards a post-social era.

    Which countries or regions do you think are leading the creative field right now? And which emerging markets should we look out for?

    A really interesting question. Leading is probably the wrong phrase to use, but there are markets that are under-estimated. Thailand and the Philippines have well-deserved reputations for emotional storytelling. Year-after-year when I look at lunar new year adverts Malaysia hits well above its weight given the size of the market. 

    Japan has been consistently delighting advertising folk for the past five decades. 

    Probably a better question to ask me after I have been through this year’s award entries.

    What trends or cultural shifts do you think will define the most impactful creative work this year?

    With everything that’s going on, I think we’ll need more humour. Trends within the advertising industry are also leaning towards a better mix of formats. As an industry we over-index on social vs. attention, efficiency and effectiveness for large brands. So we’ve seen a renaissance in OOH amongst other formats.

    There is also a return to basics: creative consistency, fluent objects, the power of storytelling and humour. Finally consumers are more interested in consuming more longer form audio and video content, so what a creative execution might look like I hope is very different.

    If you could give one piece of advice to agencies and creatives submitting their work, what would it be?

    Be single-minded in terms of category consideration. My biggest criticism of last year’s PHNX award entries was not about the quality of the work per se. Many of the entries had a given creative was put in for consideration for the wrong category. And it was the same entries doing it over-and-over again. If it isn’t relevant it’s just going to get ignored or get under the skin of the judges. In the same way that a poorly-placed ad that is slapped all over the place without consideration would have a similar effect in the real world.

    Rant over: I wish everyone the best of luck, finally don’t be disheartened. All of the work was of a high standard, choosing winners is hard.

    Which creative minds are inspiring you the most right now?

    In the widest creative sense I am working my way through veteran Hong Kong film director Johnnie To‘s back catalogue; some of his works like Breaking News feel like exceptionally contemporary given our media environment. A couple of creatives using AI in a really smart way are Omar Karim aka @arthur_chance on instagram and the Dor Brothers. Agency work-wise VCCP’s immersive installation for Transport for London and a small Malaysian shop called Days Studios (whose bread-and-butter work is usually weddings!), yet, did a fantastic job producing a Chinese New Year ad for a cosmetic treatment clinic Aglow Studio – not what you’d call a big client yet it felt like a bigger production than many large brands. Go Google the ad ‘hiss of prosperity’ and watch it on YouTube.

    It’s your last chance to enter for free here.

  • CNY 2025

    CNY 2025 or Chinese new year 2025 is shorthand often used as a hashtag on social media to circulate songs, sales promotions and advertisements from across China, Hong Kong, Indonesia, Singapore and Malaysia. I started off this post into gathering some of the best examples of CNY 2025 advertising just after Christmas and there was a poor range of adverts just a month out from CNY 2025. Imagine if there were no Christmas adverts appearing by the third week in November?

    Small businesses like the Davely Bakery Café in Malaysia had started promoting organic social content on their Facebook page by November 19. (In markets such as the Philippines, Hong Kong and Malaysia, Facebook is still big business.)

    CNY 2025 - Davely Bakery Café

    But where were the large company promotions this close to the festival? Brand campaigns only really started to appear from the second week in January onwards.

    CNY 2025 themes that I took away from researching this post:

    • Increased emphasis on demand generation and sales promotions.
    • Less big brands advertising than previous years.
    • Campaigns were run over a shorter period. Roughly half the six weeks I would have expected for successful brand building campaigns.
    • Less of a focus on storytelling and deep emotional cues than previous years.
    • Lower production values as a whole than previous years.
    • A move towards bus wraps in Singapore for CNY 2025 campaigns. These were replicated in ‘bus simulator’ games popular amongst transport fans in Hong Kong and Singapore. This replication was less about a ‘brand gaming strategy’ and more about fan curated bus skins for absolute fidelity to their favourite bus routes.
    • Less emphasis on creative consistency than in previous years.
    • Shorter ads, each with a lot of 15-second edits.
    • Increased use of humour.
    • Increased use of songs, presumably to gain earned and shared media support – very hard to do successfully as a strategy when there are so many songs to choose from.
    • Lazy use of celebrities – I hadn’t see this in previous years doing this.

    As a marketer, I saw things in CNY 2025 that I thought was good and things that I worried about in these changes between CNY 2025 and previous years:

    • Smarter memory structure building: fluent objects such as Kevin the First Pride nugget, the use of jingles and ear worm songs, the use of humour
    • Red flags for brand mental availablility: a lack of creative consistency, shorter ads and lazy use of celebrities. Shorter ads can, if done right be used to build brand, BUT, there are a number of factors to consider when doing it successfully. These include variety of formats, reach / marketing penetration, repetition, single-minded creative execution and the thumb-stopping factor.

    Reading the ‘tea leaves’ I suspect that marketing budgets have been cut, and brands might not be expecting as much of an uplift this year as China’s poor economic performance affects its neighbours.

    China

    Apple

    Apple continued its shot on an iPhone series. The Chinese New Year film is run in lots of markets but primarily made for China. I am surprised that this got past the censors. Time travel is usually a a no-no. It also reminds China’s currency economically challenged consumers of the 1990s go-go years of year-on-year double digit growth. The core aspect of the creative is the direct questions that younger family members receive.

    CNY 2025 is the first time that Apple didn’t have a Chinese film maker shot its film. Finally, Apple’s film comes in at a whooping 11 minutes 59 seconds although a good minutes is the credits.

    Bottega Veneta

    Bottega Veneta’s Chinese New Year film is all about vibes. There were some interesting styling choices in the film. The older guy with the women’s hand bag. That most of it seemed to be around older alleyways that have been refurbished. The lady in the 1980s era Jaguar. Pre-1997, a number of more anglophile Hong Kong businessmen used to get driven around in Jaguar and Daimler cars with a large V12 engine – that spoke to old money in this film.

    I was stuck by the lack of explicit references to new year, which you can also see in the Miu Miu film – what there is are more subtle cues.

    All of which is a world away from many luxury brands slapping a snake on everything this year.

    Gucci

    Gucci taps into the traditional multi-generational party and memories of ‘snake’ new years of the past. It’s probably the strongest bit of storytelling and the most cinematic of all the films that I have looked at this year.

    Miu Miu

    Prada sub-brand Miu Miu is one of the few stand out brands in a tough 2024 for the luxury sector. This Chinese New Year film is playful, borrowing from Asian mid-century set design and 1990s era Chinese electronica to tell a small story.

    Hong Kong

    Coca-Cola

    Coca-Cola has a dominant position in the soft drinks market thanks to its dominance in distribution. The only places I could buy Pepsi was in my local Pizza Hut when I lived there. This year they focused on out of home posters to reinforce memory structures. The unusual aspect to the campaign was that it went up in early February at the end of Chinese New Year. That’s a bit like launching your Christmas advertising on New Year’s Eve. Not sure why that’s happened.

    coca cola hong kong

    Giordano

    Multinational clothing brand Giordano promoted a CNY 2025 collaboration featuring the Kung Fu Panda character on its social media accounts. The preponderance of red in the clothing isn’t only about it being a seasonal colour, but also you are supposed to wear new red clothing for the new year.

    This social media film was run on channels in Hong Kong, Malaysia and other countries where Giordano has a presence.

    Malaysia

    100PLUS

    100PLUS is an isotonic drink similar in function to Gatorade or Lucozade Sport popular in Malaysia and Singapore. Its advert for Malaysia promotes the drink as alternative to colas during new year celebrations. A secondary aspect is the opportunity to win a free prize draw. The blue in the outfits is to presumably signal the blue in the brand and packaging.

    It’s slightly unusual in that it doesn’t feature multi-generational family members, which I suspect is down to a single-minded focus on teens and young adults.

    Aeon

    Japanese supermarket Aeon highlighted their CNY themed collaboration with Italian artist TokiDoki as a music video format that you could sing along too. It’s a little too mild to be an aggressive earworm of a tune.

    Aglow Clinic

    Aglow Clinic is an aesthetics clinic in Malaysia that treats a range of skin conditions including sun spots. They partnered with social media personality Roderic Chan to make this film. Considering the small size of the brand they hit well above their weight in terms of production values.

    Aiken

    Aiken is a Malaysian based beauty brand. The creative was done by the media buying agency and features Malaysian influencers as the talent in the advertisement.

    Aiken wishes you Double the Brightness for a Brighter Year! is clever word play that implicitly links feeling beautiful and the promise of good fortune. This advert went out very late into the market for 2025.

    Carina

    Carina is a household tissue brand in Malaysia, similar to Kleenex in the UK and Ireland. It has gone down the ear worm route with its song. The montage of footage feels crowdsourced.

    Eu Yan Sang

    Eu Yan Sang did separate creative for Malaysia. There are higher production values than their Singapore creative and storytelling that ties back to creating memories and tradition being a key part of Chinese New Year. The advert sought to show that the family weren’t wealthy, but had food on their plate, good manners and retained their cultural roots. As a first-generation emigrant myself this one spoke to me.

    First Pride

    Tyson Foods First Pride range of processed chicken product including chicken nuggets and satay slices featured a simple sales promotion with a sweepstake format. The advert also introduced a fluent object ‘Kevin’ the chicken nugget on a TV advert.

    Kevin had previously been shared only on out of home formats. It would be interesting to see if and how they make future use of Kevin.

    Guardian

    Guardian is the Malaysian brand of the better known Asian pharmacy retail chain better known as Mannings in Hong Kong and China. A UK analogue would be Boots. It has higher production values and evokes togetherness, good fortune and memory-making for our young protagonist. Click here to see on YouTube.

    guardian cny 2025

    Haier

    Chinese white goods manufacturer took an unconventional storytelling approach. it’s the kind of creative concept that could be used year on year, just changing the product line-up.

    Harvey Norman

    Electrical retailer Harvey Norman ties into the fact that bargains are a constant discussion around the table during Chinese New Year (and any other family gathering). The production feels rather low rent compared to other adverts here.

    HongLeong Bank

    HongLeong Bank took the story of two customers that fitted neatly with the festivities around Chinese New Year. It gives a good old tug on the heart strings.

    Julie’s

    Julie’s a is a biscuit brand that tries to focus on the human side of food. Given the visiting and gifting culture for Chinese new year – the opportunity is ideal for its brand. I was surprised by the high production values of the advert. The 3d animation is creatively consistent with work that they’ve put out over the past year. As a direction the CNY 2025 campaign is very different from their last festival campaign for CNY 2022.

    Julie’s can continue to run this campaign after CNY 2025 is over due to the lack of overt seasonal themes in the advert.

    KitKat

    KitKat Malaysia have attached the Chinese New Year creative back to ‘have a break, have a KitKat’ for creative consistency. There is enough in here to say new year. But a sufficiently light touch that they could use it year-in, year-out – so long as the brand uses the same promotional packaging design.

    If they had used snake imagery, it would be one-and-done.

    Knife

    Knife are a food flavourings brand from Malaysia. Their main advertising push is for Chinese New Year and they have made a constant effort to bring creative consistency and storytelling into their work. CNY 2025 is no exception to this approach.

    https://youtu.be/Oxo8jP-67tE?si=aSnwKB5YVxoT96z_

    Lay’s crisps

    Lay’s (known as Walkers in the UK) highlight their role as a snack at new year’s gatherings. The ad promotes a new year themed sweepstake including mahjong sets.

    Lotus’s

    Lotus’s is a supermarket market chain. In Malaysia, the shops were formerly Tesco Malaysia and sold on to a Thai retail group. This film focuses on the stress of preparing for new year, together with sales promotions. Aside from holding red t-shirts with the ‘Fu’ symbol on them, this sales promotion video could be for any time of the year. The 1970s called and wants it’s ad creative back from this Malaysian supermarket chain.

    Melinda Looi

    Malaysian fashion designer Melinda Looi came up with a homage to Wong Ka wai’s In The Mood For Love. The advert nails the mid-century elegance but struggles to get the cinematic richness and tension of the original.

    I respect that they gave it a good try and love their ambition; but it’s like Ted Baker trying to pull off the introduction to The Italian Job.

    Mr DIY

    Mr DIY is a hardware chain similar to Lowe’s in the US or B&Q in the UK. Their advert riffs on the heightened tensions of family get togethers and the relative popularity in Hong Kong film making of court room dramas – to add a bit of cultural relevance. It taps into the stressor of very direct questions similar to BRANDS Singapore campaign.

    Mr Muscle

    Household cleaner brand Mr Muscle had a Korean celebrity record a CNY 2025 specific message for their Facebook page viewers.

    The advert features Korean drama and film actor Kim Seon Ho. In common with other Korean celebrities he endorses a variety of brands in Korea and other Asian countries. For some of the brands endorsed, they have had record sales which they attribute to working with Kim. It’s not sophisticated but will appeal to his many fans in Malaysia.

    Munchy’s

    Munchy Food Brands is a Malaysian snack brand. The advert itself is pretty self explanatory. Like Watson’s they are leaning hard into trying to create an ear worm to aid long term brand recall that’s complete with an EDM-style drop.

    Nivea

    Nivea looked to promote their men’s products as a way to solve for the stress of direct family feedback on how you look. It has been shot for mobile.

    Pantai Hospital

    Pantai Medical Group runs a private hospital in Malaysia that caters to more well-off Malaysians. The emphasis on healthy food in the advert relates to the central role that food plays in Chinese New Year celebrations.

    Their elective treatments are likely to be quiet during CNY 2025, so they have provided the option for health-focused external catering. It’s an interesting product innovation for those close to their hospital in Penang. The behind the scenes clips at the end draws on Korean and Hong Kong productions. The best known in the West would be the blooper reels that used to appear at the end of Jackie Chan films.

    https://youtu.be/2tKxHrCldts?si=WIQqF1PRPsyzdKEG

    Petronas

    Petronas is the Malaysian national oil company. There is a natural fit with CNY 2025 because children go home to see their parents and siblings. Later on during the celebrations they will drive to visit relatives. On the Malaysian peninsula you could be a long time in heavy traffic, so pit-stops for fuel and refreshments are pretty much obligatory.

    Ribena

    Brutally short creative with the tagline left right at the end. ‘Ooo Juicy Fu’ – the fu is a reference to the Chinese character fu symbolising ‘fortune’. It is creatively consistent with campaigns that Ribera ran for Ramadan and the previous CNY in Malaysia.

    Shopee

    Shopee is a mobile marketplace think Shopify, Depop or Uber Eats in an app. Like Watsons Malaysian campaign it relies on a ‘new years’ song. Why a song? Entertainment during Chinese new year features newly composed catchy earworms. These may come from film series put out as family entertainment for the new year like the All’s Well, That Ends Well series of Hong Kong comedies, or television and adverts.

    Watsons

    Watsons is a Hong Kong-headquartered pharmacy chain with stores across Asia and a strong focus on health and beauty products. It’s parent company AS Watson is a set of diversified retail brands including:

    • Superdrug and Savers in the UK
    • Rossmann
    • Fortress (a PC World or Best Buy analogue)
    • PARKnSHOP, Taste, FUSION, GREAT FOOD HALL – grocery stores
    • Watson’s Wine

    They have been teasing a song related Chinese New Year campaign for Malaysia to embed in your memory structures, but were only showcasing the song 2 1/2 weeks before CNY 2025. Rapid screening of sales promotions drown out the ‘Happy Beautiful Year’ themed brand building effort.

    https://youtu.be/KpAXOYxxGvc?si=jzwNGGW5HXz8pbHk

    Yakult

    The Japanese yoghurt drink brand used some good fortune themed imagery to promote a brand sweepstake. A very simple execution that could be used again in future years.

    Singapore

    BRANDS

    BRANDS is a food and supplement business. Traditional Chinese Medicine often recommends eating particular foods to treat different ailments, which is why BRANDS essence of chicken sits in a kind of ‘wellness’ space.

    Their advert draws on the universal experience of very direct questions that people have to field from relatives when they go home for Chinese new year.

    Eu Yan Sang

    Eu Yan Sang run traditional Chinese medicine and related wellness foods shops and clinics across Asia. This Singapore ad focuses on the challenge of gift giving and the close link between good fortune and good health. Unusually, they’ve also run a second lot of creative promoting their CNY themed hamper designs as well.

    https://youtu.be/dGc3_cDjtCA?si=pTA3fXpeL481jw-P

    FairPrice

    FairPrice is a Singapore institution. Like the UK’s Co-op, it is a supermarket owned by the National Trade Union Congress and is the largest grocery chain in Singapore owning both supermarkets and convenience stores.

    The ad focuses on everyday Singaporeans with many of the shots modelled on HDB flats – Singapore’s public housing. The colour grading and small moments designed to evoke different types of nostalgia from the rituals of family and the Chinese New Year.

    Hockhua tonic

    Hockhua is a Singaporean local wellness foods brand who did a simple sales promotion for their hampers to be provided for the new year. The cut-off time then gave the brand a few weeks to assemble to the appropriate amount of hampers.

    Lazada

    E-tailer Lazada leads with sales promotions. The imagery draws on Fu xing, the god of good fortune who you would pray to in order to get a prosperous new year.

    Ministry of Digital Development and Information

    The government of Singapore used Chinese new year to reinforce a common Singaporean identity and celebrate the 60th anniversary of the city state. Sing-a-longs are a part of Chinese new year. The video featured a 1980s song that was originally recored by the artists in 1998 re-recorded by them for the government department encouraging t he citizens to look out for each other. The video was published just days before new year and relied primarily on the reach of the former prime minister’s Instagram account. It shares a common theme of small but joyful moments with the FairPrice CNY 2025 advert.

    Thailand

    This is the first year that I have covered a Thai market campaign. Thailand has a significant ethnic Chinese minority (between 10 – 15% of the population depending on which estimates you reference). Like Indonesia, Thailand integrated them for political reasons and many of them no longer have Chinese sounding family names – but the traditions live on. A second aspect is the increased role in the Thai economy that Chinese expats and tourists now play.

    Central

    Central is a premium department store in Thailand (think Peter Jones in London) and has a mid-tier brand called Robinsons (think Debenhams or House of Fraser). You have a stylistic version of the new year dinner and a cool grandfather who owes a lot to mature Japanese hipsters and The Sartorialist. The film has high production values and leans on vibes rather than storytelling, but is distinctive.

    You can find my previous reviews of Chinese New Year ads here.

  • Interpublic acquisition by Omnicom

    Interpublic disclosure

    I have worked at Interpublic twice during my career. Once at the very start of my career and more recently at McCann Health. I was never vested in Interpublic stock and I don’t own any Interpublic or Omnicom shares. This is not financial advice I am not telling you what you should do.

    This post is not intended to be, and shall not constitute, an offer to buy or sell or the solicitation of an offer to buy or sell any securities, or a solicitation of any vote or approval, nor shall there be any sale of securities in any jurisdiction in which such offer, solicitation or sale would be unlawful prior to registration or qualification under the securities laws of any such jurisdiction.

    I am pointing out the bits in discussions that I found interesting, and some bits that I found deathly dull, but pertinent.

    The shape of it

    The acquisition would be done by issuing stock. It wouldn’t involve Omnicom’s cash reserves or raising debt to make the purchase. Following the deal, the new Omnicom would be owned by

    • 60.6% of former existing Omnicom shareholders
    • 39.4% of former Interpublic shareholders

    Deal expected to close in the second half of 2025. Once it is closed Omnicom expected to get $750 million in cost savings over the following two years. Combined cashflow of more than $3 billion a year.

    Investment analyst call

    The investment analyst call was led by Omnicom’s John Wren and featured Phillippe Krakowsky. One of the main factors raised on the call by Wren was the reduction on debt to EBITDA of Omnicom from 2.5x to 2.1x. The combined organisation also had a more balanced maturity profile on debt.

    The deal impacted scale in two ways:

    • Efficiencies due to scale.
    • Increased capacity to borrow and fund future purchases.

    What was less clear from the call was the value to customers. Healthcare was cited as an area of opportunity as both businesses had a substantial healthcare marketing offering. But nothing on how to capitalise on the opportunity.

    What I didn’t hear was how the combined business was going to get to 750 million of savings, but that they were confident that they could hit that number in two years after the deal closed.

    I also didn’t hear a clear position on how the combined firm would deal with the drain of advertising revenue from marketing conglomerates and media companies to platforms. There was some lip service given to being able to better address generative AI related change as a larger group.

    Finally there was no analysis, or consideration about how Omnicom and Interpublic would surpass their competitors innovation. Instead the focus was purely on existing combined size.

    Shareholder value

    At the time of the announcement, the deal was said to offer a premium in terms of value to Interpublic shareholders.

    As for Omnicom shareholders, they claimed: The transaction will be accretive to adjusted earnings per share for both Omnicom and Interpublic shareholders.

    Slow gains – which might make taking that money out of their existing shares and instead putting it in a S&P 500 tracker ETF seem more attractive.

    Industry animal spirits (aka what people were saying in my feeds and op-eds)

    The reaction on social platforms was shrill and overwhelmingly negative. The reasons given included:

    • The inevitable job cuts.
    • The internal preoccupation that comes from two large organisations coming together.
    • The lack of clarity about unique benefit that the new company would provide.
    • The two-year inward focus on consolidation would allow more innovative competitors (depending who you listened to this would be Accenture, Brandtech, Dentsu, Publicis, Stagwell) gain further ground.

    Later on, the discussion moved on towards the reactionary nature of the discussion itself.

    From within Interpublic itself, I heard concern about the future from people in different parts of the business. This was down to a lack of internal communication rather than anything specific in nature.

    Left unchecked, it could be morale sapping and might encourage some of the best talent to leave for more stable environs.

    Update: January 17, 2025Campaign magazine podcast. The most interesting argument made in the podcast was that the media buying and creative arms of Interpublic are seen as having little-to-no-value and that deal from Omnicom’s perspective was all about Interpublic’s data platform.

    Any self-respecting investment banker worth their salt would be able to break the conglomerate down into constituent parts and sell it off (as what has happened with Interpublic agencies R/GA and Huge already).

    • In the PR and social / influence sector Golin and Weber Collective would make natural groupings to be spun off and still with enough scale to compete on the global stage.
    • From a creative agency perspective, it would be a similar situation with Mullen Lowe and McCann World Group.
    • IPG Health looks like it had already been pre-packaged for private equity when it was carved away from its advertising groups and nominally has a full suite of offerings to provide the pharmaceutical sector clients.
    • For bits of networks that you can’t sell. For instance if the purchaser doesn’t want to have an agency office in Malaysia (Malaysia is only in here hypothetically, in reality I have no idea why more global corporate headquarters aren’t located in the Cameron highlands); you can recoup some of your money by facilitating a management buyout. These are more common than you realise.

    Instead the podcast participants think that clients are just all about first and third party data platforms. I would argue that’s a simplistic view that ignores:

    • The relative complementary nature of the Interpublic and Omnicom networks in terms of product spread and geographical reach. In most markets, one or the other network has an appreciably stronger position. Where there is consolidation needed, this would most likely result in redundancies in the Asia Pacific and European regions.
    • Client brands need for continued brand building and the current chaos in the major platforms pivoting to the new presidential administration’s direction.
    • ‘Bad neighbourhoods’ for brand content will adversely affect the ability of brands to advertise or promote themselves effectively. It’s harder to build effective brand memory structures in what consumers are likely to perceive as a hateful, or hostile environment.
    • Finally there is the the little acknowledged fact that social platform advertising is disproportionally supported by D2C marketing and varying forms of hucksterism from Temu to get-rich schemes. This isn’t the kind of businesses that fill up the client ranks of large marketing conglomerates like Omnicom and Interpublic.

    What business thinking says

    Harvard Business Review claims that 70 to 90 percent of mergers and acquisitions fail. By comparison, anywhere between 25 and 80 percent of large IT projects fail. 70 to 85 percent of new consumer product launches fail. TL;DR running a business is tough.

    Secondly, Omnicom and Interpublic grew historically through acquisitions. Which would mean that they understand how to move a business forward and integrate their new acquisition.

    The business model that marketing services conglomerates historically worked on was a mix of an arbitrage play, driving integration and efficiencies.

    Arbitrage

    Omnicom and Interpublic both relied on a few ways to gain an arbitrage benefit:

    • Private companies are generally cheaper to buy than publicly listed firms. It’s a matter of economics, publicly listed firms list in a closer to perfect market. Secondly, buyout contracts to get the management to meet financial targets that facilitate either a faster financial payback or a cheaper price on the business.
    • Larger companies like Omnicom can borrow money at more favourable terms than a small to medium-sized business. Larger companies that have lower levels of leverage will be able to get money in a more favourable format than more highly leveraged business of the same size.

    Driving integration

    Historically these groups take a light touch on integration for agencies where the capabilities are common to more than one agency, WHERE the acquired agency is hitting the ambitious financial targets set by the holding company. Integration in terms of integrated new business pitches and common selling of new products or capabilities.

    This might be where the client is looking for an integrated solution. Or it might be where it makes sense to pool resources to deal with a new area like Amazon advertising and retail media or generative AI services.

    Once a newly acquired business has become ‘part of the furniture’ and the founders have stepped away, you are more likely to see it become more deeply knitted into the holding group business fabric. This is likely to include common systems and processes: time-tracking software, HR and talent management software, accounting software, cloud services and productivity software.

    Efficiencies

    Sources of efficiencies overlap integration through standardisation and being able to buy in bulk. A second source of efficiency is consolidation of common business functions:

    • Accounting / finance
    • Business development
    • Freelance staff pool
    • Human resources and recruitment
    • IT
    • Knowledge management
    • Legal services

    Open questions

    Both Omnicom and Interpublic have experience of integrating and spinning off parts of their businesses. What’s different about the Interpublic acquisition is that the scale involved is different from anything else that’s been undertaken in the sector.

    • How will this be done successfully?
    • What (additional) value is in the resulting business for clients?

    ADWEEK polled marketers to better understand their attitude to the merger. On balance they weren’t supportive of the deal. Twice as many respondents were negative about the deal compared to those who felt positively about it. The good news was that almost 60 percent either hadn’t made their mind up or were on balance neutral. At this point I need to caveat the results with the note that there wasn’t a breakdown on the types of respondents in terms of their role and seniority.

    Omnicom IPG

    But it implied that Omnicom had a serious communication job to be done convincing wider stakeholders on the merits of the deal.

    The problem might be greater than telling a better story. By some estimates 60% of Interpublic and Omnicom scopes of work are allegedly already understaffed – if true, likely putting customer satisfaction at risk. And that’s before the reduction in headcount to match the need for cost savings.

    More information

    Omnicom to Acquire Interpublic Group to Create Premier Marketing and Sales Company – Omnicom Group Inc. Newsroom

    Omnicom SEC filings – Omnicom Group Inc. Investor Relations

    IPG Mediabrands To Lay Off 103 Staffers | AdWeek – this is fast, if related to the Omnicom acquisition announcement

    Things to Consider During Blackout and Quiet Periods | Gilmartin Group

    CAGR S&P500 calculator

    Don’t Make This Common M&A Mistake | Harvard Business Review

    More Marketers Disapprove of Omnicom Acquiring IPG Than Approve | AdWeek

    3 Main Reasons Why Big Technology Projects Fail – & Why Many Companies Should Just Never Do Them | Forbes

    The Merger Mystery: Why Spend Ever More on Mergers When so Many Fail? by Geoff Meeks and J. Gay Meeks

    Most new products fail: Implicit sensory testing can help beat the odds | Food Navigator Europe

  • AE86 + more things

    Toyota AE86

    The rear-wheel drive AE86 model generation of the humble Toyota Corolla has a dedicated following. The cars were light, had twin-cam engines, a very balanced weight split and a limited slip differential.

    11_01

    Back in the 1980s in Ireland they were a steady performer on the local rally scene. The AE86 because of its simplicity became very adaptable for street and motorsport tuning. The AE86 popularised car culture internationally, turning up across media formats and supported by a vibrant cottage industry of parts manufacturers who exported their parts around the world.

    The car entered popular culture across Asia and beyond through the manga and anime adaptions of Initial-D, which told the tale of Takumi – a student holding down two jobs – a petrol station attendant and delivery driver for the family’s tofu business in a Toyota AE86 Sprinter Trueno.

    Takumi’s adventures in the family AE86 went on to be portrayed in an 18-year long manga series, a Hong Kong film featuring Taiwanese entertainer Jay Chou as Takumi, 27 game adaptions at the time of writing and at least 12 anime series or feature length films.

    This soft culture footprint gave the AE86 an impact across Asia, hence the Malaysian meet-up that Hagerty shot in Kuala Lumpur. Will the popularity of the AE86 die off with this generation of young adults? It’s possible given that over a quarter of them in the US don’t drive.

    Toyota / Hyundai motorsport collaboration

    In advance of Rally Japan, Toyota’s Gazoo Racing and Hyundai’s N Sport held a joint event in Korea. It’s quite rare to see rival manufacturers partner in this way.

    How to read a compass

    This took me back to my 12-year old self away at scout camp (which I did only once) doing the activity for my map-making activity badge. Taking bearings from multiple locations and triangulating them allowed me to plot out my map. I need to dig out my Silva compass that my Mam and Dad probably still have somewhere in their attic.

    I found myself using the basics of reading a compass when living in urban Hong Kong and Shenzhen as the extremely tall buildings stopped GPS from working that well. Sharing here, partly out of nostalgia and the the life skills benefits.

    Hello Kitty and an adoption mindset.

    Japan popular culture commentator Matt Alt put out a video about the history of Hello Kitty and Sanrio. One of the interesting things that came out of the video was how adult women embracing the playfulness of Hello Kitty, rather than ‘adulting up’ then became on the leading edge of technology adoption. I thought the idea of a ‘playful mindset’ and adoption was very interesting – yet something that we don’t often think about. I used to think about it as curiosity, but it’s more specific and it can be fostered regardless of age.