A couple of years ago, most LLMs were bad - lots of hallucination, couldn’t do simple math, couldn’t reason, limited context window & lacked intent etc. Fast forward to today and the latest models have addressed all those problems, have perfect scores on PhD level tests, can write flawless code with minimal oversight, do financial models, book tickets and make reservations. The step change is exponential and one can’t help but think how much better they are going to get a year or two from now.

likefunnysmart
Posting as :
works at
You are currently posting as works at

Mine can't find an email based on some simple criteria like who it was from or the subject matter. It was really good at giving me a five paragraph explanation on what kind of other emails it might be able to show me

like

They still can’t make tacos

likehelpful

That’s a clanker taco

funny

lol. sure.

like

Oh sweet summer child

likefunny

Well, that’s the entire point of “generative” ai

likefunny

The industry is not ready to handover the keys to AI. Period.

Most companies flaunting AI are posturing.

like

I just want to know who you work for and what clown university you got your degree from at this point

like

but my water :(

like

Not sure if you are being facetious or serious but we should be worried about our water quality, air quality, etc specifically long term

like

Nice try, AI

like

If you think LLMs write "flawless code," I have some serious doubts about your own ability.

I've worked with several of the mainstream LLMs and they all need a lot of hand-holding to keep the level of technical debt low. Even then, theu still requires very regular code reviews to catch stupid assumptions/hallucinations.

As a force-multiplier, they are practical. As a standalone expert, not even close.

like

Force multiplier is a great way to describe them. As a developer they can make me a better developer because I understand what I’m building. I’m not a particularly good writer. If I use an llm to help me write a novel, I’ll write a novel but it won’t be very good because it’s multiplying a weakness instead of a strength.

Technically slightly overstated at present. Relevant benchmarks for hard agentic terminal use problems and non-hallucination rates max out in the 50% range for the top tier models presently. Given the established rate of change though, we could easily see mid-90% range by the end of the year. Between Gemini 3 Pro in November 2025 and Gemini 3.1 Pro released this week, the non-hallucination rate increased from 12% to 50% on the relevant benchmark.

like

The core LLM flow (take input, produce plausible tokens) quality has pretty muched peaked and now it’s mainly about building harnesses around the models. Hallucinations, lack of transparency, context length/ attention limitations are a feature not a bug, and different tools and workflows need to be built to get around them and make LLMs useful for actual serious work.

It’s a shame the LLMs are superficially convincing. For example I see people ask “Analyse all the research in the world to define if X is true” and then actually believe the chatbot when it says seconds later “Based on analysis of all publications…”. In reality of course it did NOT analyse anything as that would have taken hours :) Even “Deep Research” is mainly 100-200 google searches and very little critical analysis.

Winners will be those who take a pragmatic approach to AI and learn to use a few great tools fit for their workflows. Check out NotebookLM, Claude Cowork, Skimle, Gamma etc for typical consulting use cases. And spend time learning how LLMs works beyond the “it’s magic” hype :)

like

While I agree that an agentic harness and tool use greatly enhance models' capabilities, it is inaccurate to say the models themselves have peaked. Objective benchmarks demonstrate they continue to make rapid progress on multiple fronts, including on tasks that people typically cite as their shortcomings. This is why Google, OpenAI and Anthropic have increased their pace in releasing new SOTA models--the gains are evident at a faster and faster rate. Check out Artificial Analysis if you don't already.

There is still a ton of things it can’t do but in about two years 😳

We use different LLMs. The answers I get from Claude/Gemini/ChatGPT range from D+ to C+.

Plenty of leverage for sure, it even using the latest models and rigorous prompt design and criteria, I still have to pressure test and iterate anything I get through multiple cycles before I consider it even of even moderately decent quality - this is for data analytics tasks or text generation. And even then I end up making significant adjustments and revisions before it’s client ready. As someone who actually holds a PhD, I question your estimation of the models’ capabilities. I’d put them at “over-eager undergrad intern” or worse yet “average foreigner paying the full cost of a 1-year Master’s Degree to access the U.S. job market”

What you on? AI still fecked-up 😆 Can’t lie that it’s great in certain industry & use cases - can only use it take minutes which i tend spend time auditing for accuracy…like a job in itself

Related Posts

Don’t mix Investment and Insurance.
Buy term insurance in your early age.
Invest mix mode equity and debt .
Ping me for invest and insurance help.
Also get discount price for insurance

like

I am 6.5 years experienced Java developer. Looking for referrals.
Serving notice. LWD - 14-Feb-22

like

Anyone work at Tempus Labs? How do you like it there?

like
like

Fishes, how were the appraisals in general at Deutsche Bank for roles in Mumbai location? Understand that they got concluded only recently so it will be interesting to know. While it may vary from function to function but we will atleast got to know a ballpark figure. Also if you could add your designations, will be great.

like

How is the final settlement calculated at Accenture? Notice period is 3 months.
Will I get normal monthly pay for the first two months? I'm leaving after completing 12 months.

like

Is it recommended to get two full years of consulting experience before looking for a move elsewhere...for more comp (say to another Big 4, industry, attempt at MBB or FAANG)? Already have 8 yoe in the DoD space outside of consulting. And about 16 months of consulting exp

like

Does anyone know why managers call themselves “Executives”? In my mind, I only consider Senior Managers and up an Executive. Just curious if anyone else thinks it’s a little bit of a stretch.

like

Hows wlb and projects of HCL Singapore?

like

Random q: any good easily accessible/free documentaries on wine or the history of wine? Also open to other great historical documentaries!

like

Hey fishes,
Which laptop is offered by
Natwest group to software engineers?Natwest group

like

EY leave department hasn’t been the most helpful with the exact details around maternity leave - so just wondering for any EY mamas - how do we initiate getting the two weeks before due date off? Just a doctors note or ??? Thank you!!

like

Kind of wanna move to Europe, where do I find an awesome job europeople? CD.

like

Just found out I’m getting laid off in two weeks. How do I stay motivated to do work until then?

like

Things are apparently going to get better after a client will be leaving, I’ll get to work on different brands but I am feeling pretty burnt out from the agency I’m at. Will new clients help? Or should I look elsewhere. Been at the agency almost 3 years, I feel like I need to switch things up for my sanity.

like

Does Comcast match the offer if we have a better offer from another company? Has anyone tried it? Comcast

like

What is KPMG's parental leave policy for partners versus employees (MDs and below)? Trying to plan life and figure out if it makes a difference to have a child before or after one makes partner in terms of parental leave benefits. Thanks!

like

Welcome to the bowl for synechron. We will discuss in and out of everything happening throughout the organisation. Feel free to ask any questions/doubts.

like

Folks making more than £100K: How do you deal with the 60% effective tax rate on the 100-125K bracket? (HMRC gradually reduces your £12,750 personal tax-free allowance to £0.)

This is the first time it is happening to me. With one month to go until the end of the tax year, I'm thinking of making a one-off extra pension contribution next payroll.

Any other effective strategies with only one month to go?

like

Additional Posts in Consulting

How do I say to my counselor, "be prepared to see a big fall of my utilization because of things out of my control" without sounding like complaining or having a lot of negativity?

like

Just moved to EY. How do I join the EY bowl? I searched but couldn’t find it.

like

What is your favorite monthly subscription?
Netflix / Amazon Prime dont count

likesmart

Confronted client today about partner forcing us to travel while client site has travel restrictions. As a result is team is remote next week. Counting down days till I get rolled off/fired now lol

like

Anyone else feeling unmotivated lately? I’m between projects and my peers take this opportunity to take on internal work and make a show of it, but I just feel like I want to do as little as possible... perhaps it’s the fires, covid, etc. anyone else?

like

My MD had me drive around taking pictures of the downed power lines and trees in my area to prove I missed the client meeting due to the tropical storm

funnylikeupliftingsmart

What are some boutique firms in London who specialize in the Retail industry? Have a friend with industry experience looking to move into consulting but big firm life would not be a fit for her. TIA!

like

To those who went to a fancy private school: do you think it was good for your psychology? We are lucky to choose to send our son to a very good public school or a well known private school. (Cont)

like

Any book recommendations for philosophy, psychology, and business? Or any book you read last year changed your values? Going to fly international, gonna read it on flight. 😍

like

Anyone mind sharing thoughts on Nvidia, WLB, pay etc?

likehelpful

Why do some people get laid off and others not? Is it always utilization? Do people typically know they're at risk before it gets to that point?

like

Hey everyone,
I have a BSc degree from a semi-target university in the United Kingdom, and I currently have around 2 years of strategy consultancy experience at a Big 4 in one of Africa's most competitive economies. I'm looking to step into consulting in Dubai/ Abu Dhabi (MBB, Tier 2, boutiques, etc.).
Any referrals?
Thanks

like

Best books on leadership and self development?

like

Commercial vs Public sector/Federal Consulting within the Big 4 or MBB. Differences in culture, types of work, wlb, exit opportunities, etc

like

Your BEST banana bread recommendations? Pretty strong cravings kicking in for a 🍌 bread!

likehelpful

How do you deal with annoying people?

funnylike

What can I do outside the office to boost my resume and be a substantially more attractive candidate for top consulting firms in ~12 months time. Working in disputes and valuation role but want to go into management consulting next year (post MBA role). I feel like my job isn’t enough because skills aren’t considered as transferable outside this field.

like

What are the weirdest reasons you have heard for people being laid-off? I have heard someone getting laid off because they missed submitting their time-sheets for 3 times in a year.

like

Justifying the Sheraton's painful bed to yourself because the Four Seasons don't recognize your loyalty

like

Good books for case prep/valuation. Econ consulting, if relevant. TIA

like

New to Fishbowl?

Download the Fishbowl app to
unlock all discussions on Fishbowl.
That was just a preview…
Sign Up to see all discussions
  • Discover what it’s like to work at companies from real professionals
  • Get candid advice from people in your field in a safe space
  • Chat and network with other professionals in your field
Sign up in seconds to unlock all discussions on Fishbowl.

Already a user?
Login here

Share

Embed this post

Copy and paste embed code on your site

Preview

Download the
Fishbowl app

See what’s happening in your industry
from the palm of your hand.

A phone with Fishbowl app

Scan your QR code to download
Fishbowl app on your mobile

Download app

Sign up for free to view this conversation on Fishbowl

By continuing you agree to Terms of Use and Privacy Policy

Already have an account? Log in

Sign up for free to continue using Fishbowl

By continuing you agree to Terms of Use(New) and Privacy Policy(New)
Messaging rates may apply

Already have an account? Log in

For account settings, visit Fishbowl on Desktop Browser or

General

Legal