A couple of years ago, most LLMs were bad - lots of hallucination, couldn’t do simple math, couldn’t reason, limited context window & lacked intent etc. Fast forward to today and the latest models have addressed all those problems, have perfect scores on PhD level tests, can write flawless code with minimal oversight, do financial models, book tickets and make reservations. The step change is exponential and one can’t help but think how much better they are going to get a year or two from now.

likefunnysmart
Posting as :
works at
You are currently posting as works at

Mine can't find an email based on some simple criteria like who it was from or the subject matter. It was really good at giving me a five paragraph explanation on what kind of other emails it might be able to show me

like

They still can’t make tacos

likehelpful

That’s a clanker taco

funny

lol. sure.

like

Oh sweet summer child

likefunny

Well, that’s the entire point of “generative” ai

likefunny

The industry is not ready to handover the keys to AI. Period.

Most companies flaunting AI are posturing.

like

I just want to know who you work for and what clown university you got your degree from at this point

like

but my water :(

like

Not sure if you are being facetious or serious but we should be worried about our water quality, air quality, etc specifically long term

like

Nice try, AI

like

If you think LLMs write "flawless code," I have some serious doubts about your own ability.

I've worked with several of the mainstream LLMs and they all need a lot of hand-holding to keep the level of technical debt low. Even then, theu still requires very regular code reviews to catch stupid assumptions/hallucinations.

As a force-multiplier, they are practical. As a standalone expert, not even close.

like

Force multiplier is a great way to describe them. As a developer they can make me a better developer because I understand what I’m building. I’m not a particularly good writer. If I use an llm to help me write a novel, I’ll write a novel but it won’t be very good because it’s multiplying a weakness instead of a strength.

Technically slightly overstated at present. Relevant benchmarks for hard agentic terminal use problems and non-hallucination rates max out in the 50% range for the top tier models presently. Given the established rate of change though, we could easily see mid-90% range by the end of the year. Between Gemini 3 Pro in November 2025 and Gemini 3.1 Pro released this week, the non-hallucination rate increased from 12% to 50% on the relevant benchmark.

like

The core LLM flow (take input, produce plausible tokens) quality has pretty muched peaked and now it’s mainly about building harnesses around the models. Hallucinations, lack of transparency, context length/ attention limitations are a feature not a bug, and different tools and workflows need to be built to get around them and make LLMs useful for actual serious work.

It’s a shame the LLMs are superficially convincing. For example I see people ask “Analyse all the research in the world to define if X is true” and then actually believe the chatbot when it says seconds later “Based on analysis of all publications…”. In reality of course it did NOT analyse anything as that would have taken hours :) Even “Deep Research” is mainly 100-200 google searches and very little critical analysis.

Winners will be those who take a pragmatic approach to AI and learn to use a few great tools fit for their workflows. Check out NotebookLM, Claude Cowork, Skimle, Gamma etc for typical consulting use cases. And spend time learning how LLMs works beyond the “it’s magic” hype :)

like

While I agree that an agentic harness and tool use greatly enhance models' capabilities, it is inaccurate to say the models themselves have peaked. Objective benchmarks demonstrate they continue to make rapid progress on multiple fronts, including on tasks that people typically cite as their shortcomings. This is why Google, OpenAI and Anthropic have increased their pace in releasing new SOTA models--the gains are evident at a faster and faster rate. Check out Artificial Analysis if you don't already.

There is still a ton of things it can’t do but in about two years 😳

We use different LLMs. The answers I get from Claude/Gemini/ChatGPT range from D+ to C+.

Plenty of leverage for sure, it even using the latest models and rigorous prompt design and criteria, I still have to pressure test and iterate anything I get through multiple cycles before I consider it even of even moderately decent quality - this is for data analytics tasks or text generation. And even then I end up making significant adjustments and revisions before it’s client ready. As someone who actually holds a PhD, I question your estimation of the models’ capabilities. I’d put them at “over-eager undergrad intern” or worse yet “average foreigner paying the full cost of a 1-year Master’s Degree to access the U.S. job market”

What you on? AI still fecked-up 😆 Can’t lie that it’s great in certain industry & use cases - can only use it take minutes which i tend spend time auditing for accuracy…like a job in itself

Related Posts

I would love to get my hand on one of these bad boys. Just wondering if anyone has ever driven one or gotten one in an auction. Definitely high on my list if I ever get the chance to start a real collection. Mazda RX-7 (FD).

Post Photo
like

I want to start learning DevOps, what is the right path to learn DevOps? Does TCS has good projects for DevOps.. Suggest me any combination with DevOps.

like

Whenever I am trying to apply for job from TCS i begin portal it gives an error like
'You cannot Apply against other jobs as you are a Placement Agency Referred candidate '

how can I fix this?


PLEASE HELP 🙏🙏🙏🙏🙏

like

I’m sure this has been covered but I need a recommendation for a cheap pair of noise cancelling headphones. I have AirPods which are unbelievable and have changed my life but they aren’t great...

Someone I’ve started seeing uses they/them pronouns. I’m trying my best but still feel like I make mistakes. When I want to address them is it ok to say how are you or should I say how are they?

Hello all,

Since I am new to this Portal, need 11 likes to Unlock DM.

Thankyou in Advance!

like

Hi, any idea about WFO,?

like

How to get released from current project in Wipro.

like

Hello Fishes,

I have total 4years of experience (joined my current company last year). Current package is 7.5LPA.
I am happy with my work life right now but need the hike. Also i am working as frontend developer (react.js) but my tech stack is MERN and i also have about 1year experience with springboot. In my current company my ctc will be revised on January,2023  as i am not completing 365 days before July 1(only missing for a couple of days). What should i do right now? What should be my ctc

like

Could someone please refer me at Deloitte USI ?? I have the Job Ids with me

like

Got the back to office mail today.
What to do?
I don't want to leave home.

Hi

I have got an opportunity for interview in mearsk for sap atr fico S4 Hana consultant , please let me know the interview process.

I am having total experience of 8+ yrs and relevant of 6+ years what salary can I expect and which level will be suitable for joining .

Genuine suggestions needed please.

Thanks in advance .

What is the starting salary for a tax senior? 😅

like

Hello Guys

Is it okay to share offer letters that we have with other company recruiters? Aren't offer letters strict and confidential. Can we refuse to show the same or what can we do in these situations.

like
like

Do you guys have Any good books on business? Not those self help books

like

Benchmarking: what firms have holiday shutdowns (e.g., Christmas break, July 4th, etc) and do these count against personal PTO?

like

Has anyone moved from Deloitte to Emerson Consulting? Hoping to hear some first hand experiences to know what to expect!

like

How do you single 🐟 find time to meet people and get to know them when working these crazy hours? Feel like I'm putting that part of my life on hold due to my job, and it's kind of freaking me out 😣

like

Hi folks! My interview at Google scheduled in next two weeks. I do my best at preparation, but I am so anxious, I feel like my destiny is depend on this interview and I will fail because of my emotional instability. Ask for advice how to overcome this and do not take denial too personal

like

Additional Posts in Consulting

Did anyone read the top consulting firms to work for? 😂😂😂

like

How risky is it to bring CBD/THC gummies back from CA to NYC? My friend wants some for her anxiety and I know it’s not 100% legal but does anyone really care?

like

To those starting the year on the beach: office or home office ...or actual beach?

like

What's your story of malicious compliance either externally or internally?

like

Hired as SA1. if my utilization is low for round table but I’ve gotten good Snapshots will I go up to SA2 or SA1B, which I don’t want of course.

like

Hearing rumors about Huron layoffs. Any deets? Friend of mine said he got the "email".

like

I am currently seeking a role in Consulting, my experience has been in in the intersection of Healthcare & Social Services in both the Private and NonProfit sector. I have a BA in Forensic Psych, an MPA, an MPhil in Public Policy and currently in the last leg of my PH.D in Public Policy with a specialization in Health Policy.

I would appreciate any advice anyone could give that might be helpful to help get my foot in the door.

like

I'm almost at the point of an offer, and also just started a 6 month long engagement since my utilization was taking a hit. How do you fish recommend dealing with the situation once the offer comes?

like

Best resources/books on getting personal finance in order?

like

If goal now is to get into MBB (personally prefer McK) - which of the following will be the best option that’ll help me get there -

1. SC at Big4
2. Engineer role at Startup (salary +50% higher than big 4 SC offer)
3. Amazon/Amazon Ops role (salary similar if not higher than option 2)

Would greatly appreciate any insight from the community.

like

Best consulting companies with exposure to the blockchain / digital assets space?

like

Anyone have good book recommendations that I can learn from and are entertaining? Trying to survive a 10 hour flight this weekend 🆘🆘🆘

like

Do any strategy arms of Big 4 or MBB do pre-MBA internships?

like

My MD had me drive around taking pictures of the downed power lines and trees in my area to prove I missed the client meeting due to the tropical storm

funnylikeupliftingsmart

BCG folks, I hear rumors of layoffs happening given the slow pipeline. Is it true?

like

PPMDs & SMs, what tips you would give if your client is a Member Firm? I am an incoming project manager, lateral hired from industry. Please share your thoughts about how to make an impact from day 1.

like

Does anybody have good electric standing desk recommendations? Ideally in the $200-300 range.

like

I do believe EY is d kindest when it comes to putting people first.Thank U Kelly.Resume travel based on CDC & facts, no layoffs in foreseeable future, extending trust when wfh& personal accountability

like

Losing motivation and drive as time goes on , and I’m only 27 . Nothing I’m passionate about and money just seems like it’s ever enough ($160k total comp) . I was lucky enough to be making that much at McK while solving social problems within education , Econ dev , and public health

However , social good isn’t pleasing my soul neither is the income . I’ve switched jobs to healthcare where I’m impacting people’s physical and mental health . Still no passion / drive

I feel lost / bored :(

like

What resources (online prep, books, case books, peers, mental math, etc) did you use when getting ready for case interviews? I’ve seen several online — Management Consulted, Hacking the Case Interview, MConsultingPrep, etc — and would appreciate any insights on what was helpful!

like

New to Fishbowl?

Download the Fishbowl app to
unlock all discussions on Fishbowl.
That was just a preview…
Sign Up to see all discussions
  • Discover what it’s like to work at companies from real professionals
  • Get candid advice from people in your field in a safe space
  • Chat and network with other professionals in your field
Sign up in seconds to unlock all discussions on Fishbowl.

Already a user?
Login here

Share

Embed this post

Copy and paste embed code on your site

Preview

Download the
Fishbowl app

See what’s happening in your industry
from the palm of your hand.

A phone with Fishbowl app

Scan your QR code to download
Fishbowl app on your mobile

Download app

Sign up for free to view this conversation on Fishbowl

By continuing you agree to Terms of Use and Privacy Policy

Already have an account? Log in

Sign up for free to continue using Fishbowl

By continuing you agree to Terms of Use(New) and Privacy Policy(New)
Messaging rates may apply

Already have an account? Log in

For account settings, visit Fishbowl on Desktop Browser or

General

Legal