What are the best imputation techniques to replace missing data without introducing too much bias in a model?

likehelpful
Posting as :
works at
You are currently posting as works at

If the column has many missing values then do not use the column.

If there are enough samples to build a model then remove the rows with missing data.

If there are few samples and you can’t afford to remove rows with missing data then use k-NN or MICE.

likehelpful

If there are sales data for two products and the user never bought the second product, it can be NULL for product 2. You don’t want to drop row or column. In that case, the only sense is to impute zeros. Any other imputation will create bias.
There’s no such rule that one must never impute zero. It all depends on what kind of data there is and your business problem.

Get rid of the observation

likefunnysmart

In the olden days of OLS and logistic regression, we used to either (a) bin that variable and let missing be its own bin or (b) impute to a default value (eg median, mode or zero) and then include a missing value indicator paired with the underlying variable. A bit simplistic but at least both can handle MNAR. But also, before choosing a method you should seek to understand the data capture process and why the data is missing.

like

Yeah I still do this. Have yet to see an example where doing fancy imputation methods actually made a meaningful difference to model performance. Your time/energy are probably better spent elsewhere.

Caveat that I agree with others that if the data in a column is very sparse, it shouldn’t be used for modeling.

What level/portfolio are you that you actually get to do data modeling?

like

If time series, then average of observations before and after weighted by day of week or whatever makes sense

Really depends

Related Posts

How big usually is y’all’s bonuses?

Received an offer for Director of BD comp. What's the typical base for a private clinical stage company?

like

Currently found myself with a lot of free time and looking to learn a useful tech skill. I have experience in solution architecture, but nothing more technical than that. Any recs on a hot skill that would be useful to know (e.g., analytics, cloud, AI, ML, etc.)? Don’t know where to start..

like

Anyone had pay review conversation. How much is hike/bonus

like

Has anyone actually gone through with accepting an offer from the Siegfried Group? I hear from a recruiter representing them probably once a week. How is it?

like

Anyone else absolutely *loathe* being assigned new client matters at the end of the month when you know your billable time won’t count bc conflicts haven’t cleared yet? Just me?

like

So many people on PTO. Job search has really come to a halt. I’m just waiting on a bunch of people to get back from their PTO after labor day. I HATE doing nothing. I feel so useless. Ive been trying to work on extensions for stuff in portfolio but my CW partner on the projects has been busy with actual work (she works in a diff state). What do i do? Lol. Anyone need help? Wanna chat?

like

𝑯𝑿𝑴 𝒐𝒏 𝑴𝒐𝒃𝒊𝒍𝒆 - 2𝑯 2021 𝑯𝒊𝒈𝒉𝒍𝒊𝒈𝒉𝒕𝒔! 

#SAP #SuccessFactors #Mobile #HXM

Post Photo

I’ve been at BCG in the west coast for 3 years out of undergrad. I am looking to join a Series B/C start-up in some strategy/operations role. Any rough estimates for what I can expect for comp (knowing it’s variable)? TYIA

like

Hi guys 👋🏻
This is regarding the interest amount of our EPF.

For you guys,is the interest details showing any amount in the EPFO member passbook? Or is it showing '0' like me(As seen in the photo below)?

Is the government yet to credit the interest amounts of our PF balances? Or is something wrong with my PF account only?

Any inputs will be really appreciated 🙂

Post Photo
like

PE ops companies in Singapore?

like

Hi , is there any opportunity in BIM Feild anywhere in India. I am having 1 year of revit experience, 3 years of Architectural experience and more than of 10 years of Visualization experience (3ds max ). Please let me know freinds i am desperately looking for the job. Please help!!!

like

Hi All,
I have been working as mainframe developer for around 8 years. From last year, working on requirement analysis and team management activities. I want to move into java full stack developer profile as I am not interested in management work. I know that being 8+ YOE guy and considering my tech background, it would be difficult to transition, but I am really depressed with my work profile. Please advise how should I transition my career path to a java full stack developer.

Best exit opp out of the following three with pay being equal: Senior Financial Reporting Analyst, Internal Audit Senior, or Senior Accountant?

like

I have no credit history and am just out of undergrad making ~$100k. Any chance of me being approved for Citi Double Cash or Chase Freedom Unlimited? Do I need to get a “starter” card to build some credit first?

like

I am on 5 projects right now all due within the next week-3 weeks. I feel like I’m drowning and might quit if I reach a breaking point, what should I do? Only work on the projects worth a shit? Blow off first round check ins to work on stuff that’s due?

like

Hi,

Need combinations of helpful, uplifting smart and funny.


Please help🙏

like

Should I opt for online 2 yrs MBA from DY patil university? It is good or should I look for alternative. I am getting alot of calls from them. Please suggest. I have around 3 yrs experience in IT.

like

HI Fishes, Anyone in Infosys working in client location ( Bfsi ) what clients to Infosys has in Gurgaon location. Any idea Infosys

like

Additional Posts in Data & Analytics Consultants

Looking for technical data analyst in Austin, tx (also have remote available) to join a series E startup. Need strong python & sql experience. A lot of data munging and cross-functional work. Love to chat if this makes sense. Cheers!

helpful

New data offering: name.co, or namesoftware.com for the main domain? I’m leaning toward name.co, but I worry about not having the .com.

like

Anyone have recommendations on the best app/site to learn Python? 🐍
I’m a complete beginner.

like

What are the top sales & marketing metrics that tech companies track? I have an interview for a Data architect role and the data aspect is my strength but need to brush up on the business insight part

like

Anyone here work or has interviewed for a data science role at DataRobot? Would love to get some insight. Third interview coming up.

like

Anyone in the KPMG Lighthouse practice that would be willing to take a look at my husband’s resume and put in a referral? He’s interested in the Associate, Data Scientist position.

Newish to data analytics. Have some experience in power BI, sql, and R. I wanted to get some exposure to aws and saw the AWS cloud practitioner as an available course at my job. Is this cert worth it? Or should I try one of the more advanced ones ? The data analytics one for example but wasn’t sure if it would be difficult to get through as a beginner. Will it look good on my resume?

like

Got Amazon career essentials (ACE) assessment to complete for Business Analyst role
To anyone who has experience with it- What kind of work simulation questions should I be expecting?
Is there a way I can prepare for it or any resources one must know of?

like

Could anyone recommend a set of indices that are publicly available and is a good predictor for consumer spend in home renovations ?

like

If you were to hang your own shingle as a consultant what BI tool would you use to deliver dashboarding to clients?

Assume that you aren’t plugging in to source systems as I primarily work in CDD.

like
like

Hello, Anyone here from Deloitte Omnia AI who could refer me for an open opportunity.

like

What tech companies are best for Data Analysts? Looking for $120k base + equity that’s gonna be worth a lot + other great benefits + possible remote.

funny

What are some interesting non-tech/insurance/advertising/consulting careers where data science is applied? Thinking of epidemiology and economics

like

Has anyone else begun to resent data science?

like

Anyone who has done masters in business analytics Singapore? Could you help on courses, Scholarships and careers and jobs. Looking for help.

from God import pandemic
From China import coronavirus, mortality_rate

covid_19 = pandemic.coronavirus.set_mortality(mortality = 1.)

like

Folks applying for Data and Analytics roles in the industry at M, SM, D Levels, what technical/functional topics do you prepare for on the Analytics side?

like

Fellow data 🐟, plz recommend me something to do tomorrow. Existing weekend plans went up in smoke and I'm feeling like trying something new

like

New to Fishbowl?

Download the Fishbowl app to
unlock all discussions on Fishbowl.
That was just a preview…
Sign Up to see all discussions
  • Discover what it’s like to work at companies from real professionals
  • Get candid advice from people in your field in a safe space
  • Chat and network with other professionals in your field
Sign up in seconds to unlock all discussions on Fishbowl.

Already a user?
Login here

Share

Embed this post

Copy and paste embed code on your site

Preview

Download the
Fishbowl app

See what’s happening in your industry
from the palm of your hand.

A phone with Fishbowl app

Scan your QR code to download
Fishbowl app on your mobile

Download app

Sign up for free to view this conversation on Fishbowl

By continuing you agree to Terms of Use and Privacy Policy

Already have an account? Log in

Sign up for free to continue using Fishbowl

By continuing you agree to Terms of Use(New) and Privacy Policy(New)
Messaging rates may apply

Already have an account? Log in

For account settings, visit Fishbowl on Desktop Browser or

General

Legal