anyone ever work w stop words using en code web sm(pipeline from spaCY)? i’ve added my own stopwords but sometimes they’re working and sometimes they’re not, any recs?

like
Posting as :
works at
You are currently posting as works at

When you say sometimes working sometimes not, I am guessing you mean that some of SpaCy's stopwords are not stopwords in your domain and vice versa? One possibility to generate stopwords for your own domain is to calculate the IDF of all words across your corpus and use a sensible threshold to mark words above the threshold as stopwords for your domain.

like

Sorry words below the IDF threshold. Stopwords are basically words with high document frequency (i.e. they occur in so many docs that they lose their discriminative power).

I've worked a little with this. Not extensively. If you put code on github and want me to look at it I will.

Related Posts

Which ERP has the highest potential for a new Analyst? Feel lost between Salesforce, ServiceNow, SAP, Workday, etc. Currently staffed on a major SAP project.
Want to make an informed career decision.

like

What is the fastest way to sponsor a pup?

like

Hi, all looking for a job change with good package.

I am 6 years experienced. A cyber security and information security professional with skill sets in, Vulnerability assessment and management, endpoint security, SAST, DAST, DevSecOps (1.5 years), AWS and Azure, ISO 27001 internal audits and OSINT.

Let me know if anyone can refer me.

Where do people go after Visa?

like

Hi all, new-ish associate here at a Virginia firm - quick question: if a pleadings filing deadline for a GDC case ends up being on a Saturday or Sunday, does that mean that it can actually be filed by that following Monday since the court and clerk's office will be open then?

like

Hi Fishes,

What's the salary bracket for Manager in technology consulting in EY India.

Also would of great help if someone can highlight the responsibilities and WLB. Thanks

like

Does having a CVA (certified valuation analyst) useful? have heard mixed reviews

like

How much annual hike in citi india on average at c11 level??

like

Any experience with GRC practice within the Big4? If so, which one would be the better choice as far as maturity & pay?

like

Looking to specialize in privacy and was considering getting the CIPP/CIPM/CIPT. I already have my CISSP and CISA. Is it worth it?

like

Any agencies hiring? (Planning on staying away from Rogers and pharma)

like

Welp, see yeah later KFC!

like

I understand A&M comp is primarily derived from the annual bonus based on collections/performance. However, given the significant increase in Big 4 FDD comp this year, are there any plans to raise the base salary for A&M? I have to pay rent every month not just annually 🤣

likesmart

Does anyone have an email address of the recruiter at Trailer Park?

"Use the approved language."

"Why'd you use this language?"

likefunny
like

got a CIS form filling link from cognizant, what it is.. i am experirenced java developer...interview clreared at cts and also had salary discussion

like

Last week I got automated call from infosys and asked few details . After that I got call from hr and said my interview is scheduled on next week and very next day one person called me and asked few technical questions .she said she's calling from infosys and this is technical screening round . Today same hr called and said she will be scheduling a client interview.

My doubt is till now I didn't received any mail from infosys. So this is scam or part of infosys interview process?

like

Additional Posts in Data & Analytics Consultants

Mεntal mαth in iηtervieωs. Thoughts, tips, rants, grievances?

like

Business practice: "can you guys build a model that overlays where Coronavirus is going to break out with client sites"

likefunny

What would you choose between palantir and a small startup with great growth opportunities?

like

Anyone who has done masters in business analytics Singapore? Could you help on courses, Scholarships and careers and jobs. Looking for help.

How does everyone in this bowl feel towards master’s degrees in data science/analytics? I didn’t study this in undergrad so I’m looking to get an analytics degree but I’ve heard mixed reviews.

like

I need to do a timeline with lots of information over a long period of time. I wanted to that in Tableau, but partner will not pay license and data is sensitive so I cannot use Tableau public.
Do you have suggestions?
The requirements are the following:
- No online input data upload (so local editing)
- The produced output file must be local and easily exportable and readable
-timeline must be interactive (filter/click/zoom in/out/view detals etc.)
- free

Any kind of help is really appreciated

like

today I choose violence

Post Photo
funnylikesmartuplifting

New data offering: name.co, or namesoftware.com for the main domain? I’m leaning toward name.co, but I worry about not having the .com.

like

I am currently working mostly in Data Analytics, but I want to transition into Data Engineering. I've applied to a couple Data engineer positions but it seems I don't have the qualifications for them. Should I maybe look for Jr. Data Engineer positions to get experience and try to move up from there? If so anybody know of any Jr. Data Engineer positions available?

like

Got Amazon career essentials (ACE) assessment to complete for Business Analyst role
To anyone who has experience with it- What kind of work simulation questions should I be expecting?
Is there a way I can prepare for it or any resources one must know of?

like

Anyone know of an alternative to StackOverflow for asking tech questions? The community is super toxic and I'm looking for a place I can casually ask questions that might seem dumb without being hrangued for not providing an essay proof of the research I've already done. Or some sort of buddy system where I can ping a person of relevant experience those questions. It's better to poke someone's brain for a minute than to spin in circles reading docs/dead end articles sometimes.

like

I'm non US citizen and thinking of moving back to EU. My plan is getting remote opportunities across the world and work as a freelancer. My speciality is Data analytics and BI (Qlikview, Tableau). 1) How likely I can get a gig from the US? 2) if a job looking for a remote 2-3 months contract do they care what country I'm in? 3) how can I compete Indian rates as I'm in Europe 60-70$ per hour as a BI consultant is a good rate but I see online rates as low 10$😫?

like

If you were to hang your own shingle as a consultant what BI tool would you use to deliver dashboarding to clients?

Assume that you aren’t plugging in to source systems as I primarily work in CDD.

like

Is it worth it to do a PhD in Statistics or Epidemiology or Health Informatics if I am interested in being a data science director in Pharma or a Health system? I am working remotely a a data scientist right now. I am plan on learning more ML models for now. However, is there a certain point where it makes sense to get a PhD? I want to stay in the healthcare space but not sure what I should do next besides taking coursera classes.

likeuplifting
like

Coming to the realization that consulting is not for me (tech person) feeling angry that salaries at D will never be on pair with FANG and that leadership is not tech focused. For example you could be a soul contributor at fang making 240k and people at Deloitte wait 10+ years to earn that as Senior Managers because they need to climb the corporate ladder. It seems super clear cut to me now, if you want to make money in tech don’t join consulting firms. Am I wrong?

like

Any cyber security folks here? or those interested in systems engineering? Potential role with TC 120-180, awesome team and wlb, at least compared to consulting.

like

Is there a good online course on storytelling with data? Something that can help structure your thought process and design choices when creating visualizations and Dashboards in Tableau, Power BI etc.

like

Does your company split data science roles into different practices like data & analytics and software engineering? Does this inhibit collaboration & Where does a data engineer fall?

like

In the most unnecessarily complicated phrasing worthy a dissertation, what are you working on this week?

like

New to Fishbowl?

Download the Fishbowl app to
unlock all discussions on Fishbowl.
That was just a preview…
Sign Up to see all discussions
  • Discover what it’s like to work at companies from real professionals
  • Get candid advice from people in your field in a safe space
  • Chat and network with other professionals in your field
Sign up in seconds to unlock all discussions on Fishbowl.

Already a user?
Login here

Share

Embed this post

Copy and paste embed code on your site

Preview

Download the
Fishbowl app

See what’s happening in your industry
from the palm of your hand.

A phone with Fishbowl app

Scan your QR code to download
Fishbowl app on your mobile

Download app

Sign up for free to view this conversation on Fishbowl

By continuing you agree to Terms of Use and Privacy Policy

Already have an account? Log in

Sign up for free to continue using Fishbowl

By continuing you agree to Terms of Use(New) and Privacy Policy(New)
Messaging rates may apply

Already have an account? Log in

For account settings, visit Fishbowl on Desktop Browser or

General

Legal