• Lie factor in ad graphs

    Lie factor in ad graphs

    June 9, 2020

    It’s fun to look at the visit statistics and to discover old stories. I wrote this post in 2016. For a reason I don’t know, this post has been one of the most viewed posts in my blogs during the last week.

    So, I decided to publish it again. I won’t add any new examples, but if you want to see more stuff, type [lying with data visualization] in your favorite search engine

    Lie factor in ad graphs

    What do you do when you have spare time? I tend to throw graphs from ads to a graph digitizer to compute the “lie factor”. Take the following graph for example. It appeared in an online ad campaign a couple of years ago. In this campaign, one of the four Israeli health care providers bragged about the short waiting times in their phone customer support. According to the Meuheded (the health care provider who run the campaign), their customers had to wait for one minute and one second, compared to 1:03, 1:35, and 2:39 in the cases of the competitors. Look how dramatic the difference is:

    Screen Shot 2018-02-16 at 18.34.38

    The problem?

    If the orange bar represents 61 seconds, then the dark blue one stands for 123 seconds, almost twice as much, compared to the actual numbers, the green bar is 4:20 minutes, and the light-blue one is approximately seven minutes, and not 2:39, as the number says.

    Screen Shot 2018-02-16 at 18.32.53

    I can’t figure out what guided the Meuhedet creative team in selecting the bar heights. What I do know that they lied. And this lie can be quantified.

    June 9, 2020 - 2 minute read -
    data visualisation Data Visualization dataviz lie lie-factor blog
  • StellarGraph — another promising network analysis library for Python and Scala

    StellarGraph — another promising network analysis library for Python and Scala

    June 8, 2020

    Network (graph) analysis is a complicated topic. There are several tools available for this task with different pros and cons. Recently, I stumbled upon another tool StellarGraph. StellarGraph authors claim to provide excellent performance; NumPy, Pandas, TensorFlow integration, an impressive set of algorithms, inter compatibility with Neo4j (THE graph database); and much more. The documentation looks very clear and extensive too.

    I didn’t use it yet, but I certainly plan to.

    https://www.stellargraph.io

    June 8, 2020 - 1 minute read -
    igraph neo4j network-analysis networkx social-network-analysis stellargraph blog
  • The hazard of being a wizard. On balance between specialization and the risk to become obsolete.

    The hazard of being a wizard. On balance between specialization and the risk to become obsolete.

    June 3, 2020

    A wizard is a person who continually improves his or her professional skill in a particular and defined field. I learned about this definition of wizardness from the book “Managing project, people and yourself” by Nikolay Toverosky (the book is in Russian).

    Recently, Nikolay published an interesting post about the hazards of becoming a wizard. The gist of the idea is that while you are polishing your single skill to perfection, the world changes. You may find your super-skill irrelevant anymore (see my Soviet Shoemaker story).

    Nikolay doesn’t give any suggestions. Neither do I.

    Below is the link to the original post. The post is in Russian, and you can use Google Translate to read it.

    Страница о магах У меня в книге есть глава про полководцев и магов. В её конце я подвожу итог: Несмотря на свою кру­тость, маг уяз­вим. Он поле­зен, только если его навык под­хо­дит к задаче. 658 more words

    Почему опасно быть магом — Об управлении проектами и дизайне

    June 3, 2020 - 1 minute read -
    diversity obsolete skills blog Career advice
  • Nice but useless data visualization

    Nice but useless data visualization

    June 2, 2020

    Network visualization can mesmerize and hypnotize. Chord diagrams are especially cool because they are so colorful and smooth. The problem is that sometimes, the result doesn’t provide any actual value, and serves as a cute illustration. Cute illustrations are cute; they help put some “easiness” to the text without the risk of looking too unprofessional.

    Take the two examples below.

    One example (taken from here) shows worldwide migration patterns in a clear and useful way. You can take a look at the graph and make real conclusions.

    The other example (taken from here) is mostly a useless illustration.

    The only “conclusion” that a viewer can make out of this graph is “everything is connected with everything.”

    This type of conclusion is OK for an ad or a general overview of a problem, but it is NOT a valid way to end a discussion.

    June 2, 2020 - 1 minute read -
    data visualisation Data Visualization dataviz
  • Bioinformatics career advice and a story about a Soviet shoemaker

    Bioinformatics career advice and a story about a Soviet shoemaker

    June 1, 2020

    When I was in elementary school (back in the USSR of the mid 80’s), I had a friend whose father was a shoemaker. Due to the crazy stupid way the Soviet economy worked, a Soviet shoemaker was much richer than a physician or an engineer. But this is not the story. The story is that one day this friend’s father had a chat with me about selecting a profession. This man’s point was that for as long as people have feet and need shoes on their feet, a shoemaker would be required and well-earning occupation. Guess what? People still have feet, and still, ware shoes, but I don’t see too many successful shoemakers anymore.

    Common wisdom says, “It is very hard to predict, especially the future.” And I will add “even more especially, about the job market.”. Nevertheless, people need to decide what to do with their lives, how to live, and what career paths to pursue. Some of them ask me, and I’m glad to answer. If you have any career-related questions, don’t be shy! Write to boris@gorelik.net, and I’ll see what wisdom I will be able to share with you.

    Anyhow, this is a letter that I got from another pharmacist looking for a data science career.

    Hope you are doing well. I saw your posts on Quora and thought of asking a doubt.
    First let me tell my background. I am from India, I completed my Doctor of Pharmacy program (Pharm D). I am familiar with computer programming. I have intermediate knowledge in python and R programming. So I thought taking up Bioinformatics and computational biology Masters program so that I can connect Pharma industry and my knowledge in computer science. What do you think? I have applied to University XYZ and got offer letter. I have to take a decision within 2 weeks.
    Please let me know your thoughts on this.

    To which I replied

    Obviously, since the path you are describing similar to the one I took, I will think that it is a good idea. Moreover, as you might have read in my blog (for example, here), my opinion is that advanced degrees give much more stable foundations, compared to the “fast and easy” courses. Having said that, your life is yours, not mine, and the job market today is not the job market in 2001 when I graduated my B.Pharm.

    Thank you so much for replying to my silly question. I am honoured to get a response from you.

    First of all, I don’t believe in “there are no silly questions” bullshit, but asking a silly question is better than not asking at all. Secondly, these questions are not silly at all.

    I have a question, in your post dated 2017, you have mentioned that Bioinformatics was booming in 2001 and now it has lost its significance. Are you still have the same thoughts?

    I think that this person refers to the most visited post of mine “Don’t study data science as a career move; you’ll waste your time!”. There is also a 2019 follow-up.

    If that is the case then me taking a master’s in bioinformatics and computational genomics would be a bad idea, right ?

    Here’s what I responded. Keep in mind that I wrote this before the COVID-19 outbreak.

    Look, the markets in different countries are different.

    Back in the old days, there was a worldwide wave of closing bioinfo companies. All the Israeli ones were either closing or counting weeks before closing. One anecdote: I was interviewing at a company. Two weeks later, I called the person who interviewed me to ask whether I got the job or not, and the secretary told me that that person was fired due to layoffs.

    Right now, Israel sees a renaissance of bioinformatics companies, but I don’t know what will happen in the future. These companies live mostly out of investors’ money and are subject to strict regulations. However, if you get a good education, your head will be full of useful mental models, relevant basic knowledge, and good practices.

    End of quote. One of The COVID-19 madness side effects is the massive influx of money into biotech companies. Is this a short-term anecdote, or will it become a sustainable trend? I have no idea.

    Do you have any career-related questions to me? You don’t have to be a pharmacist to ask :-). Write to boris@gorelik.net. I promise to respond, even if by sending a link to my blog posts.

    June 1, 2020 - 4 minute read -
    bioinformatics data science careers blog Career advice
  • The difference between statistically meaningful and practically meaningful. An interview with me

    The difference between statistically meaningful and practically meaningful. An interview with me

    May 28, 2020

    Recently, I gave an interview to the Techie Leadership site. Andrei Crudu, the interviewer, made a helpful outline of the conversation. I marked the most important parts in bold.

    • Academic views on leadership;
    • Managing people isn’t for everyone;
    • Lessons from a practical approach;
    • Data Science is predominantly about data cleaning;
    • The difference between statistically meaningful and practically meaningful;
    • How sometimes companies tweak results to match expectations;
    • Bad managers make you appreciate the good managers;
    • Giving credit, being decent and not cheating;
    • All good teamwork starts with effective communication;
    • You don’t know that the stuff that you know is unknown to others;

    Overall, I enjoyed chatting with Andrei, and I hope you’ll enjoy listening to the interview. If you have any comments, feel free sharing them here or on the Techie Leadership size

    https://techieleadership.com/the-difference-between-statistically-meaningful-and-practically-meaningful-with-boris-gorelik-009/

    May 28, 2020 - 1 minute read -
    interview leadership podcast significance blog Career advice
  • Is Distributed Work a Divide and Conquer Strategy?

    Is Distributed Work a Divide and Conquer Strategy?

    May 27, 2020

    Before becoming a freelance data scientist, I used to work at Automattic, which I used to regard as my dream job. Not every current and ex-Automattician share that rosy point of view. Antimattic is an anonymous blog that allows ex-Automattic employees to vent their feelings about what used to be their workplace. One recent post on that blog raises a fascinating question about distributed (or work from home, or remote) companies. “Is Distributed Work a Divide and Conquer Strategy?” I have to admit that I haven’t thought about this perspective before. It looks like we will see more and more companies switching to remote work. It’s an interesting interpretation of the “future of work.”

    Obviously this site exists because people have had negative experiences at Automattic. But many people have also had very positive experiences at the company. Could it be that the distributed nature of Automattic allows for such varying experiences? 45 more words

    Is Distributed Work a Divide and Conquer Strategy? — Antimattic

    May 27, 2020 - 1 minute read -
    antimattic automattic distributed work blog
  • Logarithmic scale misinforms. Period

    Logarithmic scale misinforms. Period

    May 26, 2020

    Being a data scientist and a self-proclaimed data visualization expert, I like using log scale graphs when I find them appropriate. However, as a speaker and a communicator, I refrain from using them in presentations as much as possible. From my experience as a data visualization lecturer, I noticed that even “technical” struggle grasping the concept of log scale graphs.

    One of the Coronavirus side effects was the introduction of the term “exponential growth” to every living room. Naturally (to some of us), exponential growth is best presented using a semi-log graph, where the X-axis represents the time (linear), and the Y-axis represents the degree of magnitude of a value (log scale).

    A recent study (link) tested and demonstrated how bad log-scale is. The research title is “The Logarithmic Scale Misinforms the Public and Affects Policy Preferences.” From my experience, log scale graphs misinform everybody. Except for experienced data scientists. Nothing can confuse or misinform us, obviously ;-)

    It is a bummer though that data visualization in that paper sucks so much.

    Don’t publish graphs like this. Especially not in data visualization papers.

    Thanks to Bella Graph who pointed me to the original study.

    May 26, 2020 - 1 minute read -
    data visualisation Data Visualization dataviz log-scale blog
  • Book review: The Year Without Pants. WordPress.com and the future of work by Scott Berkun

    Book review: The Year Without Pants. WordPress.com and the future of work by Scott Berkun

    May 26, 2020

    TL;DR Interesting “history of work” book (definitely not “future of work”) with insights on transition-state organizations. Read it if history of work is your thing, or if you work in a small company that grows rapidly. 4.5/5 (due to the personal connection)


    **

    I got The Year Without Pants in 2014 as an onboarding present when I joined Automattic. The author, Scott Berkun, used to work as a manager at Microsoft (and maybe more places) before he quit and became a career of an adviser and an author. In 2011, the Automattic founder brought Scott to work at the company. About seventy people were working in the company back then and the company was growing rapidly. Automattic has just introduced a concept of teams, and the idea was that Scott will work as a team leader, consulting the management on how to deal with the transition.

    Being an ex-Microsoft manager, Scott was fascinated by the small distributed company, and wrote a book on it, proclaiming that the way Automattic worked was “the future of work”.

    The book was published in 2012. Today, in post-COVID 2020, nobody is surprised by people who don’t need to go to the office every day. Automattic has now more than 1,000 employees and has adopted many of the rituals big companies have, such as endless meetings, tedious coordination, name tags, and corporate speak.

    Why, then, did I enjoy the book? First, for me, it was a pleasant “time travel.” I enjoyed reading about people I knew, teams I worked with, and practices I used to love or hate. Secondly, this book provides insights on a transition from a small group of like-thinkers to a formalized organization.

    May 26, 2020 - 2 minute read -
    berkun book book review distributed work remote working the-year-without-pants work-from-home blog

  • "Why it burns when you P" and other statistics rants

    May 19, 2020

    Do you sometimes Google for something only to find stuff written by yourself?
    I teach a course called “data-based decision making.” While googling for examples of statistics misuse, I stumbled upon an interesting blog post that I wrote about one and a half years ago.

    The post is so good; I decided to post it again.


    “Sunday grumpiness” is an SFW translation of Hebrew phrase that describes the most common state of mind people experience on their first work weekday. My grumpiness causes procrastination. Today, I tried to steer this procrastination to something more productive, so I searched for some statistics-related terms and stumbled upon a couple of interesting links in which people bitch about p-values.

    Why it burns when you P” is a five-years-old rant about P values. It’s funny, informative and easy to read

    Everything Wrong With P-Values Under One Roof” is a recent rant about p-values written in a form of a scientific paper. William M. Briggs, the author of this paper, ends it with an encouraging statement: “No, confidence intervals are not better. That for another day.”

    Everything wrong with statistics (and how to fix it)” is a one-hour video lecture by Dr. Kristin Lennox who talks about the same problems. I saw this video, and two more talks by Dr. Lennox on a flight I highly recommend all her videos on YouTube.

    Do You Hate Statistics as Much as Everyone Else?” – A Natan Yau’s (from flowingdata.com) attempt to get thoughtful comments from his knowledgeable readers.

    This list will not be complete without the classics:

    Why Most Published Research Findings Are False”, “Mindless Statistics”, and “Cargo Cult Science”. If you haven’t read these three pieces of wisdom, you absolutely should, they will change the way you look at numbers and research.

    *The literal meaning of שביזות יום א is Sunday dick-brokenness.

    May 19, 2020 - 2 minute read -
    p-value statistics blog
  • Visualising Odds Ratio — Henry Lau

    Visualising Odds Ratio — Henry Lau

    May 18, 2020

    Besides being a freelancer data scientist and visualization expert, I teach. One of the toughest concepts to teach and to visualize is odds ratio. Today, I stumbled upon a very interesting post that deals exactly with that

    On Thursday 7 May, the ONS published analysis comparing deaths involving COVID-19 by ethnicity. There’s an excellent summary on twitter but the headline is that when taking into account age and other socio-demographic factors, such as deprivation, household composition, education, health and disability, there is higher risk for some ethnic groups of a COVID related…

    Visualising Odds Ratio — Henry Lau

    May 18, 2020 - 1 minute read -
    data visualisation Data Visualization dataviz odds-ratio reblog blog
  • Calling bullshit on

    Calling bullshit on "persistence leads to success"

    May 14, 2020

    Did you know that J.K. Rowling, the author of Harry Potter, submitted her books 13 times before it was accepted? Did you know that Thomas Edison tried again and again, even though his teachers thought he was “too stupid to learn anything?” Did you know that Lior Raz (Fauda’s creator and lead actor) was an anonymous actor for more than ten years before he broke the barrier of anonymity? What do these all people have in common? They persisted, and they succeeded. BUT, and there is a big but.

    girl wearing pink framed sunglasses

    People keep telling us: follow your dream, and if you persist, it will come true. You will learn from your mistakes, improve, and adapt, and finally, will reach your goal. I call bullshit

    Think of the Martingale betting strategy. In theory, it works. Why doesn’t it work in practice? Because nobody has infinite time and infinite pockets. The same is right with chasing your dream. We need to pay for the shelter above our heads, the food on our tables, the clothes that we wear. Often other people depend on us. Time passes by. I had to be a party pooper, but some people who chase their dreams will eat all their savings and will either have to give up or declare bankruptcy (and then give up).

    Survivorship bias

    Read the story, it’s very educational

    But what about all those successful failers? What we see a typical example of survivorship bias, the logical error of concentrating on the people or things that made it past some selection process and overlooking those that did not, typically because of their lack of visibility. We know the names Rowling, Edison, Raz, and others not because of their multiple failures but DESPITE them. For every Rowling, Edison, and Raz, there are thousands of failed writers, engineers, and actors who ended up broke and caused sorrow to their families.

    So, should I quit?

    I don’t know. Maybe. Maybe not. It’s your life, your decision.

    May 14, 2020 - 2 minute read -
    career career-advise professional-success blog Career advice
  • COVID-19 vs. influenza dataviz. The order is now correct

    COVID-19 vs. influenza dataviz. The order is now correct

    May 12, 2020

    May

    March

    February

    Note about the numbers. While the COVID-19 casualties are based on more-or-less accurate live reports, the flu information is an estimate based on yearly average numbers.

    The code is here

    May 12, 2020 - 1 minute read -
    corona covid covid-19 blog
  • On a person that falls into the water. Or why thinking short-time is a good strategy in times of crisis

    On a person that falls into the water. Or why thinking short-time is a good strategy in times of crisis

    May 11, 2020

    At the beginning of the COVID-19 crisis, I tried to explain to my daughter (and to myself) the rationale behind the draconic measures the governments take to fight with the crisis. One rationalization that I found was an analogy of a person that falls into the water. In this situation, the person needs to act FAST to stabilize the situation. Only than, he or she can start planning their steps.

    I have been very vocal criticizing the dramatic measures that many governments took in the beginning of this crisis. It looks like these measures were more-or-less correct, and that the countries that didn’t implement them are now in a much worse situation, compared to the countries that did impose severe limitations. But even if in the retrospective it will turn out that one could do much better without the many “hammers,” I tend to think that those hammers were inevitable.

    The conclusion? One day or another, we will all need to act very fast. This means that we need to be prepared, have plan B’s work on resilience, and maybe perform emergency drills.

    May 11, 2020 - 1 minute read -
    covid covid-19 crisis blog
  • Inbox Zero

    Inbox Zero

    May 11, 2020

    May 11, 2020 - 1 minute read -
    blog
  • Bad advice from a reputable source is bad advice.

    Bad advice from a reputable source is bad advice.

    May 5, 2020

    Would you buy a grammar book with a clear spelling mistake on its cover? I hope not. That’s what happened to IBM when it published it’s new data visualization guide. I didn’t bother reading the manual because of what IBM decided to use as the first image of their guide.

    We use graphs to transfer information into images that are supposed to be later transformed in our brains to information. What visual attributes do we use to interpret the information behind a pie chart? It is the segment angle, its area, or maybe the arc length? Most probably, the answer is “all of the above” (see Robert Kosara’s works for more info). When done right, the three attributes of pie segments are linearly connected one to another, which allows synergism between the visual clues.
    But what did our friends at IBM do? The deliberately distorted the data! I took the screenshot from the guide homepage and made some measurements.
    The purple segment has the angle of 182 degrees, and the angle of the black segment is 75 degrees, which gives us the ratio of 2.42. However, while the radius of the purple segment is 135 pixels, the radius of the black one is only 110 pixels. Why is this a problem? Well, due to the radius differences, the ratio between the arc lengths is 2.91, and the ratio between the areas is 3.66. So now, let me ask you: what is the ratio between the numbers represented by the purple and the black segments?
    It is correct that the colors that IBM people used in their guide are neat, but data visualization that distorts information is not visualization but a piece of garbage. I assume that IBM produces decent computers, but don’t learn data visualization from them

    May 5, 2020 - 2 minute read -
    bad-practice critique data visualisation Data Visualization dataviz ibm blog
  • Why is it (almost) impossible to set deadlines for data science projects?

    Why is it (almost) impossible to set deadlines for data science projects?

    May 1, 2020

    I wrote this post in 2017. For some reason, it started gaining traffic in the last two weeks. I reviewed this post and couldn’t find any new insights. But maybe you can help me.

    May 1, 2020 - 1 minute read -
    blog
  • Online data science conference on May, 28

    Online data science conference on May, 28

    April 30, 2020

    NDR is a family of machine learning/data science conferences. Their next conference will be held online on May, 28 and the agenda looks great.

    Now, I’m not super objective here, because I’m presenting at NDR July event. But look at the topics, what an impressive selection!

    April 30, 2020 - 1 minute read -
    conference data science machine learning ndr romania blog
  • The quintessence of data visualization usefulness

    The quintessence of data visualization usefulness

    April 27, 2020

    I have to admit, I was skeptical at the beginning of the COVID-19 crisis. I started becoming skeptical now when it seems that the crisis didn’t hit my country too hard. But then I saw the graphs in this Financial Times article, and the skepticism disapeared. The graphs are accompanied by hundreds of words, but there is no need for reading the text to understand almost everything.

    These graphs are so good, so convincing, so well performed, they don’t leave any place for doubt or misunderstanding of the message the author wants to convey.

    If you study data visualization, look at these graphs. Look at the color choice, legend location, and design. Look at the ticks on the X- and Y-axes, how they are spaced and typeset. Note the amount of details on the axes, specifically how sparse these details are.

    April 27, 2020 - 1 minute read -
    covid-19 data visualisation Data Visualization dataviz blog
  • Finally We May Have a Path to the Fundamental Theory of Physics…  and It’s Beautiful — Stephen Wolfram Blog

    Finally We May Have a Path to the Fundamental Theory of Physics… and It’s Beautiful — Stephen Wolfram Blog

    April 27, 2020

    OK, so Stephen Wolfram (a mega celebrity in the computational intelligence world and, among other things a physicist) claims that he may have found a path to the Fundamental Theory of Physics. The blog post is long, and I hope to be able to finish reading it in a week or two. The accompanying technical text is a 450-page tome available on a dedicated site.

    Also, it turns out that Stephen Wolfram has a Twitch.tv channel in which he talks about science.

    Website: Wolfram Physics Project Technical Intro: A Class of Models with the Potential to Represent Fundamental Physics How We Got Here: The Backstory of the Wolfram Physics Project… 26,455 more words

    Finally We May Have a Path to the Fundamental Theory of Physics… and It’s Beautiful — Stephen Wolfram Blog

    April 27, 2020 - 1 minute read -
    physics reblog wolfram blog
  • Book review: Never Split the Difference by Chris Voss

    Book review: Never Split the Difference by Chris Voss

    April 25, 2020

    TL;DR: Dull on the surface but has a lot of good points

    Never_Split_3D_Jacket_copy.png

    I read Never Split the Difference following a friend’s recommendation. While reading the book, I kept feeling a constant sense of disappointment and mental eye-rolling. The author, Chris Voss, is a former FBI negotiator. The book is full of FBI war stories and pieces of advice that, on the top of it, sound either trivial or well known. HOWEVER, when the book was over, I sat summarizing my Kindle notes. Forty-five minutes later, I found myself staring at six pages of handwritten text of notes and takeaways. Which, surely, is a good sign.

    What I didn’t like: too many “war stories” from the author’s past as an FBI negotiator; their connection to the business world sometimes seems too far-fetched.

    What I liked: I liked the overall approach. Sometimes, the author cites academic research. Again, the fact that I took so many notes, is very impressive (to me).

    The bottom line: 4/5 Read it, even if you already read a negotiation book.

    April 25, 2020 - 1 minute read -
    book review netotiations blog
  • The missing graves

    The missing graves

    April 20, 2020

    Today, Israel marks Holocaust Day. Many words have been written about the Holocaust, and I want to write about missing graves.
    If you visit a Jewish cemetery, you might see a lot of gravestones with additional memorial plates.

    I took this picture in the Chișinău (Kishinev) Jewish cemetery. Burial of the deceased is considered the final act of kindness a person can perform to the dead. Erecting a “reminder and a name” (Yad-va-Shem), i.e a gravestone, is an intrinsic part of the burial. The Hebrew term for this act of kindness is “Chesed shel emet” – the truthful kindness. Many people died during the Holocaust without a grave, without a gravestone, and without any sign of kindness around them. That is why, when the Holocaust survivors started passing away after the war, their relatives decided to perform this final act of kindness by adding names of those who did not have the fortune to have their own grave.

    This is the gravestone of my grandmother’s sister Etl (Ester). The lower plate is a list of eleven relatives who never had a grave

    April 20, 2020 - 1 minute read -
    chisinau gravestone holocaust kishinev blog
  • Why is forecasting s-curves hard?

    Why is forecasting s-curves hard?

    April 19, 2020

    Constance Crozier (@clcrozier on Twitter) shared an interesting simulation in which she tried to fit a sigmoid curve (s-curve) to predict a plateau in a time-series. The result was a very intuitive and convincing animation that shows how wrong her initial forecasts were.

    The matter of fact is that this phenomenon is not new at all. My first post-University job involved fitting numerous pharmacodynamics models. We always had to keep in mind that if the available data does not account for at least 95% of the maximum effect, the model will be very much suboptimal. It took me a while, but I managed to find the reference for this phenomenon [here]. Maybe, when I have some time, I will repeat Constance Crozier’s analysis, and add confidence intervals to emphasize the point.

    EDIT: I came the conclusion that the most important takaway message of this demonstration is the necessity of reporting uncertainty with any forecast, and how small the value of a forecast is without uncertainty estimations.

    https://player.vimeo.com/video/408599958?dnt=1&app_id=122963

    S-curves (or sigmoid functions) are commonly used to model the evolution of social or biological systems over time [1]. These functions start with exponential growth, then increase linearly, and finally level off (therefore end up looking like a wonky s). Many things that we think of as exponential functions will actually follow an s-curve (otherwise […]

    Forecasting s-curves is hard — Constance Crozier

    April 19, 2020 - 1 minute read -
    curve-fitting data science forecast forecasting modelling pk-pd repost blog
  • On oranizing a data org in a company, job titles, and more

    On oranizing a data org in a company, job titles, and more

    April 16, 2020

    My colleague, Simon Ouderkik, recorded a REALLY interesting interview with Stephen Levin of Zapier and Emilie Schario of Gitlab on organizing data org in a company, job titles, career ladders, and other important stuff.

    April 16, 2020 - 1 minute read -
    reblog simon blog Career advice
  • If there is only one document you can read about data visualization, this is the one

    If there is only one document you can read about data visualization, this is the one

    April 7, 2020

    I’m sorting my teaching material, and I found this gem. The UK Government Statistical Service published a guideline for effective data visualization and tables. If you know a busy person who doesn’t have time to study data visualization and can only read one document, this document is for them (it has less than 40 pages full of examples). Click o the image above to go to the guideline

    April 7, 2020 - 1 minute read -
    data visualisation Data Visualization dataviz documentation guidelines blog
  • Older posts Newer posts