• Meet me at the online data science / AI conference

    Meet me at the online data science / AI conference

    July 16, 2020

    I will be talking about data visualization at the next NDR conference on July 28. All the conferences organized by the NDR team are well organized and of a very high value. I hope to keep the level high.

    And here’s the brief description of my talk

    See you

    July 16, 2020 - 1 minute read -
    conference data visualisation Data Visualization ndr speaking blog
  • 35 (and more) Ways Data Go Bad — Stats With Cats Blog

    35 (and more) Ways Data Go Bad — Stats With Cats Blog

    July 14, 2020

    If you plan working data analysis or processing, read the excellent post in the “stats with cats blog” titled “35 Ways Data Go Bad” post. I did experience each and every one of the 35 problems. However, this list is far from being complete. One should add the comprehensive list of Falsehoods Programmers Believe About Time.

    When you take your first statistics class, your professor will be a kind person who cares about your mental well-being. OK, maybe not, but what the professor won’t do is give you real-world data sets. The data may represent things you find in the real world but the data set will be free of errors. […]

    35 Ways Data Go Bad — Stats With Cats Blog

    July 14, 2020 - 1 minute read -
    data science statistics blog
  • Unexpected hitch of working in a distributed team

    Unexpected hitch of working in a distributed team

    July 13, 2020

    It has been about half a year after I became a freelance data scientist. Before my career change, I worked in a distributed team for more than five years. Today, I suddenly realized that working in a distributed team has a significant problem, inherent to its distributed, multinational, nature.

    My team was always spread over multiple time zones. Sometimes, the time zone span was so broad, that we could never find a time slot where all the team members were ordinarily awake. Automattic, the company I used to work for, is a firm believer in asynchronous communication, but from time to time, you HAVE to meet over a Zoom/Slack/Whatever call. Since I wasn’t a manager, the number of live calls that I had to attend was kept to a minimum, and yet, I found myself at least twice a week in a 10 pm Zoom call. I don’t know what about you, but my brain keeps working for at least two outs after log off. Thus, twice a week, I would find myself going to bed after one o’clock at night. As a result, I was sleep deprived for the majority of the week.

    Only now have I noticed the fact that my sleep has improved so much after the career change. I know that people who work in “colocated” teams also find themselves in late night phone calls, but working in a distributed group means that you’ll do it regularly.

    July 13, 2020 - 1 minute read -
    distributed work remote working sleep work-from-home blog
  • Hybrid digital/analog tangible week planning

    Hybrid digital/analog tangible week planning

    July 12, 2020

    Here’s a neat method that helps me organize my week, increase my productivity and fight procrastination.

    Being a freelancer data scientist, I’m involved in three hands-on projects for two clients. I also manage/mentor two data scientists in two other projects, and participate in strategic discussions for a customer of mine, and in a startup in which I invest. Oh, I am also in the final stages of writing a paper. I never imagined I would be in the situation with so many balls that I need to keep in the air. How do I manage to keep sanity?

    This is what I do. Following the advice in “15 Secrets Successful People Know About Time Management”, I try to keep as many items in my calendar as possible. When my workweek starts, I print out the weekly schedule on a sheet of paper. Then, I apply the tangible GTD hack that I learned from another book [link] and write out all my projects on a bunch of small post-it notes. These notes allow me “dumping” all my brain contents into an external medium, which frees up my brain to spend more CPU cycles on processing, rather than remembering and worrying.

    Next comes the fun part, I get to play with my cards by arranging them on the weekly schedule. The geometry of the post-it notes and the sheet of paper ensures that I allocate reasonably larget chunks of time for each “big thing.” It also reminds me that the amount of time each day is limited, and I can’t stick too many plans into a day or a week. (No, I won’t be able to finalize the paper, complete the analysis for a retail shop, learn a chapter in Bayesian statistics book, before the end of today).

    After I’m done, I copy each post-it note into my calendar. Thanks to the integration with Todoist (an excellent productivity tool), all these tasks end up in my todo list, where I can further work with them.

    To sum up:

    • Global week overview - check
    • Prioritization and honesty - check.
    • Fun playing with sticky notes - check.
    • Work gets done - (I wish!).

    Oh, did you notice the appointments between 5 and 6 am? This is my sports activity. Sometimes working out charges me for the entire day. Sometimes, all I want to do for the entire day is to have a nap :-)

    July 12, 2020 - 2 minute read -
    gtd planning procrastination productivity time-management blog Productivity & Procrastination
  • Before and after. Even excellent graphs can be improved

    Before and after. Even excellent graphs can be improved

    June 30, 2020

    Being a data visualization consultant, I can’t help looking for dataviz problems in graphs that I see. Even if the graph is good. Even if I know that I would not be able to create a graph that good. Even if the overall graph is excellent, and the problems are minor, or maybe especially when the graph is excellent, and the problems are minor.

    This is a nice graph published by Nevo Benita on Linkedin.

    The graph presents the gap between the men and the women in the Israeli job market. As I said, the graph is excellent. However, there are several small problems that, like grains of sand in a chocolate mousse, stand in the way. Let’s take a look at them.

    The time-series line in the upper right part of the graph shows good use of the real estate. The problem is that the X-axis ticks (the years) look as if they belong to the chart below. It takes some time to realize that the numbers are years of the upper graph, and not the X-axis of the graph below. Moving the numbers upwards by several pixels would have fixed that.

    *Now, it is more clear that “1990” and “2018” relate to the time-series graph above.
    Before (left) and after (right). *

    Let’s talk about the left-side bar chart. It took me a while to understand what it is. As a matter of fact, I managed to write a critique paragraph about that bar chart, how it is unclear what the percentages are, and how they were computed. Only then had I noticed the explanation below. Such confusion isn’t the viewer’s fault. Since we usually scan images from top to bottom, moving the title to the top of the chart will reduce this confusion. The word “percent” is also redundant in that title since it comes after the percent sign.

    Moving the explanation to the top makes it easier to notice. Before (left) and after (right)

    The last point that is worth optimizing is the color order. Consistent element order in an image makes navigation and comprehension much easier. When the order is preserved, our brain can use mental shortcuts without losing much information. When these shortcuts are broken, the brain has to work harder. What am I talking about? The graph author made the correct decision to use different font colors in the graph title to specify which color stands for which gender. This way, we don’t need a separate legend, and this is good. The title is an ordered sequence of words. The visualizer could use this order to create the order heuristic that is so helpful. Such a heuristic isn’t always possible. Fortunately for the visualizer (and sadly for the society), the salary gap in all the occupations in this graph have the same direction: men earn more than women. As a result, the rightmost part has all the green dots on the right, and the purple dots are on the left. This direction is opposite to the gender direction in the title and the color direction in the bar chart. To fix this situation, I made sure that the color that stands for the women (purple) is always to the left of the color that designates the men (green).

    Keeping the color order. Before (left) and after (right)

    So, this is the final result. I hope you can see why I like it better.

    That’s how I took and excellent graph and made it even more awesome.

    June 30, 2020 - 3 minute read -
    before-after data visualisation Data Visualization dataviz blog
  • Data visualization is not only dots, bars, and pies

    Data visualization is not only dots, bars, and pies

    June 28, 2020

    Look at this wonderful piece of data visualization (taken from here). If you know the terms “tertiary structure” and “glycan”, there is NO way you miss the message that the author of this figure wanted to convey.

    Also, note how using appropriate colors in the title, the authors got rid of graph legend.

    June 28, 2020 - 1 minute read -
    data visualisation Data Visualization dataviz legend molecular-structure blog
  • Multilingual protest in Acre, Israel

    Multilingual protest in Acre, Israel

    June 27, 2020

    June 27, 2020 - 1 minute read -
    blog
  • How to become a Python professional in 42 hours?

    How to become a Python professional in 42 hours?

    June 25, 2020

    Here’s an appealing ad that I saw

    This image has an empty alt attribute; its file name is image-2.png

    How to become a Python professional in 42 hours? I’ll tell you how. There is no way. I don’t know any field of knowledge in which one can become professional after 42 hours. Certainly not Python. Not even after 42 days. Maybe after 42 weeks if that’s mostly what you do and you already a programmer.

    June 25, 2020 - 1 minute read -
    ad career python blog
  • Standardizing bidirectional language support in interfaces and visualization

    Standardizing bidirectional language support in interfaces and visualization

    June 24, 2020

    I’m honored to take part in standardizing bidirectional language support in interfaces and visualization, as a part of an expert group formed for the Hebrew Support in Computerized Systems Committee at the SII-the standards institution of Israel.

    The Committee is led by Gilad Almosnino. Below is Gilad’s project announcement.

    June 24, 2020 - 1 minute read -
    RTL sii blog
  • Book review. Five Stars by Carmine Gallo

    Book review. Five Stars by Carmine Gallo

    June 22, 2020

    TL;DR Good motivation to improve communication. Inadequate source of information on how to achieve that

    The central premise of Five Stars Communication Secrets to Get from Good to Great by Carmine Gallo is that professionals who don’t invest in communication skills are at high risk of being replaced by computers and robots. One of the book’s sections bares the title that summarises this premise very well “Storytelling isn’t a soft skill; it’s the equivalent of hard cash.” I firmly believe in these premises. That is why I invest so much time in learning and teaching data visualization, in public speaking, and blogging.

    When I started reading this book, I got excited. I kept marking one passage after another. Gallo packed the first part of the book with numerous citations and explanations on how a lack of communication skills is the most severe risk factor in the career of a modern professional, team, or company. One example leads to another one, and one smart conclusion followed another one.

    Then, I started noticing that the book tries to convince me more and more, but I didn’t need that convincing in the first place. More than half of the book is evangelism. The author tells you how essential communication skills are, then he gives you some examples of people who did it right, and then again talks on importance. Again, and again, and again. Where are all those “secrets to get from good to great”???

    When, finally, we get to the practical parts, the reader is left mostly with shallow, almost trivial bits of advice.

    Some of the most important points I took from this book

    Slight feeling of a hamster-wheel while reading this book

    Adopt the three-act storytelling approach to presentations. The three-act storytelling approach worked for Homerus, Shakespear, Tarantino, and there is no reason it should fail you in your technical presentations. Fair enough. On the other hand, this 2012 article by Nancy Duarte, provides more depth and more actionable information on this approach (follow Duarte’s blog if presentation skills are something you are interested in).

    “In the first two to three minutes of a presentation, I want people to lean forward in their chairs.” I like this citation by Avinash Kaushik, Google’s digital marketing evangelist. I will undoubtedly try this approach in my next presentations.

    Should you read this book?

    If you read these lines, your job depends on your communication and presentation skills. If you believe this premise, you can skip the first 60% of the book. If you want to improve your communication skills, I suggest reading Jean-luc Doumont’s “Trees, Maps, and Theorems,” which is much shorter, but also much denser in methods and practical advice.

    The bottom line

    3.5/5

    June 22, 2020 - 2 minute read -
    book review presentation-tip public speaking blog
  • The delicate art of fine trolling

    The delicate art of fine trolling

    June 15, 2020

    I’m reading the a 1991 paper by Barbara Tversky that deals with the directional representation of time. One sentence in the paper interview says

    “There does not seem to be strong universal cognitive associations of quantity or quality to left or right”

    Whenever I make a similar statement in the context of data visualization, I frequently get a self-assured response “of course there is - smaller numbers appear on the left!”. To answer this remark, Barbara Tversky added a small footnote that says

    “Anyone in doubt should consult politicians on both the left and the right.”

    Photo by Pixabay on Pexels.com

    So gentle, yet so powerful.

    June 15, 2020 - 1 minute read -
    paper RTL trolling blog
  • Lie factor in ad graphs

    Lie factor in ad graphs

    June 9, 2020

    It’s fun to look at the visit statistics and to discover old stories. I wrote this post in 2016. For a reason I don’t know, this post has been one of the most viewed posts in my blogs during the last week.

    So, I decided to publish it again. I won’t add any new examples, but if you want to see more stuff, type [lying with data visualization] in your favorite search engine

    Lie factor in ad graphs

    What do you do when you have spare time? I tend to throw graphs from ads to a graph digitizer to compute the “lie factor”. Take the following graph for example. It appeared in an online ad campaign a couple of years ago. In this campaign, one of the four Israeli health care providers bragged about the short waiting times in their phone customer support. According to the Meuheded (the health care provider who run the campaign), their customers had to wait for one minute and one second, compared to 1:03, 1:35, and 2:39 in the cases of the competitors. Look how dramatic the difference is:

    Screen Shot 2018-02-16 at 18.34.38

    The problem?

    If the orange bar represents 61 seconds, then the dark blue one stands for 123 seconds, almost twice as much, compared to the actual numbers, the green bar is 4:20 minutes, and the light-blue one is approximately seven minutes, and not 2:39, as the number says.

    Screen Shot 2018-02-16 at 18.32.53

    I can’t figure out what guided the Meuhedet creative team in selecting the bar heights. What I do know that they lied. And this lie can be quantified.

    June 9, 2020 - 2 minute read -
    data visualisation Data Visualization dataviz lie lie-factor blog
  • StellarGraph — another promising network analysis library for Python and Scala

    StellarGraph — another promising network analysis library for Python and Scala

    June 8, 2020

    Network (graph) analysis is a complicated topic. There are several tools available for this task with different pros and cons. Recently, I stumbled upon another tool StellarGraph. StellarGraph authors claim to provide excellent performance; NumPy, Pandas, TensorFlow integration, an impressive set of algorithms, inter compatibility with Neo4j (THE graph database); and much more. The documentation looks very clear and extensive too.

    I didn’t use it yet, but I certainly plan to.

    https://www.stellargraph.io

    June 8, 2020 - 1 minute read -
    igraph neo4j network-analysis networkx social-network-analysis stellargraph blog
  • The hazard of being a wizard. On balance between specialization and the risk to become obsolete.

    The hazard of being a wizard. On balance between specialization and the risk to become obsolete.

    June 3, 2020

    A wizard is a person who continually improves his or her professional skill in a particular and defined field. I learned about this definition of wizardness from the book “Managing project, people and yourself” by Nikolay Toverosky (the book is in Russian).

    Recently, Nikolay published an interesting post about the hazards of becoming a wizard. The gist of the idea is that while you are polishing your single skill to perfection, the world changes. You may find your super-skill irrelevant anymore (see my Soviet Shoemaker story).

    Nikolay doesn’t give any suggestions. Neither do I.

    Below is the link to the original post. The post is in Russian, and you can use Google Translate to read it.

    Страница о магах У меня в книге есть глава про полководцев и магов. В её конце я подвожу итог: Несмотря на свою кру­тость, маг уяз­вим. Он поле­зен, только если его навык под­хо­дит к задаче. 658 more words

    Почему опасно быть магом — Об управлении проектами и дизайне

    June 3, 2020 - 1 minute read -
    diversity obsolete skills blog Career advice
  • Nice but useless data visualization

    Nice but useless data visualization

    June 2, 2020

    Network visualization can mesmerize and hypnotize. Chord diagrams are especially cool because they are so colorful and smooth. The problem is that sometimes, the result doesn’t provide any actual value, and serves as a cute illustration. Cute illustrations are cute; they help put some “easiness” to the text without the risk of looking too unprofessional.

    Take the two examples below.

    One example (taken from here) shows worldwide migration patterns in a clear and useful way. You can take a look at the graph and make real conclusions.

    The other example (taken from here) is mostly a useless illustration.

    The only “conclusion” that a viewer can make out of this graph is “everything is connected with everything.”

    This type of conclusion is OK for an ad or a general overview of a problem, but it is NOT a valid way to end a discussion.

    June 2, 2020 - 1 minute read -
    data visualisation Data Visualization dataviz
  • Bioinformatics career advice and a story about a Soviet shoemaker

    Bioinformatics career advice and a story about a Soviet shoemaker

    June 1, 2020

    When I was in elementary school (back in the USSR of the mid 80’s), I had a friend whose father was a shoemaker. Due to the crazy stupid way the Soviet economy worked, a Soviet shoemaker was much richer than a physician or an engineer. But this is not the story. The story is that one day this friend’s father had a chat with me about selecting a profession. This man’s point was that for as long as people have feet and need shoes on their feet, a shoemaker would be required and well-earning occupation. Guess what? People still have feet, and still, ware shoes, but I don’t see too many successful shoemakers anymore.

    Common wisdom says, “It is very hard to predict, especially the future.” And I will add “even more especially, about the job market.”. Nevertheless, people need to decide what to do with their lives, how to live, and what career paths to pursue. Some of them ask me, and I’m glad to answer. If you have any career-related questions, don’t be shy! Write to boris@gorelik.net, and I’ll see what wisdom I will be able to share with you.

    Anyhow, this is a letter that I got from another pharmacist looking for a data science career.

    Hope you are doing well. I saw your posts on Quora and thought of asking a doubt.
    First let me tell my background. I am from India, I completed my Doctor of Pharmacy program (Pharm D). I am familiar with computer programming. I have intermediate knowledge in python and R programming. So I thought taking up Bioinformatics and computational biology Masters program so that I can connect Pharma industry and my knowledge in computer science. What do you think? I have applied to University XYZ and got offer letter. I have to take a decision within 2 weeks.
    Please let me know your thoughts on this.

    To which I replied

    Obviously, since the path you are describing similar to the one I took, I will think that it is a good idea. Moreover, as you might have read in my blog (for example, here), my opinion is that advanced degrees give much more stable foundations, compared to the “fast and easy” courses. Having said that, your life is yours, not mine, and the job market today is not the job market in 2001 when I graduated my B.Pharm.

    Thank you so much for replying to my silly question. I am honoured to get a response from you.

    First of all, I don’t believe in “there are no silly questions” bullshit, but asking a silly question is better than not asking at all. Secondly, these questions are not silly at all.

    I have a question, in your post dated 2017, you have mentioned that Bioinformatics was booming in 2001 and now it has lost its significance. Are you still have the same thoughts?

    I think that this person refers to the most visited post of mine “Don’t study data science as a career move; you’ll waste your time!”. There is also a 2019 follow-up.

    If that is the case then me taking a master’s in bioinformatics and computational genomics would be a bad idea, right ?

    Here’s what I responded. Keep in mind that I wrote this before the COVID-19 outbreak.

    Look, the markets in different countries are different.

    Back in the old days, there was a worldwide wave of closing bioinfo companies. All the Israeli ones were either closing or counting weeks before closing. One anecdote: I was interviewing at a company. Two weeks later, I called the person who interviewed me to ask whether I got the job or not, and the secretary told me that that person was fired due to layoffs.

    Right now, Israel sees a renaissance of bioinformatics companies, but I don’t know what will happen in the future. These companies live mostly out of investors’ money and are subject to strict regulations. However, if you get a good education, your head will be full of useful mental models, relevant basic knowledge, and good practices.

    End of quote. One of The COVID-19 madness side effects is the massive influx of money into biotech companies. Is this a short-term anecdote, or will it become a sustainable trend? I have no idea.

    Do you have any career-related questions to me? You don’t have to be a pharmacist to ask :-). Write to boris@gorelik.net. I promise to respond, even if by sending a link to my blog posts.

    June 1, 2020 - 4 minute read -
    bioinformatics data science careers blog Career advice
  • The difference between statistically meaningful and practically meaningful. An interview with me

    The difference between statistically meaningful and practically meaningful. An interview with me

    May 28, 2020

    Recently, I gave an interview to the Techie Leadership site. Andrei Crudu, the interviewer, made a helpful outline of the conversation. I marked the most important parts in bold.

    • Academic views on leadership;
    • Managing people isn’t for everyone;
    • Lessons from a practical approach;
    • Data Science is predominantly about data cleaning;
    • The difference between statistically meaningful and practically meaningful;
    • How sometimes companies tweak results to match expectations;
    • Bad managers make you appreciate the good managers;
    • Giving credit, being decent and not cheating;
    • All good teamwork starts with effective communication;
    • You don’t know that the stuff that you know is unknown to others;

    Overall, I enjoyed chatting with Andrei, and I hope you’ll enjoy listening to the interview. If you have any comments, feel free sharing them here or on the Techie Leadership size

    https://techieleadership.com/the-difference-between-statistically-meaningful-and-practically-meaningful-with-boris-gorelik-009/

    May 28, 2020 - 1 minute read -
    interview leadership podcast significance blog Career advice
  • Is Distributed Work a Divide and Conquer Strategy?

    Is Distributed Work a Divide and Conquer Strategy?

    May 27, 2020

    Before becoming a freelance data scientist, I used to work at Automattic, which I used to regard as my dream job. Not every current and ex-Automattician share that rosy point of view. Antimattic is an anonymous blog that allows ex-Automattic employees to vent their feelings about what used to be their workplace. One recent post on that blog raises a fascinating question about distributed (or work from home, or remote) companies. “Is Distributed Work a Divide and Conquer Strategy?” I have to admit that I haven’t thought about this perspective before. It looks like we will see more and more companies switching to remote work. It’s an interesting interpretation of the “future of work.”

    Obviously this site exists because people have had negative experiences at Automattic. But many people have also had very positive experiences at the company. Could it be that the distributed nature of Automattic allows for such varying experiences? 45 more words

    Is Distributed Work a Divide and Conquer Strategy? — Antimattic

    May 27, 2020 - 1 minute read -
    antimattic automattic distributed work blog
  • Logarithmic scale misinforms. Period

    Logarithmic scale misinforms. Period

    May 26, 2020

    Being a data scientist and a self-proclaimed data visualization expert, I like using log scale graphs when I find them appropriate. However, as a speaker and a communicator, I refrain from using them in presentations as much as possible. From my experience as a data visualization lecturer, I noticed that even “technical” struggle grasping the concept of log scale graphs.

    One of the Coronavirus side effects was the introduction of the term “exponential growth” to every living room. Naturally (to some of us), exponential growth is best presented using a semi-log graph, where the X-axis represents the time (linear), and the Y-axis represents the degree of magnitude of a value (log scale).

    A recent study (link) tested and demonstrated how bad log-scale is. The research title is “The Logarithmic Scale Misinforms the Public and Affects Policy Preferences.” From my experience, log scale graphs misinform everybody. Except for experienced data scientists. Nothing can confuse or misinform us, obviously ;-)

    It is a bummer though that data visualization in that paper sucks so much.

    Don’t publish graphs like this. Especially not in data visualization papers.

    Thanks to Bella Graph who pointed me to the original study.

    May 26, 2020 - 1 minute read -
    data visualisation Data Visualization dataviz log-scale blog
  • Book review: The Year Without Pants. WordPress.com and the future of work by Scott Berkun

    Book review: The Year Without Pants. WordPress.com and the future of work by Scott Berkun

    May 26, 2020

    TL;DR Interesting “history of work” book (definitely not “future of work”) with insights on transition-state organizations. Read it if history of work is your thing, or if you work in a small company that grows rapidly. 4.5/5 (due to the personal connection)


    **

    I got The Year Without Pants in 2014 as an onboarding present when I joined Automattic. The author, Scott Berkun, used to work as a manager at Microsoft (and maybe more places) before he quit and became a career of an adviser and an author. In 2011, the Automattic founder brought Scott to work at the company. About seventy people were working in the company back then and the company was growing rapidly. Automattic has just introduced a concept of teams, and the idea was that Scott will work as a team leader, consulting the management on how to deal with the transition.

    Being an ex-Microsoft manager, Scott was fascinated by the small distributed company, and wrote a book on it, proclaiming that the way Automattic worked was “the future of work”.

    The book was published in 2012. Today, in post-COVID 2020, nobody is surprised by people who don’t need to go to the office every day. Automattic has now more than 1,000 employees and has adopted many of the rituals big companies have, such as endless meetings, tedious coordination, name tags, and corporate speak.

    Why, then, did I enjoy the book? First, for me, it was a pleasant “time travel.” I enjoyed reading about people I knew, teams I worked with, and practices I used to love or hate. Secondly, this book provides insights on a transition from a small group of like-thinkers to a formalized organization.

    May 26, 2020 - 2 minute read -
    berkun book book review distributed work remote working the-year-without-pants work-from-home blog

  • "Why it burns when you P" and other statistics rants

    May 19, 2020

    Do you sometimes Google for something only to find stuff written by yourself?
    I teach a course called “data-based decision making.” While googling for examples of statistics misuse, I stumbled upon an interesting blog post that I wrote about one and a half years ago.

    The post is so good; I decided to post it again.


    “Sunday grumpiness” is an SFW translation of Hebrew phrase that describes the most common state of mind people experience on their first work weekday. My grumpiness causes procrastination. Today, I tried to steer this procrastination to something more productive, so I searched for some statistics-related terms and stumbled upon a couple of interesting links in which people bitch about p-values.

    Why it burns when you P” is a five-years-old rant about P values. It’s funny, informative and easy to read

    Everything Wrong With P-Values Under One Roof” is a recent rant about p-values written in a form of a scientific paper. William M. Briggs, the author of this paper, ends it with an encouraging statement: “No, confidence intervals are not better. That for another day.”

    Everything wrong with statistics (and how to fix it)” is a one-hour video lecture by Dr. Kristin Lennox who talks about the same problems. I saw this video, and two more talks by Dr. Lennox on a flight I highly recommend all her videos on YouTube.

    Do You Hate Statistics as Much as Everyone Else?” – A Natan Yau’s (from flowingdata.com) attempt to get thoughtful comments from his knowledgeable readers.

    This list will not be complete without the classics:

    Why Most Published Research Findings Are False”, “Mindless Statistics”, and “Cargo Cult Science”. If you haven’t read these three pieces of wisdom, you absolutely should, they will change the way you look at numbers and research.

    *The literal meaning of שביזות יום א is Sunday dick-brokenness.

    May 19, 2020 - 2 minute read -
    p-value statistics blog
  • Visualising Odds Ratio — Henry Lau

    Visualising Odds Ratio — Henry Lau

    May 18, 2020

    Besides being a freelancer data scientist and visualization expert, I teach. One of the toughest concepts to teach and to visualize is odds ratio. Today, I stumbled upon a very interesting post that deals exactly with that

    On Thursday 7 May, the ONS published analysis comparing deaths involving COVID-19 by ethnicity. There’s an excellent summary on twitter but the headline is that when taking into account age and other socio-demographic factors, such as deprivation, household composition, education, health and disability, there is higher risk for some ethnic groups of a COVID related…

    Visualising Odds Ratio — Henry Lau

    May 18, 2020 - 1 minute read -
    data visualisation Data Visualization dataviz odds-ratio reblog blog
  • Calling bullshit on

    Calling bullshit on "persistence leads to success"

    May 14, 2020

    Did you know that J.K. Rowling, the author of Harry Potter, submitted her books 13 times before it was accepted? Did you know that Thomas Edison tried again and again, even though his teachers thought he was “too stupid to learn anything?” Did you know that Lior Raz (Fauda’s creator and lead actor) was an anonymous actor for more than ten years before he broke the barrier of anonymity? What do these all people have in common? They persisted, and they succeeded. BUT, and there is a big but.

    girl wearing pink framed sunglasses

    People keep telling us: follow your dream, and if you persist, it will come true. You will learn from your mistakes, improve, and adapt, and finally, will reach your goal. I call bullshit

    Think of the Martingale betting strategy. In theory, it works. Why doesn’t it work in practice? Because nobody has infinite time and infinite pockets. The same is right with chasing your dream. We need to pay for the shelter above our heads, the food on our tables, the clothes that we wear. Often other people depend on us. Time passes by. I had to be a party pooper, but some people who chase their dreams will eat all their savings and will either have to give up or declare bankruptcy (and then give up).

    Survivorship bias

    Read the story, it’s very educational

    But what about all those successful failers? What we see a typical example of survivorship bias, the logical error of concentrating on the people or things that made it past some selection process and overlooking those that did not, typically because of their lack of visibility. We know the names Rowling, Edison, Raz, and others not because of their multiple failures but DESPITE them. For every Rowling, Edison, and Raz, there are thousands of failed writers, engineers, and actors who ended up broke and caused sorrow to their families.

    So, should I quit?

    I don’t know. Maybe. Maybe not. It’s your life, your decision.

    May 14, 2020 - 2 minute read -
    career career-advise professional-success blog Career advice
  • COVID-19 vs. influenza dataviz. The order is now correct

    COVID-19 vs. influenza dataviz. The order is now correct

    May 12, 2020

    May

    March

    February

    Note about the numbers. While the COVID-19 casualties are based on more-or-less accurate live reports, the flu information is an estimate based on yearly average numbers.

    The code is here

    May 12, 2020 - 1 minute read -
    corona covid covid-19 blog
  • On a person that falls into the water. Or why thinking short-time is a good strategy in times of crisis

    On a person that falls into the water. Or why thinking short-time is a good strategy in times of crisis

    May 11, 2020

    At the beginning of the COVID-19 crisis, I tried to explain to my daughter (and to myself) the rationale behind the draconic measures the governments take to fight with the crisis. One rationalization that I found was an analogy of a person that falls into the water. In this situation, the person needs to act FAST to stabilize the situation. Only than, he or she can start planning their steps.

    I have been very vocal criticizing the dramatic measures that many governments took in the beginning of this crisis. It looks like these measures were more-or-less correct, and that the countries that didn’t implement them are now in a much worse situation, compared to the countries that did impose severe limitations. But even if in the retrospective it will turn out that one could do much better without the many “hammers,” I tend to think that those hammers were inevitable.

    The conclusion? One day or another, we will all need to act very fast. This means that we need to be prepared, have plan B’s work on resilience, and maybe perform emergency drills.

    May 11, 2020 - 1 minute read -
    covid covid-19 crisis blog
  • Older posts Newer posts