• Career advice. Becoming a freelancer immediately after finishing a masters degree

    Career advice. Becoming a freelancer immediately after finishing a masters degree

    September 8, 2020

    Will Cray [link] is a fresh M.Sc. in Computer Science and considers becoming a freelancer in the Machine Learning / Artificial Intelligence / Data Science field. Will asked for advice on the LocallyOptimistic.com community Slack channel. Here’s will question (all the names in this post are used with people’s permissions).

    Read more career advices [here].

    Let’s begin.

    Will Cray

    I’m hoping to start a career as a freelancer in the AI space after finishing my Master’s in CS with a focus in AI. I don’t, however, have any industry experience in AI or data science. Do you all think it’s feasible to start a freelancing career without any industry experience? If so, do you have any tips on how to do it successfully?
    [I worked for] two years at a major tech company, but I was a systems engineer. It was experience that isn’t necessarily relevant to what I want to work on as a freelancer.

    Let’s divide the response to Will’s questions into two parts that correspond to Slack’s two discussion threads.

    Thread #1 - Michael Kaminsky

    This is a copy/paste from Slack.

    Michael Kaminsky

    LocallyOptimistic.com – a valuable source for data folks

    My hunch is that it’s going to be pretty tough to get started, though not impossible. You’re probably looking at a pretty lean year or two to build up a reputation out of the gate

    Michael Kaminsky

    AI work in general is sort of difficult to contract out — so you might have more luck if you team up with a larger consulting outfit that can handle the other non-AI parts of the work

    Michael Kaminsky

    very rarely is someone like “we have all of the data pipeline and pieces working, now we just need to hire someone to do the AI part” — in general, the model-fitting part of an AI project is the easiest and fastest

    Will Cray

    Thank you so much for the info–it’s really helping me getting a better understanding of the landscape. Would your opinion, especially regarding that last message, change if the AI work I was doing was more custom model/agent design and training, rather than doing something quick like .fit() in sklearn?

    Michael Kaminsky

    ummm maybe? but like who needs custom model/agent design and training that doesn’t already have in-house data scientists working on it?

    Michael Kaminsky

    I don’t want to dissuade you, but my point is that you should think about who your customers are, and how you can market your services in such a way that it will provide them value. If you don’t have a clear map of the three concepts in italics, it could get rough — you can definitely figure it out by doing it, but that’s what you’ll be up against

    Will Cray

    You mentioned “larger consulting outfits” earlier–do you have any examples of organizations that you think could be a good fit?

    Michael Kaminsky

    so Brooklyn Data Company and 4 mile consulting are the two that jump to my mind — they specialize in BI and data but might want flex capacity into DS — they might be able to give you deal flow, etc. I know there are a number of others, maybe even folks in this channel

    Thread #2 - Boris Gorelik

    This is a copy/paste from Slack with some later edits and additions.

    Boris Gorelik

    Another thing to consider is what your risks are. If there are people who depend on you financially, starting with a freelance career might be too risky, especially if you don’t have 1-2 (better 2) customers who already committed to paying you for your services.

    If you can afford several months without a steady income, or no income at all, being a freelancer might expose you to a larger variety of companies and business models in the market. I know some people who used to work as freelancers and gradually “adopted” one customer and moved to full employment. In these cases, freelance projects were, in fact, mutual trial periods where both sides decided whether there is a good fit.

    Will Cray

    I greatly appreciate this insight. I have little risks. I’m single, my living expenses are low, and I have some financial runway. Part of the reason I like the idea of freelancing is for the reason you stated–I’ll get to see many different business models. As an aspiring entrepreneur, I think diversity of experiences and exposure would be useful to me. I also think being flexible in how many hours I work will allow me to allocate more time to developing my own ideas/projects; although, I understand that’s a luxury that comes with being an established freelancer. I don’t have any clients currently. Do you have any recommendations for channels to try and garner clients?

    Boris Gorelik

    As an aspiring entrepreneur, I think ….

    Even though a freelancer and an entrepreneur’s legal status may be the same, they are different occupations and careers. An entrepreneur creates and realizes business models; a freelancer sells their time and expertise to fulfill someone else’s ideas. That’s true that most of the time (not always), combining freelance with entrepreneurship is easier than combining entrepreneurship with being a full-time employee in a traditional company.

    Do you have any recommendations for channels to try and garner clients?

    Nothing except the regular facebook/linkedin/ but mostly friends and former coworkers and, in your case, teachers/lecturers. I got my first job interview via my Ph.D. advisor. Later, when I helped in hiring processes, I asked him and other professors to refer me to proper candidates. So yeah, make sure your professors know your status.

    September 8, 2020 - 4 minute read -
    career career-advise freelance blog Career advice
  • Exploring alternatives to population pyramids

    Exploring alternatives to population pyramids

    September 2, 2020

    A population pyramid also called an “age-gender-pyramid”, is a graphical illustration that shows the distribution of various age groups in a population (typically that of a country or region of the world), which forms the shape of a pyramid when the population is growing [citation from Wikipedia].

    In some cases, the pyramid provides interesting insights into the entire population. In this post, I will explore ways to make some of these insights more visible.

    The basic case

    Let’s start with the basic case. If you have two-three hours of spare time, you can go to the site devoted to population pyramids – https://www.populationpyramid.net. There, you will find population pyramids for every country in the world. The site provides present and past data, as well as future forecasts. To understand how insightful age pyramids can be, look at the graph that represents the entire world.

    (this and most other images in this post are from the site http://populationpyramid.net/)

    You can clearly see that the world is mostly young, that the amount of people declines as the age progresses, and that there is a rough balance between men and women in the world, at least before the ages of 70+.

    Now, examine the stark difference between the populations of Western Africa and Western Europe. Citing the late professor Hans Rosling, we can still see two worlds, one with large families and short lives, and one with small families and long lives.

    Another starking example of an age pyramid is the following

    Do you want to guess what country is that? This particular graph shows the age distribution of the United Arab Emirates. Such a vast distortion in symmetry and age distribution stems from the fact that more than 80% of the UAE’s population is composed of expats who come to this rich country to work. The pyramid below (taken from [this article]) sheds some light to the population composition of UAE. (Note that the genders in this graph are reversed).

    Whose bar is longer?

    The male-female disbalance in the UAE and some other Gulf countries is very striking and cannot be missed. But what about other, more subtle cases? Take a look at the world graph above. If you follow the numbers on the bars, you will notice that more boys are born than girls, but there are more old ladies than old gents in the world. Can we make such differences less subtle?

    To answer this question, we need to understand why we find it hard to compare almost equal bars. The reason for that is that our eyes (or brains) are not so good at comparing sizes. They do, however, do a much better job comparing positions. Thus, if we overlap these bars, we will see the small differences in a much more precise manner.

    (I thank the data visualization expert Bella Graf from InfoServiz.co.il for the idea of this graph).

    Now, the subtle differences in gender composition are more visible.

    What am I looking at?

    When I teach data visualization, I always tell my students to add a meaningful title to the graph. By “meaningful,” I mean a title that does not answer the question “what” but rather “so what”? (See my posts “How to suck less in data visualization,” and “C for conclusion”). What would a good title for this graph be? Let’s try the following

    OK, so now, when we have a title, we can ask ourselves, “does the graph show what it says it shows”? And the answer is no. Right now, the title talks about differences, but we don’t see the differences. We see the differences and other stuff. Let’s look only at the differences.

    I don’t like this.

    What about this?

    Now, this is not an age pyramid. That’s for sure. This graph doesn’t show the wealth of data that the classical pyramid shows. On the other hand, it does offer one thing, and it does it very well. Look, for example, at the male/female distortion in China in 1990.

    You may find the code I used to create the graphs in this post [on GitHub].

    September 2, 2020 - 3 minute read -
    age-pyramid data visualisation Data Visualization dataviz blog
  • The Mysterious Status of .blog Domains

    The Mysterious Status of .blog Domains

    September 1, 2020

    When the .blog TLD was started by Automattic, employees were given the option to reserve a domain for free. In return […], they asked that the domain be used as a primary domain (no forwarding to a different site), and that the site be updated with new content at least once a month. This requirement was the last argument for me NOT taking boris.blog – I didn’t want to make this commitment, plus I like gorelik.net a lot.

    Recently, there were some not so nice developments about .blog names that were given away to Automatticians. The complains about this situation are usually anonymously, but I think that in this case, anonymity isn’t the right approach. That is why, I decided to share here an anonymous post from the Antimattic blog. Although I am not the author of this original post, and I don’t share the views of some of the posts written there, I do share the concerns expressed in this particular article. Posting in return for a domain name might have been a reasonable request at the beginning of the .blog TLD to help promoting its adaptation. But now, several years after this TLD is active, this requirement is simply not OK. To read the original post, click the screenshot below.

    The first paragraph of this post is a verbatim copy from Antimattic.

    September 1, 2020 - 1 minute read -
    antimattic automattic blog
  • ASCII histograms are quick, easy to use and to implement

    ASCII histograms are quick, easy to use and to implement

    August 31, 2020

    From time to time, we need to look at a distribution of a group of values. Histograms are, I think, the most popular way to visualize distributions. “Back in the old days,” when most of my work was done in the console, and when creating a plot from Python was required too many boilerplate code lines, I found a neat function that produced histograms using ASCII characters.

    Recently, I updated the python function that I use to create ASCII histograms. The updated function [link] uses more modern formatting and includes several signal-to-noise improvements. One can also use it with custom output functions, such as logging.info.

    August 31, 2020 - 1 minute read -
    data visualisation Data Visualization dataviz distribution histogram blog
  • A short compilation of productivity blog posts

    A short compilation of productivity blog posts

    August 27, 2020

    This post contains a bunch of links to blogs that write about productivity.

    1. Musings of Brown Girls

    This is not an exclusively productivity blog. The authors of this collective effort write about other interesting things. I read some posts, and I liked them

    2. Self care

    Do you know that feeling when you feel bad and don’t have the energy to do anything about that? This post is for you.

    3. Saying NO

    Being a freelancer, I have to practice saying NO. Saying NO isn’t only good for productivity but also for your mental health. Interesting post.

    August 27, 2020 - 1 minute read -
    productivty repost blog
  • Many is not enough: Counting simulations to bootstrap the right way — Yanir Seroussi

    Many is not enough: Counting simulations to bootstrap the right way — Yanir Seroussi

    August 25, 2020

    An interesting post by my former coworker, Yanir Seroussi.

    Previously, I encouraged readers to test different approaches to bootstrapped confidence interval (CI) estimation. Such testing can done by relying on the definition of CIs: Given an infinite number of independent samples from the same population, we expect a ci_level CI to contain the population parameter in exactly ci_level percent of the samples. Therefore, we […]

    Many is not enough: Counting simulations to bootstrap the right way — Yanir Seroussi

    August 25, 2020 - 1 minute read -
    blog
  • There are three things one can watch forever: fire burning, water falling, and computation progress bars

    There are three things one can watch forever: fire burning, water falling, and computation progress bars

    August 23, 2020

    https://videopress.com/v/OxcrfxZ2?autoPlay=true&controls=false&loop=true&muted=true&persistVolume=false&preloadContent=metadata

    August 23, 2020 - 1 minute read -
    video blog
  • Book review: The Abyss: Bridging the Divide between Israel and the Arab World

    Book review: The Abyss: Bridging the Divide between Israel and the Arab World

    August 20, 2020

    TL;DR If you are an Israeli and don’t feel like learning the behind the scenes stories, skip it. Otherwise, I do recommend reading this book. I enjoyed it a lot 4.5/5

    The Abyss: Bridging the Divide between Israel and the Arab World went to print slightly after the outbreak of the “Arab Spring.” The author, Eli Avidar, is a former Israeli intelligence officer and diplomat. Among other things, Eli Avidar served as the head of the Israeli diplomatic mission to Qatar in 1999. Today, Eli Avidar is a Knesset member for the right-wing Yisrael Beiteinu party. Even though so many things have changed since the book was published, I didn’t find any claim that Eli Avidar made, and that turned out to be wrong, nine years after the publication.

    I enjoyed reading this book a lot despite the fact that most of Eli Avidar’s claims are not new to me. Most of them are widely known to all the Israelis, and the real question is not whether you are aware of these claims, but whether you agree with them and what conclusions you make out of them.

    On the other hand, The Abyss is an interesting storybook full of behind the scenes anecdotes and gossip. All who know me know how much I like gossips. It also provides a great introspection of how the (Jewish-)Israeli society sees the Arab-Israeli conflict, and what it feels towards it.

    Should you read the book? If you are an Israeli and don’t feel like learning the behind the scenes stories, you may skip it. Otherwise, I do recommend reading this book. I don’t know how accurate is Avidar’s description of the Arab world, but his analysis of the Israeli behavior and attitude is very accurate. If you ever cough yourself wondering “What the fuck do the Israelis think?”, this book might shed some light for you. That is why I write this review in English, despite my tendency to review Hebrew books in my Hebrew blog.

    Fun fact. I finished reading this book on August the 13th. I closed the book, opened Twitter, and saw my feed FULL with news about the upcoming normalization treaty between Israel and UAE.

    August 20, 2020 - 2 minute read -
    avidar book book review Israel israeli-arab-conflict qatar blog
  • What is the biggest problem of the Jet and Rainbow color maps, and why is it not as evil as I thought?

    What is the biggest problem of the Jet and Rainbow color maps, and why is it not as evil as I thought?

    August 17, 2020

    There was a consensus among the data visualization purists that the rainbow color map, and it’s close cousin Jet are bad. Really bad. These colormaps used to be popular at the beginning of the computational data visualization era. However, their popularity decreased in the last five years or so. The sentiment isn’t as bad as it used to be a couple of years ago, but still.

    A screenshot from circa 2016. Today we are less fanatic than that

    What is the biggest problem of the rainbow colormap? The most apparent problem with this particular colormap is that it not perceptually uniform. By “perceptually uniform,” I mean that equal changes in the value that we encode using a colormap should correspond to same changes in the color perception. This is not the case with the rainbow or the Jet colormaps. They have distinct bright and dark stripes within the number range, making them the wrong choice to encode numerical data. The situation is even worse for people with impaired color vision.

    Can you be less perceptually uniform?

    The solution to this problem was proposed in the form of better colormaps. The first one that I know of is Parula by Matlab, and it’s opensource alternative Viridis that is available in matplotlib and many other plotting libraries. (Watch this video about viridis to get a good introduction to color perception and color maps).

    Viridis, the new rainbow

    Everything was nice and good, and I was trashing the rainbow colormap whenever I could. Until yesterday, when I read about Turbo, the improved rainbow colormap developed by Google.

    In the long and interesting blog post that describes Turbo, Anton Mikhailov, a software engineer in Google, describes several relevant applications of a “good rainbow” scheme.

    According to Anton, “Because of rapid color and lightness changes, Jet accentuates detail in the background that is less apparent with Viridis** **and even Inferno. Depending on the data, some detail may be lost entirely to the naked eye. The background in the following images is barely distinguishable with Inferno (which is already punchier than Viridis), but clear with Turbo.”

    I must admit that I’m convinced.

    The biggest problem with that is mentioned concerning the original rainbow scheme that its brightness varies too much. However, it turns out that the color saturation and hue attract our attention more than the lightness (here’s the reference which I haven’t read yet). As such, it makes sense to construct a colormap that relies more on color and hue changes.

    Moreover, in many cases, the interesting details appear in the extreme values of the data range, not in the middle. In thes cases, a properly applied rainbow-like color scheme becomes a valid choice.

    The bottom line is that one should not refrain from using rainbow(-like) color maps in their visualizations anymore, provided that they use a modern implementation. Luckily, it’s even available in matplotlib

    August 17, 2020 - 3 minute read -
    colormap colors data visualisation Data Visualization dataviz jet turbo blog
  • If you don't teach yet, start! It will make you a better professional.

    If you don't teach yet, start! It will make you a better professional.

    August 12, 2020

    Many people know me as a data scientist. However, I also teach, which is sort of unnoticed to many of my friends and colleagues. I created a page dedicated to my teaching activity. Talk to me if you want to organize a course or a workshop.

    I also highly recommend teaching as way of learning. So, if you don’t teach yet, start! It will make you a better professional.

    August 12, 2020 - 1 minute read -
    teaching blog
  • How to suck less in data visualization and professional communication

    How to suck less in data visualization and professional communication

    July 28, 2020

    In technical communication, the main thing is to keep the main thing the main thing. There are multiple ways to ensure this principle. Some of these ways require careful chart fine-tuning. However, there is one tool that is easy to master, fast to apply, and that provides a high return on the investment rate. I refer to chart titles. In this talk, I had two main theses. My first thesis is that most of you suck in communication (and not only data visualization).

    My second thesis is that you can quickly improve your graphs by merely adding a good title. The importance of good titles is not new to my preaching, but I thought it was an excellent thing to formalize this thesis a bit, and I’m thankful to the NDR organizers for giving me this opportunity.

    Following is the slide stack from my NDR presentation.

    https://www.slideshare.net/borisgorelik/the-biggest-missed-opportunity-in-data-visualization

    July 28, 2020 - 1 minute read -
    communication conference data visualisation Data Visualization ndr presentation speaking blog
  • Meet me at the online data science / AI conference

    Meet me at the online data science / AI conference

    July 16, 2020

    I will be talking about data visualization at the next NDR conference on July 28. All the conferences organized by the NDR team are well organized and of a very high value. I hope to keep the level high.

    And here’s the brief description of my talk

    See you

    July 16, 2020 - 1 minute read -
    conference data visualisation Data Visualization ndr speaking blog
  • 35 (and more) Ways Data Go Bad — Stats With Cats Blog

    35 (and more) Ways Data Go Bad — Stats With Cats Blog

    July 14, 2020

    If you plan working data analysis or processing, read the excellent post in the “stats with cats blog” titled “35 Ways Data Go Bad” post. I did experience each and every one of the 35 problems. However, this list is far from being complete. One should add the comprehensive list of Falsehoods Programmers Believe About Time.

    When you take your first statistics class, your professor will be a kind person who cares about your mental well-being. OK, maybe not, but what the professor won’t do is give you real-world data sets. The data may represent things you find in the real world but the data set will be free of errors. […]

    35 Ways Data Go Bad — Stats With Cats Blog

    July 14, 2020 - 1 minute read -
    data science statistics blog
  • Unexpected hitch of working in a distributed team

    Unexpected hitch of working in a distributed team

    July 13, 2020

    It has been about half a year after I became a freelance data scientist. Before my career change, I worked in a distributed team for more than five years. Today, I suddenly realized that working in a distributed team has a significant problem, inherent to its distributed, multinational, nature.

    My team was always spread over multiple time zones. Sometimes, the time zone span was so broad, that we could never find a time slot where all the team members were ordinarily awake. Automattic, the company I used to work for, is a firm believer in asynchronous communication, but from time to time, you HAVE to meet over a Zoom/Slack/Whatever call. Since I wasn’t a manager, the number of live calls that I had to attend was kept to a minimum, and yet, I found myself at least twice a week in a 10 pm Zoom call. I don’t know what about you, but my brain keeps working for at least two outs after log off. Thus, twice a week, I would find myself going to bed after one o’clock at night. As a result, I was sleep deprived for the majority of the week.

    Only now have I noticed the fact that my sleep has improved so much after the career change. I know that people who work in “colocated” teams also find themselves in late night phone calls, but working in a distributed group means that you’ll do it regularly.

    July 13, 2020 - 1 minute read -
    distributed work remote working sleep work-from-home blog
  • Hybrid digital/analog tangible week planning

    Hybrid digital/analog tangible week planning

    July 12, 2020

    Here’s a neat method that helps me organize my week, increase my productivity and fight procrastination.

    Being a freelancer data scientist, I’m involved in three hands-on projects for two clients. I also manage/mentor two data scientists in two other projects, and participate in strategic discussions for a customer of mine, and in a startup in which I invest. Oh, I am also in the final stages of writing a paper. I never imagined I would be in the situation with so many balls that I need to keep in the air. How do I manage to keep sanity?

    This is what I do. Following the advice in “15 Secrets Successful People Know About Time Management”, I try to keep as many items in my calendar as possible. When my workweek starts, I print out the weekly schedule on a sheet of paper. Then, I apply the tangible GTD hack that I learned from another book [link] and write out all my projects on a bunch of small post-it notes. These notes allow me “dumping” all my brain contents into an external medium, which frees up my brain to spend more CPU cycles on processing, rather than remembering and worrying.

    Next comes the fun part, I get to play with my cards by arranging them on the weekly schedule. The geometry of the post-it notes and the sheet of paper ensures that I allocate reasonably larget chunks of time for each “big thing.” It also reminds me that the amount of time each day is limited, and I can’t stick too many plans into a day or a week. (No, I won’t be able to finalize the paper, complete the analysis for a retail shop, learn a chapter in Bayesian statistics book, before the end of today).

    After I’m done, I copy each post-it note into my calendar. Thanks to the integration with Todoist (an excellent productivity tool), all these tasks end up in my todo list, where I can further work with them.

    To sum up:

    • Global week overview - check
    • Prioritization and honesty - check.
    • Fun playing with sticky notes - check.
    • Work gets done - (I wish!).

    Oh, did you notice the appointments between 5 and 6 am? This is my sports activity. Sometimes working out charges me for the entire day. Sometimes, all I want to do for the entire day is to have a nap :-)

    July 12, 2020 - 2 minute read -
    gtd planning procrastination productivity time-management blog Productivity & Procrastination
  • Before and after. Even excellent graphs can be improved

    Before and after. Even excellent graphs can be improved

    June 30, 2020

    Being a data visualization consultant, I can’t help looking for dataviz problems in graphs that I see. Even if the graph is good. Even if I know that I would not be able to create a graph that good. Even if the overall graph is excellent, and the problems are minor, or maybe especially when the graph is excellent, and the problems are minor.

    This is a nice graph published by Nevo Benita on Linkedin.

    The graph presents the gap between the men and the women in the Israeli job market. As I said, the graph is excellent. However, there are several small problems that, like grains of sand in a chocolate mousse, stand in the way. Let’s take a look at them.

    The time-series line in the upper right part of the graph shows good use of the real estate. The problem is that the X-axis ticks (the years) look as if they belong to the chart below. It takes some time to realize that the numbers are years of the upper graph, and not the X-axis of the graph below. Moving the numbers upwards by several pixels would have fixed that.

    *Now, it is more clear that “1990” and “2018” relate to the time-series graph above.
    Before (left) and after (right). *

    Let’s talk about the left-side bar chart. It took me a while to understand what it is. As a matter of fact, I managed to write a critique paragraph about that bar chart, how it is unclear what the percentages are, and how they were computed. Only then had I noticed the explanation below. Such confusion isn’t the viewer’s fault. Since we usually scan images from top to bottom, moving the title to the top of the chart will reduce this confusion. The word “percent” is also redundant in that title since it comes after the percent sign.

    Moving the explanation to the top makes it easier to notice. Before (left) and after (right)

    The last point that is worth optimizing is the color order. Consistent element order in an image makes navigation and comprehension much easier. When the order is preserved, our brain can use mental shortcuts without losing much information. When these shortcuts are broken, the brain has to work harder. What am I talking about? The graph author made the correct decision to use different font colors in the graph title to specify which color stands for which gender. This way, we don’t need a separate legend, and this is good. The title is an ordered sequence of words. The visualizer could use this order to create the order heuristic that is so helpful. Such a heuristic isn’t always possible. Fortunately for the visualizer (and sadly for the society), the salary gap in all the occupations in this graph have the same direction: men earn more than women. As a result, the rightmost part has all the green dots on the right, and the purple dots are on the left. This direction is opposite to the gender direction in the title and the color direction in the bar chart. To fix this situation, I made sure that the color that stands for the women (purple) is always to the left of the color that designates the men (green).

    Keeping the color order. Before (left) and after (right)

    So, this is the final result. I hope you can see why I like it better.

    That’s how I took and excellent graph and made it even more awesome.

    June 30, 2020 - 3 minute read -
    before-after data visualisation Data Visualization dataviz blog
  • Data visualization is not only dots, bars, and pies

    Data visualization is not only dots, bars, and pies

    June 28, 2020

    Look at this wonderful piece of data visualization (taken from here). If you know the terms “tertiary structure” and “glycan”, there is NO way you miss the message that the author of this figure wanted to convey.

    Also, note how using appropriate colors in the title, the authors got rid of graph legend.

    June 28, 2020 - 1 minute read -
    data visualisation Data Visualization dataviz legend molecular-structure blog
  • Multilingual protest in Acre, Israel

    Multilingual protest in Acre, Israel

    June 27, 2020

    June 27, 2020 - 1 minute read -
    blog
  • How to become a Python professional in 42 hours?

    How to become a Python professional in 42 hours?

    June 25, 2020

    Here’s an appealing ad that I saw

    This image has an empty alt attribute; its file name is image-2.png

    How to become a Python professional in 42 hours? I’ll tell you how. There is no way. I don’t know any field of knowledge in which one can become professional after 42 hours. Certainly not Python. Not even after 42 days. Maybe after 42 weeks if that’s mostly what you do and you already a programmer.

    June 25, 2020 - 1 minute read -
    ad career python blog
  • Standardizing bidirectional language support in interfaces and visualization

    Standardizing bidirectional language support in interfaces and visualization

    June 24, 2020

    I’m honored to take part in standardizing bidirectional language support in interfaces and visualization, as a part of an expert group formed for the Hebrew Support in Computerized Systems Committee at the SII-the standards institution of Israel.

    The Committee is led by Gilad Almosnino. Below is Gilad’s project announcement.

    June 24, 2020 - 1 minute read -
    RTL sii blog
  • Book review. Five Stars by Carmine Gallo

    Book review. Five Stars by Carmine Gallo

    June 22, 2020

    TL;DR Good motivation to improve communication. Inadequate source of information on how to achieve that

    The central premise of Five Stars Communication Secrets to Get from Good to Great by Carmine Gallo is that professionals who don’t invest in communication skills are at high risk of being replaced by computers and robots. One of the book’s sections bares the title that summarises this premise very well “Storytelling isn’t a soft skill; it’s the equivalent of hard cash.” I firmly believe in these premises. That is why I invest so much time in learning and teaching data visualization, in public speaking, and blogging.

    When I started reading this book, I got excited. I kept marking one passage after another. Gallo packed the first part of the book with numerous citations and explanations on how a lack of communication skills is the most severe risk factor in the career of a modern professional, team, or company. One example leads to another one, and one smart conclusion followed another one.

    Then, I started noticing that the book tries to convince me more and more, but I didn’t need that convincing in the first place. More than half of the book is evangelism. The author tells you how essential communication skills are, then he gives you some examples of people who did it right, and then again talks on importance. Again, and again, and again. Where are all those “secrets to get from good to great”???

    When, finally, we get to the practical parts, the reader is left mostly with shallow, almost trivial bits of advice.

    Some of the most important points I took from this book

    Slight feeling of a hamster-wheel while reading this book

    Adopt the three-act storytelling approach to presentations. The three-act storytelling approach worked for Homerus, Shakespear, Tarantino, and there is no reason it should fail you in your technical presentations. Fair enough. On the other hand, this 2012 article by Nancy Duarte, provides more depth and more actionable information on this approach (follow Duarte’s blog if presentation skills are something you are interested in).

    “In the first two to three minutes of a presentation, I want people to lean forward in their chairs.” I like this citation by Avinash Kaushik, Google’s digital marketing evangelist. I will undoubtedly try this approach in my next presentations.

    Should you read this book?

    If you read these lines, your job depends on your communication and presentation skills. If you believe this premise, you can skip the first 60% of the book. If you want to improve your communication skills, I suggest reading Jean-luc Doumont’s “Trees, Maps, and Theorems,” which is much shorter, but also much denser in methods and practical advice.

    The bottom line

    3.5/5

    June 22, 2020 - 2 minute read -
    book review presentation-tip public speaking blog
  • The delicate art of fine trolling

    The delicate art of fine trolling

    June 15, 2020

    I’m reading the a 1991 paper by Barbara Tversky that deals with the directional representation of time. One sentence in the paper interview says

    “There does not seem to be strong universal cognitive associations of quantity or quality to left or right”

    Whenever I make a similar statement in the context of data visualization, I frequently get a self-assured response “of course there is - smaller numbers appear on the left!”. To answer this remark, Barbara Tversky added a small footnote that says

    “Anyone in doubt should consult politicians on both the left and the right.”

    Photo by Pixabay on Pexels.com

    So gentle, yet so powerful.

    June 15, 2020 - 1 minute read -
    paper RTL trolling blog
  • Lie factor in ad graphs

    Lie factor in ad graphs

    June 9, 2020

    It’s fun to look at the visit statistics and to discover old stories. I wrote this post in 2016. For a reason I don’t know, this post has been one of the most viewed posts in my blogs during the last week.

    So, I decided to publish it again. I won’t add any new examples, but if you want to see more stuff, type [lying with data visualization] in your favorite search engine

    Lie factor in ad graphs

    What do you do when you have spare time? I tend to throw graphs from ads to a graph digitizer to compute the “lie factor”. Take the following graph for example. It appeared in an online ad campaign a couple of years ago. In this campaign, one of the four Israeli health care providers bragged about the short waiting times in their phone customer support. According to the Meuheded (the health care provider who run the campaign), their customers had to wait for one minute and one second, compared to 1:03, 1:35, and 2:39 in the cases of the competitors. Look how dramatic the difference is:

    Screen Shot 2018-02-16 at 18.34.38

    The problem?

    If the orange bar represents 61 seconds, then the dark blue one stands for 123 seconds, almost twice as much, compared to the actual numbers, the green bar is 4:20 minutes, and the light-blue one is approximately seven minutes, and not 2:39, as the number says.

    Screen Shot 2018-02-16 at 18.32.53

    I can’t figure out what guided the Meuhedet creative team in selecting the bar heights. What I do know that they lied. And this lie can be quantified.

    June 9, 2020 - 2 minute read -
    data visualisation Data Visualization dataviz lie lie-factor blog
  • StellarGraph — another promising network analysis library for Python and Scala

    StellarGraph — another promising network analysis library for Python and Scala

    June 8, 2020

    Network (graph) analysis is a complicated topic. There are several tools available for this task with different pros and cons. Recently, I stumbled upon another tool StellarGraph. StellarGraph authors claim to provide excellent performance; NumPy, Pandas, TensorFlow integration, an impressive set of algorithms, inter compatibility with Neo4j (THE graph database); and much more. The documentation looks very clear and extensive too.

    I didn’t use it yet, but I certainly plan to.

    https://www.stellargraph.io

    June 8, 2020 - 1 minute read -
    igraph neo4j network-analysis networkx social-network-analysis stellargraph blog
  • The hazard of being a wizard. On balance between specialization and the risk to become obsolete.

    The hazard of being a wizard. On balance between specialization and the risk to become obsolete.

    June 3, 2020

    A wizard is a person who continually improves his or her professional skill in a particular and defined field. I learned about this definition of wizardness from the book “Managing project, people and yourself” by Nikolay Toverosky (the book is in Russian).

    Recently, Nikolay published an interesting post about the hazards of becoming a wizard. The gist of the idea is that while you are polishing your single skill to perfection, the world changes. You may find your super-skill irrelevant anymore (see my Soviet Shoemaker story).

    Nikolay doesn’t give any suggestions. Neither do I.

    Below is the link to the original post. The post is in Russian, and you can use Google Translate to read it.

    Страница о магах У меня в книге есть глава про полководцев и магов. В её конце я подвожу итог: Несмотря на свою кру­тость, маг уяз­вим. Он поле­зен, только если его навык под­хо­дит к задаче. 658 more words

    Почему опасно быть магом — Об управлении проектами и дизайне

    June 3, 2020 - 1 minute read -
    diversity obsolete skills blog Career advice
  • Older posts Newer posts