Automating Image clean up with Python

28. July 2020 11:24 by Jay Grossman in   //  Tags: , ,   //   Comments (0)
If you have been reading this blog, you'll know I collect sports cards. It's fun to share what I have with other collectors (by posting scans on sportscollectors.net, facebook, etc.). A few years ago I bought a Brother MFC-9130CW all in one printer/scanner that I use to do my scanning. I usually set it to scan documents in legal format so I can fit 9 cards on a scan (3 rows by 3 columns). And since I want to do this most efficiently, I generally save them as a 200dpi pdf file with multiple page... [More]

Open Source Data Tools I like

17. May 2020 22:38 by Jay Grossman in   //  Tags: , , , , , ,   //   Comments (0)
At a recent meetup for the NYC Snowflake user group, the moderator asked the panelists what open source data tools do they use. I thought it would be nice to share my answers here. [More]

Your Dreams must be bigger than your fear

3. November 2019 20:44 by Jay Grossman in   //  Tags: , ,   //   Comments (0)
Last year I was eating lunch at a local Asian restaurant, and I pulled out an awesome message in my fortune cookie. It said, “Your dream must be bigger that your fear.” This message hits home for me, and this fortune is still attached to my monitor as a reminder. [More]

Fantasy Football Retirement Letter

1. September 2019 23:01 by Jay Grossman in   //  Tags:   //   Comments (0)
I wrote a ridiculous retirement email to members of my day job's fantasy league. It includes a fun comparison of myself and recently retired NFL star Rob Gronkowski (AKA Gronk). Hope y'all enjoy. [More]

Apache Superset Test Drive

2. March 2019 11:44 by Jay Grossman in   //  Tags:   //   Comments (0)
I have lately been playing with some commercial BI & Dashboard tools. There is a certainly quite broad range when it comes to features, price, scalability, administration capabilities, how they can access data, and set up complexity. For a good sized enterprise (with 100+ users), some of these solutions can run you several hundred thousand dollars per year. As part of my diligence, I felt like I need to look at the best of breed Open Source offering - Apache Superset created by AirBNB. So I... [More]

User Manual for Jay at Work

23. May 2018 19:43 by Jay Grossman in   //  Tags:   //   Comments (0)
I thought it would be a fun exercise to write a short document about what I index on for my day job and folks I work with. What matters most to me in a role: Adding value to the world Building a sustainable business (legit P&L) Learning personally Growing others Working with great people   How I think manager relationship works best (from the manger perspective): You are responsible for your career progression! I want to know your goals (if you can't express them... [More]

Snowflake Testimonial - Rent The Runway: Reinventing Retail with Data Driven Insights

15. March 2018 14:40 by Jay Grossman in   //  Tags: , , ,   //   Comments (0)
In August, I spoke at the Snowflake analytics tour on the topic "Rent The Runway: Reinventing Retail with Data Driven Insights". I was also asked to provide an account of our experiences migrating data warehouse vendors - going from Vertica to Snowflake. Below is the video:  

#1 news story on espn.com details something that happened on SportsCollectors.Net

21. September 2017 08:28 by Jay Grossman in   //  Tags:   //   Comments (0)
  In my 20's, I taught myself to code and I created what has become a fairly popular community for sports autograph collectors - SportsCollectors.Net. I have mentioned it a few times before on this blog. There is a really passionate collector who also happens to be a Major League baseball player named Pat Neshek. Pat is a really effective middle reliever and made his second All Star game this season. He is also known as one of the most generous players of giving his time and autograph to ... [More]

Simulating Vertica's conditional_change_event

14. September 2017 20:19 by Jay Grossman in   //  Tags: , , , ,   //   Comments (0)
Lately my team has spent a bunch of time migrating our data warehouse from Vertica to Snowflake. While Snowflake has excellent support for analytic functions, Vertica has some functions that no other columnar database supports.  The conditional change event function "assigns an event window number to each row, starting from 0, and increments by 1 when the result of evaluating the argument expression on the current row differs from that on the previous row". CONDITIONAL_CHANGE_EVENT (... [More]

Snowflake's lateral flatten function on variant data type

25. August 2017 22:20 by Jay Grossman in   //  Tags:   //   Comments (0)
Snowflake is a really interesting new data warehouse built on top of AWS. I like their architecture because they had the interesting idea to separate data storage (backed by small files on S3) and compute to run queries (EC2 instances running their API). I inherited a project where we would store complex JSON in a string in a field as varchar(64000). Then we would use regex patterns to get the values we wanted from them. Sometimes these regexes would get really involved, yuck. The Variant data... [More]

About the author

Jay Grossman

techie / entrepreneur that enjoys:
 1) my kids + awesome wife
 2) building software projects/products
 3) digging for gold in data sets
 4) my various day jobs
 5) rooting for my Boston sports teams:
    New England PatriotsBoston Red SoxBoston CelticsBoston Bruins

Month List