Web Components — the right way

March 20, 2018 Spiros Glykas

From about two years we have been heard about web components, well this article will not explain how they work (for that there is Google), but rather we will address the purpose of this technology, when and how use it.

We can consider web components as a tool to extend HTML, not to replace it. The available technologies follow the same rules that we are all used to —from always— see in the most famous language markup in the world.

Web Components is a suite of different technologies allowing you to create reusable custom elements — with their functionality encapsulated away from the rest of your code — and utilise them in your web apps.
— MDN

HTML — Accelerated course

Every web developer knows — or at least should — the basics of HTML and how to define the markup of a web element that will be represented by the browser in a certain way and with a specific behaviour.

In the above example, like the greatest part of HTML elements, the node can accept a content that will be shown (and stylised) by the browser or by CSS user. But as we all know, it exists also elements that can not accept children nodes (well known as void elements) —for example the <img> and <input>tags — and other tags that require only some types of nodes.

The <select> element, like more other, is a node that uses a shadow DOM similar logic; for this reason, if you want, you can see the content of these elements in the same way as per web components:

This is approximately the mechanic beyond this language, so we can define two different HTML element models:

Modular elements — Allow children nodes (normal elements)
Self-closing elements — Does not allow children nodes (void elements)

Custom elements

As we wrote, the technologies beyond web components aim to extend HTML, not to replace it. What does it mean? That we have tools to create new HTML elements that would not naturally exist if not implemented by user-agents (browsers), or to extend the behaviour of those ones already existing by adding functionalities and custom styles.

When?

The answer is easier than you expect. Define new HTML elements is necessary when the available ones do not meet functionality and design needs.

If we need to create a new <my-button> element with a customised style, we must think about the progressive enhancement. So we should add functionalities and styles, rather than completely recreate the element from scratch.

The wrong way

If we surf on the Internet, we can find some pattern libraries fully composed by custom elements (using or not some frameworks, like Polymer), tests, examples, playgrounds …and they all have one thing in common, they are all created using a wrong pattern. Do you remember? We are extending the HTML, so we should follow its paradigms and its composition. Here some examples:

An entire app inside the shadow root. It’s the same as putting the app inside the <input> tag.

A native <button> element wrapped… inside another custom button. It’s the same as putting a <button> inside another <button>.

Does this image really need a description? Think about the <select>example above.

They are many other wrong examples around, but these are the most misleading that I found, compared to the fundamental HTML principle, the composition (read more about it).

Composition is one of the least understood features of shadow DOM, but it’s arguably the most important.
— Google Web Fundamentals

The right way

When we develop a web component, we must consider the two models described above. So we can create custom elements that allow children nodes (through the <slot> tag), or create self-closing elements (voidelements).

For example, let’s consider to create a custom element that shows a tooltip balloon near any element passed as a child, as we would do with the HTML.

The above <my-tooltip> element, if we want, can allow users to customise the tooltip balloon style— that it is in the #shadow-root — only through a set of custom properties, if we define them inside the :host selector. You can read more about defining CSS api (or style-hooks) here.

Go on with a more complex example and define a custom element that allows only one type of child node, as per <ul> and <li> tags to generate a list:

In conclusion

Composition. Web Components are useful to create new HTML elements that will compose web interfaces, as we have always done. Custom and native html elements are the ”tree leaves”, so it is an error consider them as a “big” app containers, or as application data containers. For more info, I suggest you to read the official documentation written by Google guys, in which you will find all about web components, custom elements, shadow DOM and best practices.

Cover image Building Blocks by Ola Tandstad

Source: https://dev.to/clabuxd/web-componentsthe-r...

Developers love trendy new languages but earn more with functional programming

March 16, 2018 Spiros Glykas

And most feel that AI morality is management’s problem.

Developer Q&A site Stack Overflow performs an annual survey to find out more about the programmer community, and the latest set of results has just been published.

JavaScript remains the most widely used programming language among professional developers, making that six years at the top for the lingua franca of Web development. Other Web tech including HTML (#2 in the ranking), CSS (#3), and PHP (#9). Business-oriented languages were also in wide use, with SQL at #4, Java at #5, and C# at #8. Shell scripting made a surprising showing at #6 (having not shown up at all in past years, which suggests that the questions have changed year-to-year), Python appeared at #7, and systems programming stalwart C++ rounded out the top 10.

These aren't, however, the languages that developers necessarily want to use. Only three languages from the most-used top ten were in the most-loved list; Python (#3), JavaScript (#7), and C# (#8). For the third year running, that list was topped by Rust, the new systems programming language developed by Mozilla. Second on the list was Kotlin, which wasn't even in the top 20 last year. This new interest is likely due to Google's decision last year to bless the language as an official development language for Android. TypeScript, Microsoft's better JavaScript than JavaScript comes in at fourth, with Google's Go language coming in at fifth. Smalltalk, last year's second-most loved, is nowhere to be seen this time around.

These languages may be well-liked, but it looks as if the big money is elsewhere. Globally, F# and OCaml are the top average earners, and in the US, Erlang, Scala, and OCaml are the ones to aim for.

Visual Basic 6, Cobol, and CoffeeScript were the top three most-dreaded, which is news that will surprise nobody who is still maintaining Visual Basic 6 applications thousands of years after they were originally written.

Stack Overflow also asked devs about one of today's hot-button issues: artificial intelligence. Only 20 percent of devs were worried about AI taking jobs (compared to 41 percent excited by that possibility—no doubt the Visual Basic 6 devs hope that one day computers will be able to do their jobs for them), but a remarkable 28 percent were concerned by AI intelligence surpassing human intelligence, and 29 percent concerned about algorithms making important decisions more generally.

Among developers that actually know what they're talking about, however, the concerns seemed to shift: data scientists and machine-learning specialists were 1.5 times more likely to be concerned about algorithmic fairness of AI systems than they were any singularity.

Even if AI is evil, most developers don't think it's the fault of the programmers. Fifty-eight percent say that ethics are the responsibility of upper management, 23 percent the inventor of the unethical idea, and just 20 percent think that they're the responsibility of the developer who actually wrote the code. If the Volkswagen emissions scandal is anything to judge by, the developers may not be completely off the mark; thus far, arrests appear to have been restricted to executives and engineers who designed the emissions test-defeating software, leaving the people who wrote the code unscathed

Source: https://arstechnica.com/gadgets/2018/03/de...

Fantastic youtube channels, with top-quality learning for software engineers

March 15, 2018 Spiros Glykas

Video training is undeniably one of the most popular methods, for self-taught programmers and according to Forbes, the most popular, amongst millennials.

Anyone who stops learning is old, whether at twenty or eighty. Anyone who keeps learning stays young. The greatest thing in life is to keep your mind young. (Henry Ford)

As far as I know, many people are relying on one or more of the most popular video training platforms like Pluralsight, Udemy or Coursera but they use accounts funded by the company they are working for.

But what about those who don’t have that great benefit and cannot allow a budget around 300$/year for such purpose? Thankfully, youtube is hosting quite a few channels which offer top-quality material for software engineers. Below are my favorites, in no particular order.

Even master yoda agrees

DerekBanas: Derek is providing a diverse set of tech courses. From assembly and design patterns to Django, Rails and UML. His extremely clear accent makes subtitles useless. What I like about this channel, is that is not purely technical, but has a good grasp on the soft skills side of a developer’s life, like negotation and sales techniques.
thenewboston With 4350 videos, at the time of writing this article, this guy is literally recording while he is sleeping. Like Derek Banas, thenewboston(or Bucky Roberts as known in the real world) is providing tutorials on a wide variety of subjects. Programming courses are taking the lion’s share of the channel’s library but there a few for math, biology, and gaming.
FreeCodeCamp talks A great — and relatively recent — initiative from FreeCodeCamp, where various passionate people give talks on modern technical topics. You can even send your talk there, to be published. There is no reason to consume your time here, given there is a great article answering all the questions.
Success in Tech Ramon Lopez, the owner of this channel is a great software engineer I am following for some time now. Initially, I was attracted by his system design articles, but later I loved his career advice playlist too. It looks paused for a while now, I hope it will be refreshed soon.
Traversy Media A channel whose ower is heavily involved with web programming. A quick scroll in his playlists will leave you with no doubt. :) He is mostly focusing on Node.Js and vanilla Javascript but there is some fair share of PHP, Python, and CSS. My favorite part here, are the long videos he has created, where you can get a good grasp of the tech he is talking in about an hour, like this one.

Source: https://dev.to/perigk/fantastic-youtube-ch...

Private Blockchain vs Public Blockchain

March 14, 2018 Spiros Glykas

You cannot be a crypto investor or entrepreneur without having a real understanding of the differences between these types of blockchains as well as their implications. Even if they are based on similar principles, their operation is, in fact, different to all levels. So the tokens issued by these blockchains will not be assessed in the same manner.

What are the main differences?

A blockchain is so-called “public” (or open) when anyone can become a member of the network without conditions of admission. In other words, anyone wishing to use the service proposed by the network can download the protocol locally without having to reveal his or her identity or meet predetermined criteria. A protocol is a computer program that could be compared to a Charter in that it defines the rules of operation of a network based on a blockchain. For example, the members of the bitcoin network download the Bitcoin protocol (through the intermediary of their “wallet”) to be able to join the network and exchange bitcoins, but the only condition is to have an Internet connection.

It is different with a private blockchain (or closed) since the members of the network are selected before being able to download the protocol and therefore use the proposed service by the network. The mining capabilities and the system of consensus as a whole are centralized within the hands of the same entity. A network based on a private blockchain is therefore not decentralized in itself.

Private blockchain

Finally, consortium blockchains provide many of the benefits of private blockchains without conentrating the mechanism of the consensus between the hands of the same entity.

In this article, we will mostly focus on the différence between public and private blockchain.

The differences between these types of blockchains are based on the levels of trust existing among the members of the network and the resulting level of security. Indeed, the higher the level of trust between the members of the network, the lighter the consensus mechanism (which aims to add the blocks to the blockchain securely). As we will see, there is no trust between the members of a public blockchain since it is open to everyone and inversely the confidence is much stronger on the private blockchain since members are pre-selected. In networks based on a blockchain, the level of trust among the members therefore directly impacts the structure and mechanisms of the network.

ADVANTAGES AND DISADVANTAGES OF THESE DIFFERENT TYPES OF BLOCKCHAIN

PUBLIC BLOCKCHAIN

A public blockchain is ideal when the network must be truly decentralized, which means that no central entity controls the entry of the members on the network and the consensus mechanism is democratic. A democratic mechanism of consensus means that all members can become a minor and that these minors are in competition to add the blocks to the blockchain (at least when the mechanism of the Proof of Work is used).

But this decentralization has a cost:

- The limited size of the blocks: The number of transactions that can be added in each block is limited, which involves important limitations to the speed of adding transactions to the blockchain.

- A cost per transactions which can be high: Minors only participate in the process of mining because they hope to get the reward (coinbase and fees) allocated to minors who have added a block to the blockchain. For them it is a business, this reward will finance the costs they have incurred in the process of mining (electricity, computer equipment, internet connection). Tokens that are distributed to them are directly issued by the Protocol, but the fees are supported by the users. In the case of the bitcoin, for example, minors receive 12.5 bitcoins for each block added, to which are added fees paid by the users to add their transactions to the blocks. These fees are variable and the higher the demand to add transactions, the higher the fees.

Public Blockchain

- The transactions added to the blockchain are public: the whole world (Member of the network as non-members) can access transactions that are added to the blockchain. The information of the transactions is made public for that minors who do not know the other members, to check the conformity (for example that the person who has created a transaction holds enough bitcoins). These transactions are obviously not nominative, only your public key appears, but if someone knows your public key, he will be able to find all the transactions that you have created.

CONSORTIUM

A consortium is a network in which the members that can participate to the consensus mechanism are pre-selected. This type of blockchain is generally regarded as partially decentralized to the extent that the identity of the minors is known and that it is possible to make public or not the transactions added to the blockchain.

This type of blockchain is implemented in the same manner as private blockchains that we describe below but the difference is that the consensus process is not concentrated into the hands of a single entity.

PRIVATE BLOCKCHAIN

The consensus mechanism is centralized in the hands of a single entity which mission is to verify and add all transactions to the blockchain. A network based on a private blockchain, therefore does not need to use a mechanism such as “Proof of Work” or “Proof of Stake” which are complicated to implement and expensive. The problems of security being much more simple in the case of private blockchains, it is possible to apply the mechanisms of consensus lighter, more effective and therefore easy to deploy such that the BFT.

Such control of the consensus has several advantages:

- The manipulation of the blockchain: It is indeed possible to come back at any time on the transactions that have already been added to the blockchain and therefore change the balance of the members. In a public blockchain, such operation would require that 51% of the hashing power (i.e capacity to mine) is concentrated in the hands of the same entity. This not theory anymore since it happened beginning 2014 when the cooperative of GHash minor reached the 51% threshold.

- An absence of fees: the fact that the mining process is not competitive and that there are no minors to remunerate, there are no costs and rewards attached to transactions.

- A consensus much faster: the fact that the consensus mechanism is centralized makes it much quicker. In fact, the term of “consensus” is no longer adapted since it is rather a recording of transactions on the blockchain. Note that the entity responsible for managing the blockchain can decide to change the parameters of the blockchain and in particular to increase the size of the blocks to be able to add more transactions.

- Private data: the entity in charge of the administration of the blockchain may also decide to control who can enter the network or not and if the transactions will be public or not.

Blockchain

You may ask what makes the private blockchain better than the shared database as we know them today?

The flexibility of this type of blockchain might be very useful to a large number of industries who face operational or regulatory constraints.

For example, a distribution company that wishes to follow each of the products of its stocks using a blockchain might choose to use a private blockchain for its properties of immutability, transparency, security, and flexibility. Such an undertaking would have no interest to use a public blockchain.

Similarly, a network of bank wishing to use a blockchain to perform and record transactions, would be subject to confidentiality obligations and therefore forced to use a private blockchain or a Consortium as did the R3 network with its Platform of Smart Contracts” Corda”.

Source: https://hackernoon.com/private-blockchain-...

Data science with Python: Turn your conditional loops to Numpy vectors

March 13, 2018 Spiros Glykas

Vectorization trick is fairly well-known to data scientists and is used routinely in coding, to speed up the overall data transformation, where simple mathematical transformations are performed over an iterable object e.g. a list. What is less appreciated is that it even pays to vectorize non-trivial code blocks such as conditional loops.

Python is fast emerging as the de-facto programming language of choice for data scientists. But unlike R or Julia, it is a general purpose language and does not have a functional syntax to start analyzing and transforming numerical data right out of the box. So, it needs specialized library.

Numpy , short for Numerical Python, is the fundamental package required for high performance scientific computing and data analysis in Python ecosystem. It is the foundation on which nearly all of the higher-level tools such as Pandas and scikit-learn are built. TensorFlow uses NumPy arrays as the fundamental building block on top of which they built their Tensor objects and graphflow for deep learning tasks (which makes heavy use of linear algebra operations on a long list/vector/matrix of numbers).

Many Numpy operations are implemented in C, avoiding the general cost of loops in Python, pointer indirection and per-element dynamic type checking. The speed boost depends on which operations you’re performing. For data science and modern machine learning tasks, this is an invaluable advantage.

My recent story about demonstrating the advantage of Numpy-based vectorization of simple data transformation task caught some fancy and was well received by readers. There was some interesting discussion on the utility of vectorization over code simplicity and such.

Now, mathematical transformation based on some predefined condition are fairly common in data science tasks. And it turns out one can easily vectorize simple blocks of conditional loops by first turning them into functions and then using numpy.vectorize method. In my previous article I showed an order of magnitude speed boost for numpy vectorization of simple mathematical transformation. For the present case, the speedup is less dramatic, as the internal conditional looping is still somewhat inefficient. However, there is at least 20–50% improvement in the execution time over other plain vanilla Python codes.

Here is the simple code to demonstrate it:

import numpy as np
from math import sin as sn
import matplotlib.pyplot as plt
import time

# Number of test points
N_point = 1000

# Define a custom function with some if-else loops
def myfunc(x,y): 
  if (x>0.5*y and y<0.3): return (sn(x-y)) 
  elif (x<0.5*y): return 0 
  elif (x>0.2*y): return (2*sn(x+2*y)) 
  else: return (sn(y+x))

# List of stored elements, generated from a Normal distribution
lst_x = np.random.randn(N_point)
lst_y = np.random.randn(N_point)
lst_result = []

# Optional plots of the data
plt.hist(lst_x,bins=20)
plt.show()
plt.hist(lst_y,bins=20)
plt.show()

# First, plain vanilla for-loop
t1=time.time()
First, plain vanilla for-loop
t1=time.time()
for i in range(len(lst_x)):
    x = lst_x[i]
    y= lst_y[i]
    if (x>0.5*y and y<0.3):
        lst_result.append(sn(x-y))
    elif (x<0.5*y):
        lst_result.append(0)
    elif (x>0.2*y):
        lst_result.append(2*sn(x+2*y))
    else:
        lst_result.append(sn(y+x))
t2=time.time()

print("\nTime taken by the plain vanilla for-loop\n----------------------------------------------\n{} us".format(1000000*(t2-t1)))

# List comprehension
print("\nTime taken by list comprehension and zip\n"+'-'*40)
%timeit lst_result = [myfunc(x,y) for x,y in zip(lst_x,lst_y)]

# Map() function
print("\nTime taken by map function\n"+'-'*40)
%timeit list(map(myfunc,lst_x,lst_y))

# Numpy.vectorize method
print("\nTime taken by numpy.vectorize method\n"+'-'*40)
vectfunc = np.vectorize(myfunc,otypes=[np.float],cache=False)
%timeit list(vectfunc(lst_x,lst_y))

# Results
Time taken by the plain vanilla for-loop
----------------------------------------------
2000.0934600830078 us

Time taken by list comprehension and zip
----------------------------------------
1000 loops, best of 3: 810 µs per loop

Time taken by map function
----------------------------------------
1000 loops, best of 3: 726 µs per loop

Time taken by numpy.vectorize method
----------------------------------------
1000 loops, best of 3: 516 µs per loop

Notice that I have used %timeit Jupyter magic command everywhere I could write the evaluated expression in one line. That way I am effectively running at least 1000 loops of the same expression and averaging the execution time to avoid any random effect. Consequently, if you run this whole script in a Jupyter notebook, you may slightly different result for the first case i.e. plain vanilla for-loop execution, but the next three should give very consistent trend (based on your computer hardware).

We see the evidence that, for this data transformation task based on a series of conditional checks, the vectorization approach using numpy routinely gives some 20–50% speedup compared to general Python methods.

It may not seem a dramatic improvement, but every bit of time saving adds up in a data science pipeline and pays back in the long run! If a data science job requires this transformation to happen a million times, that may result in a difference between 2 days and 8 hours.

In short, wherever you have a long list of data and need to perform some mathematical transformation over them, strongly consider turning those python data structures (list or tuples or dictionaries) into numpy.ndarray objects and using inherent vectorization capabilities.

Numpy provides a C-API for even faster code execution but it takes away the simplicity of Python programming. This Scipy lecture note shows all the related options you have in this regard.

There is an entire open-source, online book on this topic by a French neuroscience researcher. Check it out here.

Source: https://www.codementor.io/tirthajyotisarka...

The two big traps of code test coverage

March 12, 2018 Spiros Glykas

Measurement of code coverage is one of those things that always catches my attention. On the one hand, I often find that organizations don’t necessarily know how much code they are covering during testing — which is really surprising! At the other end of the coverage spectrum, there are organizations for whom the number is so important that the quality and efficacy of the tests has become mostly irrelevant. They mindlessly chase the 100% dragon and believe that if they have that number the software is good, or even the best that it can be. This is at least as dangerous as not knowing what you’ve tested, in fact perhaps more so since it can give you a false sense of security.

Security. Code coverage can be a good and interesting number to assess your software quality, but it’s important to remember that it is a means, rather than an end. We don’t want coverage for coverage’s sake, we want coverage because it’s supposed to indicate that we’ve done a good job testing the software. If the tests themselves aren’t meaningful, then having more of them certainly doesn’t indicate better software. The important goal is to make sure every piece of code is tested, not just executed. Failing enough time and money to fully test everything, at least make sure that everything important is tested.

What this means is that while low coverage means we’re probably not testing enough, high coverage by itself doesn’t necessarily correlate to high quality — the picture is more complicated than that.

Obviously, having a happy medium where you have “enough” coverage to be comfortable about releasing the software with a good, stable, maintainable test suite that has “just enough tests” would be perfect. But still, these coverage traps are common.

The first trap is the “we don’t know our coverage” trap. This seems unreasonable to me — coverage tools are cheap and plentiful. A friend of mine suggests that organizations know their coverage number isn’t good, so developers and testers are loath to expose the poor coverage to management. I would hope this isn’t the usual case.

One real issue that teams encounter when trying to measure coverage is that the system is too complicated. When you build an application out of pieces on top of pieces on top of pieces, just knowing where to put the coverage counters can be a daunting task. I would suggest that if it’s actually difficult to measure the coverage in your application, you should think twice about the architecture.

A second way to fall into this trap happens with organizations that may have a lot of testing, but no real coverage number because they haven’t found a proper way to aggregate the numbers from different test runs. If you’re doing manual testing, functional testing, unit testing, and end-to-end testing, you can’t simply add the numbers up. Even if they are each achieving 25% coverage it is unlikely that it’s 100% when combined. In fact, it’s more likely to be closer to the 25% than to the 100% when you look into it.

The next trap is the “coverage is everything” perspective. Once teams are able to measure coverage, it’s not uncommon for managers to say “let’s increase that number.” Eventually the number itself becomes more important than the testing. Perhaps the best analogy is one I heard from Parasoft’s founder, Adam Kolawa:

“It’s like asking a pianist to cover 100% of the piano keys rather than hit just the keys that make sense in the context of a given piece of music. When he plays the piece, he gets whatever amount of key coverage makes sense.”

Therein lies the problem — mindless coverage is the same as mindless music. The coverage needs to reflect real, meaningful use of the code, otherwise it’s just noise.

And speaking of noise… the cost of coverage goes up as coverage increases. Remember that you not only need to create tests, but you have to maintain them going forward. If you’re not planning on re-using and maintaining a test, you should probably not waste the time creating it in the first place.

As the test suite gets larger, the amount of noise increases in unexpected ways. Twice as many tests may mean two or even three times as much noise. The meaningless tests end up creating more noise than good tests because they have no real context, but have to be dealt with each time the tests are executed. Talk about technical debt! Useless tests are a real danger.

Now, in certain industries, safetycritical industries for example, the 100% coverage metric is a requirement. But even in that case, it’s all too easy to treat any execution of a line of code as a meaningful test, which is simply not true. I have two basic questions I ask to determine if a test is a good test:

What does it mean when the test fails? 2
What does it mean when the test passes?

Ideally, when a test fails, we know something about what went wrong, and if the test is really good, it will point us in the right direction to fix it. All too often when a test fails, no one knows why, no one can reproduce it, and the test is ignored.

Conversely, when a test passes we should be able to know what was tested —– it should mean that a particular feature or piece of functionality is working properly. If you can’t answer one of those questions, you probably have a problem with your test.

If you can’t answer either of them, the test is probably more trouble than it’s worth.

The way out of this trap is first to understand that the coverage percentage itself isn’t the goal. The real goal is to create useful meaningful tests. This of course takes time.

In simple code, writing unit tests is simple, but in complex, real-world applications, it means writing stubs and mocks and using frameworks. This can take quite a bit of time and if you’re not doing it all the time, it’s easy to forget the nuances of the APIs involved. Even if you are serious about testing, the time it takes to create a really good test can be more than you expect.

Hopefully you’ve learned that coverage is important, and improving coverage is a worthy goal. But keep in mind that simply chasing the percentage isn’t nearly as valuable as writing stable, maintainable, meaningful tests.

Source: https://sdtimes.com/test/two-big-traps-cod...