<br />
<b>Notice</b>:  Function _load_textdomain_just_in_time was called <strong>incorrectly</strong>. Translation loading for the <code>wordpress-seo</code> domain was triggered too early. This is usually an indicator for some code in the plugin or theme running too early. Translations should be loaded at the <code>init</code> action or later. Please see <a href="https://developer.wordpress.org/advanced-administration/debug/debug-wordpress/">Debugging in WordPress</a> for more information. (This message was added in version 6.7.0.) in <b>/var/www/public/blog/wp-includes/functions.php</b> on line <b>6131</b><br />
{"id":81,"date":"2021-08-20T19:44:32","date_gmt":"2021-08-20T17:44:32","guid":{"rendered":"https:\/\/dmalabs.net\/blog\/?p=81"},"modified":"2021-08-20T19:44:34","modified_gmt":"2021-08-20T17:44:34","slug":"is-web-scraping-legal-how-to-scrape-data-legally","status":"publish","type":"post","link":"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/","title":{"rendered":"Is web scraping legal? How to scrape data legally?"},"content":{"rendered":"\n<p>Web scraping was a breakthrough for a number of businesses around the globe. As it\u2019s a cheap and automatic way to collect online data, it is frequently used by both startups and mature organizations. However, using bots is still questionable. Is it actually legal to scrape, collect and process online data that we don\u2019t owe? In this article, we will finally explain it.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Are bots legal?<\/h2>\n\n\n\n<p>Web scraping is simply based on bots. Although it\u2019s hard to imagine the Internet without them, it\u2019s usually not something to be enthusiastic about. Bots are commonly associated with fraudulent activities, shady data collection practices, abusive e-mail campaigns, and bans on social media. Well, that\u2019s all true.&nbsp;<\/p>\n\n\n\n<p>However, web scraping has nothing to do with \u201cbad bots\u201d, so those used to conduct harmful activities such as online frauds, data theft, stealing of intellectual property, or spam. On the contrary, \u201cgood bots\u201d exist and can enable businesses to improve their business. Scraping bots can help to automate price comparison, build databases, reach an audience or generate quality leads. In the case of carefully targeted campaigns, bringing value to potential customers and no harm to competitors, it\u2019s difficult to talk about harassment.<\/p>\n\n\n\n<p>To read more about the benefits of using web scraping, check out our article: <a href=\"https:\/\/dmalabs.net\/blog\/web-scraping-for-business\/\">5 Ideas of How To Use Web Scraping For Business<\/a>.&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">How to scrape data legally?<\/h2>\n\n\n\n<p>So, is web scraping legal? Can we scrape online data without limitations? We wish, but sadly &#8211; it\u2019s not that simple.<\/p>\n\n\n\n<p>It\u2019s a fact that big companies commonly use data scraping bots for their own gain. However, they also tend to protect their data from being scraped by other players. Well, if you found out that someone is using bots against you for competitive advantage, most probably you would not be totally fine with it.&nbsp;<\/p>\n\n\n\n<p>Generally speaking, the question of web scraping is usually <strong>not about whether it is legal, but whether it is ethical<\/strong>. To put it simply, if only scraping is not carried out for harmful purposes, such as stealing data, reusing content, or sending spam, you can assume it is legal. Remember that the technology itself is only a tool, and it depends on us how we\u2019re gonna use it &#8211; and what for. However, it\u2019s a two-way street. You need to be aware that even though you don\u2019t want to be scraped by competitors, it is perfectly legal in many cases.&nbsp;<\/p>\n\n\n\n<p>But finally, how to make sure when using bots is fine? Where\u2019s this thin line between scraping for gain and harassing competitors? We will discuss it in a second.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">When is web scraping illegal?<\/h2>\n\n\n\n<p>Although we already know that web scraping is not illegal itself, it might be &#8211; depending on the purpose for which you intend to use it. There are a few cases when using data scraping bots might be illegal &#8211; let\u2019s catch a glimpse of them.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Scraping non-public content<\/strong><\/h3>\n\n\n\n<p>If the data is displayed for public consumption, there\u2019s nothing illegal in copying it to a file on your computer. However, if you\u2019re trying to scrape information that was not meant to be seen by the audience, that\u2019s a different thing. Copying protected data for financial gain is prohibited by CFAA (Computer Fraud and Abuse Act) &#8211; American legislation over accessing online data without authorization. So, don\u2019t try to cross the thin line and make sure that you will not try to break into a source of sensitive data.&nbsp;<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Copyright abuse<\/strong><\/h3>\n\n\n\n<p>Scraping data itself is not illegal &#8211; unlike doing it to re-publish it on your website. Data available publicly might be protected by the copyright, which means it cannot be used for any purpose. In fact, re-using non-open source content, no matter if scraped automatically or copy-pasted manually, is always illegal. Then make sure that your text is unique and you have the right to use it within your channels.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><a href=\"https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/15.png\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"538\" src=\"https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/15-1024x538.png\" alt=\"Is web scraping legal?\" class=\"wp-image-83\" srcset=\"https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/15-1024x538.png 1024w, https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/15-300x158.png 300w, https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/15-768x403.png 768w, https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/15-830x436.png 830w, https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/15-230x121.png 230w, https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/15-350x184.png 350w, https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/15-480x252.png 480w, https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/15.png 1200w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/a><\/figure><\/div>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Abusing Robots.txt<\/strong><\/h3>\n\n\n\n<p>Authorized ways of using online data can also be regulated by robots.txt. It\u2019s the file containing information on how and which data available on the website can be crawled or scraped. If the document prohibits scraping, the only option to do it legally is to ask the website owner for official permission.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Abusing Terms of Service<\/strong><\/h3>\n\n\n\n<p>Similar to robots.txt, every website has its own Terms of Service (ToS). So, each time prior to scraping a particular website you should make sure that it\u2019s allowed. If there\u2019s nothing about web scraping being prohibited, you\u2019re safe. Otherwise, you should also receive permission in writing.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\"><strong>Exceeding crawl rate<\/strong><\/h3>\n\n\n\n<p>Websites are made for humans. This is why most of the bots try to imitate human behavior in order to prevent exceeding the crawl rate with too intensive scraping. However, on the other hand &#8211; advanced, human-like bots tend to process much more data and can significantly slow down the page. So, use bots that will help you to keep the right balance between maintaining crawl rate on a reasonable level and preventing server overload.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Examples of legal and illegal web scaping<\/h2>\n\n\n\n<p>To provide you with a better understanding of the legal issues of web scraping, let\u2019s catch a glimpse of a few cases in which web scraping can be used in either right or wrong manner.<\/p>\n\n\n\n<p>For example, bots can be used to search for YouTube video titles or descriptions. It can be legally scraped, downloaded, and saved into a file. However, the videos cannot be re-posted on our own site, as this would be copyright abuse.<\/p>\n\n\n\n<p>To give you another example, a web crawler is able to scrape the names of users available publicly on social media. However, you can\u2019t log in to their Facebook or LinkedIn accounts to retain protected data as it\u2019s not permitted by the rules of the service.&nbsp;<\/p>\n\n\n\n<p>What are the other tools that companies can use to protect their data from being scraped? For example, they can implement CAPTCHA verification technology. They can also use \u201crate-throttling\u201d to protect their websites from downloading too many web pages at once. This helps to avoid malicious bots which can overload the site.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Legal case: LinkedIn vs HiQ<\/h2>\n\n\n\n<p>Although cases in which web scraping is legal are pretty defined, a few companies still decide to protect their data by legal means. An example is LinkedIn who took HiQ to court.<\/p>\n\n\n\n<p>HiQ is the company scraping public information from LinkedIn profiles to provide businesses with data and insights on employees. As LinkedIn decided to launch a similar tool themselves, in 2017 they sued HiQ for unauthorized data collection. Long story short, in late 2019 the US Court of Appeals claimed that there was no CFAA abuse, and LinkedIn was asked to stop applying blocking measures against HiQ.<\/p>\n\n\n\n<p>This case proves that any data which is publicly available and not copyrighted can be scraped freely. However, it still cannot be used for unlimited commercial purposes.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Summary<\/h2>\n\n\n\n<p>To sum up, web scraping is perfectly legal unless you use it unethically. It\u2019s just a tool that is not harmful itself but can become illegal when used with the wrong intention. However, if you have any doubts, you can always ask the lawyer for professional advice.<\/p>\n\n\n\n<p>If you need to scrape data for business and receive it in a processed, ready-to-use form, feel free to contact us. We will analyze your case and let you know what kind of data you can scrape and what is recommended in your case.&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Web scraping was a breakthrough for a number of businesses around the [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":82,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3],"tags":[15,5],"class_list":["post-81","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-webscraping","tag-legal-web-scraping","tag-web-scraping"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v16.9 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Is web scraping legal? How to scrape data legally? - DMALabs Blog<\/title>\n<meta name=\"description\" content=\"Is it actually legal to scrape, collect and process online data that we don\u2019t owe? In this article, we will finally explain it.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Is web scraping legal? How to scrape data legally?\" \/>\n<meta property=\"og:description\" content=\"Is it actually legal to scrape, collect and process online data that we don\u2019t owe? In this article, we will finally explain it.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/\" \/>\n<meta property=\"og:site_name\" content=\"DMALabs Blog\" \/>\n<meta property=\"article:published_time\" content=\"2021-08-20T17:44:32+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-08-20T17:44:34+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/14-1.png\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"630\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"DMALabs\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebSite\",\"@id\":\"https:\/\/dmalabs.net\/blog\/#website\",\"url\":\"https:\/\/dmalabs.net\/blog\/\",\"name\":\"DMALabs Blog\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/dmalabs.net\/blog\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/#primaryimage\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/14-1.png\",\"contentUrl\":\"https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/14-1.png\",\"width\":1200,\"height\":630,\"caption\":\"Is web scraping legal?\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/#webpage\",\"url\":\"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/\",\"name\":\"Is web scraping legal? How to scrape data legally? - DMALabs Blog\",\"isPartOf\":{\"@id\":\"https:\/\/dmalabs.net\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/#primaryimage\"},\"datePublished\":\"2021-08-20T17:44:32+00:00\",\"dateModified\":\"2021-08-20T17:44:34+00:00\",\"author\":{\"@id\":\"https:\/\/dmalabs.net\/blog\/#\/schema\/person\/e9534d3dc5e7385f177a98e6b27e9fe0\"},\"description\":\"Is it actually legal to scrape, collect and process online data that we don\\u2019t owe? In this article, we will finally explain it.\",\"breadcrumb\":{\"@id\":\"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Is web scraping legal? How to scrape data legally?\"}]},{\"@type\":\"Person\",\"@id\":\"https:\/\/dmalabs.net\/blog\/#\/schema\/person\/e9534d3dc5e7385f177a98e6b27e9fe0\",\"name\":\"DMALabs\",\"image\":{\"@type\":\"ImageObject\",\"@id\":\"https:\/\/dmalabs.net\/blog\/#personlogo\",\"inLanguage\":\"en-US\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/a4fe2907cb3e0939f7450aa9dfdf7df365f54cbc03b44a27e6782c5990cde625?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/a4fe2907cb3e0939f7450aa9dfdf7df365f54cbc03b44a27e6782c5990cde625?s=96&d=mm&r=g\",\"caption\":\"DMALabs\"},\"sameAs\":[\"https:\/\/dmalabs.net\/blog\"],\"url\":\"https:\/\/dmalabs.net\/blog\/author\/dmalabs\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Is web scraping legal? How to scrape data legally? - DMALabs Blog","description":"Is it actually legal to scrape, collect and process online data that we don\u2019t owe? In this article, we will finally explain it.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/","og_locale":"en_US","og_type":"article","og_title":"Is web scraping legal? How to scrape data legally?","og_description":"Is it actually legal to scrape, collect and process online data that we don\u2019t owe? In this article, we will finally explain it.","og_url":"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/","og_site_name":"DMALabs Blog","article_published_time":"2021-08-20T17:44:32+00:00","article_modified_time":"2021-08-20T17:44:34+00:00","og_image":[{"width":1200,"height":630,"url":"https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/14-1.png","path":"\/var\/www\/public\/blog\/wp-content\/uploads\/2021\/08\/14-1.png","size":"full","id":"82","alt":"Is web scraping legal?","pixels":756000,"type":"image\/png"}],"twitter_card":"summary_large_image","twitter_misc":{"Written by":"DMALabs","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebSite","@id":"https:\/\/dmalabs.net\/blog\/#website","url":"https:\/\/dmalabs.net\/blog\/","name":"DMALabs Blog","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/dmalabs.net\/blog\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"ImageObject","@id":"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/#primaryimage","inLanguage":"en-US","url":"https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/14-1.png","contentUrl":"https:\/\/dmalabs.net\/blog\/wp-content\/uploads\/2021\/08\/14-1.png","width":1200,"height":630,"caption":"Is web scraping legal?"},{"@type":"WebPage","@id":"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/#webpage","url":"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/","name":"Is web scraping legal? How to scrape data legally? - DMALabs Blog","isPartOf":{"@id":"https:\/\/dmalabs.net\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/#primaryimage"},"datePublished":"2021-08-20T17:44:32+00:00","dateModified":"2021-08-20T17:44:34+00:00","author":{"@id":"https:\/\/dmalabs.net\/blog\/#\/schema\/person\/e9534d3dc5e7385f177a98e6b27e9fe0"},"description":"Is it actually legal to scrape, collect and process online data that we don\u2019t owe? In this article, we will finally explain it.","breadcrumb":{"@id":"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/dmalabs.net\/blog\/is-web-scraping-legal-how-to-scrape-data-legally\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Is web scraping legal? How to scrape data legally?"}]},{"@type":"Person","@id":"https:\/\/dmalabs.net\/blog\/#\/schema\/person\/e9534d3dc5e7385f177a98e6b27e9fe0","name":"DMALabs","image":{"@type":"ImageObject","@id":"https:\/\/dmalabs.net\/blog\/#personlogo","inLanguage":"en-US","url":"https:\/\/secure.gravatar.com\/avatar\/a4fe2907cb3e0939f7450aa9dfdf7df365f54cbc03b44a27e6782c5990cde625?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a4fe2907cb3e0939f7450aa9dfdf7df365f54cbc03b44a27e6782c5990cde625?s=96&d=mm&r=g","caption":"DMALabs"},"sameAs":["https:\/\/dmalabs.net\/blog"],"url":"https:\/\/dmalabs.net\/blog\/author\/dmalabs\/"}]}},"_links":{"self":[{"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/posts\/81","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/comments?post=81"}],"version-history":[{"count":1,"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/posts\/81\/revisions"}],"predecessor-version":[{"id":84,"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/posts\/81\/revisions\/84"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/media\/82"}],"wp:attachment":[{"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/media?parent=81"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/categories?post=81"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dmalabs.net\/blog\/wp-json\/wp\/v2\/tags?post=81"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}