Skip to main content

mod rewrite - How to check is canonicalized domains are being used? Apache 301 redirect does not preserve referrer



I have multiple domains which are setup to redirect (301) to my main domain. However I know some of these domains have little to no value in terms of SEO and I would like to get rid of them. But a concern of mine is that there may exist backlinks under these domains.



I checked google analytics and none of these domains came up, but I decided to confirm they would registered if they were used. Unfortunately in testing, my Apache 301 redirect does not seem to preserve the referring URL. I know this is largely dependent on the client, but it seems the consensus is that most of the time this is preserved.




  1. Are there any settings in modern browsers which instruct them to remove a referrer when redirected? I'm getting this behavior in Firefox, Chrome and IE.


  2. Is there anything I can do on the server side which may influence a client to preserve the referrer?

  3. If this is a dead end, what other methods are there to check if there are any backlinks or usages of these aliased domains?



Here is my redirect:



## Redirect non www to www
RewriteCond %{HTTPS} off [OR]
RewriteCond %{HTTP_HOST} !^www\.example\.com$ [NC]
RewriteRule ^(.*)$ https://www.example.com$1 [R=301,L]


Answer



An referrer is not the same as a redirect.



If you call a page e.g. http://www.example.com and on the page you have one or more resource like Images, CSS and JavaScript files, the browser will get them as well. If the Browser do so he send the original page, in our case this is http://www.example.com as a referrer to the server. Even this is optimal all modern browser do so. There is only one exception if the origin page is a https URL but the resources is http.



Now a redirect is something completely different. If you request is going to a server and the server responds with a 301 redirect the browser understand that the location has changed and therefor requesting the new location.



But if a 301 is for a resource (image,css,js,etc.) the refer will usually send again to the new location. The same exception applies here with https and http (see above).




A referrer will not be send by the Browser if a user enter a URL into the browser e.g http://example.com and this URL will be redirected to http://www.example.com, because http://example.com is not the referring page it was just redirected to a new location.



Now to the possible solution: you could add some UTM Parameters to your redirect https://en.wikipedia.org/wiki/UTM_parameters . This will be tract by Google Analytics. So you see if a page was called with this UTM Parameters and that means that it was called by a redirect. You can do statistic of how many times that page is called with this parameter or what source was the most used etc. Of course if someone have disabled JS or have any Anti-Tracking plug in than you will not see this call in your statistic.



## Redirect non www to www
RewriteCond %{HTTPS} off [OR]
RewriteCond %{HTTP_HOST} !^www\.example\.com$ [NC]
RewriteRule ^(.*)$ https://www.example.com$1?utm_source=%{HTTP_HOST}/%{REQUEST_URI}%?{QUERY_STRING}&utm_campaign=redirect [R=301,QSA,L]

Comments

Popular posts from this blog

iLO 3 Firmware Update (HP Proliant DL380 G7)

The iLO web interface allows me to upload a .bin file ( Obtain the firmware image (.bin) file from the Online ROM Flash Component for HP Integrated Lights-Out. ) The iLO web interface redirects me to a page in the HP support website ( http://www.hp.com/go/iLO ) where I am supposed to find this .bin firmware, but no luck for me. The support website is a mess and very slow, badly categorized and generally unusable. Where can I find this .bin file? The only related link I am able to find asks me about my server operating system (what does this have to do with the iLO?!) and lets me download an .iso with no .bin file And also a related question: what is the latest iLO 3 version? (for Proliant DL380 G7, not sure if the iLO is tied to the server model)

linux - Awstats - outputting stats for merged Access_logs only producing stats for one server's log

I've been attempting this for two weeks and I've accessed countless number of sites on this issue and it seems there is something I'm not getting here and I'm at a lost. I manged to figure out how to merge logs from two servers together. (Taking care to only merge the matching domains together) The logs from the first server span from 15 Dec 2012 to 8 April 2014 The logs from the second server span from 2 Mar 2014 to 9 April 2014 I was able to successfully merge them using the logresolvemerge.pl script simply enermerating each log and > out_putting_it_to_file Looking at the two logs from each server the format seems exactly the same. The problem I'm having is producing the stats page for the logs. The command I've boiled it down to is /usr/share/awstats/tools/awstats_buildstaticpages.pl -configdir=/home/User/Documents/conf/ -config=example.com awstatsprog=/usr/share/awstats/wwwroot/cgi-bin/awstats.pl dir=/home/User/Documents/parced -month=all -year=all...

linux - How can I get my mediawiki to stop thinking I have cookies disabled?

I've searched half a day for how to resolve this issue, and can't figure it out. Shortly after I made my wiki a simple private wiki according to the instructions at Mediawiki's website, it started giving me this weird login error message: Wiki uses cookies to log in users. You have cookies disabled. Please enable them and try again. If I remove those private wiki settings, the error disappears, even if I try logging in. But I need it to be a private wiki for only my team. So what do I do? Here's what I've done so far. Just to be safe, after ever change, I try rebooting Apache using: sudo /etc/init.d/apache2 restart In my php.ini file, I have the following set: session.save_path = "/var/lib/php5" session.cookie_secure = secure session.cookie_path = /tmp session.cookie_domain = my server's internal URL (should I even set this? this field was blank before, but not commented out) session.referer_check = Off I ran the following to ensure that the fold...