cassandra 2.0 - CQL SELECT with lower bound -

- May 15, 2013

Let's say I have Cassandra DB and I need to process a larger group data which I can query with a selection. The problem is that the processing is very slow and I want to use the distributed system to work in.

I know that by using marginal capacity of CLL I can get a limited number of rows, but I will need something like LIMIT and OFFSET so that each process can get an independent share of the data. (What is the offset that will eventually be implemented in the CQ? I have read that it would be inefficient, is the reason for this not applicable?)

I would like to avoid waiting for the end to start the next question For, as suggested in this, these procedures are useless while waiting for the previous questions to be completed.

For example, suppose I would like to process the weather data and for the moment, my table is visible (I could use it to store other data types, such as time For timewid, this is just a dummy problem):

  Make Weather Weather_data (station worker, date varchar, time varchar, value double, primary key (station, date), time);    For a given station and date, I want to make the segment of data (based on time). I think I know how many measures I have for each station and date.  
 If the correct answer is "Changing the structure of the table", then I would be happy to see how to modify it. / P>  
 
  I change my answer because I misunderstood the original problem. What I will do is information about station and date related to other sub-sections, for example, for the day or whatever is the appropriate division for you.  
  Create table Weather_data (station verarchar, date varchar , Dayhour int, time varchar, value double, primary key ((station, date), day, time));    In this way, you can divide your data into 24 parts and allow parallel execution as I said earlier. In this way you can divide only in the first 2 hours for example - The downside is that you will always hit the same nodes, there may be an option to make a primary key:  
  Primary key ((station, date, day-night), time)    This will split your data on a day-to-day basis, if the side effect is given to you from a specific station Needed to get all the measurements from the station You must complete the 24 questions then. The last but least solution can not be denied (arrange the hours to adjust the data in a new table and leave the original).  
 HH, Carlo   

 




  



















Get link





Facebook





X





Pinterest





Email





Other Apps




Comments





Post a Comment



Popular posts from this blog




java - ImportError: No module named py4j.java_gateway -



-



August 15, 2015








    I'm trying to call the Python using the Java program  py4j . I've been installing the plug-in Eclipse and test name Piidvi project. I'm trying to execute the following part of the code found on py4j webpage:  Import from py4j.java_gateway to JavaGateway, java_import gateway = JavaGateway () jvm = gateway.jvm java_import (jvm, '' Org.eclipse Kkorkrisorsej. * ") Vrkspes_rut = Jvankresourkesplginkgetvrkspas (). GetRoot (Gateway .help (workspace_root, '* Projects *') project_names = [project.getName () (for projects workpace_root.getProjects))] print (Projekt_nam)    But I There is an error in import. I have checked that the P4JJ is present in the Jar Eclipse plugin directory. Can anyone help please?      I had to install  the py4j application   





Read more





python - Receiving "KeyError" after decoding json result from url -



-



May 15, 2012








    I am new to Python I am trying to parse JSON result from a URL. Basically, I was using the following:    response = urllib.request.urlopen (url) json_obj = json.load (response)    It should be a stroke "str 'not' bytes' in the lines of a given" JSON object, so after searching on the StackoverView Flo, I decode the response in this way:    F = urllib.request.urlopen (Url) charset = f.info (). Get_param ('charset', 'utf8') data = f.read () decoded = json.loads (data.decode (charset))    If I print "decode" I is as follows:    { 'link': { 'summary data': 'https: // localhost / piwebapi / streams / p0_7qHaW4UHU-RlCaz8tpasAAQAAAAU0hJTExNQU42NDIwXFNJTlVTT0lE / summary' 'value': 'https: // localhost / Piwebapi / streams / P0_7qHaW4UHU-RlCaz8tpasAAQAAAAU0hJTExNQU42NDIwXFNJTlVTT0lE / price ',' InterpolatedData ':' https: // localhost / Piwebapi / streams / P0_7qHaW4UHU-RlCaz8tpasAAQAAAAU0hJTE...





Read more





.net - Creating a new Queue Manager and Queue in Websphere MQ (using
C#) -



-



March 15, 2010








    "itemprop =" text ">  I am writing applications that use WebSphere MQ to send messages. My unittests (for flow), I want to verify that I have put the correct message on the response queue. I am trying to figure out how to do this. My main obstacle is that I think it might be scary to clear the queue before running in my queue because the same queue can be used by other applications, I thought a decent solution would be a new line The manager will remove the queue for my solidarity and after using it. So my question is: Is it possible to use C # by using queue manager and queue?    After the   For future reference and future people, who want to make queues. I thought how to make and remove IBM MQ QE (not quenhers) with PCM messaging. It is not very simple, but it can be done.   We have implemented it in a library and it is being used to create and delete commands before and after integration tests. The most important part of the code in this library is shown in the...





Read more

Search This Blog

States

cassandra 2.0 - CQL SELECT with lower bound -

Comments

Post a Comment

Popular posts from this blog

java - ImportError: No module named py4j.java_gateway -

python - Receiving "KeyError" after decoding json result from url -

.net - Creating a new Queue Manager and Queue in Websphere MQ (using C#) -